IEICE global.ieice.org Site

Author Search Result

[Author] Lei HU(18hit)

1-18hit

Fast Modular Reduction over Euclidean Rings and Its Application to Universal Hash Functions
Xiangyong ZENG Lei HU

LETTER

Vol:
E88-A No:1
Page(s):
305-310
In this letter, we propose a fast modular reduction method over Euclidean rings, which is a generalization of Barrett's reduction algorithm over the ring of integers. As an application, we construct new universal hash function families whose operations are modular arithmetic over a Euclidean ring, which can be any of three rings, the ring of integers, the ring of Gauss integers and the ring of Eisenstein integers. The implementation of these families is efficient by using our method.
On Linear Complexity of Kronecker Sequences
QuanLong WANG Lei HU ZongDuo DAI

PAPER-Information Security

Vol:
E86-A No:11
Page(s):
2853-2859
Recently six conjectures on linear complexities (LC) of some Kronecker sequences of two or three component sequences are proposed by Karkkainen. In, the LC of Kronecker sequences of two component sequences were studied by Uehara and Imamura, their results are true except in the case when eb 2 or when ea = eb = 1. In this paper the LC for Kronecker sequences of two component sequences are determined completely, and it is shown that all the six conjectures are true except in some special cases, which are listed and corrected.
Computationally Efficient Method of Signal Subspace Fitting for Direction-of-Arrival Estimation
Lei HUANG Dazheng FENG Linrang ZHANG Shunjun WU

PAPER-Antennas and Propagation

Vol:
E88-B No:8
Page(s):
3408-3415
It is interesting to resolve coherent signals impinging upon a linear sensor array with low computational complexity in array signal processing. In this paper, a computationally efficient method of signal subspace fitting (SSF) for direction-of-arrival (DOA) estimation is developed, based on the multi-stage wiener filter (MSWF). To find the new signal subspace, the proposed method only needs to compute the matched filters in the forward recursion of the MSWF, does not involve the estimate of an array covariance matrix or any eigendecomposition, thus implying that the proposed method is computationally efficient. Numerical results show that the proposed method provides the comparable estimation accuracy with the classical weighted subspace fitting (WSF) method for uncorrelated signals at reasonably high SNR and reasonably large samples, and surpasses the latter for coherent signals in the case of low SNR and small samples. When SNR is low and the samples are small, the proposed method is less accurate than the classical WSF method for uncorrelated signals. This drawback is balanced by the computational advantage of the proposed method.
A Micro-Code-Based IME Engine for HEVC and Its Hardware Implementation
Leilei HUANG Yibo FAN Chenhao GU Xiaoyang ZENG

PAPER-Integrated Electronics

Vol:
E102-C No:10
Page(s):
756-765
High Efficiency Video Coding (HEVC) standard is now becoming one of the most widespread video coding standards in the world. As a successor of H.264 standard, it aims to provide a much superior encoding performance. To fulfill this goal, several new notations along with the corresponding computation processes are introduced by this standard. Among those computation processes, the integer motion estimation (IME) is one of bottlenecks due to the complex partitions of the inter prediction units (PU) and the large search window commonly adopted. Many algorithms have been proposed to address this issue and usually put emphasis on a large search window and great computation amount. However, the coding efforts should be related to the scenes. To be more specific, for relatively static videos, a small search window along with a simple search scheme should be adopted to reduce the time cost and power consumption. In view of this, a micro-code-based IME engine is proposed in this paper, which could be applied with search schemes of different complexity. To test the performance, three different search schemes based on this engine are designed and evaluated under HEVC test model (HM) 16.9, achieving a B-D rate increase of 0.55/-0.07/-0.14%. Compared with our previous work, the hardware implementation is optimized to reduce 64.2% of the SRAMs bits and 32.8% of the logic gate count. The final design could support 4K×2K @139/85/37fps videos @500MHz.
Faster-ADNet for Visual Tracking
Tiansa ZHANG Chunlei HUO Zhiqiang ZHOU Bo WANG

LETTER-Image Recognition, Computer Vision

Pubricized:
2018/12/12
Vol:
E102-D No:3
Page(s):
684-687
By taking advantages of deep learning and reinforcement learning, ADNet (Action Decision Network) outperforms other approaches. However, its speed and performance are still limited by factors such as unreliable confidence score estimation and redundant historical actions. To address the above limitations, a faster and more accurate approach named Faster-ADNet is proposed in this paper. By optimizing the tracking process via a status re-identification network, the proposed approach is more efficient and 6 times faster than ADNet. At the same time, the accuracy and stability are enhanced by historical actions removal. Experiments demonstrate the advantages of Faster-ADNet.
Constructing Boolean Functions by Modifying Maiorana-McFarland's Superclass Functions
Xiangyong ZENG Lei HU

PAPER-Symmetric Key Cryptography

Vol:
E88-A No:1
Page(s):
59-66
In this study, we construct balanced Boolean functions with a high nonlinearity and an optimum algebraic degree for both odd and even dimensions. Our approach is based on modifying functions from the Maiorana-McFarland's superclass, which has been introduced by Carlet. A drawback of Maiorana-McFarland's function is that their restrictions obtained by fixing some variables in their input are affine. Affine functions are cryptographically weak functions, so there is a risk that this property will be exploited in attacks. Due to the contribution of Carlet, our constructions do not have the potential weakness that is shared by the Maiorana-McFarland construction or its modifications.
A Family of Binary Sequences with 4-Valued Optimal Out-of-Phase Correlation and Large Linear Span
Xiangyong ZENG Lei HU Wenfeng JIANG

PAPER-Information Theory

Vol:
E89-A No:7
Page(s):
2029-2035
In this paper, a new family S(r) of 2n binary sequences of period 2n-1 is proposed, where n ≡ 2 mod 4 and gcd(r, 2n/2-1)=1. The presented family takes 4-valued out-of-phase auto- and cross-correlation values -1, 2n/2-1, and 2n/2+1-1, and its correlation distribution is determined. For r=2(n-2)/4-1, each sequence in S(r), except the unique ideal autocorrelation sequence in the family, is proved to have a large linear span n2n/2-2, whilst the linear span of the latter is n2(n-2)/4-1.
A High-Throughput and Compact Hardware Implementation for the Reconstruction Loop in HEVC Intra Encoding
Yibo FAN Leilei HUANG Zheng XIE Xiaoyang ZENG

PAPER-Integrated Electronics

Vol:
E100-C No:6
Page(s):
643-654
In the newly finalized video coding standard, namely high efficiency video coding (HEVC), new notations like coding unit (CU), prediction unit (PU) and transformation unit (TU) are introduced to improve the coding performance. As a result, the reconstruction loop in intra encoding is heavily burdened to choose the best partitions or modes for them. In order to solve the bottleneck problems in cycle and hardware cost, this paper proposed a high-throughput and compact implementation for such a reconstruction loop. By “high-throughput”, it refers to that it has a fixed throughput of 32 pixel/cycle independent of the TU/PU size (except for 4×4 TUs). By “compact”, it refers to that it fully explores the reusability between discrete cosine transform (DCT) and inverse discrete cosine transform (IDCT) as well as that between quantization (Q) and de-quantization (IQ). Besides the contributions made in designing related hardware, this paper also provides a universal formula to analyze the cycle cost of the reconstruction loop and proposed a parallel-process scheme to further reduce the cycle cost. This design is verified on the Stratix IV FPGA. The basic structure achieved a maximum frequency of 150MHz and a hardware cost of 64K ALUTs, which could support the real time TU/PU partition decision for 4K×2K@20fps videos.
New Family of Binary Sequences with Low Correlation
Wenfeng JIANG Lei HU Xiangyong ZENG

LETTER-Coding Theory

Vol:
E91-A No:1
Page(s):
417-421
In this paper, a new family of binary sequences of period 2n-1 with low correlation is proposed for integer n=em and even m. The new family has family size 2n+1 and maximum nontrivial correlation +1 and +1 for even and odd e respectively. Especially, for n=2m and 3m, we obtain a new family of binary sequences with maximum nontrivial correlation +1, and the obtained family is one of the binary families with best correlation among the known families with family size no less than their period 2n-1 for even n. Moreover, the correlation distribution of the new family is also determined.
Utilizing Shape-Based Feature and Discriminative Learning for Building Detection
Shangqi ZHANG Haihong SHEN Chunlei HUO

LETTER-Image Recognition, Computer Vision

Pubricized:
2016/11/18
Vol:
E100-D No:2
Page(s):
392-395
Building detection from high resolution remote sensing images is challenging due to the high intraclass variability and the difficulty in describing buildings. To address the above difficulties, a novel approach is proposed based on the combination of shape-specific feature extraction and discriminative feature classification. Shape-specific feature can capture complex shapes and structures of buildings. Discriminative feature classification is effective in reflecting similarities among buildings and differences between buildings and backgrounds. Experiments demonstrate the effectiveness of the proposed approach.
Binary Constant Weight Codes Based on Cyclic Difference Sets
Nian LI Xiangyong ZENG Lei HU

LETTER-Coding Theory

Vol:
E91-A No:5
Page(s):
1288-1292
Based on cyclic difference sets, sequences with two-valued autocorrelation can be constructed. Using these constructed sequences, two classes of binary constant weight codes are presented. Some codes proposed in this paper are proven to be optimal.
Improved MILP Modeling for Automatic Security Evaluation and Application to FOX
Kexin QIAO Lei HU Siwei SUN Xiaoshuang MA Haibin KAN

PAPER-Symmetric Key Based Cryptography

Vol:
E98-A No:1
Page(s):
72-80
Counting the number of differentially active S-boxes is of great importance in evaluating the security of a block cipher against differential attack. Mouha et al. proposed a technique based on Mixed-Integer Linear Programming (MILP) to automatically calculate a lower bound of the number of differentially active S-boxes for word-oriented block ciphers, and applied it to symmetric ciphers AES and Enocoro-128v2. Later Sun et al. extended the method by introducing bit-level representations for S-boxes and new constraints in the MILP problem, and applied the extended method to PRESENT-80 and LBlock. This kind of methods greatly depends on the constraints in the MILP problem describing the differential propagation of the block cipher. A more accurate description of the differential propagation leads to a tighter bound on the number of differentially active S-boxes. In this paper, we refine the constraints in the MILP problem describing XOR operations, and apply the refined MILP modeling to determine a lower bound of the number of active S-boxes for the Lai-Massey type block cipher FOX in the model of single-key differential attack, and obtain a tighter bound in FOX64 than existing results. Experimental results show that 6, instead of currently known 8, rounds of FOX64 is strong enough to resist against basic single-key differential attack since the differential characteristic probability is upper bounded by 2-64, and thus the maximum differential characteristic probability of 12-round FOX64 is upper bounded by 2-128, where 128 is the key-length of FOX64. We also get the lower bound of the number of differentially active S-boxes for 5-round FOX128, and proved the security of the full-round FOX128 with respect to single-key differential attack.
On the Linear Span of a Binary Sequence Family with Optimal Correlation Properties
Xiangyong ZENG John Q. LIU Lei HU Desmond P. TAYLOR

PAPER-Information Theory

Vol:
E91-A No:2
Page(s):
664-672
A new subfamily of sequences with optimal correlation properties is constructed for the generalized Kasami set. A lower bound on the linear span is established. It is proved that with suitable choices of parameters, this subfamily has exponentially larger linear spans than either No sequences or TN sequences. A class of sequences with ideal autocorrelation is also proved to have large linear span.
Co-Saliency Detection via Local Prediction and Global Refinement
Jun WANG Lei HU Ning LI Chang TIAN Zhaofeng ZHANG Mingyong ZENG Zhangkai LUO Huaping GUAN

PAPER-Image

Vol:
E102-A No:4
Page(s):
654-664
This paper presents a novel model in the field of image co-saliency detection. Previous works simply design low level handcrafted features or extract deep features based on image patches for co-saliency calculation, which neglect the entire object perception properties. Besides, they also neglect the problem of visual similar region's mismatching when designing co-saliency calculation model. To solve these problems, we propose a novel strategy by considering both local prediction and global refinement (LPGR). In the local prediction stage, we train a deep convolutional saliency detection network in an end-to-end manner which only use the fully convolutional layers for saliency map prediction to capture the entire object perception properties and reduce feature redundancy. In the global refinement stage, we construct a unified co-saliency refinement model by integrating global appearance similarity into a co-saliency diffusion function, realizing the propagation and optimization of local saliency values in the context of entire image group. To overcome the adverse effects of visual similar regions' mismatching, we innovatively incorporates the inter-images saliency spread constraint (ISC) term into our co-saliency calculation function. Experimental results on public datasets demonstrate consistent performance gains of the proposed model over the state-of-the-art methods.
Learning Deep Dictionary for Hyperspectral Image Denoising
Leigang HUO Xiangchu FENG Chunlei HUO Chunhong PAN

LETTER-Pattern Recognition

Pubricized:
2015/04/20
Vol:
E98-D No:7
Page(s):
1401-1404
Using traditional single-layer dictionary learning methods, it is difficult to reveal the complex structures hidden in the hyperspectral images. Motivated by deep learning technique, a deep dictionary learning approach is proposed for hyperspectral image denoising, which consists of hierarchical dictionary learning, feature denoising and fine-tuning. Hierarchical dictionary learning is helpful for uncovering the hidden factors in the spectral dimension, and fine-tuning is beneficial for preserving the spectral structure. Experiments demonstrate the effectiveness of the proposed approach.
A Family of p-ary Binomial Bent Functions
Dabin ZHENG Xiangyong ZENG Lei HU

LETTER-Cryptography and Information Security

Vol:
E94-A No:9
Page(s):
1868-1872
For a prime p with p≡3 (mod 4) and an odd number m, the Bentness of the p-ary binomial function fa,b(x)=Tr1n(axpm-1)+Tr12 is characterized, where n=2m, a ∈ F*pn, and b ∈ F*p2. The necessary and sufficient conditions of fa,b(x) being Bent are established respectively by an exponential sum and two sequences related to a and b. For the special case of p=3, we further characterize the Bentness of the ternary function fa,b(x) by the Hamming weight of a sequence.
A Highly Configurable 7.62GOP/s Hardware Implementation for LSTM
Yibo FAN Leilei HUANG Kewei CHEN Xiaoyang ZENG

PAPER-Integrated Electronics

Pubricized:
2019/11/27
Vol:
E103-C No:5
Page(s):
263-273
The neural network has been one of the most useful techniques in the area of speech recognition, language translation and image analysis in recent years. Long Short-Term Memory (LSTM), a popular type of recurrent neural networks (RNNs), has been widely implemented on CPUs and GPUs. However, those software implementations offer a poor parallelism while the existing hardware implementations lack in configurability. In order to make up for this gap, a highly configurable 7.62 GOP/s hardware implementation for LSTM is proposed in this paper. To achieve the goal, the work flow is carefully arranged to make the design compact and high-throughput; the structure is carefully organized to make the design configurable; the data buffering and compression strategy is carefully chosen to lower the bandwidth without increasing the complexity of structure; the data type, logistic sigmoid (σ) function and hyperbolic tangent (tanh) function is carefully optimized to balance the hardware cost and accuracy. This work achieves a performance of 7.62 GOP/s @ 238 MHz on XCZU6EG FPGA, which takes only 3K look-up table (LUT). Compared with the implementation on Intel Xeon E5-2620 CPU @ 2.10GHz, this work achieves about 90× speedup for small networks and 25× speed-up for large ones. The consumption of resources is also much less than that of the state-of-the-art works.
Learning Deep Relationship for Object Detection
Nuo XU Chunlei HUO

LETTER-Image Recognition, Computer Vision

Pubricized:
2017/09/28
Vol:
E101-D No:1
Page(s):
273-276
Object detection has been a hot topic of image processing, computer vision and pattern recognition. In recent years, training a model from labeled images using machine learning technique becomes popular. However, the relationship between training samples is usually ignored by existing approaches. To address this problem, a novel approach is proposed, which trains Siamese convolutional neural network on feature pairs and finely tunes the network driven by a small amount of training samples. Since the proposed method considers not only the discriminative information between objects and background, but also the relationship between intraclass features, it outperforms the state-of-arts on real images.

Author Search Result

[Author] Lei HU(18hit)

Fast Modular Reduction over Euclidean Rings and Its Application to Universal Hash Functions

On Linear Complexity of Kronecker Sequences

Computationally Efficient Method of Signal Subspace Fitting for Direction-of-Arrival Estimation

A Micro-Code-Based IME Engine for HEVC and Its Hardware Implementation

Faster-ADNet for Visual Tracking

Constructing Boolean Functions by Modifying Maiorana-McFarland's Superclass Functions

A Family of Binary Sequences with 4-Valued Optimal Out-of-Phase Correlation and Large Linear Span

A High-Throughput and Compact Hardware Implementation for the Reconstruction Loop in HEVC Intra Encoding

New Family of Binary Sequences with Low Correlation

Utilizing Shape-Based Feature and Discriminative Learning for Building Detection

Binary Constant Weight Codes Based on Cyclic Difference Sets

Improved MILP Modeling for Automatic Security Evaluation and Application to FOX

On the Linear Span of a Binary Sequence Family with Optimal Correlation Properties

Co-Saliency Detection via Local Prediction and Global Refinement

Learning Deep Dictionary for Hyperspectral Image Denoising

A Family of p-ary Binomial Bent Functions

A Highly Configurable 7.62GOP/s Hardware Implementation for LSTM

Learning Deep Relationship for Object Detection

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles