Qingyun SHE Zongqing LU Weifeng LI Qingmin LIAO
The bilateral filter (BF) is a nonlinear and low-pass filter which can smooth an image while preserving detail structures. However, the filer is time consuming for real-time processing. In this paper, we bring forward a fresh idea that bilateral filtering can be accelerated by a multigrid (MG) scheme. Our method is based on the following two facts. a) The filtering result by a BF with a large kernel size on the original resolution can be approximated by applying a small kernel sized (3×3) version on the lower resolution many times on the premise of visual acceptance. Early work has shown that a BF can be viewed as nonlinear diffusion. The desired filtering result is actually an intermediate status of the diffusion process. b) Iterative linear equation techniques are sufficiently mature to cope with the nonlinear diffusion equation, which can be accelerated by the MG scheme. Experimental results with both simulated data sets and real sets are provided, and the new method is demonstrated to achieve almost twice the speed of the state-of-the-art. Compared with previous efforts for finding a generalized representation to link bilateral filtering and nonlinear diffusion by adaptive filtering, a novel relationship between nonlinear diffusion and bilateral filtering is explored in this study by focusing attention on numerical calculus.
Feng LIU Shuping WANG Shengming JIANG Yanli XU
For the three-user X channel, its degree of freedom (DoF) 9/5 has been shown achievable theoretically through asymptotic model with infinite resources, which is impractical. In this article, we explore the propagation delay (PD) feature among different links to maximize the achievable DoF with the minimum cost. Since perfect interference alignment (IA) is impossible for 9 messages within 5 time-slots, at least one extra time-slot should be utilized. By the cyclic polynomial approach, we propose a scheme with the maximum achievable DoF of 5/3 for 10 messages within 6 time-slots. Feasibility conditions in the Euclidean space are also deduced, which demonstrates a quite wide range of node arrangements.
Haiyuan LIU Taiyi ZHANG Ruiping ZHANG Feng LIU
For the performance deficiency of the pilot symbol aided channel estimation in orthogonal frequency division multiplexing (OFDM) systems, the wavelets network interpolation channel estimator is proposed. By contrast with conventional methods, wavelets network interpolation channel estimator can guarantee the high transmission rate and lower Bit error rates (BER). Computer simulation results demonstrate that the proposed channel estimation method exhibit an improved performance compared to the conventional linear channel estimation methods and is robust to fading rate, especially in fast fading channels.
Xiaodong LU Yefeng LIU Tatsuya TSUDA Kinji MORI
In Video-on-Demand (VoD) services, the playback continuity is one of the most crucial factors for end-user to judge service quality. It is even more significant than the actual video image quality since new generation VoD users commonly have heterogeneous requirements on service according to their context. Moreover, managing dynamic situations in VoD service is always a challenge, especially in the unpredictable user preferences and network conditions. In this paper, i) Autonomous Decentralized VoD System (ADVODS) has been proposed to satisfy different service quality demands of users and, ii) the Autonomous Node Allocation Technology (ANAT) is proposed for assuring service continuity. With the help of autonomous nodes and mobile agents, ANAT can applies different backup policies to users with different Service Level Agreements (SLA), and dynamically update the backup schema to adapt the changing situations such as various service time or congestion events. Drawing on the evaluation results this paper shows that proposed system architecture has a better performance on streaming service provision and continuity.
Liming ZHANG Defu ZHANG Weifeng LI
A new stock index trend analysis approach is proposed in this paper, which is based on a newly developed signal decomposition approach - adaptive Fourier decomposition (AFD). AFD can effectively extract the signal's primary trend, which specifically suits the Dow Theory based technique analysis. The proposed approach integrates two different kinds of forecasting approaches, including the Dow theory the RBF neural network. Effectiveness of the proposed approach is assessed through comparison with the direct RBF neural network approach. The result is proved to be promising.
Zhigang CHEN Taiyi ZHANG Yatong ZHOU Feng LIU
A novel blind channel estimation scheme is proposed for OFDM systems employing PSK modulation. This scheme minimizes the number of possible channels by exploiting the constant modulus property, chooses a best fit over the possible channels by exploiting the finite alphabet property of information signals, and achieves competitive performance with low computational complexity. Results comparing the new scheme with the finite-alphabet based channel estimation are presented.
Shuchao JIANG Feng LIU Shengming JIANG Xuan GENG
X communication model with two receivers is introduced to underwater acoustic networks, in which each transmitter sends an independent message to each receiver. Based on distance aligned structure, we propose a scheme, which can perform perfect interference alignment. The feasibility is also illustrated in three dimensional Euclidean space.
Guang-Hua SONG Xin-Feng LI Zhe-Ming LU
Recently, the controllability of complex networks has become a hot topic in the field of network science, where the driver nodes play a key and central role. Therefore, studying their structural characteristics is of great significance to understand the underlying mechanism of network controllability. In this paper, we systematically investigate the nodal centrality of driver nodes in controlling complex networks, we find that the driver nodes tend to be low in-degree but high out-degree nodes, and most of driver nodes tend to have low betweenness centrality but relatively high closeness centrality. We also find that the tendencies of driver nodes towards eigenvector centrality and Katz centrality show very similar behaviors, both high eigenvector centrality and high Katz centrality are avoided by driver nodes. Finally, we find that the driver nodes towards PageRank centrality demonstrate a polarized distribution, i.e., the vast majority of driver nodes tend to be low PageRank nodes whereas only few driver nodes tend to be high PageRank nodes.
Feng LIU Xianlong CHENG Conggai LI Yanli XU
This letter solves the energy efficiency optimization problem for the simultaneous wireless information and power transfer (SWIPT) systems with non-orthogonal multiple access (NOMA), multiple input single output (MISO) and power-splitting structures, where each user may have different individual quality of service (QoS) requirements about information and energy. Nonlinear energy harvesting model is used. Alternate optimization approach is adopted to find the solution, which shows a fast convergence behavior. Simulation results show the proposed scheme has higher energy efficiency than existing dual-layer iteration and throughput maximization methods.
Jieyun ZHOU Xiaofeng LI Haitao CHEN Rutong CHEN Masayuki NUMAO
Objects tracking methods have been wildly used in the field of video surveillance, motion monitoring, robotics and so on. Particle filter is one of the promising methods, but it is difficult to apply to real-time objects tracking because of its high computation cost. In order to reduce the processing cost without sacrificing the tracking quality, this paper proposes a new method for real-time 3D objects tracking, using parallelized particle filter algorithms by MapReduce architecture which is running on GPGPU. Our methods are as follows. First, we use a Kinect to get the 3D information of objects. Unlike the conventional 2D-based objects tracking, 3D objects tracking adds depth information. It can track not only from the x and y axis but also from the z axis, and the depth information can correct some errors in 2D objects tracking. Second, to solve the high computation cost problem, we use the MapReduce architecture on GPGPU to parallelize the particle filter algorithm. We implement the particle filter algorithms on GPU and evaluate the performance by actually running a program on CUDA5.5.
Lu YIN Junfeng LI Yonghong YAN Masato AKAGI
The simultaneous utterances impact the ability of both the hearing-impaired persons and automatic speech recognition systems. Recently, deep neural networks have dramatically improved the speech separation performance. However, most previous works only estimate the speech magnitude and use the mixture phase for speech reconstruction. The use of the mixture phase has become a critical limitation for separation performance. This study proposes a two-stage phase-aware approach for multi-talker speech separation, which integrally recovers the magnitude as well as the phase. For the phase recovery, Multiple Input Spectrogram Inversion (MISI) algorithm is utilized due to its effectiveness and simplicity. The study implements the MISI algorithm based on the mask and gives that the ideal amplitude mask (IAM) is the optimal mask for the mask-based MISI phase recovery, which brings less phase distortion. To compensate for the error of phase recovery and minimize the signal distortion, an advanced mask is proposed for the magnitude estimation. The IAM and the proposed mask are estimated at different stages to recover the phase and the magnitude, respectively. Two frameworks of neural network are evaluated for the magnitude estimation on the second stage, demonstrating the effectiveness and flexibility of the proposed approach. The experimental results demonstrate that the proposed approach significantly minimizes the distortions of the separated speech.
Xiaofeng LING Rui WANG Ping WANG Yu ZHU
In this paper, we study simultaneous wireless information and power transfer (SWIPT) in two-way relay channels where two users exchange information with each other via a multi-antenna relay node. The signals forwarded by the relay node are also used to supply the power to two users. We formulate a max-min optimization problem aiming to maximize the minimum harvested energy between two users to achieve fairness. We jointly optimize the relay beamforming matrix and allocating powers at the two users subject to the quality of service (QoS) constraints. To be specific, we consider the amplify-and-forward (AF) relay strategy and the time splitting SWIPT strategy. To this end, we propose two different time splitting protocols to enable relay to supply power to two users. To solve the non-convex joint optimization problem, we propose to split the original optimization problem into two subproblems and solving them iteratively to obtain the final solution. It is shown that the first subproblem dealing with the beamforming matrix can be optimally solved by using the technique of relaxed semidefinite programming (SDR), and the second subproblem, which deals with the power allocation, can be solved via linear programming. The performance comparison of two schemes as well as the one-way relaying scheme are provided and the effectiveness of the proposed schemes is verified.
Feng LIU Taiyi ZHANG Jiancheng SUN
In this paper a new adaptive multi-input multi-output (MIMO) channel estimation and multiuser detection algorithm based kernel space iterative inversion is proposed. The functions of output signals are mapped from a low dimensional space to a high dimensional reproducing kernel Hilbert space. The function of the output signals is represented as a linear combination of a set of basis functions, and a Mercer kernel function is constructed by the distribution function. In order to avoid finding the function f(.) and g(.), the correlation among the output signals is calculated in the low dimension space by the kernel. Moreover, considering the practical application, the algorithm is extended to online iteration of mixture system. The computer simulation results illustrated that the new algorithm increase the performance of channel estimation, the global convergence, and the system stability.
Yu ZHOU Junfeng LI Yanqing SUN Jianping ZHANG Yonghong YAN Masato AKAGI
In this paper, we present a hybrid speech emotion recognition system exploiting both spectral and prosodic features in speech. For capturing the emotional information in the spectral domain, we propose a new spectral feature extraction method by applying a novel non-uniform subband processing, instead of the mel-frequency subbands used in Mel-Frequency Cepstral Coefficients (MFCC). For prosodic features, a set of features that are closely correlated with speech emotional states are selected. In the proposed hybrid emotion recognition system, due to the inherently different characteristics of these two kinds of features (e.g., data size), the newly extracted spectral features are modeled by Gaussian Mixture Model (GMM) and the selected prosodic features are modeled by Support Vector Machine (SVM). The final result of the proposed emotion recognition system is obtained by combining the results from these two subsystems. Experimental results show that (1) the proposed non-uniform spectral features are more effective than the traditional MFCC features for emotion recognition; (2) the proposed hybrid emotion recognition system using both spectral and prosodic features yields the relative recognition error reduction rate of 17.0% over the traditional recognition systems using only the spectral features, and 62.3% over those using only the prosodic features.
Chuang WANG Zunchao LI Cheng LUO Lijuan ZHAO Yefei ZHANG Feng LIANG
A novel auto-tuning digital DC--DC converter is presented. In order to reduce the recovery time and undershoot, the auto-tuning control combines LnL, conventional PID and a predictive PID with a configurable predictive coefficient. A switch module is used to select an algorithm from the three control algorithms, according to the difference between the error signal and the two initially predefined thresholds. The detection and control logic is designed for both window delay line ADC and $Sigma Delta$ DPWM to correct the delay deviation. When the output of the converter exceeds the quantization range, the digital output of ADC is set at 0 or 1, and the delay line stops working to reduce power consumption. Theoretical analysis and simulations in the CSMC CMOS 0.5,$mu$m process are carried out to verify the proposed DC--DC converter. It is found that the converter achieves a power efficiency of more than 90% at heavy load, and reduces the recovery time and undershoot.
Time variations of wireless multipath channels can lead to severe intercarrier interference (ICI) in orthogonal frequency division multiplex (OFDM) systems, whereas large Doppler frequency spread can provide us with time diversity gain. In order to take advantage of the time diversity and to suppress the interference and noise enhancement at the same time, the receiver normally detects the data successively. In this letter, we propose an improved detection ordering based on the log-likelihood ratio (LLR) rather than the signal-to-noise ratio (SNR) for the successive detector. Using both theoretical analysis and computer simulation, it is shown that this scheme outperforms the traditional successive detection methods.
Weifeng LI Katsunobu ITOU Kazuya TAKEDA Fumitada ITAKURA
We address issues for improving hands-free speech enhancement and speech recognition performance in different car environments using a single distant microphone. This paper describes a new single-channel in-car speech enhancement method that estimates the log spectra of speech at a close-talking microphone based on the nonlinear regression of the log spectra of noisy signal captured by a distant microphone and the estimated noise. The proposed method provides significant overall quality improvements in our subjective evaluation on the regression-enhanced speech, and performed best in most objective measures. Based on our isolated word recognition experiments conducted under 15 real car environments, the proposed adaptive nonlinear regression approach shows an advantage in average relative word error rate (WER) reductions of 50.8% and 13.1%, respectively, compared to original noisy speech and ETSI advanced front-end (ETSI ES 202 050).
Roger Yubtzuan CHEN Sheng-Feng LIN
A linear CMOS transconductor is presented. PMOS transistors are employed in the resistor-replacement and voltage-level shifting to avoid the body effect. To annihilate the non-linear voltage terms, the substrate-bias effect of MOS transistors is treated more accurately in our design. Consequently, the non-linearity of the large-signal transconductance is reduced. The fabricated circuit occupies an area of 245 µm176 µm ( ≈approx 0.043 mm2) and dissipates 0.87 mW from a 3.3 V supply. For an input of 1 Vp-p, the measured output total harmonic distortion is less than 1.2%. The transconductance varies by less than 0.5% in the input range.
Shen LI Lingfeng LI Takeshi IKENAGA Shunichi ISHIWATA Masataka MATSUI Satoshi GOTO
The coexistence of MPEG-2 and its powerful successor H.264/AVC has created a huge need for MPEG-2/H.264 video transcoding. However, a traditional transcoder where an MPEG-2 decoder is simply cascaded to an H.264 encoder requires huge computational power due to the adoption of a complicated rate-distortion based mode decision process in H.264. This paper proposes a 2-D Sobel filter based motion vector domain method and a DCT domain method to measure macroblock complexity and realize content-based H.264 candidate mode decision. A new local edge based fast INTRA prediction mode decision method is also adopted to boost the encoding efficiency. Simulation results confirm that with the proposed methods the computational burden of a traditional transcoder can be reduced by 20%30% with only a negligible bit-rate increase for a wide range of video sequences.
Zhaoxi FANG Feng LIANG Shaozhong ZHANG Xiaolin ZHOU
Timing asynchronism strongly degrades the performance of analog network coded (ANC) bi-directional transmission. This letter investigates receiver design for asynchronous broadband bi-directional transmission over frequency selective fading channels. Based on time domain oversampling, we propose fractionally spaced frequency domain minimum mean square error (MMSE) equalizers for bi-directional ANC based on orthogonal frequency division multiplexing (OFDM) and cyclic prefixed single carrier (CP-SC) radio access. Simulation results show that the proposed fractionally spaced equalizer (FSE) can eliminate the negative effect of timing misalignment in bi-directional transmissions.