IEICE global.ieice.org Site

Keyword Search Result

[Keyword] Ada(1871hit)

161-180hit(1871hit)

Transferring Adaptive Bit Rate Streaming Quality Models from H.264/HD to H.265/4K-UHD Open Access
Pierre LEBRETON Kazuhisa YAMAGISHI

PAPER-Network

Pubricized:
2019/06/25
Vol:
E102-B No:12
Page(s):
2226-2242
In this paper the quality of adaptive bit rate video streaming is investigated and two state-of-the-art models, i.e., the NTT audiovisual quality-estimation and ITU-T P.1203 models, are considered. This paper shows how these models can be applied to new conditions, e.g., 4K ultra high definition (4K-UHD) videos encoded using H.265, considering that they were originally designed and trained for HD videos encoded with H.264. Six subjective evaluations involving up to 192 participants and a large variety of test conditions, e.g., durations from 10sec to 3min, coding-quality variation, and stalling events, were conducted on both TV and mobile devices. Using the subjective data, this paper addresses how models and coefficients can be transferred to new conditions. A comparison between state-of-the-art models is conducted, showing the performance of transferred and retrained models. It is found that other video-quality estimation models, such as VMAF, can be used as input of the NTT and ITU-T P.1203 long-term pooling modules, allowing these other video-quality-estimation models to support the specificities of adaptive bit-rate-streaming scenarios. Finally, all retrained coefficients are detailed in this paper allowing future work to directly reuse the results of this study.
Sparse Time-Varying Complex AR (TV-CAR) Speech Analysis Based on Adaptive LASSO
Keiichi FUNAKI

LETTER-Speech and Hearing

Vol:
E102-A No:12
Page(s):
1910-1914
Linear Prediction (LP) analysis is commonly used in speech processing. LP is based on Auto-Regressive (AR) model and it estimates the AR model parameter from signals with l2-norm optimization. Recently, sparse estimation is paid attention since it can extract significant features from big data. The sparse estimation is realized by l1 or l0-norm optimization or regularization. Sparse LP analysis methods based on l1-norm optimization have been proposed. Since excitation of speech is not white Gaussian, a sparse LP estimation can estimate more accurate parameter than the conventional l2-norm based LP. These are time-invariant and real-valued analysis. We have been studied Time-Varying Complex AR (TV-CAR) analysis for an analytic signal and have evaluated the performance on speech processing. The TV-CAR methods are l2-norm methods. In this paper, we propose the sparse TV-CAR analysis based on adaptive LASSO (Least absolute shrinkage and selection operator) that is l1-norm regularization and evaluate the performance on F0 estimation of speech using IRAPT (Instantaneous RAPT). The experimental results show that the sparse TV-CAR methods perform better for a high level of additive Pink noise.
Adaptive-Partial Template Update with Center-Shifting Recovery for High Frame Rate and Ultra-Low Delay Deformation Matching
Songlin DU Yuhao XU Tingting HU Takeshi IKENAGA

PAPER-Image

Vol:
E102-A No:12
Page(s):
1872-1881
High frame rate and ultra-low delay matching system plays an important role in various human-machine interactive applications, which demands better performance in matching deformable and out-of-plane rotating objects. Although many algorithms have been proposed for deformation tracking and matching, few of them are suitable for hardware implementation due to complicated operations and large time consumption. This paper proposes a hardware-oriented template update and recovery method for high frame rate and ultra-low delay deformation matching system. In the proposed method, the new template is generated in real time by partially updating the template descriptor and adding new keypoints simultaneously with the matching process in pixels (proposal #1), which avoids the large inter-frame delay. The size and shape of region of interest (ROI) are made flexible and the Hamming threshold used for brute-force matching is adjusted according to pixel position and the flexible ROI (proposal #2), which solves the problem of template drift. The template is recovered by the previous one with a relative center-shifting vector when it is judged as lost via region-wise difference check (proposal #3). Evaluation results indicate that the proposed method successfully achieves the real-time processing of 784fps at the resolution of 640×480 on field-programmable gate array (FPGA), with a delay of 0.808ms/frame, as well as achieves satisfactory deformation matching results in comparison with other general methods.
New Sub-Band Adaptive Volterra Filter for Identification of Loudspeaker
Satoshi KINOSHITA Yoshinobu KAJIKAWA

PAPER-Digital Signal Processing

Vol:
E102-A No:12
Page(s):
1946-1955
Adaptive Volterra filters (AVFs) are usually used to identify nonlinear systems, such as loudspeaker systems, and ordinary adaptive algorithms can be used to update the filter coefficients of AVFs. However, AVFs require huge computational complexity even if the order of the AVF is constrained to the second order. Improving calculation efficiency is therefore an important issue for the real-time implementation of AVFs. In this paper, we propose a novel sub-band AVF with high calculation efficiency for second-order AVFs. The proposed sub-band AVF consists of four parts: input signal transformation for a single sub-band AVF, tap length determination to improve calculation efficiency, switching the number of sub-bands while maintaining the estimation accuracy, and an automatic search for an appropriate number of sub-bands. The proposed sub-band AVF can improve calculation efficiency for which the dominant nonlinear components are concentrated in any frequency band, such as loudspeakers. A simulation result demonstrates that the proposed sub-band AVF can realize higher estimation accuracy than conventional efficient AVFs.
Adversarial Domain Adaptation Network for Semantic Role Classification
Haitong YANG Guangyou ZHOU Tingting HE Maoxi LI

PAPER-Natural Language Processing

Pubricized:
2019/09/02
Vol:
E102-D No:12
Page(s):
2587-2594
In this paper, we study domain adaptation of semantic role classification. Most systems utilize the supervised method for semantic role classification. But, these methods often suffer severe performance drops on out-of-domain test data. The reason for the performance drops is that there are giant feature differences between source and target domain. This paper proposes a framework called Adversarial Domain Adaption Network (ADAN) to relieve domain adaption of semantic role classification. The idea behind our method is that the proposed framework can derive domain-invariant features via adversarial learning and narrow down the gap between source and target feature space. To evaluate our method, we conduct experiments on English portion in the CoNLL 2009 shared task. Experimental results show that our method can largely reduce the performance drop on out-of-domain test data.
Target-Adapted Subspace Learning for Cross-Corpus Speech Emotion Recognition
Xiuzhen CHEN Xiaoyan ZHOU Cheng LU Yuan ZONG Wenming ZHENG Chuangao TANG

LETTER-Speech and Hearing

Pubricized:
2019/08/26
Vol:
E102-D No:12
Page(s):
2632-2636
For cross-corpus speech emotion recognition (SER), how to obtain effective feature representation for the discrepancy elimination of feature distributions between source and target domains is a crucial issue. In this paper, we propose a Target-adapted Subspace Learning (TaSL) method for cross-corpus SER. The TaSL method trys to find a projection subspace, where the feature regress the label more accurately and the gap of feature distributions in target and source domains is bridged effectively. Then, in order to obtain more optimal projection matrix, ℓ1 norm and ℓ2,1 norm penalty terms are added to different regularization terms, respectively. Finally, we conduct extensive experiments on three public corpuses, EmoDB, eNTERFACE and AFEW 4.0. The experimental results show that our proposed method can achieve better performance compared with the state-of-the-art methods in the cross-corpus SER tasks.
Hadamard-Type Matrices on Finite Fields and Complete Complementary Codes
Tetsuya KOJIMA

PAPER-Sequences

Vol:
E102-A No:12
Page(s):
1651-1658
Hadamard matrix is defined as a square matrix where any components are -1 or +1, and where any pairs of rows are mutually orthogonal. In this work, we consider the similar matrix on finite field GF(p) where p is an odd prime. In such a matrix, every component is one of the integers on GF(p){0}, that is, {1,2,...,p-1}. Any additions and multiplications should be executed under modulo p. In this paper, a method to generate such matrices is proposed. In addition, the paper includes the applications to generate n-shift orthogonal sequences and complete complementary codes. The generated complete complementary code is a family of multi-valued sequences on GF(p){0}, where the number of sequence sets, the number of sequences in each sequence set and the sequence length depend on the various divisors of p-1. Such complete complementary codes with various parameters have not been proposed in previous studies.
A New Formula to Compute the NLMS Algorithm at a Computational Complexity of O(2N)
Kiyoshi NISHIYAMA Masahiro SUNOHARA Nobuhiko HIRUMA

LETTER-Digital Signal Processing

Vol:
E102-A No:11
Page(s):
1545-1549
The least mean squares (LMS) algorithm has been widely used for adaptive filtering because of easily implementing at a computational complexity of O(2N) where N is the number of taps. The drawback of the LMS algorithm is that its performance is sensitive to the scaling of the input. The normalized LMS (NLMS) algorithm solves this problem on the LMS algorithm by normalizing with the sliding-window power of the input; however, this normalization increases the computational cost to O(3N) per iteration. In this work, we derive a new formula to strictly perform the NLMS algorithm at a computational complexity of O(2N), that is referred to as the C-NLMS algorithm. The derivation of the C-NLMS algorithm uses the H∞ framework presented previously by one of the authors for creating a unified view of adaptive filtering algorithms. The validity of the C-NLMS algorithm is verified using simulations.
Personalized Food Image Classifier Considering Time-Dependent and Item-Dependent Food Distribution Open Access
Qing YU Masashi ANZAWA Sosuke AMANO Kiyoharu AIZAWA

PAPER

Pubricized:
2019/06/21
Vol:
E102-D No:11
Page(s):
2120-2126
Since the development of food diaries could enable people to develop healthy eating habits, food image recognition is in high demand to reduce the effort in food recording. Previous studies have worked on this challenging domain with datasets having fixed numbers of samples and classes. However, in the real-world setting, it is impossible to include all of the foods in the database because the number of classes of foods is large and increases continually. In addition to that, inter-class similarity and intra-class diversity also bring difficulties to the recognition. In this paper, we solve these problems by using deep convolutional neural network features to build a personalized classifier which incrementally learns the user's data and adapts to the user's eating habit. As a result, we achieved the state-of-the-art accuracy of food image recognition by the personalization of 300 food records per user.
Hybridizing Dragonfly Algorithm with Differential Evolution for Global Optimization Open Access
MeiJun DUAN HongYu YANG Bo YANG XiPing WU HaiJun LIANG

PAPER-Fundamentals of Information Systems

Pubricized:
2019/07/17
Vol:
E102-D No:10
Page(s):
1891-1901
Due to its simplicity and efficiency, differential evolution (DE) has gained the interest of researchers from various fields for solving global optimization problems. However, it is prone to premature convergence at local minima. To overcome this drawback, a novel hybrid dragonfly algorithm with differential evolution (Hybrid DA-DE) for solving global optimization problems is proposed. Firstly, a novel mutation operator is introduced based on the dragonfly algorithm (DA). Secondly, the scaling factor (F) is adjusted in a self-adaptive and individual-dependent way without extra parameters. The proposed algorithm combines the exploitation capability of DE and exploration capability of DA to achieve optimal global solutions. The effectiveness of this algorithm is evaluated using 30 classical benchmark functions with sixteen state-of-the-art meta-heuristic algorithms. A series of experimental results show that Hybrid DA-DE outperforms other algorithms significantly. Meanwhile, Hybrid DA-DE has the best adaptability to high-dimensional problems.
An Adaptive Bit Allocation for Maximum Bit-Rate Tomlinson-Harashima Precoding Open Access
Shigenori KINJO Shuichi OHNO

LETTER-Communication Theory and Signals

Vol:
E102-A No:10
Page(s):
1438-1442
An adaptive bit allocation scheme for zero-forcing (ZF) Tomlinson-Harashima precoding (THP) is proposed. The ZF-THP enables us to achieve feasible bit error rate (BER) performance when appropriate substream permutations are installed at the transmitter. In this study, the number of bits in each substream is adaptively allocated to minimize the average BER in fading environments. Numerical examples are provided to compare the proposed method with eigenbeam space division multiplexing (E-SDM) method.
Adaptive Multi-Scale Tracking Target Algorithm through Drone
Qiusheng HE Xiuyan SHAO Wei CHEN Xiaoyun LI Xiao YANG Tongfeng SUN

PAPER

Pubricized:
2019/04/26
Vol:
E102-B No:10
Page(s):
1998-2005
In order to solve the influence of scale change on target tracking using the drone, a multi-scale target tracking algorithm is proposed which based on the color feature tracking algorithm. The algorithm realized adaptive scale tracking by training position and scale correlation filters. It can first obtain the target center position of next frame by computing the maximum of the response, where the position correlation filter is learned by the least squares classifier and the dimensionality reduction for color features is analyzed by principal component analysis. The scale correlation filter is obtained by color characteristics at 33 rectangular areas which is set by the scale factor around the central location and is reduced dimensions by orthogonal triangle decomposition. Finally, the location and size of the target are updated by the maximum of the response. By testing 13 challenging video sequences taken by the drone, the results show that the algorithm has adaptability to the changes in the target scale and its robustness along with many other performance indicators are both better than the most state-of-the-art methods in illumination Variation, fast motion, motion blur and other complex situations.
Improving Semi-Blind Uplink Interference Suppression on Multicell Massive MIMO Systems: A Beamspace Approach
Kazuki MARUTA Chang-Jun AHN

PAPER

Pubricized:
2019/02/20
Vol:
E102-B No:8
Page(s):
1503-1511
This paper improves our previously proposed semi-blind uplink interference suppression scheme for multicell multiuser massive MIMO systems by incorporating the beamspace approach. The constant modulus algorithm (CMA), a known blind adaptive array scheme, can fully exploit the degree of freedom (DoF) offered by massive antenna arrays to suppress inter-user interference (IUI) and inter-cell interference (ICI). Unfortunately, CMA wastes a lot of the benefit of DoF for null-steering even when the number of incoming signal is fewer than that of receiving antenna elements. Our new proposal introduces the beamspace method which degenerates the number of array input for CMA from element-space to beamspace. It can control DoF expended for subsequent interference suppression by CMA. Optimizing the array beamforming gain and null-steering ability, can further improve the output signal-to-interference and noise power ratio (SINR). Computer simulation confirmed that our new proposal reduced the required number of data symbols by 34.6%. In addition, the 5th percentile SINR was also improved by 14.3dB.
Power Allocation Scheme for Energy Efficiency Maximization in Distributed Antenna System with Discrete-Rate Adaptive Modulation
Xiangbin YU Xi WANG Tao TENG Qiyishu LI Fei WANG

PAPER-Wireless Communication Technologies

Pubricized:
2019/02/12
Vol:
E102-B No:8
Page(s):
1705-1714
In this paper, we study the power allocation (PA) scheme design for energy efficiency (EE) maximization with discrete-rate adaptive modulation (AM) in the downlink distributed antenna system (DAS). By means of the Karush-Kuhn-Tucker (KKT) conditions, an optimal PA scheme with closed-form expression is derived for maximizing the EE subject to maximum transmit power and target bit error rate (BER) constraints, where the number of active transmit antennas is also derived for attaining PA coefficients. Considering that the optimal scheme needs to calculate the PA of all transmit antennas for each modulation mode, its complexity is extremely high. For this reason, a low-complexity suboptimal PA is also presented based on the antenna selection method. By choosing one or two remote antennas, the suboptimal scheme offers lower complexity than the optimal one, and has almost the same EE performance as the latter. Besides, the outage probability is derived in a performance evaluation. Computer simulation shows that the developed optimal scheme can achieve the same EE as the exhaustive search based approach, which has much higher complexity, and the suboptimal scheme almost matches the EE of the optimal one as well. The suboptimal scheme with two-antenna selection is particularly effective in terms of balancing performance and complexity. Moreover, the derived outage probability is in good agreement with the corresponding simulation.
Adaptive FIR Filtering for PAPR Reduction in OFDM Systems
Hikaru MORITA Teruyuki MIYAJIMA Yoshiki SUGITANI

PAPER-Digital Signal Processing

Vol:
E102-A No:8
Page(s):
938-945
This study proposes a Peak-to-Average Power Ratio (PAPR) reduction method using an adaptive Finite Impulse Response (FIR) filter in Orthogonal Frequency Division Multiplexing systems. At the transmitter, an iterative algorithm that minimizes the p-norm of a transmitted signal vector is used to update the weight coefficients of the FIR filter to reduce PAPR. At the receiver, the FIR filter used at the transmitter is estimated using pilot symbols, and its effect can be compensated for by using an equalizer for proper demodulation. Simulation results show that the proposed method is superior to conventional methods in terms of the PAPR reduction and computational complexity. It also shows that the proposed method has a trade-off between PAPR reduction and bit error rate performance.
Unsupervised Cross-Database Micro-Expression Recognition Using Target-Adapted Least-Squares Regression
Lingyan LI Xiaoyan ZHOU Yuan ZONG Wenming ZHENG Xiuzhen CHEN Jingang SHI Peng SONG

LETTER-Pattern Recognition

Pubricized:
2019/03/26
Vol:
E102-D No:7
Page(s):
1417-1421
Over the past several years, the research of micro-expression recognition (MER) has become an active topic in affective computing and computer vision because of its potential value in many application fields, e.g., lie detection. However, most previous works assumed an ideal scenario that both training and testing samples belong to the same micro-expression database, which is easily broken in practice. In this letter, we hence consider a more challenging scenario that the training and testing samples come from different micro-expression databases and investigated unsupervised cross-database MER in which the source database is labeled while the label information of target database is entirely unseen. To solve this interesting problem, we propose an effective method called target-adapted least-squares regression (TALSR). The basic idea of TALSR is to learn a regression coefficient matrix based on the source samples and their provided label information and also enable this learned regression coefficient matrix to suit the target micro-expression database. We are thus able to use the learned regression coefficient matrix to predict the micro-expression categories of the target micro-expression samples. Extensive experiments on CASME II and SMIC micro-expression databases are conducted to evaluate the proposed TALSR. The experimental results show that our TALSR has better performance than lots of recent well-performing domain adaptation methods in dealing with unsupervised cross-database MER tasks.
A Tile-Based Solution Using Cubemap for Viewport-Adaptive 360-degree Video Delivery
Huyen T. T. TRAN Duc V. NGUYEN Nam PHAM NGOC Truong Cong THANG

PAPER

Pubricized:
2019/01/22
Vol:
E102-B No:7
Page(s):
1292-1300
360-degree video delivery in Virtual Reality is very challenging due to the fact that 360-degree videos require much higher bandwidth than conventional videos. To overcome this problem, viewport-adaptive streaming has been introduced. In this study, we propose a new adaptation method for tiling-based viewport-adaptive streaming of 360-degree videos. For content preparation, the Cubemap projection format is used, where faces or parts of a face are encoded as tiles. Also, the problem is formulated as an optimization problem, in which each visible tile is weighted based on how that tile overlaps with the viewport. To solve the problem, an approximation algorithm is proposed in this study. An evaluation of the proposed method and reference methods is carried out under different tiling schemes and bandwidths. Experiments show that the Cubemap format with tiling provides a lot of benefits in terms of storage, viewport quality across different viewing directions and bandwidths, and tolerance to prediction errors.
MTTF-Aware Design Methodology of Adaptively Voltage Scaled Circuit with Timing Error Predictive Flip-Flop
Yutaka MASUDA Masanori HASHIMOTO

PAPER

Vol:
E102-A No:7
Page(s):
867-877
Adaptive voltage scaling is a promising approach to overcome manufacturing variability, dynamic environmental fluctuation, and aging. This paper focuses on error prediction based adaptive voltage scaling (EP-AVS) and proposes a mean time to failure (MTTF) aware design methodology for EP-AVS circuits. Main contributions of this work include (1) optimization of both voltage-scaled circuit and voltage control logic, and (2) quantitative evaluation of power saving for practically long MTTF. Experimental results show that the proposed EP-AVS design methodology achieves 38.0% power saving while satisfying given target MTTF.
Extended Beamforming by Sum and Difference Composite Co-Array for Real-Valued Signals
Sho IWAZAKI Koichi ICHIGE

PAPER-Digital Signal Processing

Vol:
E102-A No:7
Page(s):
918-925
We have developed a novel array configuration based on the combination of sum and difference co-arrays. There have been many studies on array antenna configurations that enhance the degree of freedom (DOF) of an array, but the maximum DOF of the difference co-array configuration is often limited. With our proposed array configuration, called “sum and difference composite co-array”, we aim to further enhance the DOF by combining the concept of sum co-array and difference co-array. The performance of the proposed array configuration is evaluated through computer simulated beamforming*.
Adaptive Group Formation Scheme for Mobile Group Wireless Sensor Networks
Mochammad Zen Samsono HADI Yuichi MIYAJI Hideyuki UEHARA

PAPER-Network

Pubricized:
2019/01/09
Vol:
E102-B No:7
Page(s):
1313-1322
In this paper, we propose a novel group formation scheme which is integrated with an EMGC protocol in order to cope with dynamic group change. It uses a link expiration time and residual energy to form a stable link in a group. It also has a group merging procedure to decrease the number of groups. Furthermore, we develop two additional functions for the protocol, i.e., GL rotation and a stay connection procedure to diminish energy consumption of sensor nodes in the network. Simulation results show that the proposed protocol outperforms MBC, EMGCwoh, and EMGC protocols in terms of data delivery, network lifetime, and energy dissipation per round with various group change probabilities and percentages of groups.

161-180hit(1871hit)

Keyword Search Result

[Keyword] Ada(1871hit)

Transferring Adaptive Bit Rate Streaming Quality Models from H.264/HD to H.265/4K-UHD Open Access

Sparse Time-Varying Complex AR (TV-CAR) Speech Analysis Based on Adaptive LASSO

Adaptive-Partial Template Update with Center-Shifting Recovery for High Frame Rate and Ultra-Low Delay Deformation Matching

New Sub-Band Adaptive Volterra Filter for Identification of Loudspeaker

Adversarial Domain Adaptation Network for Semantic Role Classification

Target-Adapted Subspace Learning for Cross-Corpus Speech Emotion Recognition

Hadamard-Type Matrices on Finite Fields and Complete Complementary Codes

A New Formula to Compute the NLMS Algorithm at a Computational Complexity of O(2N)

Personalized Food Image Classifier Considering Time-Dependent and Item-Dependent Food Distribution Open Access

Hybridizing Dragonfly Algorithm with Differential Evolution for Global Optimization Open Access

An Adaptive Bit Allocation for Maximum Bit-Rate Tomlinson-Harashima Precoding Open Access

Adaptive Multi-Scale Tracking Target Algorithm through Drone

Improving Semi-Blind Uplink Interference Suppression on Multicell Massive MIMO Systems: A Beamspace Approach

Power Allocation Scheme for Energy Efficiency Maximization in Distributed Antenna System with Discrete-Rate Adaptive Modulation

Adaptive FIR Filtering for PAPR Reduction in OFDM Systems

Unsupervised Cross-Database Micro-Expression Recognition Using Target-Adapted Least-Squares Regression

A Tile-Based Solution Using Cubemap for Viewport-Adaptive 360-degree Video Delivery

MTTF-Aware Design Methodology of Adaptively Voltage Scaled Circuit with Timing Error Predictive Flip-Flop

Extended Beamforming by Sum and Difference Composite Co-Array for Real-Valued Signals

Adaptive Group Formation Scheme for Mobile Group Wireless Sensor Networks

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles