IEICE global.ieice.org Site

Keyword Search Result

[Keyword] ATI(18690hit)

9381-9400hit(18690hit)

An Irregular Search Window Reuse Scheme for MPEG-2 to H.264 Transcoding
Xiang-Hui WEI Shen LI Yang SONG Satoshi GOTO

PAPER-Image Coding and Video Coding

Vol:
E91-A No:3
Page(s):
749-755
Motion estimation (ME) is a computation-intensive module in video coding system. In MPEG-2 to H.264 transcoding, motion vector (MV) from MPEG-2 reused as search center in H.264 encoder is a simple but effective technique to simplify ME processing. However, directly applying MPEG-2 MV as search center will bring difficulties on application of data reuse method in hardware design, because the irregular overlapping of search windows between successive macro block (MB). In this paper, we propose a search window reuse scheme for transcoding, especially for HDTV application. By utilizing the similarity between neighboring MV, overlapping area of search windows can be regularized. Experiment results show that our method achieves average 93.1% search window reuse-rate in HDTV720p sequence with almost no video quality degradation. Compared to transcoding method without any data reuse scheme, bandwidth of the proposed method can be reduced to 40.6% of that.
Development of Cryopackaging and I/O Technologies for High-Speed Superconductive Digital Systems
Yoshihito HASHIMOTO Shinichi YOROZU Yoshio KAMEDA

INVITED PAPER

Vol:
E91-C No:3
Page(s):
325-332
A cryocooled system with I/O interface circuits, which enables high-speed system operation of superconductive single-flux-quantum (SFQ) circuits at over 40 GHz, and the demonstration of a 47-Gbps SFQ 22 switch system are presented. The cryocooled system has 32 I/Os and cools an SFQ multi-chip module (MCM) to 4 K with a two-stage 1-W Gifford-McMahon cryocooler. An SFQ 4:1 multiplexer (MUX) and an SFQ 1:4 demultiplexer (DEMUX) have been designed to interface the speed gap between the I/O (~10 Gbps/ch) and SFQ circuits (>40 GHz). An SFQ 22 switch chip, in which the MUX/DEMUX and an SFQ 22 switch are integrated, and an 8-channel superconductive voltage driver (SVD) chip have been designed with an advanced cell library for a junction critical current density of 10 kA/cm2. An SFQ 22 switch MCM has been made by flip-chip bonding the switch chip and SVD chip on a superconductive MCM carrier with φ 50-µm InSn solder bumps. An SFQ 22 switch system, which is the switch MCM packaged in the cryocooled system, has been demonstrated up to a port speed of 47 Gbps for the first time.
A Sparse Decomposition Method for Periodic Signal Mixtures
Makoto NAKASHIZUKA

PAPER-Digital Signal Processing

Vol:
E91-A No:3
Page(s):
791-800
This study proposes a method to decompose a signal into a set of periodic signals. The proposed decomposition method imposes a penalty on the resultant periodic subsignals in order to improve the sparsity of decomposition and avoid the overestimation of periods. This penalty is defined as the weighted sum of the l2 norms of the resultant periodic subsignals. This decomposition is approximated by an unconstrained minimization problem. In order to solve this problem, a relaxation algorithm is applied. In the experiments, decomposition results are presented to demonstrate the simultaneous detection of periods and waveforms hidden in signal mixtures.
Single Sinusoidal Frequency Estimation Using Second and Fourth Order Linear Prediction Errors
Kenneth Wing-Kin LUI Hing-Cheung SO

LETTER-Digital Signal Processing

Vol:
E91-A No:3
Page(s):
875-878
By utilizing the second and fourth order linear prediction errors, a novel estimator for a single noisy sinusoid is devised. The frequency estimate is obtained from a solving a cubic equation and a simple root selection procedure is provided. Asymptotical variance of the estimated frequency is derived and confirmed by computer simulations. It is demonstrated that the proposed estimator is superior to the reformed Pisarenko harmonic decomposer, which is the improved version of Pisarenko harmonic decomposer.
Improved Fading Scheme for Spatio-Temporal Error Concealment in Video Transmission
Min-Cheol HWANG Jun-Hyung KIM Chun-Su PARK Sung-Jea KO

PAPER-Image Coding and Video Coding

Vol:
E91-A No:3
Page(s):
740-748
Error concealment at a decoder is an efficient method to reduce the degradation of visual quality caused by channel errors. In this paper, we propose a novel spatio-temporal error concealment algorithm based on the spatial-temporal fading (STF) scheme which has been recently introduced. Although STF achieves good performance for the error concealment, several drawbacks including blurring still remain in the concealed blocks. To alleviate these drawbacks, in the proposed method, hybrid approaches with adaptive weights are proposed. First, the boundary matching algorithm and the decoder motion vector estimation which are well-known temporal error concealment methods are adaptively combined to compensate for the defect of each other. Then, an edge preserved method is utilized to reduce the blurring effects caused by the bilinear interpolation for spatial error concealment. Finally, two concealed results obtained by the hybrid spatial and temporal error concealment are pixel-wisely blended with adaptive weights. Experimental results exhibit that the proposed method outperforms conventional methods including STF in terms of the PSNR performance as well as subjective visual quality, and the computational complexity of the proposed method is similar to that of STF.
A Subsampling-Based Digital Image Watermarking Scheme Resistant to Permutation Attack
Chuang LIN Jeng-Shyang PAN Chia-An HUANG

LETTER-Image

Vol:
E91-A No:3
Page(s):
911-915
The letter proposes a novel subsampling-based digital image watermarking scheme resisting the permutation attack. The subsampling-based watermarking schemes have drawn great attention for their convenience and effectiveness in recent years, but the traditional subsampling-based watermarking schemes are very vulnerable to the permutation attack. In this letter, the watermark information is embedded in the average values of the 1-level DWT coefficients to resist the permutation attack. The concrete embedding process is achieved by the quantization-based method. Experimental results show that the proposed scheme can resist not only the permutation attack but also some common image processing attacks.
Filtering in Generalized Signal-Dependent Noise Model Using Covariance Information
Seiichi NAKAMORI María J. GARCIA-LIGERO Aurora HERMOSO-CARAZO Josefa LINARES-PEREZ

PAPER-Digital Signal Processing

Vol:
E91-A No:3
Page(s):
809-817
In this paper, we propose a recursive filtering algorithm to restore monochromatic images which are corrupted by general dependent additive noise. It is assumed that the equation which describes the image field is not available and a filtering algorithm is obtained using the information provided by the covariance functions of the signal, noise that affects the measurement equation, and the fourth-order moments of the signal. The proposed algorithm is obtained by an innovation approach which provides a simple derivation of the least mean-squared error linear estimators. The estimation of the grey level in each spatial coordinate is made taking into account the information provided by the grey levels located on the row of the pixel to be estimated. The proposed filtering algorithm is applied to restore images which are affected by general signal-dependent additive noise.
Basic Bifurcation of Artificial Spiking Neurons with Triangular Base Signal
Toshimitsu OHTANI Toshimichi SAITO

LETTER-Nonlinear Problems

Vol:
E91-A No:3
Page(s):
891-894
This paper studies a spiking neuron circuit with triangular base signal. The circuit can output rich spike-trains and the dynamics can be analyzed using a one-dimensional piecewise linear map. This system exhibits period doubling bifurcation, tangent bifurcation, super-stable periodic orbit bifurcation and so on. These phenomena can be characterized based on the inter-spike intervals. Using the maps, we can analyze the phenomena precisely. By presenting a simple test circuit, typical phenomena are confirmed experimentally.
Embedded System Implementation of Sound Localization in Proximal Region
Nobuyuki IWANAGA Tomoya MATSUMURA Akihiro YOSHIDA Wataru KOBAYASHI Takao ONOYE

PAPER-Engineering Acoustics

Vol:
E91-A No:3
Page(s):
763-771
A sound localization method in the proximal region is proposed, which is based on a low-cost 3D sound localization algorithm with the use of head-related transfer functions (HRTFs). The auditory parallax model is applied to the current algorithm so that more accurate HRTFs can be used for sound localization in the proximal region. In addition, head-shadowing effects based on rigid-sphere model are reproduced in the proximal region by means of a second-order IIR filter. A subjective listening test demonstrates the effectiveness of the proposed method. Embedded system implementation of the proposed method is also described claiming that the proposed method improves sound effects in the proximal region only with 5.1% increase of memory capacity and 8.3% of computational costs.
Impact of Channel Estimation Error on the Sum-Rate in MIMO Broadcast Channels with User Selection
Yupeng LIU Ling QIU

LETTER-Wireless Communication Technologies

Vol:
E91-B No:3
Page(s):
955-958
We investigate the MIMO broadcast channels with imperfect channel knowledge due to estimation error and much more users than transmit antennas to exploit multiuser diversity. The channel estimation error causes the interference among users, resulting in the sum-rate loss. A tight upper bound of this sum-rate loss based on zeroforcing beamforming is derived theoretically. This bound only depends on the channel estimation quality and transmit antenna number, but not on the user number. Based on this upper bound, we show this system maintains full multiuser diversity, and always benefits from the increasing transmit power.
Canonicalization of Feature Parameters for Robust Speech Recognition Based on Distinctive Phonetic Feature (DPF) Vectors
Mohammad NURUL HUDA Muhammad GHULAM Takashi FUKUDA Kouichi KATSURADA Tsuneo NITTA

PAPER-Feature Extraction

Vol:
E91-D No:3
Page(s):
488-498
This paper describes a robust automatic speech recognition (ASR) system with less computation. Acoustic models of a hidden Markov model (HMM)-based classifier include various types of hidden factors such as speaker-specific characteristics, coarticulation, and an acoustic environment, etc. If there exists a canonicalization process that can recover the degraded margin of acoustic likelihoods between correct phonemes and other ones caused by hidden factors, the robustness of ASR systems can be improved. In this paper, we introduce a canonicalization method that is composed of multiple distinctive phonetic feature (DPF) extractors corresponding to each hidden factor canonicalization, and a DPF selector which selects an optimum DPF vector as an input of the HMM-based classifier. The proposed method resolves gender factors and speaker variability, and eliminates noise factors by applying the canonicalzation based on the DPF extractors and two-stage Wiener filtering. In the experiment on AURORA-2J, the proposed method provides higher word accuracy under clean training and significant improvement of word accuracy in low signal-to-noise ratio (SNR) under multi-condition training compared to a standard ASR system with mel frequency ceptral coeffient (MFCC) parameters. Moreover, the proposed method requires a reduced, two-fifth, Gaussian mixture components and less memory to achieve accurate ASR.
Improvements in Fabrication Process for Nb-Based Single Flux Quantum Circuits in Japan
Mutsuo HIDAKA Shuichi NAGASAWA Kenji HINODE Tetsuro SATOH

INVITED PAPER

Vol:
E91-C No:3
Page(s):
318-324
We developed an Nb-based fabrication process for single flux quantum (SFQ) circuits in a Japanese government project that began in September 2002 and ended in March 2007. Our conventional process, called the Standard Process (SDP), was improved by overhauling all the process steps and routine process checks for all wafers. Wafer yield with the improved SDP dramatically increased from 50% to over 90%. We also developed a new fabrication process for SFQ circuits, called the Advanced Process (ADP). The specifications for ADP are nine planarized Nb layers, a minimum Josephson junction (JJ) size of 11 µm, a line width of 0.8 µm, a JJ critical current density of 10 kA/cm2, a 2.4 Ω Mo sheet resistance, and vertically stacked superconductive contact holes. We fabricated an eight-bit SFQ shift register, a one million SQUID array and a 16-kbit RAM by using the ADP. The shift register was operated up to 120 GHz and no short or open circuits were detected in the one million SQUID array. We confirmed correct memory operations by the 16-kbit RAM and a 5.7 times greater integration level compared to that possible with the SDP.
Development, Long-Term Operation and Portability of a Real-Environment Speech-Oriented Guidance System
Tobias CINCAREK Hiromichi KAWANAMI Ryuichi NISIMURA Akinobu LEE Hiroshi SARUWATARI Kiyohiro SHIKANO

PAPER-Applications

Vol:
E91-D No:3
Page(s):
576-587
In this paper, the development, long-term operation and portability of a practical ASR application in a real environment is investigated. The target application is a speech-oriented guidance system installed at the local community center. The system has been exposed to ordinary people since November 2002. More than 300 hours or more than 700,000 inputs have been collected during four years. The outcome is a rare example of a large scale real-environment speech database. A simulation experiment is carried out with this database to investigate how the system's performance improves during the first two years of operation. The purpose is to determine empirically the amount of real-environment data which has to be prepared to build a system with reasonable speech recognition performance and response accuracy. Furthermore, the relative importance of developing the main system components, i.e. speech recognizer and the response generation module, is assessed. Although depending on the system's modeling capacities and domain complexity, experimental results show that overall performance stagnates after employing about 10-15 k utterances for training the acoustic model, 40-50 k utterances for training the language model and 40 k-50 k utterances for compiling the question and answer database. The Q&A database was most important for improving the system's response accuracy. Finally, the portability of the well-trained first system prototype for a different environment, a local subway station, is investigated. Since collection and preparation of large amounts of real data is impractical in general, only one month of data from the new environment is employed for system adaptation. While the speech recognition component of the first prototype has a high degree of portability, the response accuracy is lower than in the first environment. The main reason is a domain difference between the two systems, since they are installed in different environments. This implicates that it is imperative to take the behavior of users under real conditions into account to build a system with high user satisfaction.
Stability-Guaranteed Width Control for Hot Strip Mill
Cheol Jae PARK I Cheol HWANG

LETTER-Systems and Control

Vol:
E91-A No:3
Page(s):
883-886
We propose a stability-guaranteed width control (SGWC) for the hot strip finishing mill. It is shown that the proposed SGWC guarantees the stability of the width controller by the universal approximation of the neural network. It is shown through the field test in the hot strip mill of POSCO that the stability of the width controller is guaranteed by the proposed control scheme.
Omnidirectional Audio-Visual Talker Localization Based on Dynamic Fusion of Audio-Visual Features Using Validity and Reliability Criteria
Yuki DENDA Takanobu NISHIURA Yoichi YAMASHITA

PAPER-Applications

Vol:
E91-D No:3
Page(s):
598-606
This paper proposes a robust omnidirectional audio-visual (AV) talker localizer for AV applications. The proposed localizer consists of two innovations. One of them is robust omnidirectional audio and visual features. The direction of arrival (DOA) estimation using an equilateral triangular microphone array, and human position estimation using an omnidirectional video camera extract the AV features. The other is a dynamic fusion of the AV features. The validity criterion, called the audio- or visual-localization counter, validates each audio- or visual-feature. The reliability criterion, called the speech arriving evaluator, acts as a dynamic weight to eliminate any prior statistical properties from its fusion procedure. The proposed localizer can compatibly achieve talker localization in a speech activity and user localization in a non-speech activity under the identical fusion rule. Talker localization experiments were conducted in an actual room to evaluate the effectiveness of the proposed localizer. The results confirmed that the talker localization performance of the proposed AV localizer using the validity and reliability criteria is superior to that of conventional localizers.
A High-Speed Pipelined Degree-Computationless Modified Euclidean Algorithm Architecture for Reed-Solomon Decoders
Seungbeom LEE Hanho LEE

PAPER-VLSI Design Technology and CAD

Vol:
E91-A No:3
Page(s):
830-835
This paper presents a novel high-speed low-complexity pipelined degree-computationless modified Euclidean (pDCME) algorithm architecture for high-speed RS decoders. The pDCME algorithm allows elimination of the degree-computation so as to reduce hardware complexity and obtain high-speed processing. A high-speed RS decoder based on the pDCME algorithm has been designed and implemented with 0.13-µm CMOS standard cell technology in a supply voltage of 1.1 V. The proposed RS decoder operates at a clock frequency of 660 MHz and has a throughput of 5.3 Gb/s. The proposed architecture requires approximately 15% fewer gate counts and a simpler control logic than architectures based on the popular modified Euclidean algorithm.
TCP Flow Level Performance Evaluation on Error Rate Aware Scheduling Algorithms in Evolved UTRA and UTRAN Networks
Yan ZHANG Masato UCHIDA Masato TSURU Yuji OIE

PAPER-Wireless Communication Technologies

Vol:
E91-B No:3
Page(s):
761-771
We present a TCP flow level performance evaluation on error rate aware scheduling algorithms in Evolved UTRA and UTRAN networks. With the introduction of the error rate, which is the probability of transmission failure under a given wireless condition and the instantaneous transmission rate, the transmission efficiency can be improved without sacrificing the balance between system performance and user fairness. The performance comparison with and without error rate awareness is carried out dependant on various TCP traffic models, user channel conditions, schedulers with different fairness constraints, and automatic repeat request (ARQ) types. The results indicate that error rate awareness can make the resource allocation more reasonable and effectively improve the system and individual performance, especially for poor channel condition users.
Bilingual Cluster Based Models for Statistical Machine Translation
Hirofumi YAMAMOTO Eiichiro SUMITA

PAPER-Applications

Vol:
E91-D No:3
Page(s):
588-597
We propose a domain specific model for statistical machine translation. It is well-known that domain specific language models perform well in automatic speech recognition. We show that domain specific language and translation models also benefit statistical machine translation. However, there are two problems with using domain specific models. The first is the data sparseness problem. We employ an adaptation technique to overcome this problem. The second issue is domain prediction. In order to perform adaptation, the domain must be provided, however in many cases, the domain is not known or changes dynamically. For these cases, not only the translation target sentence but also the domain must be predicted. This paper focuses on the domain prediction problem for statistical machine translation. In the proposed method, a bilingual training corpus, is automatically clustered into sub-corpora. Each sub-corpus is deemed to be a domain. The domain of a source sentence is predicted by using its similarity to the sub-corpora. The predicted domain (sub-corpus) specific language and translation models are then used for the translation decoding. This approach gave an improvement of 2.7 in BLEU score on the IWSLT05 Japanese to English evaluation corpus (improving the score from 52.4 to 55.1). This is a substantial gain and indicates the validity of the proposed bilingual cluster based models.
RSFQ Baseband Digital Signal Processing
Anna Yurievna HERR

INVITED PAPER

Vol:
E91-C No:3
Page(s):
293-305
Ultra fast switching speed of superconducting digital circuits enable realization of Digital Signal Processors with performance unattainable by any other technology. Based on rapid-single-flux technology (RSFQ) logic, these integrated circuits are capable of delivering high computation capacity up to 30 GOPS on a single processor and very short latency of 0.1 ns. There are two main applications of such hardware for practical telecommunication systems: filters for superconducting ADCs operating with digital RF data and recursive filters at baseband. The later of these allows functions such as multiuser detection for 3G WCDMA, equalization and channel precoding for 4G OFDM MIMO, and general blind detection. The performance gain is an increase in the cell capacity, quality of service, and transmitted data rate. The current status of the development of the RSFQ baseband DSP is discussed. Major components with operating speed of 30 GHz have been developed. Designs, test results, and future development of the complete systems including cryopackaging and CMOS interface are reviewed.
Underwater Transient Signal Classification Using Binary Pattern Image of MFCC and Neural Network
Taegyun LIM Keunsung BAE Chansik HWANG Hyeonguk LEE

LETTER-Engineering Acoustics

Vol:
E91-A No:3
Page(s):
772-774
This paper presents a new method for classification of underwater transient signals, which employs a binary image pattern of the mel-frequency cepstral coefficients as a feature vector and a feed-forward neural network as a classifier. The feature vector is obtained by taking DCT and 1-bit quantization for the square matrix of the mel-frequency cepstral coefficients that is derived from the frame based cepstral analysis. The classifier is a feed-forward neural network having one hidden layer and one output layer, and a back propagation algorithm is used to update the weighting vector of each layer. Experimental results with underwater transient signals demonstrate that the proposed method is very promising for classification of underwater transient signals.

9381-9400hit(18690hit)

Keyword Search Result

[Keyword] ATI(18690hit)

An Irregular Search Window Reuse Scheme for MPEG-2 to H.264 Transcoding

Development of Cryopackaging and I/O Technologies for High-Speed Superconductive Digital Systems

A Sparse Decomposition Method for Periodic Signal Mixtures

Single Sinusoidal Frequency Estimation Using Second and Fourth Order Linear Prediction Errors

Improved Fading Scheme for Spatio-Temporal Error Concealment in Video Transmission

A Subsampling-Based Digital Image Watermarking Scheme Resistant to Permutation Attack

Filtering in Generalized Signal-Dependent Noise Model Using Covariance Information

Basic Bifurcation of Artificial Spiking Neurons with Triangular Base Signal

Embedded System Implementation of Sound Localization in Proximal Region

Impact of Channel Estimation Error on the Sum-Rate in MIMO Broadcast Channels with User Selection

Canonicalization of Feature Parameters for Robust Speech Recognition Based on Distinctive Phonetic Feature (DPF) Vectors

Improvements in Fabrication Process for Nb-Based Single Flux Quantum Circuits in Japan

Development, Long-Term Operation and Portability of a Real-Environment Speech-Oriented Guidance System

Stability-Guaranteed Width Control for Hot Strip Mill

Omnidirectional Audio-Visual Talker Localization Based on Dynamic Fusion of Audio-Visual Features Using Validity and Reliability Criteria

A High-Speed Pipelined Degree-Computationless Modified Euclidean Algorithm Architecture for Reed-Solomon Decoders

TCP Flow Level Performance Evaluation on Error Rate Aware Scheduling Algorithms in Evolved UTRA and UTRAN Networks

Bilingual Cluster Based Models for Statistical Machine Translation

RSFQ Baseband Digital Signal Processing

Underwater Transient Signal Classification Using Binary Pattern Image of MFCC and Neural Network

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles