IEICE global.ieice.org Site

Author Search Result

[Author] Hongwu YANG(2hit)

1-2hit

Dynamic Bayesian Network Inversion for Robust Speech Recognition
Lei XIE Hongwu YANG

LETTER-Speech and Hearing

Vol:
E90-D No:7
Page(s):
1117-1120
This paper presents an inversion algorithm for dynamic Bayesian networks towards robust speech recognition, namely DBNI, which is a generalization of hidden Markov model inversion (HMMI). As a dual procedure of expectation maximization (EM)-based model reestimation, DBNI finds the 'uncontaminated' speech by moving the input noisy speech to the Gaussian means under the maximum likelihood (ML) sense given the DBN models trained on clean speech. This algorithm can provide both the expressive advantage from DBN and the noise-removal feature from model inversion. Experiments on the Aurora 2.0 database show that the hidden feature model (a typical DBN for speech recognition) with the DBNI algorithm achieves superior performance in terms of word error rate reduction.
Perceptually Weighted Mel-Cepstrum Analysis of Speech Based on Psychoacoustic Model
Hongwu YANG Dezhi HUANG Lianhong CAI

LETTER-Speech and Hearing

Vol:
E89-D No:12
Page(s):
2998-3001
This letter proposes a novel approach for mel-cepstral analysis based on the psychoacoustic model of MPEG. A perceptual weighting function is developed by applying cubic spline interpolation on the signal-to-mask ratios (SMRs) which are obtained from the psychoacoustic model. Experiments on speaker identification and speech re-synthesis showed that the proposed method not only improved the speaker recognition performance, but also improved the speech quality of the re-synthesized speech.

Author Search Result

[Author] Hongwu YANG(2hit)

Dynamic Bayesian Network Inversion for Robust Speech Recognition

Perceptually Weighted Mel-Cepstrum Analysis of Speech Based on Psychoacoustic Model

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles