The search functionality is under construction.
The search functionality is under construction.

Speaker-Independent Isolated Word Recognition Based on Dynamics-Emphasized Cepstrum

Sadaoki FURUI

  • Full Text Views

    0

  • Cite this

Summary :

A new analysis technique applicable to speech recognition is proposed considering the auditory mechanism of speech perception which emphasizes spectral dynamics as well as compensates for the spectral undershoot associated with coarticulation. A speech wave is represented by the LPC cepstrum and logarithmic energy sequences, and the time sequences over short periods are expanded by the first- and second-order polynomial functions at every frame period. The dynamics of the cepstrum sequences are then emphasized by the linear combination of their polynomial expansion coefficients, that is, derivatives, and their instantaneous values. Speaker-independent word recognition experiments using time functions of the dynamics-emphasized cepstrum and the polynomial coefficient for energy indicate that the error rate can be largely reduced by this method. The experimental results are compared with those obtained by the previous method in which the polynomial coefficients for the cepstrum and energy time functions were used in combination with the original time functions of these parameters as independent parameters.

Publication
IEICE TRANSACTIONS on transactions Vol.E69-E No.12 pp.1310-1317
Publication Date
1986/12/25
Publicized
Online ISSN
DOI
Type of Manuscript
PAPER
Category
Acoustics

Authors

Keyword