The search functionality is under construction.
The search functionality is under construction.

Efficient Combination of Likelihood Recycling and Batch Calculation for Fast Acoustic Likelihood Calculation

Atsunori OGAWA, Satoshi TAKAHASHI, Atsushi NAKAMURA

  • Full Text Views

    0

  • Cite this

Summary :

This paper proposes an efficient combination of state likelihood recycling and batch state likelihood calculation for accelerating acoustic likelihood calculation in an HMM-based speech recognizer. Recycling and batch calculation are each based on different technical approaches, i.e. the former is a purely algorithmic technique while the latter fully exploits computer architecture. To accelerate the recognition process further by combining them efficiently, we introduce conditional fast processing and acoustic backing-off. Conditional fast processing is based on two criteria. The first potential activity criterion is used to control not only the recycling of state likelihoods at the current frame but also the precalculation of state likelihoods for several succeeding frames. The second reliability criterion and acoustic backing-off are used to control the choice of recycled or batch calculated state likelihoods when they are contradictory in the combination and to prevent word accuracies from degrading. Large vocabulary spontaneous speech recognition experiments using four different CPU machines under two environmental conditions showed that, compared with the baseline recognizer, recycling and batch calculation, our combined acceleration technique further reduced both of the acoustic likelihood calculation time and the total recognition time. We also performed detailed analyses to reveal each technique's acceleration and environmental dependency mechanisms by classifying types of state likelihoods and counting each of them. The analysis results comfirmed the effectiveness of the combined acceleration technique.

Publication
IEICE TRANSACTIONS on Information Vol.E94-D No.3 pp.648-658
Publication Date
2011/03/01
Publicized
Online ISSN
1745-1361
DOI
10.1587/transinf.E94.D.648
Type of Manuscript
PAPER
Category
Speech and Hearing

Authors

Keyword