The search functionality is under construction.
The search functionality is under construction.

SUSKIT---A Speech Understanding System Based on Robust Phone Spotting--

Yutaka KOBAYASHI, Masanori OMOTE, Hidenori ENDO, Yasuhisa NIIMI

  • Full Text Views

    0

  • Cite this

Summary :

This paper describes an overview of our speech understanding system and reports on the recent results of the sentence recognition experiments. The system, we call SUSKIT-, recognizes database queries in natural Japanese sentences. The user is expected to speak sentence by sentence. Among the difficult problems to overcome, this study paid the prime attentions to how to cope with the contextual variations of pronunciations and how to verify partial sentence hypotheses in a hierarchical system. The SUSKIT- predicts words strings in a top-down manner, however, the verification of hypotheses against the input speech is done using a unit independent of word boundaries. Words are not suitable units of verification because the smoothing effect owing to phonetic contexts makes it difficult to recognize short words. In order to avoid the misrecognition caused by the smoothing effect across word boundaries, the SUSKIT- dynamically extracts those phoneme strings bounded by the easily detectable phonemes from the predicted word string as verification templates. The left-to-right timesynchronous beam-search strategy was adopted for searching likely sentences. We carried out sentence recognition experiments using the speech corpus consists of 159 sentences read by three Japanese male speakers. The task perplexity was 8.3. Using the speaker-dependent HMM parameters, we obtained the sentence recognition rates of 83.0-92.5%.

Publication
IEICE TRANSACTIONS on Fundamentals Vol.E74-A No.7 pp.1863-1869
Publication Date
1991/07/25
Publicized
Online ISSN
DOI
Type of Manuscript
Special Section PAPER (Special Issue on Continuous Speech Recognition and Understanding)
Category
Speech Understanding

Authors

Keyword