Selected Topics from LVCSR Research for Asian Languages at Tokyo Tech

Sadaoki FURUI

doi:10.1587/transinf.E95.D.1182

Selected Topics from LVCSR Research for Asian Languages at Tokyo Tech

Sadaoki FURUI

Full Text Views

0

Cite this

Summary :

This paper presents our recent work in regard to building Large Vocabulary Continuous Speech Recognition (LVCSR) systems for the Thai, Indonesian, and Chinese languages. For Thai, since there is no word boundary in the written form, we have proposed a new method for automatically creating word-like units from a text corpus, and applied topic and speaking style adaptation to the language model to recognize spoken-style utterances. For Indonesian, we have applied proper noun-specific adaptation to acoustic modeling, and rule-based English-to-Indonesian phoneme mapping to solve the problem of large variation in proper noun and English word pronunciation in a spoken-query information retrieval system. In spoken Chinese, long organization names are frequently abbreviated, and abbreviated utterances cannot be recognized if the abbreviations are not included in the dictionary. We have proposed a new method for automatically generating Chinese abbreviations, and by expanding the vocabulary using the generated abbreviations, we have significantly improved the performance of spoken query-based search.

Publication: IEICE TRANSACTIONS on Information Vol.E95-D No.5 pp.1182-1194

Publication Date: 2012/05/01

Publicized

Online ISSN: 1745-1361

DOI: 10.1587/transinf.E95.D.1182

Type of Manuscript: Special Section PAPER (Special Section on Recent Advances in Multimedia Signal Processing Techniques and Applications)

Category: Speech Processing

Cite this

Copy

Sadaoki FURUI, "Selected Topics from LVCSR Research for Asian Languages at Tokyo Tech" in IEICE TRANSACTIONS on Information, vol. E95-D, no. 5, pp. 1182-1194, May 2012, doi: 10.1587/transinf.E95.D.1182.
Abstract: This paper presents our recent work in regard to building Large Vocabulary Continuous Speech Recognition (LVCSR) systems for the Thai, Indonesian, and Chinese languages. For Thai, since there is no word boundary in the written form, we have proposed a new method for automatically creating word-like units from a text corpus, and applied topic and speaking style adaptation to the language model to recognize spoken-style utterances. For Indonesian, we have applied proper noun-specific adaptation to acoustic modeling, and rule-based English-to-Indonesian phoneme mapping to solve the problem of large variation in proper noun and English word pronunciation in a spoken-query information retrieval system. In spoken Chinese, long organization names are frequently abbreviated, and abbreviated utterances cannot be recognized if the abbreviations are not included in the dictionary. We have proposed a new method for automatically generating Chinese abbreviations, and by expanding the vocabulary using the generated abbreviations, we have significantly improved the performance of spoken query-based search.
URL: https://global.ieice.org/en_transactions/information/10.1587/transinf.E95.D.1182/_p

Copy

@ARTICLE{e95-d_5_1182,
author={Sadaoki FURUI, },
journal={IEICE TRANSACTIONS on Information},
title={Selected Topics from LVCSR Research for Asian Languages at Tokyo Tech},
year={2012},
volume={E95-D},
number={5},
pages={1182-1194},
abstract={This paper presents our recent work in regard to building Large Vocabulary Continuous Speech Recognition (LVCSR) systems for the Thai, Indonesian, and Chinese languages. For Thai, since there is no word boundary in the written form, we have proposed a new method for automatically creating word-like units from a text corpus, and applied topic and speaking style adaptation to the language model to recognize spoken-style utterances. For Indonesian, we have applied proper noun-specific adaptation to acoustic modeling, and rule-based English-to-Indonesian phoneme mapping to solve the problem of large variation in proper noun and English word pronunciation in a spoken-query information retrieval system. In spoken Chinese, long organization names are frequently abbreviated, and abbreviated utterances cannot be recognized if the abbreviations are not included in the dictionary. We have proposed a new method for automatically generating Chinese abbreviations, and by expanding the vocabulary using the generated abbreviations, we have significantly improved the performance of spoken query-based search.},
keywords={},
doi={10.1587/transinf.E95.D.1182},
ISSN={1745-1361},
month={May},}

Copy

TY - JOUR
TI - Selected Topics from LVCSR Research for Asian Languages at Tokyo Tech
T2 - IEICE TRANSACTIONS on Information
SP - 1182
EP - 1194
AU - Sadaoki FURUI
PY - 2012
DO - 10.1587/transinf.E95.D.1182
JO - IEICE TRANSACTIONS on Information
SN - 1745-1361
VL - E95-D
IS - 5
JA - IEICE TRANSACTIONS on Information
Y1 - May 2012
AB - This paper presents our recent work in regard to building Large Vocabulary Continuous Speech Recognition (LVCSR) systems for the Thai, Indonesian, and Chinese languages. For Thai, since there is no word boundary in the written form, we have proposed a new method for automatically creating word-like units from a text corpus, and applied topic and speaking style adaptation to the language model to recognize spoken-style utterances. For Indonesian, we have applied proper noun-specific adaptation to acoustic modeling, and rule-based English-to-Indonesian phoneme mapping to solve the problem of large variation in proper noun and English word pronunciation in a spoken-query information retrieval system. In spoken Chinese, long organization names are frequently abbreviated, and abbreviated utterances cannot be recognized if the abbreviations are not included in the dictionary. We have proposed a new method for automatically generating Chinese abbreviations, and by expanding the vocabulary using the generated abbreviations, we have significantly improved the performance of spoken query-based search.
ER -