This paper presents a single-chip speech dialogue module and its evaluation on a personal robot. This module is implemented on an application processor that was developed primarily for mobile phones to provide a compact size, low power-consumption, and low cost. It performs speech recognition with preprocessing functions such as direction-of-arrival (DOA) estimation, noise cancellation, beamforming with an array of microphones, and echo cancellation. Text-to-speech (TTS) conversion is also equipped with. Evaluation results obtained on a new personal robot, PaPeRo-mini, which is a scale-down version of PaPeRo, demonstrate an 85% correct rate in DOA estimation, and as much as 54% and 30% higher speech recognition rates in noisy environments and during robot utterances, respectively. These results are shown to be comparable to those obtained by PaPeRo.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Copy
Miki SATO, Toru IWASAWA, Akihiko SUGIYAMA, Toshihiro NISHIZAWA, Yosuke TAKANO, "A Single-Chip Speech Dialogue Module and Its Evaluation on a Personal Robot, PaPeRo-Mini" in IEICE TRANSACTIONS on Fundamentals,
vol. E93-A, no. 1, pp. 261-271, January 2010, doi: 10.1587/transfun.E93.A.261.
Abstract: This paper presents a single-chip speech dialogue module and its evaluation on a personal robot. This module is implemented on an application processor that was developed primarily for mobile phones to provide a compact size, low power-consumption, and low cost. It performs speech recognition with preprocessing functions such as direction-of-arrival (DOA) estimation, noise cancellation, beamforming with an array of microphones, and echo cancellation. Text-to-speech (TTS) conversion is also equipped with. Evaluation results obtained on a new personal robot, PaPeRo-mini, which is a scale-down version of PaPeRo, demonstrate an 85% correct rate in DOA estimation, and as much as 54% and 30% higher speech recognition rates in noisy environments and during robot utterances, respectively. These results are shown to be comparable to those obtained by PaPeRo.
URL: https://global.ieice.org/en_transactions/fundamentals/10.1587/transfun.E93.A.261/_p
Copy
@ARTICLE{e93-a_1_261,
author={Miki SATO, Toru IWASAWA, Akihiko SUGIYAMA, Toshihiro NISHIZAWA, Yosuke TAKANO, },
journal={IEICE TRANSACTIONS on Fundamentals},
title={A Single-Chip Speech Dialogue Module and Its Evaluation on a Personal Robot, PaPeRo-Mini},
year={2010},
volume={E93-A},
number={1},
pages={261-271},
abstract={This paper presents a single-chip speech dialogue module and its evaluation on a personal robot. This module is implemented on an application processor that was developed primarily for mobile phones to provide a compact size, low power-consumption, and low cost. It performs speech recognition with preprocessing functions such as direction-of-arrival (DOA) estimation, noise cancellation, beamforming with an array of microphones, and echo cancellation. Text-to-speech (TTS) conversion is also equipped with. Evaluation results obtained on a new personal robot, PaPeRo-mini, which is a scale-down version of PaPeRo, demonstrate an 85% correct rate in DOA estimation, and as much as 54% and 30% higher speech recognition rates in noisy environments and during robot utterances, respectively. These results are shown to be comparable to those obtained by PaPeRo.},
keywords={},
doi={10.1587/transfun.E93.A.261},
ISSN={1745-1337},
month={January},}
Copy
TY - JOUR
TI - A Single-Chip Speech Dialogue Module and Its Evaluation on a Personal Robot, PaPeRo-Mini
T2 - IEICE TRANSACTIONS on Fundamentals
SP - 261
EP - 271
AU - Miki SATO
AU - Toru IWASAWA
AU - Akihiko SUGIYAMA
AU - Toshihiro NISHIZAWA
AU - Yosuke TAKANO
PY - 2010
DO - 10.1587/transfun.E93.A.261
JO - IEICE TRANSACTIONS on Fundamentals
SN - 1745-1337
VL - E93-A
IS - 1
JA - IEICE TRANSACTIONS on Fundamentals
Y1 - January 2010
AB - This paper presents a single-chip speech dialogue module and its evaluation on a personal robot. This module is implemented on an application processor that was developed primarily for mobile phones to provide a compact size, low power-consumption, and low cost. It performs speech recognition with preprocessing functions such as direction-of-arrival (DOA) estimation, noise cancellation, beamforming with an array of microphones, and echo cancellation. Text-to-speech (TTS) conversion is also equipped with. Evaluation results obtained on a new personal robot, PaPeRo-mini, which is a scale-down version of PaPeRo, demonstrate an 85% correct rate in DOA estimation, and as much as 54% and 30% higher speech recognition rates in noisy environments and during robot utterances, respectively. These results are shown to be comparable to those obtained by PaPeRo.
ER -