This paper describes a low-power VLSI chip for speaker-independent 60-kWord continuous speech recognition based on a context-dependent Hidden Markov Model (HMM). It features a compression-decoding scheme to reduce the external memory bandwidth for Gaussian Mixture Model (GMM) computation and multi-path Viterbi transition units. We optimize the internal SRAM size using the max-approximation GMM calculation and adjusting the number of look-ahead frames. The test chip, fabricated in 40 nm CMOS technology, occupies 1.77 mm
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Copy
Guangji HE, Takanobu SUGAHARA, Yuki MIYAMOTO, Shintaro IZUMI, Hiroshi KAWAGUCHI, Masahiko YOSHIMOTO, "A 168-mW 2.4-Real-Time 60-kWord Continuous Speech Recognition Processor VLSI" in IEICE TRANSACTIONS on Electronics,
vol. E96-C, no. 4, pp. 444-453, April 2013, doi: 10.1587/transele.E96.C.444.
Abstract: This paper describes a low-power VLSI chip for speaker-independent 60-kWord continuous speech recognition based on a context-dependent Hidden Markov Model (HMM). It features a compression-decoding scheme to reduce the external memory bandwidth for Gaussian Mixture Model (GMM) computation and multi-path Viterbi transition units. We optimize the internal SRAM size using the max-approximation GMM calculation and adjusting the number of look-ahead frames. The test chip, fabricated in 40 nm CMOS technology, occupies 1.77 mm
URL: https://global.ieice.org/en_transactions/electronics/10.1587/transele.E96.C.444/_p
Copy
@ARTICLE{e96-c_4_444,
author={Guangji HE, Takanobu SUGAHARA, Yuki MIYAMOTO, Shintaro IZUMI, Hiroshi KAWAGUCHI, Masahiko YOSHIMOTO, },
journal={IEICE TRANSACTIONS on Electronics},
title={A 168-mW 2.4-Real-Time 60-kWord Continuous Speech Recognition Processor VLSI},
year={2013},
volume={E96-C},
number={4},
pages={444-453},
abstract={This paper describes a low-power VLSI chip for speaker-independent 60-kWord continuous speech recognition based on a context-dependent Hidden Markov Model (HMM). It features a compression-decoding scheme to reduce the external memory bandwidth for Gaussian Mixture Model (GMM) computation and multi-path Viterbi transition units. We optimize the internal SRAM size using the max-approximation GMM calculation and adjusting the number of look-ahead frames. The test chip, fabricated in 40 nm CMOS technology, occupies 1.77 mm
keywords={},
doi={10.1587/transele.E96.C.444},
ISSN={1745-1353},
month={April},}
Copy
TY - JOUR
TI - A 168-mW 2.4-Real-Time 60-kWord Continuous Speech Recognition Processor VLSI
T2 - IEICE TRANSACTIONS on Electronics
SP - 444
EP - 453
AU - Guangji HE
AU - Takanobu SUGAHARA
AU - Yuki MIYAMOTO
AU - Shintaro IZUMI
AU - Hiroshi KAWAGUCHI
AU - Masahiko YOSHIMOTO
PY - 2013
DO - 10.1587/transele.E96.C.444
JO - IEICE TRANSACTIONS on Electronics
SN - 1745-1353
VL - E96-C
IS - 4
JA - IEICE TRANSACTIONS on Electronics
Y1 - April 2013
AB - This paper describes a low-power VLSI chip for speaker-independent 60-kWord continuous speech recognition based on a context-dependent Hidden Markov Model (HMM). It features a compression-decoding scheme to reduce the external memory bandwidth for Gaussian Mixture Model (GMM) computation and multi-path Viterbi transition units. We optimize the internal SRAM size using the max-approximation GMM calculation and adjusting the number of look-ahead frames. The test chip, fabricated in 40 nm CMOS technology, occupies 1.77 mm
ER -