We propose two methods to fuse auditory information and visual information for accurate sppech recognition. The first method fuses two kinds of information by using linear combination after calculating two kinds of probabilities by HMM for each word. The second method fuses two kinds of information by using the histogram which expresses the correlation of them. We have performed experiments comparing the proposed methods with the conventional method and confirmed the validity of the proposed methods.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Copy
Akira SHINTANI, Akio OGIHARA, Yoshikazu YAMAGUCHI, Yasuhisa HAYASHI, Kunio FUKUNAGA, "Speech Recognition Using HMM Based on Fusion of Visual and Auditory Information" in IEICE TRANSACTIONS on Fundamentals,
vol. E77-A, no. 11, pp. 1875-1878, November 1994, doi: .
Abstract: We propose two methods to fuse auditory information and visual information for accurate sppech recognition. The first method fuses two kinds of information by using linear combination after calculating two kinds of probabilities by HMM for each word. The second method fuses two kinds of information by using the histogram which expresses the correlation of them. We have performed experiments comparing the proposed methods with the conventional method and confirmed the validity of the proposed methods.
URL: https://global.ieice.org/en_transactions/fundamentals/10.1587/e77-a_11_1875/_p
Copy
@ARTICLE{e77-a_11_1875,
author={Akira SHINTANI, Akio OGIHARA, Yoshikazu YAMAGUCHI, Yasuhisa HAYASHI, Kunio FUKUNAGA, },
journal={IEICE TRANSACTIONS on Fundamentals},
title={Speech Recognition Using HMM Based on Fusion of Visual and Auditory Information},
year={1994},
volume={E77-A},
number={11},
pages={1875-1878},
abstract={We propose two methods to fuse auditory information and visual information for accurate sppech recognition. The first method fuses two kinds of information by using linear combination after calculating two kinds of probabilities by HMM for each word. The second method fuses two kinds of information by using the histogram which expresses the correlation of them. We have performed experiments comparing the proposed methods with the conventional method and confirmed the validity of the proposed methods.},
keywords={},
doi={},
ISSN={},
month={November},}
Copy
TY - JOUR
TI - Speech Recognition Using HMM Based on Fusion of Visual and Auditory Information
T2 - IEICE TRANSACTIONS on Fundamentals
SP - 1875
EP - 1878
AU - Akira SHINTANI
AU - Akio OGIHARA
AU - Yoshikazu YAMAGUCHI
AU - Yasuhisa HAYASHI
AU - Kunio FUKUNAGA
PY - 1994
DO -
JO - IEICE TRANSACTIONS on Fundamentals
SN -
VL - E77-A
IS - 11
JA - IEICE TRANSACTIONS on Fundamentals
Y1 - November 1994
AB - We propose two methods to fuse auditory information and visual information for accurate sppech recognition. The first method fuses two kinds of information by using linear combination after calculating two kinds of probabilities by HMM for each word. The second method fuses two kinds of information by using the histogram which expresses the correlation of them. We have performed experiments comparing the proposed methods with the conventional method and confirmed the validity of the proposed methods.
ER -