Speech Recognition Using HMM Based on Fusion of Visual and Auditory Information

Akira SHINTANI; Akio OGIHARA; Yoshikazu YAMAGUCHI; Yasuhisa HAYASHI; Kunio FUKUNAGA

Speech Recognition Using HMM Based on Fusion of Visual and Auditory Information

Akira SHINTANI, Akio OGIHARA, Yoshikazu YAMAGUCHI, Yasuhisa HAYASHI, Kunio FUKUNAGA

Full Text Views

0

Cite this

Summary :

We propose two methods to fuse auditory information and visual information for accurate sppech recognition. The first method fuses two kinds of information by using linear combination after calculating two kinds of probabilities by HMM for each word. The second method fuses two kinds of information by using the histogram which expresses the correlation of them. We have performed experiments comparing the proposed methods with the conventional method and confirmed the validity of the proposed methods.

Publication: IEICE TRANSACTIONS on Fundamentals Vol.E77-A No.11 pp.1875-1878

Publication Date: 1994/11/25

Publicized

Online ISSN

DOI

Type of Manuscript: Special Section LETTER (Special Section of Letters Selected from the 1994 IEICE Spring Conference)

Category

Cite this

Copy

Akira SHINTANI, Akio OGIHARA, Yoshikazu YAMAGUCHI, Yasuhisa HAYASHI, Kunio FUKUNAGA, "Speech Recognition Using HMM Based on Fusion of Visual and Auditory Information" in IEICE TRANSACTIONS on Fundamentals, vol. E77-A, no. 11, pp. 1875-1878, November 1994, doi: .
Abstract: We propose two methods to fuse auditory information and visual information for accurate sppech recognition. The first method fuses two kinds of information by using linear combination after calculating two kinds of probabilities by HMM for each word. The second method fuses two kinds of information by using the histogram which expresses the correlation of them. We have performed experiments comparing the proposed methods with the conventional method and confirmed the validity of the proposed methods.
URL: https://global.ieice.org/en_transactions/fundamentals/10.1587/e77-a_11_1875/_p

Copy

@ARTICLE{e77-a_11_1875,
author={Akira SHINTANI, Akio OGIHARA, Yoshikazu YAMAGUCHI, Yasuhisa HAYASHI, Kunio FUKUNAGA, },
journal={IEICE TRANSACTIONS on Fundamentals},
title={Speech Recognition Using HMM Based on Fusion of Visual and Auditory Information},
year={1994},
volume={E77-A},
number={11},
pages={1875-1878},
abstract={We propose two methods to fuse auditory information and visual information for accurate sppech recognition. The first method fuses two kinds of information by using linear combination after calculating two kinds of probabilities by HMM for each word. The second method fuses two kinds of information by using the histogram which expresses the correlation of them. We have performed experiments comparing the proposed methods with the conventional method and confirmed the validity of the proposed methods.},
keywords={},
doi={},
ISSN={},
month={November},}

Copy

TY - JOUR
TI - Speech Recognition Using HMM Based on Fusion of Visual and Auditory Information
T2 - IEICE TRANSACTIONS on Fundamentals
SP - 1875
EP - 1878
AU - Akira SHINTANI
AU - Akio OGIHARA
AU - Yoshikazu YAMAGUCHI
AU - Yasuhisa HAYASHI
AU - Kunio FUKUNAGA
PY - 1994
DO -
JO - IEICE TRANSACTIONS on Fundamentals
SN -
VL - E77-A
IS - 11
JA - IEICE TRANSACTIONS on Fundamentals
Y1 - November 1994
AB - We propose two methods to fuse auditory information and visual information for accurate sppech recognition. The first method fuses two kinds of information by using linear combination after calculating two kinds of probabilities by HMM for each word. The second method fuses two kinds of information by using the histogram which expresses the correlation of them. We have performed experiments comparing the proposed methods with the conventional method and confirmed the validity of the proposed methods.
ER -