Consolidation-Based Speech Translation and Evaluation Approach

Chiori HORI; Bing ZHAO; Stephan VOGEL; Alex WAIBEL; Hideki KASHIOKA; Satoshi NAKAMURA

doi:10.1587/transinf.E92.D.477

IEICE TRANSACTIONS on Information

Consolidation-Based Speech Translation and Evaluation Approach

Chiori HORI, Bing ZHAO, Stephan VOGEL, Alex WAIBEL, Hideki KASHIOKA, Satoshi NAKAMURA

Full Text Views

0

Cite this

Summary :

The performance of speech translation systems combining automatic speech recognition (ASR) and machine translation (MT) systems is degraded by redundant and irrelevant information caused by speaker disfluency and recognition errors. This paper proposes a new approach to translating speech recognition results through speech consolidation, which removes ASR errors and disfluencies and extracts meaningful phrases. A consolidation approach is spun off from speech summarization by word extraction from ASR 1-best. We extended the consolidation approach for confusion network (CN) and tested the performance using TED speech and confirmed the consolidation results preserved more meaningful phrases in comparison with the original ASR results. We applied the consolidation technique to speech translation. To test the performance of consolidation-based speech translation, Chinese broadcast news (BN) speech in RT04 were recognized, consolidated and then translated. The speech translation results via consolidation cannot be directly compared with gold standards in which all words in speech are translated because consolidation-based translations are partial translations. We would like to propose a new evaluation framework for partial translation by comparing them with the most similar set of words extracted from a word network created by merging gradual summarizations of the gold standard translation. The performance of consolidation-based MT results was evaluated using BLEU. We also propose Information Preservation Accuracy (IPAccy) and Meaning Preservation Accuracy (MPAccy) to evaluate consolidation and consolidation-based MT. We confirmed that consolidation contributed to the performance of speech translation.

Publication: IEICE TRANSACTIONS on Information Vol.E92-D No.3 pp.477-488

Publication Date: 2009/03/01

Publicized

Online ISSN: 1745-1361

DOI: 10.1587/transinf.E92.D.477

Type of Manuscript: PAPER

Category: Speech and Hearing

Cite this

Copy

Chiori HORI, Bing ZHAO, Stephan VOGEL, Alex WAIBEL, Hideki KASHIOKA, Satoshi NAKAMURA, "Consolidation-Based Speech Translation and Evaluation Approach" in IEICE TRANSACTIONS on Information, vol. E92-D, no. 3, pp. 477-488, March 2009, doi: 10.1587/transinf.E92.D.477.
Abstract: The performance of speech translation systems combining automatic speech recognition (ASR) and machine translation (MT) systems is degraded by redundant and irrelevant information caused by speaker disfluency and recognition errors. This paper proposes a new approach to translating speech recognition results through speech consolidation, which removes ASR errors and disfluencies and extracts meaningful phrases. A consolidation approach is spun off from speech summarization by word extraction from ASR 1-best. We extended the consolidation approach for confusion network (CN) and tested the performance using TED speech and confirmed the consolidation results preserved more meaningful phrases in comparison with the original ASR results. We applied the consolidation technique to speech translation. To test the performance of consolidation-based speech translation, Chinese broadcast news (BN) speech in RT04 were recognized, consolidated and then translated. The speech translation results via consolidation cannot be directly compared with gold standards in which all words in speech are translated because consolidation-based translations are partial translations. We would like to propose a new evaluation framework for partial translation by comparing them with the most similar set of words extracted from a word network created by merging gradual summarizations of the gold standard translation. The performance of consolidation-based MT results was evaluated using BLEU. We also propose Information Preservation Accuracy (IPAccy) and Meaning Preservation Accuracy (MPAccy) to evaluate consolidation and consolidation-based MT. We confirmed that consolidation contributed to the performance of speech translation.
URL: https://global.ieice.org/en_transactions/information/10.1587/transinf.E92.D.477/_p

Copy

@ARTICLE{e92-d_3_477,
author={Chiori HORI, Bing ZHAO, Stephan VOGEL, Alex WAIBEL, Hideki KASHIOKA, Satoshi NAKAMURA, },
journal={IEICE TRANSACTIONS on Information},
title={Consolidation-Based Speech Translation and Evaluation Approach},
year={2009},
volume={E92-D},
number={3},
pages={477-488},
abstract={The performance of speech translation systems combining automatic speech recognition (ASR) and machine translation (MT) systems is degraded by redundant and irrelevant information caused by speaker disfluency and recognition errors. This paper proposes a new approach to translating speech recognition results through speech consolidation, which removes ASR errors and disfluencies and extracts meaningful phrases. A consolidation approach is spun off from speech summarization by word extraction from ASR 1-best. We extended the consolidation approach for confusion network (CN) and tested the performance using TED speech and confirmed the consolidation results preserved more meaningful phrases in comparison with the original ASR results. We applied the consolidation technique to speech translation. To test the performance of consolidation-based speech translation, Chinese broadcast news (BN) speech in RT04 were recognized, consolidated and then translated. The speech translation results via consolidation cannot be directly compared with gold standards in which all words in speech are translated because consolidation-based translations are partial translations. We would like to propose a new evaluation framework for partial translation by comparing them with the most similar set of words extracted from a word network created by merging gradual summarizations of the gold standard translation. The performance of consolidation-based MT results was evaluated using BLEU. We also propose Information Preservation Accuracy (IPAccy) and Meaning Preservation Accuracy (MPAccy) to evaluate consolidation and consolidation-based MT. We confirmed that consolidation contributed to the performance of speech translation.},
keywords={},
doi={10.1587/transinf.E92.D.477},
ISSN={1745-1361},
month={March},}

Copy

TY - JOUR
TI - Consolidation-Based Speech Translation and Evaluation Approach
T2 - IEICE TRANSACTIONS on Information
SP - 477
EP - 488
AU - Chiori HORI
AU - Bing ZHAO
AU - Stephan VOGEL
AU - Alex WAIBEL
AU - Hideki KASHIOKA
AU - Satoshi NAKAMURA
PY - 2009
DO - 10.1587/transinf.E92.D.477
JO - IEICE TRANSACTIONS on Information
SN - 1745-1361
VL - E92-D
IS - 3
JA - IEICE TRANSACTIONS on Information
Y1 - March 2009
AB - The performance of speech translation systems combining automatic speech recognition (ASR) and machine translation (MT) systems is degraded by redundant and irrelevant information caused by speaker disfluency and recognition errors. This paper proposes a new approach to translating speech recognition results through speech consolidation, which removes ASR errors and disfluencies and extracts meaningful phrases. A consolidation approach is spun off from speech summarization by word extraction from ASR 1-best. We extended the consolidation approach for confusion network (CN) and tested the performance using TED speech and confirmed the consolidation results preserved more meaningful phrases in comparison with the original ASR results. We applied the consolidation technique to speech translation. To test the performance of consolidation-based speech translation, Chinese broadcast news (BN) speech in RT04 were recognized, consolidated and then translated. The speech translation results via consolidation cannot be directly compared with gold standards in which all words in speech are translated because consolidation-based translations are partial translations. We would like to propose a new evaluation framework for partial translation by comparing them with the most similar set of words extracted from a word network created by merging gradual summarizations of the gold standard translation. The performance of consolidation-based MT results was evaluated using BLEU. We also propose Information Preservation Accuracy (IPAccy) and Meaning Preservation Accuracy (MPAccy) to evaluate consolidation and consolidation-based MT. We confirmed that consolidation contributed to the performance of speech translation.
ER -

IEICE TRANSACTIONS on Information

Consolidation-Based Speech Translation and Evaluation Approach

Summary :

Authors

Keyword

Latest Issue

Contents

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles

IEICE TRANSACTIONS on Information

Consolidation-Based Speech Translation and Evaluation Approach

Summary :

Authors

Keyword

Latest Issue

Contents

Copyrights notice of machine-translated contents

Cite this

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles