Class-Dependent Modeling for Dialog Translation

Andrew FINCH; Eiichiro SUMITA; Satoshi NAKAMURA

doi:10.1587/transinf.E92.D.2469

IEICE TRANSACTIONS on Information

Class-Dependent Modeling for Dialog Translation

Andrew FINCH, Eiichiro SUMITA, Satoshi NAKAMURA

Full Text Views

0

Cite this

Summary :

This paper presents a technique for class-dependent decoding for statistical machine translation (SMT). The approach differs from previous methods of class-dependent translation in that the class-dependent forms of all models are integrated directly into the decoding process. We employ probabilistic mixture weights between models that can change dynamically on a sentence-by-sentence basis depending on the characteristics of the source sentence. The effectiveness of this approach is demonstrated by evaluating its performance on travel conversation data. We used this approach to tackle the translation of questions and declarative sentences using class-dependent models. To achieve this, our system integrated two sets of models specifically built to deal with sentences that fall into one of two classes of dialog sentence: questions and declarations, with a third set of models built with all of the data to handle the general case. The technique was thoroughly evaluated on data from 16 language pairs using 6 machine translation evaluation metrics. We found the results were corpus-dependent, but in most cases our system was able to improve translation performance, and for some languages the improvements were substantial.

Publication: IEICE TRANSACTIONS on Information Vol.E92-D No.12 pp.2469-2477

Publication Date: 2009/12/01

Publicized

Online ISSN: 1745-1361

DOI: 10.1587/transinf.E92.D.2469

Type of Manuscript: PAPER

Category: Speech and Hearing

Cite this

Copy

Andrew FINCH, Eiichiro SUMITA, Satoshi NAKAMURA, "Class-Dependent Modeling for Dialog Translation" in IEICE TRANSACTIONS on Information, vol. E92-D, no. 12, pp. 2469-2477, December 2009, doi: 10.1587/transinf.E92.D.2469.
Abstract: This paper presents a technique for class-dependent decoding for statistical machine translation (SMT). The approach differs from previous methods of class-dependent translation in that the class-dependent forms of all models are integrated directly into the decoding process. We employ probabilistic mixture weights between models that can change dynamically on a sentence-by-sentence basis depending on the characteristics of the source sentence. The effectiveness of this approach is demonstrated by evaluating its performance on travel conversation data. We used this approach to tackle the translation of questions and declarative sentences using class-dependent models. To achieve this, our system integrated two sets of models specifically built to deal with sentences that fall into one of two classes of dialog sentence: questions and declarations, with a third set of models built with all of the data to handle the general case. The technique was thoroughly evaluated on data from 16 language pairs using 6 machine translation evaluation metrics. We found the results were corpus-dependent, but in most cases our system was able to improve translation performance, and for some languages the improvements were substantial.
URL: https://global.ieice.org/en_transactions/information/10.1587/transinf.E92.D.2469/_p

Copy

@ARTICLE{e92-d_12_2469,
author={Andrew FINCH, Eiichiro SUMITA, Satoshi NAKAMURA, },
journal={IEICE TRANSACTIONS on Information},
title={Class-Dependent Modeling for Dialog Translation},
year={2009},
volume={E92-D},
number={12},
pages={2469-2477},
abstract={This paper presents a technique for class-dependent decoding for statistical machine translation (SMT). The approach differs from previous methods of class-dependent translation in that the class-dependent forms of all models are integrated directly into the decoding process. We employ probabilistic mixture weights between models that can change dynamically on a sentence-by-sentence basis depending on the characteristics of the source sentence. The effectiveness of this approach is demonstrated by evaluating its performance on travel conversation data. We used this approach to tackle the translation of questions and declarative sentences using class-dependent models. To achieve this, our system integrated two sets of models specifically built to deal with sentences that fall into one of two classes of dialog sentence: questions and declarations, with a third set of models built with all of the data to handle the general case. The technique was thoroughly evaluated on data from 16 language pairs using 6 machine translation evaluation metrics. We found the results were corpus-dependent, but in most cases our system was able to improve translation performance, and for some languages the improvements were substantial.},
keywords={},
doi={10.1587/transinf.E92.D.2469},
ISSN={1745-1361},
month={December},}

Copy

TY - JOUR
TI - Class-Dependent Modeling for Dialog Translation
T2 - IEICE TRANSACTIONS on Information
SP - 2469
EP - 2477
AU - Andrew FINCH
AU - Eiichiro SUMITA
AU - Satoshi NAKAMURA
PY - 2009
DO - 10.1587/transinf.E92.D.2469
JO - IEICE TRANSACTIONS on Information
SN - 1745-1361
VL - E92-D
IS - 12
JA - IEICE TRANSACTIONS on Information
Y1 - December 2009
AB - This paper presents a technique for class-dependent decoding for statistical machine translation (SMT). The approach differs from previous methods of class-dependent translation in that the class-dependent forms of all models are integrated directly into the decoding process. We employ probabilistic mixture weights between models that can change dynamically on a sentence-by-sentence basis depending on the characteristics of the source sentence. The effectiveness of this approach is demonstrated by evaluating its performance on travel conversation data. We used this approach to tackle the translation of questions and declarative sentences using class-dependent models. To achieve this, our system integrated two sets of models specifically built to deal with sentences that fall into one of two classes of dialog sentence: questions and declarations, with a third set of models built with all of the data to handle the general case. The technique was thoroughly evaluated on data from 16 language pairs using 6 machine translation evaluation metrics. We found the results were corpus-dependent, but in most cases our system was able to improve translation performance, and for some languages the improvements were substantial.
ER -

IEICE TRANSACTIONS on Information

Class-Dependent Modeling for Dialog Translation

Summary :

Authors

Keyword

Latest Issue

Contents

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles

IEICE TRANSACTIONS on Information

Class-Dependent Modeling for Dialog Translation

Summary :

Authors

Keyword

Latest Issue

Contents

Copyrights notice of machine-translated contents

Cite this

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles