The search functionality is under construction.

IEICE TRANSACTIONS on Information

A Variable Break Prediction Method Using CART in a Japanese Text-to-Speech System

Deok-Su NA, Myung-Jin BAE

  • Full Text Views

    0

  • Cite this

Summary :

Break prediction is an important step in text-to-speech systems as break indices (BIs) have a great influence on how to correctly represent prosodic phrase boundaries. However, an accurate prediction is difficult since BIs are often chosen according to the meaning of a sentence or the reading style of the speaker. In Japanese, the prediction of an accentual phrase boundary (APB) and major phrase boundary (MPB) is particularly difficult. Thus, this paper presents a method to complement the prediction errors of an APB and MPB. First, we define a subtle BI in which it is difficult to decide between an APB and MPB clearly as a variable break (VB), and an explicit BI as a fixed break (FB). The VB is chosen using the classification and regression tree, and multiple prosodic targets in relation to the pith and duration are then generated. Finally, unit-selection is conducted using multiple prosodic targets. The experimental results show that the proposed method improves the naturalness of synthesized speech.

Publication
IEICE TRANSACTIONS on Information Vol.E92-D No.2 pp.349-352
Publication Date
2009/02/01
Publicized
Online ISSN
1745-1361
DOI
10.1587/transinf.E92.D.349
Type of Manuscript
LETTER
Category
Speech and Hearing

Authors

Keyword