The search functionality is under construction.

IEICE TRANSACTIONS on Information

Open Access
Prosody Correction Preserving Speaker Individuality for Chinese-Accented Japanese HMM-Based Text-to-Speech Synthesis

Daiki SEKIZAWA, Shinnosuke TAKAMICHI, Hiroshi SARUWATARI

  • Full Text Views

    47

  • Cite this
  • Free PDF (507.5KB)

Summary :

This article proposes a prosody correction method based on partial model adaptation for Chinese-accented Japanese hidden Markov model (HMM)-based text-to-speech synthesis. Although text-to-speech synthesis built from non-native speech accurately reproduces the speaker's individuality in synthetic speech, the naturalness of the synthetic speech is strongly degraded. In the proposed model, to improve the naturalness while preserving the speaker individuality of Chinese-accented Japanese text-to-speech synthesis, we partially utilize HMM parameters of native Japanese speech to synthesize prosody-corrected synthetic speech. Results of an experimental evaluation demonstrate that duration and F0 correction are significantly effective for improving naturalness.

Publication
IEICE TRANSACTIONS on Information Vol.E102-D No.6 pp.1218-1221
Publication Date
2019/06/01
Publicized
2019/03/11
Online ISSN
1745-1361
DOI
10.1587/transinf.2018EDL8264
Type of Manuscript
LETTER
Category
Speech and Hearing

Authors

Daiki SEKIZAWA
  University of Tokyo
Shinnosuke TAKAMICHI
  University of Tokyo
Hiroshi SARUWATARI
  University of Tokyo

Keyword