The search functionality is under construction.
The search functionality is under construction.

HMM-Based Maximum Likelihood Frame Alignment for Voice Conversion from a Nonparallel Corpus

Ki-Seung LEE

  • Full Text Views

    0

  • Cite this

Summary :

One of the problems associated with voice conversion from a nonparallel corpus is how to find the best match or alignment between the source and the target vector sequences without linguistic information. In a previous study, alignment was achieved by minimizing the distance between the source vector and the transformed vector. This method, however, yielded a sequence of feature vectors that were not well matched with the underlying speaker model. In this letter, the vectors were selected from the candidates by maximizing the overall likelihood of the selected vectors with respect to the target model in the HMM context. Both objective and subjective evaluations were carried out using the CMU ARCTIC database to verify the effectiveness of the proposed method.

Publication
IEICE TRANSACTIONS on Information Vol.E100-D No.12 pp.3064-3067
Publication Date
2017/12/01
Publicized
2017/08/23
Online ISSN
1745-1361
DOI
10.1587/transinf.2017EDL8144
Type of Manuscript
LETTER
Category
Speech and Hearing

Authors

Ki-Seung LEE
  Konkuk University

Keyword