The search functionality is under construction.
The search functionality is under construction.

Fast Gated Recurrent Network for Speech Synthesis

Bima PRIHASTO, Tzu-Chiang TAI, Pao-Chi CHANG, Jia-Ching WANG

  • Full Text Views

    0

  • Cite this

Summary :

The recurrent neural network (RNN) has been used in audio and speech processing, such as language translation and speech recognition. Although RNN-based architecture can be applied to speech synthesis, the long computing time is still the primary concern. This research proposes a fast gated recurrent neural network, a fast RNN-based architecture, for speech synthesis based on the minimal gated unit (MGU). Our architecture removes the unit state history from some equations in MGU. Our MGU-based architecture is about twice faster, with equally good sound quality than the other MGU-based architectures.

Publication
IEICE TRANSACTIONS on Information Vol.E105-D No.9 pp.1634-1638
Publication Date
2022/09/01
Publicized
2022/06/10
Online ISSN
1745-1361
DOI
10.1587/transinf.2021EDL8032
Type of Manuscript
LETTER
Category
Speech and Hearing

Authors

Bima PRIHASTO
  National Central University
Tzu-Chiang TAI
  Providence University
Pao-Chi CHANG
  National Central University
Jia-Ching WANG
  National Central University

Keyword