The search functionality is under construction.
The search functionality is under construction.

Multi-Task Learning for Improved Recognition of Multiple Types of Acoustic Information

Jae-Won KIM, Hochong PARK

  • Full Text Views

    0

  • Cite this

Summary :

We propose a new method for improving the recognition performance of phonemes, speech emotions, and music genres using multi-task learning. When tasks are closely related, multi-task learning can improve the performance of each task by learning common feature representation for all the tasks. However, the recognition tasks considered in this study demand different input signals of speech and music at different time scales, resulting in input features with different characteristics. In addition, a training dataset with multiple labels for all information sources is not available. Considering these issues, we conduct multi-task learning in a sequential training process using input features with a single label for one information source. A comparative evaluation confirms that the proposed method for multi-task learning provides higher performance for all recognition tasks than individual learning for each task as in conventional methods.

Publication
IEICE TRANSACTIONS on Information Vol.E104-D No.10 pp.1762-1765
Publication Date
2021/10/01
Publicized
2021/07/14
Online ISSN
1745-1361
DOI
10.1587/transinf.2021EDL8029
Type of Manuscript
LETTER
Category
Speech and Hearing

Authors

Jae-Won KIM
  Kwangwoon University
Hochong PARK
  Kwangwoon University

Keyword