The search functionality is under construction.

IEICE TRANSACTIONS on Information

Open Domain Continuous Filipino Speech Recognition: Challenges and Baseline Experiments

Federico ANG, Rowena Cristina GUEVARA, Yoshikazu MIYANAGA, Rhandley CAJOTE, Joel ILAO, Michael Gringo Angelo BAYONA, Ann Franchesca LAGUNA

  • Full Text Views

    0

  • Cite this

Summary :

In this paper, a new database suitable for HMM-based automatic Filipino speech recognition is described for the purpose of training a domain-independent, large-vocabulary continuous speech recognition system. Although it is known that high-performance speech recognition systems depend on a superior speech database used in the training stage, due to the lack of such an appropriate database, previous reports on Filipino speech recognition had to contend with serious data sparsity issues. In this paper we alleviate such sparsity through appropriate data analysis that makes the evaluation results more reliable. The best system is identified through its low word-error rate to a cross-validation set containing almost three hours of unknown speech data. Language-dependent problems are discussed, and their impact on accuracy was analyzed. The approach is currently data driven, however it serves as a competent baseline model for succeeding future developments.

Publication
IEICE TRANSACTIONS on Information Vol.E97-D No.9 pp.2443-2452
Publication Date
2014/09/01
Publicized
Online ISSN
1745-1361
DOI
10.1587/transinf.2013EDP7442
Type of Manuscript
PAPER
Category
Speech and Hearing

Authors

Federico ANG
  University of the Philippines
Rowena Cristina GUEVARA
  University of the Philippines
Yoshikazu MIYANAGA
  Hokkaido University
Rhandley CAJOTE
  University of the Philippines
Joel ILAO
  University of the Philippines
Michael Gringo Angelo BAYONA
  University of the Philippines
Ann Franchesca LAGUNA
  University of the Philippines

Keyword