The search functionality is under construction.

IEICE TRANSACTIONS on Information

Deep Learning Approaches for Pathological Voice Detection Using Heterogeneous Parameters

JiYeoun LEE, Hee-Jin CHOI

  • Full Text Views

    0

  • Cite this

Summary :

We propose a deep learning-based model for classifying pathological voices using a convolutional neural network and a feedforward neural network. The model uses combinations of heterogeneous parameters, including mel-frequency cepstral coefficients, linear predictive cepstral coefficients and higher-order statistics. We validate the accuracy of this model using the Massachusetts Eye and Ear Infirmary (MEEI) voice disorder database and the Saarbruecken Voice Database (SVD). Our model achieved an accuracy of 99.3% for MEEI and 75.18% for SVD. This model achieved an accuracy that is 7.18% higher than that of competitive models in previous studies.

Publication
IEICE TRANSACTIONS on Information Vol.E103-D No.8 pp.1920-1923
Publication Date
2020/08/01
Publicized
2020/05/14
Online ISSN
1745-1361
DOI
10.1587/transinf.2020EDL8031
Type of Manuscript
LETTER
Category
Speech and Hearing

Authors

JiYeoun LEE
  Jungwon University
Hee-Jin CHOI
  KAIST

Keyword