IEICE global.ieice.org Site

Author Search Result

[Author] Deokgyu YUN(2hit)

1-2hit

A Non-Intrusive Speech Intelligibility Estimation Method Based on Deep Learning Using Autoencoder Features
Yoonhee KIM Deokgyu YUN Hannah LEE Seung Ho CHOI

LETTER-Speech and Hearing

Pubricized:
2019/12/11
Vol:
E103-D No:3
Page(s):
714-715
This paper presents a deep learning-based non-intrusive speech intelligibility estimation method using bottleneck features of autoencoder. The conventional standard non-intrusive speech intelligibility estimation method, P.563, lacks intelligibility estimation performance in various noise environments. We propose a more accurate speech intelligibility estimation method based on long-short term memory (LSTM) neural network whose input and output are an autoencoder bottleneck features and a short-time objective intelligence (STOI) score, respectively, where STOI is a standard tool for measuring intrusive speech intelligibility with reference speech signals. We showed that the proposed method has a superior performance by comparing with the conventional standard P.563 and mel-frequency cepstral coefficient (MFCC) feature-based intelligibility estimation methods for speech signals in various noise environments.
A Deep Learning-Based Approach to Non-Intrusive Objective Speech Intelligibility Estimation
Deokgyu YUN Hannah LEE Seung Ho CHOI

LETTER-Speech and Hearing

Pubricized:
2018/01/09
Vol:
E101-D No:4
Page(s):
1207-1208
This paper proposes a deep learning-based non-intrusive objective speech intelligibility estimation method based on recurrent neural network (RNN) with long short-term memory (LSTM) structure. Conventional non-intrusive estimation methods such as standard P.563 have poor estimation performance and lack of consistency, especially, in various noise and reverberation environments. The proposed method trains the LSTM RNN model parameters by utilizing the STOI that is the standard intrusive intelligibility estimation method with reference speech signal. The input and output of the LSTM RNN are the MFCC vector and the frame-wise STOI value, respectively. Experimental results show that the proposed objective intelligibility estimation method outperforms the conventional standard P.563 in various noisy and reverberant environments.

Author Search Result

[Author] Deokgyu YUN(2hit)

A Non-Intrusive Speech Intelligibility Estimation Method Based on Deep Learning Using Autoencoder Features

A Deep Learning-Based Approach to Non-Intrusive Objective Speech Intelligibility Estimation

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles