IEICE global.ieice.org Site

Author Search Result

[Author] Shin Jae KANG(3hit)

1-3hit

Supervised Denoising Pre-Training for Robust ASR with DNN-HMM
Shin Jae KANG Kang Hyun LEE Nam Soo KIM

LETTER-Speech and Hearing

Pubricized:
2015/09/07
Vol:
E98-D No:12
Page(s):
2345-2348
In this letter, we propose a novel supervised pre-training technique for deep neural network (DNN)-hidden Markov model systems to achieve robust speech recognition in adverse environments. In the proposed approach, our aim is to initialize the DNN parameters such that they yield abstract features robust to acoustic environment variations. In order to achieve this, we first derive the abstract features from an early fine-tuned DNN model which is trained based on a clean speech database. By using the derived abstract features as the target values, the standard error back-propagation algorithm with the stochastic gradient descent method is performed to estimate the initial parameters of the DNN. The performance of the proposed algorithm was evaluated on Aurora-4 DB, and better results were observed compared to a number of conventional pre-training methods.
Implementation of HMM-Based Human Activity Recognition Using Single Triaxial Accelerometer
Chang Woo HAN Shin Jae KANG Nam Soo KIM

LETTER-Digital Signal Processing

Vol:
E93-A No:7
Page(s):
1379-1383
In this letter, we propose a novel approach to human activity recognition. We present a class of features that are robust to the tilt of the attached sensor module and a state transition model suitable for HMM-based activity recognition. In addition, postprocessing techniques are applied to stabilize the recognition results. The proposed approach shows significant improvements in recognition experiments over a variety of human activity DB.
Estimation of Phone Mismatch Penalty Matricesfor Two-Stage Keyword Spotting
Chang Woo HAN Shin Jae KANG Nam Soo KIM

LETTER-Speech and Hearing

Vol:
E93-D No:8
Page(s):
2331-2335
In this letter, we propose a novel approach to estimate three different kinds of phone mismatch penalty matrices for two-stage keyword spotting. When the output of a phone recognizer is given, detection of a specific keyword is carried out through text matching with the phone sequences provided by the specified keyword using the proposed phone mismatch penalty matrices. The penalty matrices associated with substitution, insertion and deletion errors are estimated from the training data through deliberate error generation. The proposed approach has shown a significant improvement in a Korean continuous speech recognition task.

Author Search Result

[Author] Shin Jae KANG(3hit)

Supervised Denoising Pre-Training for Robust ASR with DNN-HMM

Implementation of HMM-Based Human Activity Recognition Using Single Triaxial Accelerometer

Estimation of Phone Mismatch Penalty Matricesfor Two-Stage Keyword Spotting

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles