IEICE global.ieice.org Site

Author Search Result

[Author] Sangmin LEE(4hit)

1-4hit

Voice Activity Detection Based on Generalized Normal-Laplace Distribution Incorporating Conditional MAP
Ji-Hyun SONG Sangmin LEE

LETTER-Speech and Hearing

Vol:
E96-D No:12
Page(s):
2888-2891
In this paper, we propose a novel voice activity detection (VAD) algorithm based on the generalized normal-Laplace (GNL) distribution to provide enhanced performance in adverse noise environments. Specifically, the probability density function (PDF) of a noisy speech signal is represented by the GNL distribution; the variance of the speech and noise of the GNL distribution are estimated using higher-order moments. After in-depth analysis of estimated variances, a feature that is useful for discrimination between speech and noise at low SNRs is derived and compared to a threshold to detect speech activity. To consider the inter-frame correlation of speech activity, the result from the previous frame is employed in the decision rule of the proposed VAD algorithm. The performance of our proposed VAD algorithm is evaluated in terms of receiver operating characteristics (ROC) and detection accuracy. Results show that the proposed method yields better results than conventional VAD algorithms.
Speech/Music Classification Enhancement for 3GPP2 SMV Codec Based on Deep Belief Networks
Ji-Hyun SONG Hong-Sub AN Sangmin LEE

LETTER-Speech and Hearing

Vol:
E97-A No:2
Page(s):
661-664
In this paper, we propose a robust speech/music classification algorithm to improve the performance of speech/music classification in the selectable mode vocoder (SMV) of 3GPP2 using deep belief networks (DBNs), which is a powerful hierarchical generative model for feature extraction and can determine the underlying discriminative characteristic of the extracted features. The six feature vectors selected from the relevant parameters of the SMV are applied to the visible layer in the proposed DBN-based method. The performance of the proposed algorithm is evaluated using the detection accuracy and error probability of speech and music for various music genres. The proposed algorithm yields better results when compared with the original SMV method and support vector machine (SVM) based method.
A Variable Step-Size Feedback Cancellation Algorithm Based on GSAP for Digital Hearing Aids
Hongsub AN Hyeonmin SHIM Jangwoo KWON Sangmin LEE

LETTER-Digital Signal Processing

Vol:
E97-A No:7
Page(s):
1615-1618
Acoustic feedback is a major complaint of hearing aid users. Adaptive filters are a common method for suppressing acoustic feedback in digital hearing aids. In this letter, we propose a new variable step-size algorithm for normalized least mean square and an affine projection algorithm to combine with a variable step-size affine projection algorithm and global speech absence probability in an adaptive filter. The computer simulation used to test the proposed algorithm results in a lower misalignment error than the comparison algorithm at a similar convergence rate. Therefore, the proposed algorithm suggests an effective solution for the feedback suppression system of digital hearing aids.
Voice Activity Detection Using Global Speech Absence Probability Based on Teager Energy for Speech Enhancement
Yun-Sik PARK Sangmin LEE

LETTER-Speech and Hearing

Vol:
E95-D No:10
Page(s):
2568-2571
In this paper, we propose a novel voice activity detection (VAD) algorithm using global speech absence probability (GSAP) based on Teager energy (TE) for speech enhancement. The proposed method provides a better representation of GSAP, resulting in improved decision performance for speech and noise segments by the use of a TE operator which is employed to suppress the influence of noise signals. The performance of our approach is evaluated by objective tests under various environments, and it is found that the suggested method yields better results than conventional schemes.

Author Search Result

[Author] Sangmin LEE(4hit)

Voice Activity Detection Based on Generalized Normal-Laplace Distribution Incorporating Conditional MAP

Speech/Music Classification Enhancement for 3GPP2 SMV Codec Based on Deep Belief Networks

A Variable Step-Size Feedback Cancellation Algorithm Based on GSAP for Digital Hearing Aids

Voice Activity Detection Using Global Speech Absence Probability Based on Teager Energy for Speech Enhancement

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles