The search functionality is under construction.

Author Search Result

[Author] Hyoung-Gook KIM(3hit)

1-3hit
  • Adaptive Single-Channel Speech Enhancement Method for a Push-To-Talk Enabled Wireless Communication Device

    Hyoung-Gook KIM  Jin Young KIM  

     
    PAPER-Multimedia Systems for Communications

      Vol:
    E99-B No:8
      Page(s):
    1745-1753

    In this paper, we propose a single-channel speech enhancement method for a push-to-talk enabled wireless communication device. The proposed method is based on adaptive weighted β-order spectral amplitude estimation under speech presence uncertainty and enhanced instantaneous phase estimation in order to achieve flexible and effective noise reduction while limiting the speech distortion due to different noise conditions. Experimental results confirm that the proposed method delivers higher voice quality and intelligibility than the reference methods in various noise environments.

  • Gradient-Flow Tensor Divergence Feature for Human Action Recognition

    Ngoc Nam BUI  Jin Young KIM  Hyoung-Gook KIM  

     
    LETTER-Vision

      Vol:
    E99-A No:1
      Page(s):
    437-440

    Current research trends in computer vision have tended towards achieving the goal of recognizing human action, due to the potential utility of such recognition in various applications. Among many potential approaches, an approach involving Gaussian Mixture Model (GMM) supervectors with a Support Vector Machine (SVM) and a nonlinear GMM KL kernel has been proven to yield improved performance for recognizing human activities. In this study, based on tensor analysis, we develop and exploit an extended class of action features that we refer to as gradient-flow tensor divergence. The proposed method has shown a best recognition rate of 96.3% for a KTH dataset, and reduced processing time.

  • A Visual Signal Reliability for Robust Audio-Visual Speaker Identification

    Md. TARIQUZZAMAN  Jin Young KIM  Seung You NA  Hyoung-Gook KIM  Dongsoo HAR  

     
    LETTER-Human-computer Interaction

      Vol:
    E94-D No:10
      Page(s):
    2052-2055

    In this paper, a novel visual signal reliability (VSR) measure is proposed to consider video degradation at the signal level in audio-visual speaker identification (AVSI). The VSR estimation is formulated using a~ Gaussian fuzzy membership function (GFMF) to measure lighting variations. The variance parameters of GFMF are optimized in order to maximize the performance of the overall AVSI. The experimental results show that the proposed method outperforms the score-based reliability measuring technique.