The search functionality is under construction.

Author Search Result

[Author] Jin Soo SEO(3hit)

1-3hit
  • An Informative Feature Selection Method for Music Genre Classification

    Jin Soo SEO  

     
    LETTER-Music Information Processing

      Vol:
    E94-D No:6
      Page(s):
    1362-1365

    This letter presents a new automatic musical genre classification method based on an informative song-level representation, in which the mutual information between the feature and the genre label is maximized. By efficiently combining distance-based indexing with informative features, the proposed method represents a song as one vector instead of complex statistical models. Experiments on an audio genre DB show that the proposed method can achieve the classification accuracy comparable or superior to the state-of-the-art results.

  • A Music Similarity Function Based on the Centroid Model

    Jin Soo SEO  

     
    LETTER-Music Information Processing

      Vol:
    E96-D No:7
      Page(s):
    1573-1576

    Music-similarity computation is an essential building block for the browsing, retrieval, and indexing of digital music archives. This paper proposes a music similarity function based on the centroid model, which divides the feature space into non-overlapping clusters for the efficient computation of the timber distance of two songs. We place particular emphasis on the centroid deviation as a feature for music-similarity computation. Experiments show that the centroid-model representation of the auditory features is promising for music-similarity computation.

  • Speaker Change Detection Based on a Weighted Distance Measure over the Centroid Model

    Jin Soo SEO  

     
    LETTER-Speech and Hearing

      Vol:
    E95-D No:5
      Page(s):
    1543-1546

    Speaker change detection involves the identification of the time indices of an audio stream, where the identity of the speaker changes. This paper proposes novel measures for speaker change detection over the centroid model, which divides the feature space into non-overlapping clusters for effective speaker-change comparison. The centroid model is a computationally-efficient variant of the widely-used mixture-distribution based background models for speaker recognition. Experiments on both synthetic and real-world data were performed; the results show that the proposed approach yields promising results compared with the conventional statistical measures.