The search functionality is under construction.
The search functionality is under construction.

Author Search Result

[Author] Xinzhou XU(1hit)

1-1hit
  • Speech Emotion Detection Using Fusion on Multi-Source Low-Level Information Based Recurrent Branches Open Access

    Jiaxin WU  Bing LI  Li ZHAO  Xinzhou XU  

     
    PAPER-Speech and Hearing

      Pubricized:
    2024/07/05
      Vol:
    E107-A No:11
      Page(s):
    1641-1649

    The task of Speech Emotion Detection (SED) aims at judging positive class and negetive class when the speaker expresses emotions. The SED performances are heavily dependent on the diversity and prominence of emotional features extracted from the speech. However, most of the existing related research focuses on investigating the effects of single feature source and hand-crafted features. Thus, we propose a SED approach using multi-source low-level information based recurrent branches. The fusion multi-source low-level information obtain variety and discriminative representations from speech emotion signals. In addition, focal-loss function benifit for imbalance classes, resulting in reducing the proportion of well-classified samples and increasing the weights for difficult samples on SED tasks. Experiments on IEMOCAP corpus demonstrate the effectiveness of the proposed method. Compared with the baselines, MSIR achieve the significant performance improvements in terms of Unweighted Average Recall and F1-score.