IEICE global.ieice.org Site

Author Search Result

[Author] Ji XI(1hit)

1-1hit

A CNN-Based Feature Pyramid Segmentation Strategy for Acoustic Scene Classification Open Access
Ji XI Yue XIE Pengxu JIANG Wei JIANG

LETTER-Speech and Hearing

Pubricized:
2024/03/26
Vol:
E107-D No:8
Page(s):
1093-1096
Currently, a significant portion of acoustic scene categorization (ASC) research is centered around utilizing Convolutional Neural Network (CNN) models. This preference is primarily due to CNN’s ability to effectively extract time-frequency information from audio recordings of scenes by employing spectrum data as input. The expression of many dimensions can be achieved by utilizing 2D spectrum characteristics. Nevertheless, the diverse interpretations of the same object’s existence in different positions on the spectrum map can be attributed to the discrepancies between spectrum properties and picture qualities. The lack of distinction between different aspects of input information in ASC-based CNN networks may result in a decline in system performance. Considering this, a feature pyramid segmentation (FPS) approach based on CNN is proposed. The proposed approach involves utilizing spectrum features as the input for the model. These features are split based on a preset scale, and each segment-level feature is then fed into the CNN network for learning. The SoftMax classifier will receive the output of all feature scales, and these high-level features will be fused and fed to it to categorize different scenarios. The experiment provides evidence to support the efficacy of the FPS strategy and its potential to enhance the performance of the ASC system.

Author Search Result

[Author] Ji XI(1hit)

A CNN-Based Feature Pyramid Segmentation Strategy for Acoustic Scene Classification Open Access

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles