The search functionality is under construction.

IEICE TRANSACTIONS on Information

Auxiliary Loss for BERT-Based Paragraph Segmentation

Binggang ZHUO, Masaki MURATA, Qing MA

  • Full Text Views

    0

  • Cite this

Summary :

Paragraph segmentation is a text segmentation task. Iikura et al. achieved excellent results on paragraph segmentation by introducing focal loss to Bidirectional Encoder Representations from Transformers. In this study, we investigated paragraph segmentation on Daily News and Novel datasets. Based on the approach proposed by Iikura et al., we used auxiliary loss to train the model to improve paragraph segmentation performance. Consequently, the average F1-score obtained by the approach of Iikura et al. was 0.6704 on the Daily News dataset, whereas that of our approach was 0.6801. Our approach thus improved the performance by approximately 1%. The performance improvement was also confirmed on the Novel dataset. Furthermore, the results of two-tailed paired t-tests indicated that there was a statistical significance between the performance of the two approaches.

Publication
IEICE TRANSACTIONS on Information Vol.E106-D No.1 pp.58-67
Publication Date
2023/01/01
Publicized
2022/10/20
Online ISSN
1745-1361
DOI
10.1587/transinf.2022EDP7083
Type of Manuscript
PAPER
Category
Natural Language Processing

Authors

Binggang ZHUO
  Tottori University
Masaki MURATA
  Tottori University
Qing MA
  Ryukoku University

Keyword