The search functionality is under construction.

IEICE TRANSACTIONS on Information

SVM-Based Multi-Document Summarization Integrating Sentence Extraction with Bunsetsu Elimination

Tsutomu HIRAO, Kazuhiro TAKEUCHI, Hideki ISOZAKI, Yutaka SASAKI, Eisaku MAEDA

  • Full Text Views

    0

  • Cite this

Summary :

In this paper, we propose a machine learning-based method of multi-document summarization integrating sentence extraction with bunsetsu elimination. We employ Support Vector Machines for both of the modules used. To evaluate the effect of bunsetsu elimination, we participated in the multi-document summarization task at TSC-2 by the following two approaches: (1) sentence extraction only, and (2) sentence extraction + bunsetsu elimination. The results of subjective evaluation at TSC-2 show that both approaches are superior to the Lead-based method from the viewpoint of information coverage. In addition, we made extracts from given abstracts to quantitatively examine the effectiveness of bunsetsu elimination. The experimental results showed that our bunsetsu elimination makes summaries more informative. Moreover, we found that extraction based on SVMs trained by short extracts are better than the Lead-based method, but that SVMs trained by long extracts are not.

Publication
IEICE TRANSACTIONS on Information Vol.E86-D No.9 pp.1702-1709
Publication Date
2003/09/01
Publicized
Online ISSN
DOI
Type of Manuscript
Special Section PAPER (Special Issue on Text Processing for Information Access)
Category

Authors

Keyword