The search functionality is under construction.

IEICE TRANSACTIONS on Information

A Hybrid Topic Model for Multi-Document Summarization

JinAn XU, JiangMing LIU, Kenji ARAKI

  • Full Text Views

    0

  • Cite this

Summary :

Topic features are useful in improving text summarization. However, independency among topics is a strong restriction on most topic models, and alleviating this restriction can deeply capture text structure. This paper proposes a hybrid topic model to generate multi-document summaries using a combination of the Hidden Topic Markov Model (HTMM), the surface texture model and the topic transition model. Based on the topic transition model, regular topic transition probability is used during generating summary. This approach eliminates the topic independence assumption in the Latent Dirichlet Allocation (LDA) model. Meanwhile, the results of experiments show the advantage of the combination of the three kinds of models. This paper includes alleviating topic independency, and integrating surface texture and shallow semantic in documents to improve summarization. In short, this paper attempts to realize an advanced summarization system.

Publication
IEICE TRANSACTIONS on Information Vol.E98-D No.5 pp.1089-1094
Publication Date
2015/05/01
Publicized
2015/02/09
Online ISSN
1745-1361
DOI
10.1587/transinf.2014EDP7229
Type of Manuscript
PAPER
Category
Natural Language Processing

Authors

JinAn XU
  Beijing Jiaotong University
JiangMing LIU
  Beijing Jiaotong University
Kenji ARAKI
  Hokkaido University

Keyword