The search functionality is under construction.

IEICE TRANSACTIONS on Information

Speech Summarization: An Approach through Word Extraction and a Method for Evaluation

Chiori HORI, Sadaoki FURUI

  • Full Text Views

    0

  • Cite this

Summary :

In this paper, we propose a new method of automatic speech summarization for each utterance, where a set of words that maximizes a summarization score is extracted from automatic speech transcriptions. The summarization score indicates the appropriateness of summarized sentences. This extraction is achieved by using a dynamic programming technique according to a target summarization ratio. This ratio is the number of characters/words in the summarized sentence divided by the number of characters/words in the original sentence. The extracted set of words is then connected to build a summarized sentence. The summarization score consists of a word significance measure, linguistic likelihood, and a confidence measure. This paper also proposes a new method of measuring summarization accuracy based on a word network expressing manual summarization results. The summarization accuracy of each automatic summarization is calculated by comparing it with the most similar word string in the network. Japanese broadcast-news speech, transcribed using a large-vocabulary continuous-speech recognition (LVCSR) system, is summarized and evaluated using our proposed method with 20, 40, 60, 70 and 80% summarization ratios. Experimental results reveal that the proposed method can effectively extract relatively important information by removing redundant or irrelevant information.

Publication
IEICE TRANSACTIONS on Information Vol.E87-D No.1 pp.15-25
Publication Date
2004/01/01
Publicized
Online ISSN
DOI
Type of Manuscript
Special Section PAPER (Special Section on the 2002 IEICE Excellent Paper Award)
Category

Authors

Keyword