The search functionality is under construction.
The search functionality is under construction.

The Effect of Corpus Size on Case Frame Acquisition for Predicate-Argument Structure Analysis

Ryohei SASANO, Daisuke KAWAHARA, Sadao KUROHASHI

  • Full Text Views

    0

  • Cite this

Summary :

This paper reports the effect of corpus size on case frame acquisition for predicate-argument structure analysis in Japanese. For this study, we collect a Japanese corpus consisting of up to 100 billion words, and construct case frames from corpora of six different sizes. Then, we apply these case frames to syntactic and case structure analysis, and zero anaphora resolution, in order to investigate the relationship between the corpus size for case frame acquisition and the performance of predicate-argument structure analysis. We obtained better analyses by using case frames constructed from larger corpora; the performance was not saturated even with a corpus size of 100 billion words.

Publication
IEICE TRANSACTIONS on Information Vol.E93-D No.6 pp.1361-1368
Publication Date
2010/06/01
Publicized
Online ISSN
1745-1361
DOI
10.1587/transinf.E93.D.1361
Type of Manuscript
Special Section PAPER (Special Section on Info-Plosion)
Category
Natural Language Processing

Authors

Keyword