The search functionality is under construction.
The search functionality is under construction.

Minimizing Human Intervention for Constructing Korean Part-of-Speech Tagged Corpus

Do-Gil LEE, Gumwon HONG, Seok Kee LEE, Hae-Chang RIM

  • Full Text Views

    0

  • Cite this

Summary :

The construction of annotated corpora requires considerable manual effort. This paper presents a pragmatic method to minimize human intervention for the construction of Korean part-of-speech (POS) tagged corpus. Instead of focusing on improving the performance of conventional automatic POS taggers, we devise a discriminative POS tagger which can selectively produce either a single analysis or multiple analyses based on the tagging reliability. The proposed approach uses two decision rules to judge the tagging reliability. Experimental results show that the proposed approach can effectively control the quality of corpus and the amount of manual annotation by the threshold value of the rule.

Publication
IEICE TRANSACTIONS on Information Vol.E93-D No.8 pp.2336-2338
Publication Date
2010/08/01
Publicized
Online ISSN
1745-1361
DOI
10.1587/transinf.E93.D.2336
Type of Manuscript
LETTER
Category
Natural Language Processing

Authors

Keyword