IEICE global.ieice.org Site

The search functionality is under construction.

The search functionality is under construction.

Author Search Result

[Author] Yoshiyuki TAKEDA(1hit)

1-1hit

Determining Indexing Strings with Statistical Analysis
Yoshiyuki TAKEDA Kyoji UMEMURA Eiko YAMAMOTO

PAPER

Vol:
E86-D No:9
Page(s):
1781-1787
Determining indexing strings is an important factor in information retrieval. Ideally, the strings should be words that represent documents or queries. Although any single word may be the first candidate for indexing strings for an English corpus, it may not be ideal due to the existence of compound nouns, which are often good indexing strings, and which often depend on the genre of the corpus used. The situation is even worse in Japanese or Chinese where the words are not separated by spaces. In this paper, we propose a method of determining indexing strings based on statistical analysis. The novel features of our method are to make the most of the statistical measure called "adaptation" and not to use language-dependent resources such as dictionaries and stop word lists. In evaluating our method using a Japanese test collection, we found that it actually improves the precision of information retrieval systems.

Latest Issue

English

Links

Call for Papers

Call for Papers

Special Section

Submit to IEICE Trans.

Submit to IEICE Trans.

Information for Authors

Transactions NEWS

Transactions NEWS

Popular articles

Popular articles

Top 10 Downloads