The search functionality is under construction.

IEICE TRANSACTIONS on Information

A Ranking Approach to Source Retrieval of Plagiarism Detection

Leilei KONG, Zhimao LU, Zhongyuan HAN, Haoliang QI

  • Full Text Views

    0

  • Cite this

Summary :

This paper addresses the issue of source retrieval in plagiarism detection. The task of source retrieval is retrieving all plagiarized sources of a suspicious document from a source document corpus whilst minimizing retrieval costs. The classification-based methods achieved the best performance in the current researches of source retrieval. This paper points out that it is more important to cast the problem as ranking and employ learning to rank methods to perform source retrieval. Specially, it employs RankBoost and Ranking SVM to obtain the candidate plagiarism source documents. Experimental results on the dataset of PAN@CLEF 2013 Source Retrieval show that the ranking based methods significantly outperforms the baseline methods based on classification. We argue that considering the source retrieval as a ranking problem is better than a classification problem.

Publication
IEICE TRANSACTIONS on Information Vol.E100-D No.1 pp.203-205
Publication Date
2017/01/01
Publicized
2016/09/29
Online ISSN
1745-1361
DOI
10.1587/transinf.2016EDL8090
Type of Manuscript
LETTER
Category
Data Engineering, Web Information Systems

Authors

Leilei KONG
  Harbin Engineering University,Heilongjiang Institute of Technology
Zhimao LU
  Harbin Engineering University
Zhongyuan HAN
  Heilongjiang Institute of Technology,Harbin Institute of Technology
Haoliang QI
  Heilongjiang Institute of Technology

Keyword