Web person search often return web pages related to several distinct namesakes. This paper proposes a new web page model for template-free person data extraction, and uses Dirichlet Process Mixture model to solve name disambiguation. The results show that our method works best on web pages with complex structure.
Yuliang WEI
Harbin Institute of Technology
Guodong XIN
Harbin Institute of Technology
Wei WANG
Harbin Institute of Technology
Fang LV
Harbin Institute of Technology
Bailing WANG
Harbin Institute of Technology
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Copy
Yuliang WEI, Guodong XIN, Wei WANG, Fang LV, Bailing WANG, "Personal Data Retrieval and Disambiguation in Web Person Search" in IEICE TRANSACTIONS on Information,
vol. E102-D, no. 2, pp. 392-395, February 2019, doi: 10.1587/transinf.2018EDL8172.
Abstract: Web person search often return web pages related to several distinct namesakes. This paper proposes a new web page model for template-free person data extraction, and uses Dirichlet Process Mixture model to solve name disambiguation. The results show that our method works best on web pages with complex structure.
URL: https://global.ieice.org/en_transactions/information/10.1587/transinf.2018EDL8172/_p
Copy
@ARTICLE{e102-d_2_392,
author={Yuliang WEI, Guodong XIN, Wei WANG, Fang LV, Bailing WANG, },
journal={IEICE TRANSACTIONS on Information},
title={Personal Data Retrieval and Disambiguation in Web Person Search},
year={2019},
volume={E102-D},
number={2},
pages={392-395},
abstract={Web person search often return web pages related to several distinct namesakes. This paper proposes a new web page model for template-free person data extraction, and uses Dirichlet Process Mixture model to solve name disambiguation. The results show that our method works best on web pages with complex structure.},
keywords={},
doi={10.1587/transinf.2018EDL8172},
ISSN={1745-1361},
month={February},}
Copy
TY - JOUR
TI - Personal Data Retrieval and Disambiguation in Web Person Search
T2 - IEICE TRANSACTIONS on Information
SP - 392
EP - 395
AU - Yuliang WEI
AU - Guodong XIN
AU - Wei WANG
AU - Fang LV
AU - Bailing WANG
PY - 2019
DO - 10.1587/transinf.2018EDL8172
JO - IEICE TRANSACTIONS on Information
SN - 1745-1361
VL - E102-D
IS - 2
JA - IEICE TRANSACTIONS on Information
Y1 - February 2019
AB - Web person search often return web pages related to several distinct namesakes. This paper proposes a new web page model for template-free person data extraction, and uses Dirichlet Process Mixture model to solve name disambiguation. The results show that our method works best on web pages with complex structure.
ER -