For Full-Text PDF, please login, if you are a member of IEICE,|
or go to Pay Per View on menu list, if you are a nonmember of IEICE.
Personal Data Retrieval and Disambiguation in Web Person Search
Yuliang WEI Guodong XIN Wei WANG Fang LV Bailing WANG
IEICE TRANSACTIONS on Information and Systems
Publication Date: 2019/02/01
Online ISSN: 1745-1361
Type of Manuscript: LETTER
Category: Data Engineering, Web Information Systems
sequential block model, deep learning, web extraction, name disambiguation,
Full Text: PDF(868.2KB)>>
Web person search often return web pages related to several distinct namesakes. This paper proposes a new web page model for template-free person data extraction, and uses Dirichlet Process Mixture model to solve name disambiguation. The results show that our method works best on web pages with complex structure.