Query Expansion and Text Mining for ChronoSeeker -- Search Engine for Future/Past Events --

Hideki KAWAI  Adam JATOWT  Katsumi TANAKA  Kazuo KUNIEDA  Keiji YAMADA  

IEICE TRANSACTIONS on Information and Systems   Vol.E94-D   No.3   pp.552-563
Publication Date: 2011/03/01
Online ISSN: 1745-1361
DOI: 10.1587/transinf.E94.D.552
Print ISSN: 0916-8532
Type of Manuscript: Special Section PAPER (Special Section on Data Engineering)
futurology,  strategic foresight,  futuristics,  scenario planning,  vertical search,  text mining,  information retrieval,  

Full Text: PDF(1.3MB)>>
Buy this Article

This paper introduces a future and past search engine, ChronoSeeker, which can help users to develop long-term strategies for their organizations. To provide on-demand searches, we tackled two technical issues: (1) organizing efficient event searches and (2) filtering out noises from search results. Our system employed query expansion with typical expressions related to event information such as year expressions, temporal modifiers, and context terms for efficient event searches. We utilized a machine-learning technique of filtering noise to classify candidates into information or non-event information, using heuristic features and lexical patterns derived from a text-mining approach. Our experiment revealed that filtering achieved an 85% F-measure, and that query expansion could collect dozens more events than those without expansion.