Results Merging with the OASIS System: An Experimental Comparison of Two Techniques

Vitaliy KLUEV  

IEICE TRANSACTIONS on Information and Systems   Vol.E86-D   No.9   pp.1773-1780
Publication Date: 2003/09/01
Online ISSN: 
Print ISSN: 0916-8532
Type of Manuscript: Special Section PAPER (Special Issue on Text Processing for Information Access)
distributed system,  search engine,  ranking,  

Full Text: PDF(198.3KB)>>
Buy this Article

Mechanisms used for results merging are very important for distributed search systems. They are to select the most relevant documents retrieved by different servers and put them on the top of the list returned to the end user. There are several approaches to solve key problems of this task such as eliminating duplicates and ranking results combined. But it is still not clear how to achieve this. We use the clustering technique to divide retrieved results into several groups and a metric on the base of the vector space model to arrange items inside each group. Preliminary tests were conducted using the OASIS system and several collections of real Internet data. They showed relatively superior results when compared to the neural network clustering and LSI calculation. Proposed mechanisms can be applied to metasearch systems and to distributed search systems as well because such mechanisms do not require any special information except standard de facto data received from servers.