For Full-Text PDF, please login, if you are a member of IEICE,|
or go to Pay Per View on menu list, if you are a nonmember of IEICE.
User Feedback-Driven Document Clustering Technique for Information Organization
Han-joon KIM Sang-goo LEE
IEICE TRANSACTIONS on Information and Systems
Publication Date: 2002/06/01
Print ISSN: 0916-8532
Type of Manuscript: LETTER
semi-supervised clustering, hierarchical agglomerative clustering, relevance feedback, fuzzy information retrieval,
Full Text: PDF>>
This paper discusses a new type of semi-supervised document clustering that uses partial supervision to partition a large set of documents. Most clustering methods organizes documents into groups based only on similarity measures. In this paper, we attempt to isolate more semantically coherent clusters by employing the domain-specific knowledge provided by a document analyst. By using external human knowledge to guide the clustering mechanism with some flexibility when creating the clusters, clustering efficiency can be considerably enhanced. Experimental results show that the use of only a little external knowledge can considerably enhance the quality of clustering results that satisfy users' constraint.