A Data Cleansing Method for Clustering Large-Scale Transaction Databases

Woong-Kee LOH  Yang-Sae MOON  Jun-Gyu KANG  

IEICE TRANSACTIONS on Information and Systems   Vol.E93-D   No.11   pp.3120-3123
Publication Date: 2010/11/01
Online ISSN: 1745-1361
DOI: 10.1587/transinf.E93.D.3120
Print ISSN: 0916-8532
Type of Manuscript: LETTER
Category: Data Engineering, Web Information Systems
clustering,  data cleansing,  large-scale transaction databases,  

Full Text: PDF>>
Buy this Article

In this paper, we emphasize the need for data cleansing when clustering large-scale transaction databases and propose a new data cleansing method that improves clustering quality and performance. We evaluate our data cleansing method through a series of experiments. As a result, the clustering quality and performance were significantly improved by up to 165% and 330%, respectively.