Home > Papers

 
 
K-ANMI: A Mutual Information Based Clustering Algorithm for Categorical Data
Zengyou He * #,Xiaofei Xu,Shengchun Deng
Department of Computer Science and Engineering, Harbin Institute of Technology
*Correspondence author
#Submitted by
Subject:
Funding: 863(No.Grant No. 2003AA4Z3370, Grant )
Opened online: 4 November 2005
Accepted by: none
Citation: Zengyou He,Xiaofei Xu,Shengchun Deng.K-ANMI: A Mutual Information Based Clustering Algorithm for Categorical Data[OL]. [ 4 November 2005] http://en.paper.edu.cn/en_releasepaper/content/3506
 
 
Clustering categorical data is an integral part of data mining and has attracted much attention recently. In this paper, we present k-ANMI, a new efficient algorithm for clustering categorical data. The k-ANMI algorithm works in a way that is similar to the popular k-means algorithm, and the goodness of clustering in each step is evaluated using a mutual information based criterion (namely, Average Normalized Mutual Information-ANMI) borrowed from cluster ensemble. Experimental results on real datasets show that k-ANMI algorithm is competitive with those state-of-art categorical data clustering algorithms with respect to clustering accuracy.
Keywords:Clustering, Categorical Data, Mutual Information, Cluster Ensemble, Data Mining
 
 
 

For this paper

  • PDF (0B)
  • ● Revision 0   
  • ● Print this paper
  • ● Recommend this paper to a friend
  • ● Add to my favorite list

    Saved Papers

    Please enter a name for this paper to be shown in your personalized Saved Papers list

Tags

Add yours

Related Papers

Statistics

PDF Downloaded 851
Bookmarked 0
Recommend 5
Comments Array
Submit your papers