Survey of Clustering Algorithms

Rui Xu, Missouri University of Science and Technology
Donald C. Wunsch, Missouri University of Science and Technology

This document has been relocated to

There were 1730 downloads as of 23 Jun 2016.


Data analysis plays an indispensable role for understanding various phenomena. Cluster analysis, primitive exploration with little or no prior knowledge, consists of research developed across a wide variety of communities. The diversity, on one hand, equips us with many tools. On the other hand, the profusion of options causes confusion. We survey clustering algorithms for data sets appearing in statistics, computer science, and machine learning, and illustrate their applications in some benchmark data sets, the traveling salesman problem, and bioinformatics, a new field attracting intensive efforts. Several tightly related topics, proximity measure, and cluster validation, are also discussed.