The clustering problem has been widely studied because it arises in many knowledge management oriented applications. It aims at identifying the distribution of patterns and intrinsic correlations in data sets by partitioning the data points into similarity clusters. Traditional clustering algorithms use distance functions to measure similarity centroid, which subside the influences of data points. Hence, in this article a novel non-distance based clustering algorithm is proposed which uses Combined Standard Deviation (CSD) as measure of similarity. The performance of CSD based K-means approach, called K-CSD clustering algorithm, is tested on synthetic data sets. It compared favorably to widely used K-means clustering algorithm.
Thangavel, Kuttiannan and Kumar, Durairaj Ashok
"A Combined Standard Deviation Based Data Clustering Algorithm,"
Journal of Modern Applied Statistical Methods: Vol. 5
, Article 21.
Available at: http://digitalcommons.wayne.edu/jmasm/vol5/iss1/21