A Combined Standard Deviation Based Data Clustering Algorithm

Kuttiannan Thangavel, Gandhigram Rural Institute, Deemed UniversityFollow
Durairaj Ashok Kumar, Government Arts CollegeFollow

Abstract

The clustering problem has been widely studied because it arises in many knowledge management oriented applications. It aims at identifying the distribution of patterns and intrinsic correlations in data sets by partitioning the data points into similarity clusters. Traditional clustering algorithms use distance functions to measure similarity centroid, which subside the influences of data points. Hence, in this article a novel non-distance based clustering algorithm is proposed which uses Combined Standard Deviation (CSD) as measure of similarity. The performance of CSD based K-means approach, called K-CSD clustering algorithm, is tested on synthetic data sets. It compared favorably to widely used K-means clustering algorithm.

DOI

10.22237/jmasm/1146457200

Download

Included in

Applied Statistics Commons, Social and Behavioral Sciences Commons, Statistical Theory Commons

COinS

Journal of Modern Applied Statistical Methods

A Combined Standard Deviation Based Data Clustering Algorithm

Authors

Abstract

DOI

Included in

Share