An Improved K-Means Algorithm Based on Contour Similarity

Zhao, Jing; Bao, Yanke; Li, Dongsheng; Guan, Xinguo

doi:10.3390/math12142211

Open AccessArticle

An Improved K-Means Algorithm Based on Contour Similarity

¹

Key Laboratory of Industrial Automation and Machine Vision of Qiannan, School of Mathematics and Statistics, Qiannan Normal University for Nationalities, Duyun 558000, China

²

College of Science, Liaoning Technical University, Fuxin 123000, China

^*

Author to whom correspondence should be addressed.

Mathematics 2024, 12(14), 2211; https://doi.org/10.3390/math12142211

Submission received: 10 June 2024 / Revised: 30 June 2024 / Accepted: 6 July 2024 / Published: 15 July 2024

(This article belongs to the Special Issue Optimization Algorithms in Data Science: Methods and Theory)

Download

Browse Figures

Versions Notes

Abstract

The traditional k-means algorithm is widely used in large-scale data clustering because of its easy implementation and efficient process, but it also suffers from the disadvantages of local optimality and poor robustness. In this study, a Csk-means algorithm based on contour similarity is proposed to overcome the drawbacks of the traditional k-means algorithm. For the traditional k-means algorithm, which results in local optimality due to the influence of outliers or noisy data and random selection of the initial clustering centers, the Csk-means algorithm overcomes both drawbacks by combining data lattice transformation and dissimilar interpolation. In particular, the Csk-means algorithm employs Fisher optimal partitioning of the similarity vectors between samples for the process of determining the number of clusters. To improve the robustness of the k-means algorithm to the shape of the clusters, the Csk-means algorithm utilizes contour similarity to compute the similarity between samples during the clustering process. Experimental results show that the Csk-means algorithm provides better clustering results than the traditional k-means algorithm and other comparative algorithms.

Keywords: k-means algorithm; degree of similarity; contour similarity; improved algorithm

Share and Cite

MDPI and ACS Style

Zhao, J.; Bao, Y.; Li, D.; Guan, X. An Improved K-Means Algorithm Based on Contour Similarity. Mathematics 2024, 12, 2211. https://doi.org/10.3390/math12142211

AMA Style

Zhao J, Bao Y, Li D, Guan X. An Improved K-Means Algorithm Based on Contour Similarity. Mathematics. 2024; 12(14):2211. https://doi.org/10.3390/math12142211

Chicago/Turabian Style

Zhao, Jing, Yanke Bao, Dongsheng Li, and Xinguo Guan. 2024. "An Improved K-Means Algorithm Based on Contour Similarity" Mathematics 12, no. 14: 2211. https://doi.org/10.3390/math12142211

APA Style

Zhao, J., Bao, Y., Li, D., & Guan, X. (2024). An Improved K-Means Algorithm Based on Contour Similarity. Mathematics, 12(14), 2211. https://doi.org/10.3390/math12142211

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Improved K-Means Algorithm Based on Contour Similarity

Abstract

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI