Next Article in Journal
Colour Choice as a Strategic Instrument in Neuromarketing
Next Article in Special Issue
Predicting the Performance of Ensemble Classification Using Conditional Joint Probability
Previous Article in Journal
Complex Network Model of Global Financial Time Series Based on Different Distance Functions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Improved K-Means Algorithm Based on Contour Similarity

1
Key Laboratory of Industrial Automation and Machine Vision of Qiannan, School of Mathematics and Statistics, Qiannan Normal University for Nationalities, Duyun 558000, China
2
College of Science, Liaoning Technical University, Fuxin 123000, China
*
Author to whom correspondence should be addressed.
Mathematics 2024, 12(14), 2211; https://doi.org/10.3390/math12142211
Submission received: 10 June 2024 / Revised: 30 June 2024 / Accepted: 6 July 2024 / Published: 15 July 2024
(This article belongs to the Special Issue Optimization Algorithms in Data Science: Methods and Theory)

Abstract

The traditional k-means algorithm is widely used in large-scale data clustering because of its easy implementation and efficient process, but it also suffers from the disadvantages of local optimality and poor robustness. In this study, a Csk-means algorithm based on contour similarity is proposed to overcome the drawbacks of the traditional k-means algorithm. For the traditional k-means algorithm, which results in local optimality due to the influence of outliers or noisy data and random selection of the initial clustering centers, the Csk-means algorithm overcomes both drawbacks by combining data lattice transformation and dissimilar interpolation. In particular, the Csk-means algorithm employs Fisher optimal partitioning of the similarity vectors between samples for the process of determining the number of clusters. To improve the robustness of the k-means algorithm to the shape of the clusters, the Csk-means algorithm utilizes contour similarity to compute the similarity between samples during the clustering process. Experimental results show that the Csk-means algorithm provides better clustering results than the traditional k-means algorithm and other comparative algorithms.
Keywords: k-means algorithm; degree of similarity; contour similarity; improved algorithm k-means algorithm; degree of similarity; contour similarity; improved algorithm

Share and Cite

MDPI and ACS Style

Zhao, J.; Bao, Y.; Li, D.; Guan, X. An Improved K-Means Algorithm Based on Contour Similarity. Mathematics 2024, 12, 2211. https://doi.org/10.3390/math12142211

AMA Style

Zhao J, Bao Y, Li D, Guan X. An Improved K-Means Algorithm Based on Contour Similarity. Mathematics. 2024; 12(14):2211. https://doi.org/10.3390/math12142211

Chicago/Turabian Style

Zhao, Jing, Yanke Bao, Dongsheng Li, and Xinguo Guan. 2024. "An Improved K-Means Algorithm Based on Contour Similarity" Mathematics 12, no. 14: 2211. https://doi.org/10.3390/math12142211

APA Style

Zhao, J., Bao, Y., Li, D., & Guan, X. (2024). An Improved K-Means Algorithm Based on Contour Similarity. Mathematics, 12(14), 2211. https://doi.org/10.3390/math12142211

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop