Next Article in Journal
Bioinformatic Analysis of the Anti-Cancer Effects of Safranal via the p53 Signaling Pathway in Cervical Cancer Cell Lines
Previous Article in Journal
Classical and Non-Classical Compound Combinations for Treatment of Prostate Cancer Cell Line PC3
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Abstract

Optimizing Breast Cancer Classification: A Comparative Analysis of Supervised and Unsupervised Machine Learning Techniques †

by
Prithwish Ghosh
1 and
Debolina Banerjee
2,*
1
Department of Statistics, North Carolina State University, Raleigh, NC 27607, USA
2
Department of Botany, Institute of Science, Visva Bharati, Bolpur 731204, India
*
Author to whom correspondence should be addressed.
Presented at the 3rd International Electronic Conference on Biomolecules, 23–25 April 2024; Available online: https://sciforum.net/event/IECBM2024.
Proceedings 2024, 103(1), 40; https://doi.org/10.3390/proceedings2024103040
Published: 12 April 2024
(This article belongs to the Proceedings of The 3rd International Electronic Conference on Biomolecules)
This study focuses on the comprehensive analysis of machine learning algorithms for the classification of breast cancer into benign and malignant categories using the Wisconsin breast cancer dataset. Two distinct approaches, supervised and unsupervised, were employed to evaluate the effectiveness of various algorithms in discerning the nature of cancerous growths based on diverse physical properties.
In the supervised learning realm, the study employed three powerful algorithms: Support Vector Machines (SVMs), Random Forests, and XGBoost. Additionally, unsupervised learning techniques, specifically Linear Discriminant Analysis and the Gaussian Finite Mixture Model for classification, were investigated. Notably, the XGBoost algorithm emerged as the most promising candidate in the supervised category, exhibiting superior classification performance when applied to the testing dataset (20% of the total data). The results indicated that XGBoost achieved an impressive precision of 98.23%, outperforming both the Gaussian Finite Mixture Model for classification (97.5%) and the Linear Discriminant Analysis (96.48%). The XGBoost algorithm, implemented on a subset of the data, demonstrated its efficacy in accurately identifying the nature of breast cancer, highlighting its potential as a robust tool for predicting malignancy based on distinct physical properties.
This study underscores the significance of supervised and unsupervised learning, particularly the XGBoost algorithm and the Gaussian Finite Mixture Model for classification, in optimizing breast cancer classification. The findings contribute valuable insights into the selection of appropriate machine learning techniques for the accurate and efficient identification of benign and malignant breast cancer, thereby facilitating improved diagnostic practices.

Author Contributions

Conceptualization, P.G. and D.B.; methodology, P.G.; software, P.G.; validation, D.B.; formal analysis, P.G.; investigation, D.B.; resources, D.B.; data curation, P.G.; writing—original draft preparation, P.G. and D.B.; writing—review and editing, P.G.; visualization, P.G.; supervision, P.G.; project administration, P.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Informed Consent Statement

Not Applicable.

Data Availability Statement

The data is taken from: https://doi.org/10.24432/C5DW2B.

Conflicts of Interest

The authors declare no conflict of Interest.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ghosh, P.; Banerjee, D. Optimizing Breast Cancer Classification: A Comparative Analysis of Supervised and Unsupervised Machine Learning Techniques. Proceedings 2024, 103, 40. https://doi.org/10.3390/proceedings2024103040

AMA Style

Ghosh P, Banerjee D. Optimizing Breast Cancer Classification: A Comparative Analysis of Supervised and Unsupervised Machine Learning Techniques. Proceedings. 2024; 103(1):40. https://doi.org/10.3390/proceedings2024103040

Chicago/Turabian Style

Ghosh, Prithwish, and Debolina Banerjee. 2024. "Optimizing Breast Cancer Classification: A Comparative Analysis of Supervised and Unsupervised Machine Learning Techniques" Proceedings 103, no. 1: 40. https://doi.org/10.3390/proceedings2024103040

Article Metrics

Back to TopTop