Next Article in Journal
ProtectingSmall and Medium Enterprises: A Specialized Cybersecurity Risk Assessment Framework and Tool
Previous Article in Journal
The Generation Rights Trading between Self-Owned Power Plants and New Energy Enterprises under the Conditions of Price Difference and Time-of-Use Pricing Settlement
Previous Article in Special Issue
Picture Fuzzy Soft Matrices and Application of Their Distance Measures to Supervised Learning: Picture Fuzzy Soft k-Nearest Neighbor (PFS-kNN)
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Review

Bias Mitigation via Synthetic Data Generation: A Review

by
Mohamed Ashik Shahul Hameed
,
Asifa Mehmood Qureshi
and
Abhishek Kaushik
*
Dundalk Institute of Technology, A91 K584 Dundalk, Ireland
*
Author to whom correspondence should be addressed.
Electronics 2024, 13(19), 3909; https://doi.org/10.3390/electronics13193909
Submission received: 29 August 2024 / Revised: 23 September 2024 / Accepted: 24 September 2024 / Published: 2 October 2024

Abstract

Artificial intelligence (AI) is widely used in healthcare applications to perform various tasks. Although these models have great potential to improve the healthcare system, they have also raised significant ethical concerns, including biases that increase the risk of health disparities in medical applications. The under-representation of a specific group can lead to bias in the datasets that are being replicated in the AI models. These disadvantaged groups are disproportionately affected by bias because they may have less accurate algorithmic forecasts or underestimate the need for treatment. One solution to eliminate bias is to use synthetic samples or artificially generated data to balance datasets. Therefore, the purpose of this study is to review and evaluate how synthetic data can be generated and used to mitigate biases, specifically focusing on the medical domain. We explored high-quality peer-reviewed articles that were focused on synthetic data generation to eliminate bias. These studies were selected based on our defined inclusion criteria and exclusion criteria and the quality of the content. The findings reveal that generated synthetic data can help improve accuracy, precision, and fairness. However, the effectiveness of synthetic data is closely dependent on the quality of the data generation process and the initial datasets used. The study also highlights the need for continuous improvement in synthetic data generation techniques and the importance of evaluation metrics for fairness in AI models.
Keywords: synthetic data; artificial data; bias; fairness; AI; data generation synthetic data; artificial data; bias; fairness; AI; data generation

Share and Cite

MDPI and ACS Style

Shahul Hameed, M.A.; Qureshi, A.M.; Kaushik, A. Bias Mitigation via Synthetic Data Generation: A Review. Electronics 2024, 13, 3909. https://doi.org/10.3390/electronics13193909

AMA Style

Shahul Hameed MA, Qureshi AM, Kaushik A. Bias Mitigation via Synthetic Data Generation: A Review. Electronics. 2024; 13(19):3909. https://doi.org/10.3390/electronics13193909

Chicago/Turabian Style

Shahul Hameed, Mohamed Ashik, Asifa Mehmood Qureshi, and Abhishek Kaushik. 2024. "Bias Mitigation via Synthetic Data Generation: A Review" Electronics 13, no. 19: 3909. https://doi.org/10.3390/electronics13193909

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop