Evaluating AI Methods for Pulse Oximetry: Performance, Clinical Accuracy, and Comprehensive Bias Analysis
Abstract
:1. Introduction
2. Methodology
2.1. Literature Search
- “oximetry” according to MeSH terms.
- “pulse oximetry” OR “oximet*” OR “oxygen saturation” to include all the relative references.
- “photoplethysmography” according to MeSH terms.
- “photoplethysmography” OR “PPG” as a broad term used to describe the optical imaging method for detecting arterial pulsations.
- “Artificial Intelligence” OR “AI” OR “Machine Learning” OR “Deep Learning” OR “Neural Networks” OR “Support Vector Machines” OR “Genetic Algorithms” OR “supervised learning” OR “unsupervised learning” to capture the broad spectrum of AI methods used in the analysis.
- “accuracy” OR “precision” OR “measurement Accuracy” OR “error” OR “bias” OR “reliability” to identify studies that assess the performance metrics of AI methods in SpO2 estimation.
2.2. Inclusion and Exclusion Criteria
- Studies do not involve human subjects;
- Inadequately detailed AI methodologies;
- Literature mentions PPG signals but not in the context of estimating SpO2.
- Review articles.
2.3. Data Extraction and Analysis
2.4. Data Collection and Outcomes
3. Results and Discussions
3.1. Characteristics of the Selected Studies
3.2. Estimation Methods
- Neural Network Models (NNMs): Neural networks, particularly Artificial Neural Networks (ANNs) [48,56] and Convolutional Neural Networks (CNNs) [23,42,57,58,62], are among the most widely used models for SpO2 estimation. These networks excel at identifying complex patterns in physiological data, making them highly suited for clinical monitoring tasks. Variants like 3D CNNs [31], U-Net [61], and CL-SPO2Net [63] are tailored for specific datasets or device inputs (e.g., smartphone sensors like those found in iPhone models) [42]. Neural networks, particularly CNNs, have shown great potential in analyzing both raw PPG signals and video-based data for real-time SpO2 estimation, a crucial aspect in high-stakes environments such as emergency medicine, operating rooms, and ICUs.
- Support Vector Machines (SVMs): Diverse SVM applications include Support Vector Machines Regression for traditional regression tasks [44,60] and specialized implementations such as SVMs for classification across subjects [46] or SVMs with linear kernels [22], highlighting their versatility in handling both linear and non-linear data in high-stakes clinical settings.
- Ensemble Methods (EMs): Techniques like Random Forests [43,47,50], Gradient Boosting [43,47,58], and AdaBoost [58] leverage the power of combining multiple models to enhance prediction accuracy and stability. This is crucial for medical applications, where reliability and robustness are paramount. More sophisticated ensemble approaches like XGBoost [47] and Extra Trees Regressor [58] reflect the need to optimize performance for critical clinical decision-making.
- Gaussian Processes (GPs): Gaussian Process regression models have demonstrated strong performance in estimating respiratory rate (RR) and oxygen saturation from PPG signals, outperforming other machine learning models for both RR and SpO2 estimation [60]. This probabilistic approach provides a measure of uncertainty in predictions, crucial for clinical applications thanks to their ability to handle small datasets and provide uncertainty estimates.
- Explainable AI (XAI): Although not an AI model itself, XAI is a framework or set of techniques aimed at making AI model outputs more understandable to humans. XAI is included in our review due to its importance in medical AI systems, where explainability is critical. Clinicians need to understand how decisions are made, and XAI techniques help interpret model outputs by providing insights into the contribution of each feature to the prediction. This enhances transparency and builds trust in AI-based clinical monitoring systems [59].
- Transformer Models, (TFM): Transformer models have shown great potential across various clinical applications due to their ability to handle sequential data efficiently, making them highly suitable for continuous monitoring in intensive care units (ICUs) and operating rooms, where real-time data analysis is crucial [61]. They are increasingly being used in tasks such as patient monitoring, medical imaging, and even predicting clinical outcomes, thanks to their powerful ability to model long-term dependencies and process large datasets with high accuracy. An example of this is the Vision Transformer (ViViT) architecture, which has been adapted for video-based SpO2 estimation, demonstrating the versatility of transformers in handling visual and time-series data for physiological monitoring applications [58]. Transformers are particularly valuable for their scalability and ability to integrate multimodal data, making them a versatile tool in modern healthcare settings.
3.3. Key Performance Metrics
3.4. Feature Extraction
3.5. Software Libraries and Tools
3.6. Study Quality Assessment
- D1: Patient selection. This domain assessed whether studies provided sufficient demographic information (age, gender, number of participants, skin color, or ancestry reported) and type of participants (clinical/controlled environment). For instance, Kim et al. had a small sample size of 10 healthy participants [25]. In contrast, larger studies such as Guo et al. (reported 513 participants) or Chu et al. (included 1732 participants) reported the largest datasets, providing highly robust data for clinical applications [61].
- D2: Index test (AI model and PPG analysis). This domain evaluated the methods or tools employed, preprocessing techniques for PPG analysis, and extracted features. In several studies, the use of filters and preprocessing tools such as Butterworth filters was highlighted. For instance, Shuzan et al. extracted features from a filtered PPG signal for accurate SpO2 estimation [60]. The AI models used across the studies varied, with some relying on traditional machine learning techniques, such as Support Vector Machines (SVMs) in the study by Liu et al. [22], while others employed advanced deep learning models.
- D3: Reference standard. This domain analyzed whether the studies employed a ground truth method for SpO2 estimation and considered sample size. Some studies used traditional pulse oximeters as the reference standard. However, others, such as Aguirregomezcorta et al., used arterial blood gas analysis (BGA), a more accurate, gold-standard method for reference [43]. The variety of reference standards emphasizes the importance of validating AI models against reliable, consistent methods to ensure clinical accuracy.
- D4: Flow and timing. This domain evaluated the range of SpO2 saturation intervals tested and the reporting of performance metrics such as MAE and RMSE. Chu et al. included a large range of saturation intervals with over 1732 participants, providing key performance metrics and robust data for SpO2 estimation [61]. By contrast, Vijayarangan et al. tested reflectance pulse oximetry with a narrower saturation interval and a smaller sample size [50].
3.7. Study Limitations and Future Research Perspectives
4. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Jubran, A. Pulse oximetry. Crit. Care 2015, 19, 272. [Google Scholar] [CrossRef] [PubMed]
- Khanna, A.K.; Banga, A.; Rigdon, J.; White, B.N.; Cuvillier, C.; Ferraz, J.; Olsen, F.; Hackett, L.; Bansal, V.; Kaw, R. Role of continuous pulse oximetry and capnography monitoring in the prevention of postoperative respiratory failure, postoperative opioid-induced respiratory depression and adverse outcomes on hospital wards: A systematic review and meta-analysis. J. Clin. Anesth. 2024, 94, 111374. [Google Scholar] [CrossRef] [PubMed]
- Division of Industry and Consumer Education (DICE). United States Food and Drug Administration. FDA. Pulse Oximeter Accuracy and Limitations: FDA Safety Communication. 2021. Available online: https://www.fda.gov/news-events/fda-brief/fda-brief-fda-warns-about-limitations-and-accuracy-pulse-oximeters (accessed on 30 September 2024).
- Anesthesiology Devices Advisory Committee; CDRH–Center for Devices and Radiological Health; U.S. Food and Drug Administration. FDA Executive Summary Review of Pulse Oximeters and Factors That Can Impact Their Accuracy; Technical report; U.S. Food and Drug Administration: Silver Spring, MA, USA, 2022.
- Nitzan, M.; Nitzan, I.; Arieli, Y. The Various Oximetric Techniques Used for the Evaluation of Blood Oxygenation. Sensors 2020, 20, 4844. [Google Scholar] [CrossRef] [PubMed]
- Allen, J. Photoplethysmography and its application in clinical physiological measurement. Physiol. Meas. 2007, 28, R1–R39. [Google Scholar] [CrossRef]
- Webster, J. Design of Pulse Oximeters; Institute of Physics Publishing: Bristol, UK, 1997; p. 260. [Google Scholar] [CrossRef]
- Moço, A.; Verkruysse, W. Pulse oximetry based on photoplethysmography imaging with red and green light: Calibratability and challenges. J. Clin. Monit. Comput. 2021, 35, 123–133. [Google Scholar] [CrossRef]
- Tobin, R.M.; Pologe, J.A.; Batchelder, P.B. A characterization of motion affecting pulse oximetry in 350 patients. Anesth. Analg. 2002, 94, S54. [Google Scholar]
- Santos, M.; Vollam, S.; Pimentel, M.A.; Areia, C.; Young, L.; Roman, C.; Ede, J.; Piper, P.; King, E.; Harford, M.; et al. The Use of Wearable Pulse Oximeters in the Prompt Detection of Hypoxemia and During Movement: Diagnostic Accuracy Study. J. Med. Internet Res. 2022, 24, e28890. [Google Scholar] [CrossRef]
- Cabanas, A.M.; Martín-Escudero, P.; Shelley, K.H. Improving pulse oximetry accuracy in dark-skinned patients: Technical aspects and current regulations. Br. J. Anaesth. 2023, 131, 640–644. [Google Scholar] [CrossRef]
- Aoki, K.C.; Barrant, M.; Gai, M.J.; Handal, M.; Xu, V.; Mayrovitz, H.N. Impacts of Skin Color and Hypoxemia on Noninvasive Assessment of Peripheral Blood Oxygen Saturation: A Scoping Review. Cureus 2023, 15, e46078. [Google Scholar] [CrossRef]
- Gudelunas, M.K.; Lipnick, M.; Hendrickson, C.; Vanderburg, S.; Okunlola, B.; Auchus, I.; Feiner, J.R.; Bickler, P.E. Low Perfusion and Missed Diagnosis of Hypoxemia by Pulse Oximetry in Darkly Pigmented Skin: A Prospective Study. Anesth. Analg. 2024, 138, 552–561. [Google Scholar] [CrossRef]
- León-Valladares, D.; Barrio-Mateu, L.A.; Cortés-Carmona, N.; Fuentes-Lizana, G.; Cabanas, A.M.; Latorre-Progulakis, K.; Fuentes-Guajardo, M.; Martín-Escudero, P. Determining factors of pulse oximetry accuracy: A literature review. Rev. Clínica Española 2024, 224, 314–330. [Google Scholar] [CrossRef]
- Khan, M.; Pretty, C.G.; Amies, A.C.; Elliott, R.; Shaw, G.M.; Chase, J.G. Investigating the Effects of Temperature on Photoplethysmography. IFAC-PapersOnLine 2015, 48, 360–365. [Google Scholar] [CrossRef]
- Kyriacou, P.A.; Charlton, P.H.; Al-Halawani, R.; Shelley, K.H. Inaccuracy of pulse oximetry with dark skin pigmentation: Clinical implications and need for improvement. Br. J. Anaesth. 2023, 130, e33–e36. [Google Scholar] [CrossRef] [PubMed]
- Halm, M.A. Skin Pigmentation and Accuracy of Pulse Oximetry Values. Am. J. Crit. Care 2023, 32, 459–462. [Google Scholar] [CrossRef] [PubMed]
- Cabanas, A.M.; Fuentes-Guajardo, M.; Latorre, K.; León, D.; Martín-Escudero, P. Skin Pigmentation Influence on Pulse Oximetry Accuracy: A Systematic Review and Bibliometric Analysis. Sensors 2022, 22, 3402. [Google Scholar] [CrossRef]
- U.S. Food and Drug Administration. Approach for Improving the Performance Evaluation of Pulse Oximeter Devices Taking into Consideration Skin Pigmentation, Race and Ethnicity Discussion Paper and Request for Feedback Approach for Improving the Performance Evaluation of Pulse Oximeter Devi; Technical report; FDA Safety, Communication: Silver Spring, MD, USA, 2023.
- Khanna, A.K.; Beard, J.; Lamminmäki, S.; Närväinen, J.; Antaki, N.; Yapici, H.O. Assessment of skin pigmentation-related bias in pulse oximetry readings among adults. J. Clin. Monit. Comput. 2024, 38, 113–120. [Google Scholar] [CrossRef]
- Marlar, A.I.; Knabe, B.K.; Taghikhan, Y.; Applegate, R.L.; Fleming, N.W. Performance of pulse oximeters as a function of race compared to skin pigmentation: A single center retrospective study. J. Clin. Monit. Comput. 2024. [Google Scholar] [CrossRef]
- Liu, S.H.; Liu, H.C.; Chen, W.; Tan, T.H. Evaluating quality of photoplethymographic signal on wearable forehead pulse oximeter with supervised classification approaches. IEEE Access 2020, 8, 185121–185135. [Google Scholar] [CrossRef]
- Priem, G.; Martinez, C.; Bodinier, Q.; Carrault, G. Clinical grade SpO2 prediction through semi-supervised learning. In Proceedings of the IEEE 20th International Conference on Bioinformatics and Bioengineering, BIBE 2020, Cincinnati, OH, USA, 26–28 October 2020; Institute of Electrical and Electronics Engineers Inc.: Piscataway, NJ, USA, 2020; pp. 914–921. [Google Scholar] [CrossRef]
- Center for Devices and Radiological Health. Pulse Oximeters—Premarket Notification Submissions; U.S. Department of Health and Human Services: Washington, DC, USA, 2013; Volume 510, pp. 1–19.
- Kim, N.H.; Yu, S.G.; Kim, S.E.; Lee, E.C. Non-Contact Oxygen Saturation Measurement Using YCgCr Color Space with an RGB Camera. Sensors 2021, 21, 6120. [Google Scholar] [CrossRef]
- Van de Louw, A.; Cracco, C.; Cerf, C.; Harf, A.; Duvaldestin, P.; Lemaire, F.; Brochard, L. Accuracy of pulse oximetry in the intensive care unit. Intensive Care Med. 2001, 27, 1606–1613. [Google Scholar] [CrossRef]
- Martín-Escudero, P. La Oximetría en Registro Continuo en el Esfuerzo Máximo en Distintas Especialidades Deportivas. Ph.D. Thesis, Universidad Complutense de Madrid, Madrid, Spain, 2003. [Google Scholar]
- Marca Fuertes, C.; Canales, M.G.; Miguel-Tobal, F.; Escudero, P.M. La pulsioximetría y su aplicación en pruebas de esfuerzo máximo. Apunt. Med. De L’Esport 2011, 46, 23–27. [Google Scholar] [CrossRef]
- Department of Health and Social Care. Equity in Medical Devices: Independent Review; Technical report; Department of Health and Social Care: London, UK, 2024.
- Akbilgic, O. Principles of Artificial Intellgence for Medicine. J. Am. Heart Assoc. 2024, 13, e035815. [Google Scholar] [CrossRef] [PubMed]
- Stogiannopoulos, T.; Cheimariotis, G.A.; Mitianoudis, N. A non-contact SpO2 estimation using video magnification and infrared data. In Proceedings of the ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, Rhodes Island, Greece, 4–10 June 2023; Institute of Electrical and Electronics Engineers Inc.: Piscataway, NJ, USA, 2023. [Google Scholar] [CrossRef]
- Whiteside, L.A.; Roy, M.E. Use of an Artificial Intelligence Device for Evaluating Blood Loss in Complex Major Orthopaedic Surgery Procedures. J. Arthroplast. 2024, 39, S53–S58. [Google Scholar] [CrossRef] [PubMed]
- Martins, I.; Matos, J.; Gonçalves, T.; Celi, L.A.; Wong, A.I.; Cardoso, J.S. Evaluating the Impact of Pulse Oximetry Bias in Machine Learning under Counterfactual Thinking. arXiv 2024, arXiv:2408.04396. [Google Scholar]
- Hammond, A.; Jain, B.; Celi, L.A.; Stanford, F.C. An extension to the FDA approval process is needed to achieve AI equity. Nat. Mach. Intell. 2023, 5, 96–97. [Google Scholar] [CrossRef]
- Zhang, Z.; Khatami, R. Can we trust the oxygen saturation measured by consumer smartwatches? Lancet Respir. Med. 2022, 10, e47–e48. [Google Scholar] [CrossRef]
- Tranfield, D.; Denyer, D.; Smart, P. Towards a Methodology for Developing Evidence-Informed Management Knowledge by Means of Systematic Review* Introduction: The need for an evidence-informed approach. Br. J. Manag. 2003, 14, 207–222. [Google Scholar] [CrossRef]
- Merigó, J.M.; Blanco-Mesa, F.; Gil-Lafuente, A.M.; Yager, R.R. Thirty Years of the International Journal of Intelligent Systems: A Bibliometric Review. Int. J. Intell. Syst. 2017, 32, 526–554. [Google Scholar] [CrossRef]
- Lopez-Perez, L.; Georga, E.; Conti, C.; Vicente, V.; García, R.; Pecchia, L.; Fotiadis, D.; Licitra, L.; Cabrera, M.F.; Arredondo, M.T.; et al. Statistical and machine learning methods for cancer research and clinical practice: A systematic review. Biomed. Signal Process. Control 2024, 92, 106067. [Google Scholar] [CrossRef]
- Moher, D.; Liberati, A.; Tetzlaff, J.; Altman, D.G. Preferred reporting items for systematic reviews and meta-analyses: The PRISMA statement. PLoS Med. 2009, 6, e1000097. [Google Scholar] [CrossRef]
- Aria, M.; Cuccurullo, C. bibliometrix: An R-tool for comprehensive science mapping analysis. J. Inf. 2017, 11, 959–975. [Google Scholar] [CrossRef]
- McGuinness, L.A.; Higgins, J.P.T. Risk-of-bias VISualization (robvis): An R package and Shiny web app for visualizing risk-of-bias assessments. Res. Synth. Methods 2021, 12, 55–61. [Google Scholar] [CrossRef] [PubMed]
- DIng, X.; Nassehi, D.; Larson, E.C. Measuring Oxygen Saturation with Smartphone Cameras Using Convolutional Neural Networks. IEEE J. Biomed. Health Inform. 2019, 23, 2603–2610. [Google Scholar] [CrossRef]
- Badiola Aguirregomezcorta, I.; Blazek, V.; Leonhardt, S.; Hoog Antink, C. Learning about reflective PPG for SpO2 determination using Machine Learning. Curr. Dir. Biomed. Eng. 2021, 7, 33–36. [Google Scholar] [CrossRef]
- Ogawa, M.; Yamakoshi, Y.; Nogawa, M.; Yamakoshi, T.; Motoi, K.; Tanaka, S.; Yamakoshi, K. A new calibration method with support vector machines for pulse oximetry. In IFMBE Proceedings; Springer: Berlin/Heidelberg, Germany, 2008; Volume 22, pp. 1125–1127. [Google Scholar] [CrossRef]
- Ogawa, M.; Yamakoshi, Y.; Nogawa, M.; Yamakoshi, T.; Motoi, K.; Tanaka, S.; Yamakoshi, K. An attempt of new calibration method with support vector machines regression for pulse oximetry. In IFMBE Proceedings; Springer: Berlin/Heidelberg, Germany, 2009; Volume 25, pp. 465–468. [Google Scholar] [CrossRef]
- Liou, S.W.; Kurniadi, D.; Zheng, B.R.; Xie, W.Q.; Tien, C.J.; Jong, G.J. Classification of biomedical signal on IoT platform using support vector machine. In Proceedings of the 4th IEEE International Conference on Applied System Innovation 2018, ICASI 2018, Taiwan, China, 13–17 April 2018; Lam, A.D.K.-T., Prior, S.D., Meen, T.-H., Eds.; Institute of Electrical and Electronics Engineers Inc.: Piscataway, NJ, USA, 2018; pp. 50–53. [Google Scholar] [CrossRef]
- Guo, J.; Yang, L.; Bie, R.; Yu, J.; Gao, Y.; Shen, Y.; Kos, A. An XGBoost-based physical fitness evaluation model using advanced feature selection and Bayesian hyper-parameter optimization for wearable running monitoring. Comput. Netw. 2019, 151, 166–180. [Google Scholar] [CrossRef]
- Ghazal, S.; Id, M.S.; Id, D.B.; Bouachir, W.; Id, A.J.; Noumeir, R. Using machine learning models to predict oxygen saturation following ventilator support adjustment in critically ill children: A single center pilot study. PLoS ONE 2019, 2, e0198921. [Google Scholar] [CrossRef]
- Venkat, S.; P.S., M.T.P.S.A.; Alex, A.; S.P., P.; Balamugesh; D.J., C.; Joseph, J.; Sivaprakasam, M. Machine Learning based SpO2 Computation Using Reflectance Pulse Oximetry. In Proceedings of the 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Berlin, Germany, 23–27 July 2019; pp. 482–485. [Google Scholar] [CrossRef]
- Vijayarangan, S.; Suresh, P.; Sp, P.; Joseph, J.; Sivaprakasam, M. Robust Modelling of Reflectance Pulse Oximetry for SpO2 Estimation. In Proceedings of the 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Montreal, QC, Canada, 20–24 July 2020; pp. 374–377. [Google Scholar] [CrossRef]
- Rodriguez-Labra, J.I.; Kosik, C.J.; Maddipatla, D.; Narakathu, B.B.; Atashbar, M.Z. Development of a PPG Sensor Array as a Wearable Device for Monitoring Cardiovascular Metrics. IEEE Sens. J. 2021, 21, 26320–26327. [Google Scholar] [CrossRef]
- Kim, J.W.; Park, S.M.; Choi, S.W. Reliability Assessment of Telemedicine Data by Analyzing Photoplethysmography with Deep Neural Network Technology. Trans. Korean Soc. Mech. Eng. B 2021, 45, 261–269. [Google Scholar] [CrossRef]
- Devi, G.; Balasubramanian, K.; Ngoc, L. Machine Learning and Deep Learning Techniques for Medical Science; CRC Press: Boca Raton, FL, USA, 2022. [Google Scholar] [CrossRef]
- Qiao, D.; Ayesha, A.H.; Zulkernine, F.; Jaffar, N.; Masroor, R. ReViSe: Remote Vital Signs Measurement Using Smartphone Camera. IEEE Access 2022, 10, 131656–131670. [Google Scholar] [CrossRef]
- Zhu, L.; Vatanparvar, K.; Gwak, M.; Kuang, J.; Gao, A. Contactless SpO2 Detection from Face Using Consumer Camera. In Proceedings of the BHI-BSN 2022—IEEE-EMBS International Conference on Biomedical and Health Informatics and IEEE-EMBS International Conference on Wearable and Implantable Body Sensor Networks—Proceedings, Ioannina, Greece, 27–30 September 2022; Institute of Electrical and Electronics Engineers Inc.: Piscataway, NJ, USA, 2022. [Google Scholar] [CrossRef]
- Koteska, B.; Bodanova, A.M.; Mitrova, H.; Sidorenko, M.; Lehocki, F. A Deep Learning Approach to Estimate SpO2 from PPG Signals. In Proceedings of the 9th International Conference on Bioinformatics Research and Applications 2022, Berlin Germany, 18–20 September 2022; pp. 142–148. [Google Scholar] [CrossRef]
- Mathew, J.; Tian, X.; Wong, C.W.; Ho, S.; Milton, D.K.; Wu, M. Remote Blood Oxygen Estimation From Videos Using Neural Networks. IEEE J. Biomed. Health Inform. 2023, 27, 3710–3720. [Google Scholar] [CrossRef]
- Stogiannopoulos, T.; Cheimariotis, G.A.; Mitianoudis, N. A Study of Machine Learning Regression Techniques for Non-Contact SpO2 Estimation from Infrared Motion-Magnified Facial Video. Information 2023, 14, 301. [Google Scholar] [CrossRef]
- Zhong, Y.; Jatav, A.; Afrin, K.; Shivaram, T.; Bukkapatnam, S.T.S. Enhanced SpO2 estimation using explainable machine learning and neck photoplethysmography. Artif. Intell. Med. 2023, 145, 102685. [Google Scholar] [CrossRef] [PubMed]
- Shuzan, M.N.I.; Chowdhury, M.H.; Chowdhury, M.E.H.; Murugappan, M.; Hoque Bhuiyan, E.; Arslane Ayari, M.; Khandakar, A. Machine Learning-Based Respiration Rate and Blood Oxygen Saturation Estimation Using Photoplethysmogram Signals. Bioengineering 2023, 10, 167. [Google Scholar] [CrossRef]
- Chu, Y.; Tang, K.; Hsu, Y.C.; Huang, T.; Wang, D.; Li, W.; Savitz, S.I.; Jiang, X.; Shams, S. Non-invasive arterial blood pressure measurement and SpO2 estimation using PPG signal: A deep learning framework. BMC Med. Inform. Decis. Mak. 2023, 23, 131. [Google Scholar] [CrossRef]
- Gammariello, M.C.; Sergi, I.; Montanaro, T.; Shumba, A.T.; Mazzeo, P.L.; Patrono, L.; Distante, C. SpO2 Estimation Using Deep Neural Networks: A Comparative Study. In Proceedings of the 2023 8th International Conference on Smart and Sustainable Technologies (SpliTech), Split, Croatia, 4–7 July 2023; pp. 1–6. [Google Scholar] [CrossRef]
- Peng, J.; Su, W.; Chen, H.; Sun, J.; Tian, Z. CL-SPO2Net: Contrastive Learning Spatiotemporal Attention Network for Non-Contact Video-Based SpO2 Estimation. Bioengineering 2024, 11, 113. [Google Scholar] [CrossRef]
- Martín-Escudero, P.; Cabanas, A.M.; Dotor-Castilla, M.L.; Galindo-Canales, M.; Miguel-Tobal, F.; Fernández-Pérez, C.; Fuentes-Ferrer, M.; Giannetti, R. Are Activity Wrist-Worn Devices Accurate for Determining Heart Rate during Intense Exercise? Bioengineering 2023, 10, 254. [Google Scholar] [CrossRef]
- Artemyev, M.; Churikova, M.; Grinenko, M.; Perepelkina, O. Robust algorithm for remote photoplethysmography in realistic conditions. Digit. Signal Process. 2020, 104, 102737. [Google Scholar] [CrossRef]
- Liu, Y.; Xu, C.; Qi, L.; Li, Y. A robust non-contact heart rate estimation from facial video based on a non-parametric signal extraction model. Biomed. Signal Process. Control 2024, 93, 106186. [Google Scholar] [CrossRef]
- Gupta, A.; Ravelo-García, A.G.; Dias, F.M. Availability and performance of face based non-contact methods for heart rate and oxygen saturation estimations: A systematic review. Comput. Methods Programs Biomed. 2022, 219, 106771. [Google Scholar] [CrossRef]
- Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You only look once: Unified, real-time object detection. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 779–788. [Google Scholar] [CrossRef]
- Fu, E.S.; Downs, J.B.; Schweiger, J.W.; Miguel, R.V.; Smith, R.A. Supplemental oxygen impairs detection of hypoventilation by pulse oximetry. Chest 2004, 126, 1552–1558. [Google Scholar] [CrossRef]
- Hoffman, J.S.; Viswanath, V.K.; Tian, C.; Ding, X.; Thompson, M.J.; Larson, E.C.; Patel, S.N.; Wang, E.J. Smartphone camera oximetry in an induced hypoxemia study. NPJ Digit. Med. 2022, 5, 146. [Google Scholar] [CrossRef] [PubMed]
- Scikit-learn.org. Scikit-Learn. Available online: https://scikit-learn.org/stable/ (accessed on 30 September 2024).
- Gerboni, G.; Comunale, G.; Chen, W.; Lever Taylor, J.; Migliorini, M.; Picard, R.; Cruz, M.; Regalia, G. Prospective clinical validation of the Empatica EmbracePlus wristband as a reflective pulse oximeter. Front. Digit. Health 2023, 5, 1258915. [Google Scholar] [CrossRef]
- Feli, M.; Azimi, I.; Anzanpour, A.; Rahmani, A.M.; Liljeberg, P. An energy-efficient semi-supervised approach for on-device photoplethysmogram signal quality assessment. Smart Health 2023, 28, 100390. [Google Scholar] [CrossRef]
- Sarkar, S.; Ghosh, A. Schrödinger spectrum and slim CNN architecture-based signal quality estimation for Photoplethysmogram signals. Biomed. Signal Process. Control 2024, 94, 106240. [Google Scholar] [CrossRef]
- Veiga, P.; Varandas, R.; Gamboa, H. Machine Learning Algorithm Development and Metrics Extraction from PPG Signal for Improved Robustness in Wearables. In Proceedings of the 16th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2023), Lisbon, Portugal, 16–18 February 2023; Volume 4, pp. 178–185. [Google Scholar] [CrossRef]
- Satter, S.; Kwon, T.H.; Kim, K.D. A Comparative Analysis of Various Machine Learning Algorithms to Improve the Accuracy of HbA1c Estimation Using Wrist PPG Data. Sensors 2023, 23, 7231. [Google Scholar] [CrossRef]
- Cabanas, A.M.; Fuentes-Guajardo, M.; Sáez, N.; Catalán, D.D.; Collao-Caiconte, P.O.; Martín-Escudero, P. Exploring the Hidden Complexity: Entropy Analysis in Pulse Oximetry of Female Athletes. Biosensors 2024, 14, 52. [Google Scholar] [CrossRef]
- Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic minority over-sampling technique. J. Artif. Int. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
- Twilio. Available online: https://www.twilio.com/en-us (accessed on 30 September 2024).
- Whiting, P.F.; Rutjes, A.W.S.; Westwood, M.E.; Mallett, S.; Deeks, J.J.; Reitsma, J.B.; Leeflang, M.M.G.; Sterne, J.A.C.; Bossuyt, P.M.M. QUADAS-2: A revised tool for the quality assessment of diagnostic accuracy studies. Ann. Intern. Med. 2011, 155, 529–536. [Google Scholar] [CrossRef]
- Bierman, A.; Benner, K.; Rea, M.S. Melanin bias in pulse oximetry explained by light source spectral bandwidth. Br. J. Anaesth. 2024, 132, 957–963. [Google Scholar] [CrossRef]
- Landry, C.; Hedge, E.T.; Hughson, R.L.; Peterson, S.D.; Arami, A. Cuffless Blood Pressure Estimation During Moderate- and Heavy-Intensity Exercise Using Wearable ECG and PPG. IEEE J. Biomed. Health Inform. 2022, 26, 5942–5952. [Google Scholar] [CrossRef]
- Ghorbani, R.; Reinders, M.J.T.; Tax, D.M.J. Personalized anomaly detection in PPG data using representation learning and biometric identification. Biomed. Signal Process. Control 2024, 94, 106216. [Google Scholar] [CrossRef]
Indexing Terms | Items (n) |
---|---|
Web of Science | |
#1 “oximet*” OR “oxygen saturation” OR “SpO2” | 46,561 |
#2 “photoplethysmography” OR PPG | 15,740 |
#3 “Artificial Intelligence” OR “AI” OR “Machine Learning” OR “Deep Learning” OR “Neural Networks” OR “Support Vector Machines” | |
OR “Genetic Algorithms” OR “supervised learning” OR “unsupervised learning” | 1,246,213 |
#4 “accuracy” OR “precision” OR “measurement accuracy” OR “error” OR “bias” OR “reliability” | 3,322,177 |
#5 #1 AND #2 AND #3 AND #4 | 68 |
PubMed | |
#1 “oximet*” OR “oxygen saturation” OR “SpO2” | 60,310 |
#2 “photoplethysmography” OR “PPG” | 8764 |
#3 “Artificial Intelligence” OR “AI” OR “Machine Learning” OR “Deep Learning” OR “Neural Networks” OR “Support Vector Machines” | |
OR “Genetic Algorithms” OR “supervised learning” OR “unsupervised learning” | 1,246,213 |
#4 “accuracy” OR “precision” OR “measurement accuracy” OR “error” OR “bias” OR “reliability” | 1,419,845 |
#5 #1 AND #2 AND #3 AND #4 | 47 |
Scopus | |
#1 “oximet*” OR “oxygen saturation” OR “SpO2” | 126,043 |
#2 “photoplethysmography” OR “PPG” | 17,684 |
#3 “Artificial Intelligence” OR “AI” OR “Machine Learning” OR “Deep Learning” OR “Neural Networks” OR “Support Vector Machines” | |
OR “Genetic Algorithms” OR “supervised learning” OR “unsupervised learning” | 771,104 |
#4 “accuracy” OR “precision” OR “measurement accuracy” OR “error” OR “bias” OR “reliability” | 6,992,893 |
#5 #1 AND #2 AND #3 AND #4 | 151 |
Description | Results |
---|---|
Timespan | 2008–2024 |
Sources (journals, books, etc.) | 22 |
Documents | 26 |
Average citations per document | 11.64 |
Average citations per year per doc | 4.14 |
References | 797 |
Articles | 17 |
Conference papers | 11 |
Keywords | 391 |
Authors | 131 |
Single-authored documents | 0 |
Documents per author | 0.23 |
Reference | Database/Type of Oximeter | SpO2 Range |
---|---|---|
Ogawa M., 2008 [44] | Masimo Radical with LNOP DCI sensor | 83–100% |
Ogawa M., 2009 [45] | Masimo Radical with LNOP DCI sensor | 76–98% |
Liou S.W., 2018 [46] | MIMIC Database | – |
Ding X., 2019 [42] | Nellcor PM10N and iPhone 7 Plus | – |
Guo J., 2019 [47] | Wearable smartband system | – |
Ghazal S., 2019 [48] | ERB Database 2016–1210, 4061 | 84–100% |
Venkat S., 2019 [49] | Reflectance DAQ, Nellcor, Masimo Radical 7/BGA test by GEM Premier 3000 | 80–100% |
Priem G., 2020 [23] | Wrist-worn reflectance pulse oximeter | 70–100% |
Vijayarangan S., 2020 [50] | Reflectance DAQ, Nellcor, Masimo Radical 7/BGA test by GEM Premier 3000 | 60–100% |
Liu S.H., 2020 [22] | Reflectance Forehead MAX30102/Finger Rossmax SA310 | – |
Badiola-Aguirregomezcorta I., 2021 [43] | Reflective ear pulse oximeter LAVIMO/BGA test | 70–100% |
Rodriguez-Labra J.I., 2021 [51] | A wearable multi-channel PPG prototype | – |
Kim J.W., 2021 [52] | 58 PPG datasets | – |
Kim N.H., 2021 [25] | Lab-made rPPG/CMS-50E pulse oximeter | 90–98% |
Gayathri D., 2022 [53] | Raspberry Pi-4 board | – |
Qiao D., 2022 [54] | PURE Dataset/Face rPPG/CMS-50E pulse oximeter | 90–100% |
Zhu L., 2022 [55] | Finger EMAY EMO-80 pulse oximeter | 94–99% |
Koteska B., 2022 [56] | BIDMC Dataset | 85–100% |
Mathew J., 2023 [57] | Self-collected dataset/CMS-50E pulse oximeter | – |
Stogiannopoulos T., 2023 [58] | JPD-500D ControlBios Oxicore | – |
Stogiannopoulos T., 2023 [31] | JPD-500D ControlBios Oxicore | – |
Zhong Y., 2023 [59] | Smart wearable SleepEaze with MAX30105 sensor/CMS-50E pulse oximeter | 90–100% |
Shuzan M.N.I., 2023 [60] | BIDMC dataset | 84–100% |
Chu Y., 2023 [61] | MIMIC III dataset | – |
Gammariello M.C., 2023 [62] | MTHS dataset/M70 pulse oximeter | – |
Peng J., 2024 [63] | UBFC Dataset/CMS-50E pulse oximeter | 93–100% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Cabanas, A.M.; Sáez, N.; Collao-Caiconte, P.O.; Martín-Escudero, P.; Pagán, J.; Jiménez-Herranz, E.; Ayala, J.L. Evaluating AI Methods for Pulse Oximetry: Performance, Clinical Accuracy, and Comprehensive Bias Analysis. Bioengineering 2024, 11, 1061. https://doi.org/10.3390/bioengineering11111061
Cabanas AM, Sáez N, Collao-Caiconte PO, Martín-Escudero P, Pagán J, Jiménez-Herranz E, Ayala JL. Evaluating AI Methods for Pulse Oximetry: Performance, Clinical Accuracy, and Comprehensive Bias Analysis. Bioengineering. 2024; 11(11):1061. https://doi.org/10.3390/bioengineering11111061
Chicago/Turabian StyleCabanas, Ana María, Nicolás Sáez, Patricio O. Collao-Caiconte, Pilar Martín-Escudero, Josué Pagán, Elena Jiménez-Herranz, and José L. Ayala. 2024. "Evaluating AI Methods for Pulse Oximetry: Performance, Clinical Accuracy, and Comprehensive Bias Analysis" Bioengineering 11, no. 11: 1061. https://doi.org/10.3390/bioengineering11111061
APA StyleCabanas, A. M., Sáez, N., Collao-Caiconte, P. O., Martín-Escudero, P., Pagán, J., Jiménez-Herranz, E., & Ayala, J. L. (2024). Evaluating AI Methods for Pulse Oximetry: Performance, Clinical Accuracy, and Comprehensive Bias Analysis. Bioengineering, 11(11), 1061. https://doi.org/10.3390/bioengineering11111061