A Review on Machine Learning Deployment Patterns and Key Features in the Prediction of Preeclampsia
Abstract
:1. Introduction
- Which ML models have been included in the prediction of PE?
- Which ML model demonstrates the highest predictive capability?
- Which features are integrated into the individual ML models?
- Which features did the individual ML model identify to be of high predictive value?
- When are the individual ML models intended to be used during pregnancy?
- How frequently are the individual ML models intended to be deployed throughout pregnancy?
2. Materials and Methods
2.1. Study Design
2.2. Eligibility Criteria
2.3. Search Strategy
(pregn* OR obstetrics) AND (early OR surveillance OR monitor*) AND (detect* OR program OR predict* OR intervention OR screen*) AND (Artificial intelligence OR AI OR machine learning OR deep learning) AND (first trimester OR intelligent OR automat*) AND (preeclampsia [Title/Abstract])
2.4. Selection Process
2.5. Data Collection
- Study characteristics: Study type, year of publication, and country.
- Dataset and participant information: Type and quality of the dataset. Number of participants and the incidence of PE cases used for training, validation, and test sets in the ML models.
- Features: Variables used for training the ML model.
- ML models employed in the study.
- Best performance: Identifying the best-performing ML model and its prediction of PE subgroups. For those studies, where the prediction of PE has not been specified other than predicting PE, it has been denoted as predicting “All PE” within this review to compare across studies. The performance is evaluated using performance metrics (Area Under the Curve (AUC), Receiver Operating Characteristic (ROC), accuracy, average accuracy, sensitivity, recall, specificity, precision, F1-score, Brier score, negative prediction value (NPV), positive prediction value (PPV), kappa, Matthew’s correlation coefficient, G-mean, screen-positive rate (SPR), true positive (TP), true-positive rate (TPR), detection rate (DR), false detection rate (FDR), false-negative rate (FNR), false positive (FP), and false-positive rate (FPR)). Among the listed terms, sensitivity, recall, and TPR refer to the same metric value, describing the prediction of positive cases from all the positive cases within the dataset [11].
- Top predictive features: The five most important features identified by the individual ML model for predicting PE among its included features.
- The intended use of the ML model: Is either reported or interpreted from the study. Including the number of times the ML model is intended to be used and which gestational week within the pregnancy, if this has been denoted by the authors.
- ML deployment details: Patterns and strategies for deployment, configuration within deployment environments, ecosystems integration, monitoring, maintenance, security and protection of data, scalability, load balancing, resource management, versioning, and tracking ML models.
2.6. Risk of Bias
3. Results
Performance of Machine Learning Models
4. Discussion
4.1. Best-Performing Machine Learning Model
4.2. Datasets Used
4.3. Feature Selection
4.4. Key Features Used
4.5. Machine Learning Deployment Pattern
4.6. Machine Learning Deployment, Monitoring, and Maintenance
4.6.1. Patterns and Strategies of ML Deployment
4.6.2. Configuration Within Deployment Environments
4.6.3. Ecosystem Integration
4.6.4. Monitoring of ML Models
4.6.5. Maintenance of ML Models
4.6.6. Security and Data Protection
4.6.7. Scalability, Load Balancing, and Resource Management
4.6.8. Versioning and Tracking of ML Models
4.7. Explainability
4.8. Practical Considerations
4.9. Necessary Steps to Establish the Use of ML in Real Clinical Practice
4.10. Limitations
4.11. Future Research
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Appendix A
Study | Features Used in the Machine Learning Model |
---|---|
A predictive Bayesian network model for home management of preeclampsia [13] | Values taken at each of the following gestational week: 12, 16, 20, 24, 28, 32, 36, 38, 40, and 42: Age Smoking Obese Chronic hypertension Parity-history PE Treatment Systolic BP Diastolic BP Hemoglobin Creatinine Protein/creatinine |
Machine learning approach for preeclampsia risk factors association [14] | Duration of completed pregnancy in weeks. Toxemia Education (completed years of schooling) Highest completed year school or degree Pregnancy outcome Labor force status Poverty Water retention/edema Race Anemia Sex Birth order Birth weight One-minute and five-minute APGAR scores Month of pregnancy when prenatal care began Number of prenatal visits Weight gained during pregnancy Medical risk factors for the pregnancy Obstetric procedures performed Delivery complications Congenital anomalies and abnormalities Mother’s marital status Number of live births now living The parents’ age Hispanic origin State/country of birth |
Preeclampsia Prediction Using machine learning and Polygenic Risk Scores From Clinical and Genetic Risk Factors in Early and Late Pregnancies [15] | Maternal age at delivery Self-reported race Relf-reported ethnicity (Hispanic or non-Hispanic) Hospital (tertiary or community) Gravidity Parity Gestational age at delivery Gestational age at preeclampsia diagnosis Last BMI before pregnancy BMI at delivery Maximal diastolic BP during pregnancy Maximal systolic BP during pregnancy Family history of chronic hypertension Family history of preeclampsia Interpregnancy interval In vitro fertilization Multiple gestation Smoking before pregnancy Drugs of abuse before pregnancy Drugs of abuse during pregnancy Alcohol use before pregnancy High-risk pregnancy Maximal BMI before pregnancy Mean BMI in the period 0–14 gestational weeks Systolic BP at first prenatal visit Diastolic BP at first prenatal visit History of pregestational diabetes History of kidney disease before pregnancy History of gestational diabetes in a prior pregnancy History of a prior high-risk pregnancy History of autoimmune disease History of preeclampsia in a prior pregnancy Family history of hypertension Family history of PE Minimal platelets count in the period 0–14 gestational weeks and in pregnancy before preeclampsia diagnosis or delivery Maximal uric acid in the period 0–14 gestational weeks and in pregnancy before preeclampsia diagnosis or delivery Presence of proteinuria in the period 0–14 gestational weeks and in pregnancy before preeclampsia diagnosis or delivery Systolic BP polygenic risk score Small for gestational age or intrauterine growth restriction Last BMI during pregnancy before preeclampsia diagnosis or delivery Maximal BMI before pregnancy Prescription of antihypertensive medication during pregnancy Diagnosis of gestational hypertension during pregnancy |
Performance of a machine learning approach for the prediction of pre-eclampsia in a middle-income country [24] | Maternal age Nulliparity Spontaneous pregnancy Induction of ovulation In vitro fertilization Gestation age at screening Smoker Alcohol intake Other drugs (heroin or cocaine) Pre-existing diabetes Chronic hypertension Lupus Antiphospholipid syndrome Polycystic ovary syndrome Hypothyroidism Congenital heart disease PE in a previous pregnancy Fetal growth restriction in a previous pregnancy Mother of the patient had PE BMI MAP MAP (MoM) UtA-PI UtA-PI (MoM) PlGF PlGF (MoM) PAPP-A Gestational age at delivery |
Validation of machine-learning model for first-trimester prediction of pre-eclampsia using cohort from PREVAL study [25]. Based on the machine learning model trained by Ansbacher-Feldman et al. [26] | Maternal age Maternal weight Maternal height Gestation age at screening Racial origin Medical history: Chronic hypertension Diabetes type I Diabetes type II Systemic lupus erythematosus/antiphospholipid syndrome Smoker Family history of PE Method of conception: Spontaneous In vitro fertilization Use of ovulation drugs Obstetric history: Nulliparous Parous, no previous PE Parous, previous PE Interpregnancy interval Aspirin MAP UtA-PI Serum concentration of pregnancy-associated plasma protein-A (PAPP-A) Serum concentration of PlGF |
An interpretable longitudinal preeclampsia risk prediction using machine learning [16] | Maternal age Self-reported race Self-reported ethnicity (Hispanic or non-Hispanic) Private insurance Public insurance Alcohol use history Smoking history Illicit drugs history Gravidity Parity In vitro fertilization Nulliparous Interpregnancy interval Multiple gestation Maximal systolic BP: 0–14 weeks 0–20 weeks 0–24 weeks 0–28 weeks 0–32 weeks 0–36 weeks 0–39 weeks 0 weeks—admission Maximal diastolic BP: 0–14 weeks 0–20 weeks 0–24 weeks 0–28 weeks 0–32 weeks 0–36 weeks 0–39 weeks 0 weeks—admission Maximal heart rate: 0–14 weeks 0–20 weeks 0–24 weeks 0–28 weeks 0–32 weeks 0–36 weeks 0–39 weeks 0 weeks—admission Maximal BMI: 0–14 weeks 0–20 weeks 0–24 weeks 0–28 weeks 0–32 weeks 0–36 weeks 0–39 weeks 0 weeks—admission Maximal weight: 0–14 weeks 0–20 weeks 0–24 weeks 0–28 weeks 0–32 weeks 0–36 weeks 0–39 weeks 0 weeks—admission Family history of chronic hypertension Family history of preeclampsia Family history of diabetes Family history of heart disease Family history of hyperlipidemia Family history of stroke Past history of diabetes Past history of gestational diabetes Past history of cesarean delivery Past history of preterm birth Past history of gynecologic surgery Past history of asthma Past history of chronic hypertension Past history of gestational hypertension Past history of high-risk pregnancy Past history of hyperemesis gravidarum Past history of migraine Past history of obesity Past history of PE Past history of pregnancy related fatigue Past history of sexually transmitted disease Chronic hypertension Anemia during pregnancy Headaches during pregnancy Autoimmune disease High-risk pregnancy Hyperemesis gravidarum Pregnancy related fatigue Oligohydramnios: At week 39 and admission Proteinuria: 0–14 weeks 0–20 weeks 0–24 weeks 0–28 weeks 0–32 weeks 0–36 weeks 0–39 weeks 0 weeks—admission Maximal aspartate transferase: 0–14 weeks 0–20 weeks 0–24 weeks 0–28 weeks 0–32 weeks 0–36 weeks 0–39 weeks 0 weeks—admission Maximal white blood count: 0–14 weeks 0–20 weeks 0–24 weeks 0–28 weeks 0–32 weeks 0–36 weeks 0–39 weeks 0 weeks—admission Maximal alanine transaminase: 0–14 weeks 0–20 weeks 0–24 weeks 0–28 weeks 0–32 weeks 0–36 weeks 0–39 weeks 0 weeks—admission Maximal serum calcium: 0–14 weeks 0–20 weeks 0–24 weeks 0–28 weeks 0–32 weeks 0–36 weeks 0–39 weeks 0 weeks—admission Maximal serum creatinine: 0–14 weeks 0–20 weeks 0–24 weeks 0–28 weeks 0–32 weeks 0–36 weeks 0–39 weeks 0 weeks—admission Maximal eosinophils: 0–14 weeks 0–20 weeks 0–24 weeks 0–28 weeks 0–32 weeks 0–36 weeks 0–39 weeks 0 weeks—admission Maximal serum glucose: 0–14 weeks 0–20 weeks 0–24 weeks 0–28 weeks 0–32 weeks 0–36 weeks 0–39 weeks 0 weeks—admission Maximal hemoglobin: 0–14 weeks 0–20 weeks 0–24 weeks 0–28 weeks 0–32 weeks 0–36 weeks 0–39 weeks 0 weeks—admission Maximal lymphocytes: 0–14 weeks 0–20 weeks 0–24 weeks 0–28 weeks 0–32 weeks 0–36 weeks 0–39 weeks 0 weeks—admission Maximal platelets: 0–14 weeks 0–20 weeks 0–24 weeks 0–28 weeks 0–32 weeks 0–36 weeks 0–39 weeks 0 weeks—admission Minimal red blood count: 0–14 weeks 0–20 weeks 0–24 weeks 0–28 weeks 0–32 weeks 0–36 weeks 0–39 weeks 0 weeks—admission Antihypertensive medications: 0–14 weeks 0–20 weeks 0–24 weeks 0–28 weeks 0–32 weeks 0–36 weeks 0–39 weeks 0 weeks—admission |
Predictive Performance of machine learning-Based Methods for the Prediction of Preeclampsia-A Prospective Study [27] | Maternal age BMI Medium: Urban Rural Parity: Nulliparity Multiparity Smoking status during pregnancy The use of assisted reproductive technologies Personal or family history of PE Personal history of hypertension Personal history of renal disease Personal history of diabetes Personal history of systemic lupus erythematosus/antiphospholipid syndrome Hyperglycemia in pregnancy Obesity Interpregnancy interval MAP (MoM) UtA-PI (MoM) PAPP-A (MoM) PLGF (MoM) Placental protein-13 (MoM) |
Dynamic gestational week prediction model for pre-eclampsia based on ID3 algorithm [17] | Static parameters: Multiple births Spontaneous miscarriage history History of hypertension in pregnancy History of diabetes mellitus Family history of hypertension Preconception BMI Dynamic parameters: Gestational week BMI during pregnancy Systolic BP Diastolic BP Pulse pressure MAP Pulse waveform area parameters Cardiac output Cardiac index Total peripheral resistance Hematocrit Mean platelet volume Platelet count Alanine aminotransferase Aspartate aminotransferase Creatinine Uric acid PlGF |
Development of a prediction model on preeclampsia using machine learning-based method: a retrospective cohort study in China [18] | Maternal age Height Weight BMI Parity Method of conception Previous diagnosis of hypertension History of diabetes mellitus History of gestational diabetes History of PE History of fetal growth restriction MAP β-human chorionic gonadotropin PAPP-A Gestational age at screening Chronic hypertension Left uterine artery PI Right uterine artery PI Mean uterine artery PI |
Novel electronic health records applied for prediction of pre-eclampsia: Machine-learning algorithms [19] | All features: Maternal age BMI Mean BP Maternal abdominal circumference Gravidity Parity PE in a previous pregnancy Prior cesarean delivery Pregnancy interval Nulliparity Multifetal gestations Assisted reproductive technology Pre-pregnancy diabetes Heart disease Thyroid disease Renal disease Autoimmune diseases Mental disorder Uterine leiomyoma Adenomyosis Uterine malfunctions History of seizure disorder Family history of hypertension Hemoglobin White blood cell count Platelet counts Direct bilirubin Total bilirubin Alanine aminotransferase Γ-glutamyl transferase Total protein Albumin Globulin Fasting plasma glucose Total bile acid Creatinine Serum urea nitrogen Serum uric acid Baseline risk features: Nulliparity Multifetal gestations PE in a previous pregnancy Pre-gestational diabetes BMI Maternal age Assisted reproductive technology Kidney diseases Autoimmune diseases Questionnaire features: Family history of hypertension Nulliparity Prior cesarean delivery Pregnancy interval Multifetal gestations Assisted reproductive technology Gravidity Parity Pre-gestational diabetes Heart disease Thyroid disease Renal disease Autoimmune diseases Mental disorder Uterine leiomyoma Adenomyosis Uterine malfunctions History of seizure disorder Maternal age BMI |
Early prediction of preeclampsia via machine learning [28] | Maternal age Height weight Blood pressure: Mean systolic Mean diastolic Maximum systolic Maximum diastolic Race Ethnicity: Hispanic Non-Hispanic unknown Gravida: Nulliparous Multiparous Number of babies Medical history: PE Assisted reproductive treatment Chronic hypertension Diabetes (type I or type II) Obesity Renal disease Autoimmune conditions: Systemic lupus erythematosus Discoid lupus erythematosus Systemic sclerosis Rheumatoid arthritis Dermatomyositis Polymyositis Undifferentiated connective tissue disease Celiac disease Antiphospholipid syndrome Sexually transmitted diseases (human papillomavirus, chlamydia, genital herpes) Hyperemesis gravidarum Headache Migraine Poor obstetrics history Poor obstetrics history Medical history at 17 weeks of gestation: Gestational diabetes Anemia High-risk pregnancy Routine prenatal laboratory results: Protein from urine Glucose from urine Platelet count Red blood cells White blood cells Creatinine Hemoglobin Hematocrit Monocytes Lymphocytes Eosinophils Neutrophils Basophils Blood type with Rh Uric acid Rubella Varicella Hepatitis B Syphilis Chlamydia Gonorrhea Intake of medication: Aspirin Nifedipine Aldomet Labetalol Insulin Glyburide Prednisone Azathioprine Plaquenil Heparin Levothyroxine Doxylamine Acyclovir |
Clinical risk assessment in early pregnancy for preeclampsia in nulliparous women: A population based cohort study [30] | Multivariable regression model: Family history of PE Country of birth Method of conception Gestational length Maternal age Height Weight Smoking in early pregnancy Pre-existing diabetes mellitus Chronic hypertension Systemic lupus erythematosus MAP Backward selection model and RF model: Gestational length first examination in weeks Maternal age BMI MAP Capillary glucose Protein in urine Hemoglobin Previous miscarriage Previous ectopic pregnancy Infertility duration Family situation: Single Living together with partner Other Region of birth: Sweden Nordic countries (except Sweden) Europe (except of Nordic countries) Africa North America South America Asia Oceania Smoking 3 months before pregnancy Smoking at registration Snuff 3 months before pregnancy Snuff at registration Alcohol consumption three months before registration Alcohol consumption at registration Family history of PE Family history of hypertension Infertility: Without treatment Ovary simulation In vitro fertilization Cardiovascular disease Endocrine disease Pre-existing diabetes Thrombosis Psychiatric disease systemic lupus erythematosus Epilepsy Chronic hypertension Morbus Chron/ulcerous colitis Lung disease or asthma Chronic kidney disease Hepatitis Gynecological disease or operation Recurrent urinary tract infections Blood group |
Artificial intelligence-assisted prediction of preeclampsia: Development and external validation of a nationwide health insurance dataset of the BPJS Kesehatan in Indonesia [20] | Demographic: Age Marriage Family role Member strata Member type International Classification of Diseases 10th Revision coded diagnoses: A codes B codes C codes D codes E codes F codes G codes H codes I codes J codes K codes L codes M codes N codes Infection-related codes: G0, H00, H01, H10, H15, H16, H20, H30, H60, H65, H66, H67, H68, H70, I0, J0, J1, J2, J40, J41, J42, J85, J86, K12, K2, K35, K36, K37, K5, K65, K67, K73, K80, K81, L0, M00, M01, M02, N7 Immune-related codes: B20, D8, E10, G35, G61, G70, I0, J30, J31, J32, J35, J45, L2, L50, M04, M05, M06, M15, M16, M17, M18, M19, M3, M65, N00, N01, N03, N04 Nervous system-related codes: A8, C7, G Eye-related codes: C69, H0, H1, H2, H3, H4, H5 Ear-related codes: C30, D02, H6, H7, H8, H9 Heat-related codes: C38, I2, I3, I4, I5 Respiratory system-related codes: A1, C0, C3, J Digestive system-related codes: A0, C0, C1, K0, K1, K3, K4, K5, K6 Skin and subcutaneous-related codes: B0, B1, B8, C43, C44, L Musculoskeletal system-related codes: C40, C41, M Urinary system-related codes: C64, C65, C66, C67, C68, N0, N1, N2, N3 Reproduction system-related codes: A5, A60, A61, A62, A63, A64, C51, C52, C53, C54, C55, C56, C57, C58, N7, N8 Liver and pancreas-related codes: B15, B16, B17, B19, C22, C23, C24, C25, K7, K8 Breast-related codes: C50, N6 Vascular-related codes: I1, I7, I8 |
Ethnicity as a Factor for the Estimation of the Risk for Preeclampsia: A Neural Network Approach [21] | MAP Uterine pulsatility index PAPP-A Ethnicity Weight Height Smoking Alcohol consumption Previous PE Conception: Spontaneous Ovulation drug In vitro fertilization Medical condition of pregnant woman Drugs taken by the pregnant woman Gestation age Crown rump length Mother had PE |
An early screening model for preeclampsia: utilizing zero-cost maternal predictors exclusively [29] | Pre-gestational BMI Age Height Chronic hypertension Nausea and vomiting in pregnancy Previous PE Gravidity Pre-gestational diabetes Multifetal pregnancy Menstrual cycle irregularity Previous miscarriage Scarred uterus Previous stillbirth Family history of hypertension Chronic renal disease Assisted reproductive technology |
An imbalance-aware deep neural network for early prediction of preeclampsia [22] | Dataset 1: The International Classification of Diseases, 9th Revision, Clinical Modification: Obesity (V853, V854, 27800, 27801, 27803, 6491) Pregnancy resulting from assisted reproductive technology (V2385) Cocaine dependence (3042, 3056) Amphetamine dependence (3044, 3057) Gestational diabetes (6488) Pre-existing diabetes (250, 6480) Anxiety (3000) Anemia NOS (2859) Iron deficiency anemia (280) Other anemia (281) Depression (311) Primigravida at the extremes of maternal age (6595, V2381, V2383) Hemorrhagic disorders due to intrinsic circulating antibodies (2865) Systemic lupus erythematosus (7100) Lupus erythematosus (6954) Autoimmune disease not elsewhere classified (27949) Pure hypercholesterolemia (2720) Unspecified vitamin D deficiency (2689) Proteinuria (7910) Tobacco use disorder (3051, 6490) History of tobacco use (V1582) Hypertension (401) Hypertensive heart disease (402) Chronic venous hypertension (4593) Unspecified renal disease in pregnancy without mention of hypertension (404) Chronic kidney disease (585) Hypertensive kidney disease (403) Hypertensive heart and chronic kidney disease (404) Renal failure not elsewhere classified (586) Infections of genitourinary tract in pregnancy (6466) Urinary tract infection (5990) Pernal history of trophoblastic disease (V131) Supervision of high-risk pregnancy with history of trophoblastic disease (V231) Thrombophilia (28981) History of premature delivery (V1321) Hemorrhage in early pregnancy (640) Congenital abnormalities of the uterus including those complicating pregnancy, childbirth, or the puerperium (6540, 7522, 7523) Multiple gestations (651) Fetal growth restriction (764) Asthma (493) Obstructive sleep apnea (32723) Other cardiovascular diseases complicating pregnancy and childbirth or the puerperium (6486) Sickle cell disease (28260) Thyroid disease (240, 241, 242, 243, 244, 245, 246) Inadequate prenatal care (V237) Periodontal disease (523) Preeclampsia/eclampsia (6424, 6425, 6426, 6427) Dataset 2: The International Classification of Diseases, 10th Revision, Clinical Modification: The International Classification of Diseases, 9th Revision, Clinical Modification: Obesity (E66, O9921, O9981, O9984, Z683, Z684, Z713, Z9884) Pregnancy resulting from assisted reproductive technology (O0981) Cocaine dependence (F14, T405) Amphetamine dependence (F15, F19, P044, T4362) Gestational diabetes (O244, P700) Pre-existing diabetes (E10, E11, O240, O241, O243, O248, O249) Anxiety (F064, F41) Anemia NOS (D51) Iron deficiency anemia (D50) Other anemia (D64, D59, D489, D53, O990) Depression (F32, F341, F33, F0631, Z139, Z1331, Z1332) Primigravidas at the extremes of maternal age (O095, O096) Hemorrhagic disorders due to intrinsic circulating antibodies (D683) Systemic lupus erythematosus (M32) Lupus erythematosus (L93, D6862) Autoimmune disease not elsewhere classified (D89) Pure hypercholesterolemia (E780) Unspecified vitamin D deficiency (E55) Proteinuria (D511, N06, O121, O122, R80) Current smoker (F172) Hypertension (G932, I10, I14, I15, I272, I674, I973, O10, O13, O16, R030) Hypertensive heart disease (I11) Chronic venous hypertension (I873) Unspecified renal disease in pregnancy without mention of hypertension (O2683, O9089) Chronic kidney disease (D631, E0822, E0922, E1922, E1122, E1322, N18) Hypertensive kidney disease (I12) Hypertensive heart and chronic kidney disease (I13) Renal failure not elsewhere classified (N19) Infections of genitourinary tract in pregnancy (O23, O861, O862, O868) Urinary tract infection (O0338, O0388, O0488, O0788, O0883, N136, N390, N99521, N99531) Pernal history of trophoblastic disease (Z8759, O01) Supervision of high-risk pregnancy with history of throphoblastic disease (O091) Thrombophilia (D685, D686) History of premature delivery (Z8751) Hemorrhage in early pregnancy (O20) Congenital abnormalities of the uterus including those complicating pregnancy, childbirth, or the puerperium (O34, O340) Multiple gestations (O30) Fetal growth restriction (O093) Other cardiovascular diseases complicating pregnancy and childbirth or the puerperium (O9943) Sickle cell disease (D57) Thyroid disease (E00, E01, E02, E03, E04, E05, E06, E07) Inadequate prenatal care (O093) Periodontal disease (E08630, E09630, E10630, E11630, E13630, K05, K06, K08129) Preeclampsia/eclampsia (O14, O15)Dataset 3: Acute renal failure Asthma Autoimmune diseases Bacterial vaginosis Chronic kidney disease CNS abnormality (spina bifida, congenital hydrocephalus, multiple diagnostic codes, microcephaly, other congenital illness) Chlamydia Chronic hypertension Cocaine Condylomata Congenital syphilis Depression Diabetes (unspecified prior diabetes, type I, type II, gestational diabetes, none) Gestational hypertension Gonococcal infection Group B streptococcus Heart failure Hemorrhagic disorder Hepatitis B infection Maternal herpes infection or history of herpes Personal history of trophoblastic disease History of infertility History of premature delivery High-risk pregnancy with history of trophoblastic disease Hyperemesis gravidarum Periodontal disease Previous cesarean Primigravida Proteinuria Repeat cesarean Sickle cell anemia with crisis Internal injuries of thorax, abdomen, and pelvis Thrombocytopenia (other, disseminated intravascular coagulation, gestational, none) Thrombophilia Kidney disease (lupus nephritis, pyelonephritis, glomerulonephritis, transplant, nephrotic syndrome, nephrolithiasis, multiple diagnostic codes, other, none) Anemia without hemoglobinopathy (folate deficiency anemia, unspecified anemia, B2 deficiency anemia, iron deficiency anemia, none) Collagen vascular disease (multiple diagnostic codes, rheumatoid arthritis, lupus, none) Hemoglobinopathy (hemoglobin-SC, alpha thalassemia, beta thalassemia, hemoglobin-SS, hemoglobin-Sthal, sickle cell trait, none) Maternal liver, gall bladder, or pancreatic illness (hepatitis A, liver transplant, pancreatitis, other, hepatitis B, cholelithiasis, hepatitis C, none) Structural heart disease (artificial valves, myocarditis/cardiomyopathy, rheumatic heart disease, other, valve disorder, congenital heart disease, none) Marijuana use Maternal neuromuscular disease (cerebral palsy, myotonic dystrophy, myasthenia gravis, multiple sclerosis, none) Operations on heart and pericardium Opioid abuse Other substance abuse (hallucinogens, sedatives/hypnotics/anxiolytics, stimulants, anti-depressants/other psychoactive, alcohol, multiple diagnostic codes, other, none) Total number of pregnancies Deliveries prior to admission Total abortions MAP Previous incidents of high blood pressure |
Prediction model development of late-onset preeclampsia using machine learning-based methods [31] | Systolic blood pressure Serum blood urea nitrogen Serum creatinine Platelet counts Potassium White blood cell Calcium Spot urine protein to creatinine ratio Aspartate transaminase Magnesium TCO2 Alanine transaminase Urine albumin to creatinine ratio |
Neural networks to estimate the risk for preeclampsia occurrence [23] | MAP Uterine pulsatility index Serum PAPP-A Ethnicity Weight Height Smoking Alcohol consumption Previous PE Method of conception: Spontaneous Ovulation drug In vitro fertilization Medical condition Drugs taken Gestational age in days Crown rump length Mother of the pregnant woman’s history of PE |
Appendix B
Study | Dataset | Size | Balance | Data Splitting | Quality | Class Imbalance | Performance |
---|---|---|---|---|---|---|---|
[13] | Test set | 417 | Imbalanced: 7.9% PE | - | NA | NA | Non-standardized reporting used, but is interpreted to be TP and FP. Does not highlight metric trade-offs but interpret their results. |
[14] | Training set | 1634 | Imbalanced: 16.5% PE | Leave one out CV | Missing data were replaced with the mode (categorical) and average (numerical). | NA | The performance is based on CV. Used standardized performance metrics. Does not interpret the results of the different metrics. |
[15] | Training set | 1125 | Imbalanced: 7.8% PE | 5-fold CV | Missing data were replaced with the mean (continuous variables) and assumed false (binary variables) | NA | The performance is based on CV. Using standardized metrics. Does not interpret the results of the different metrics. |
[16] | Training set | 98,241 | NA | 5-fold CV | NA | NA | - |
Validation set | 22,511 | NA | NA | NA | - | ||
Test set | 7705 | Imbalanced: 5.9% PE | - | NA | NA | Used standardized performance metrics. Does not interpret the results of the different metrics. | |
[17] | Training set | 1272 | Imbalanced: 18% PE | 70%/30% random split performed 20 times | NA | NA | - |
Validation set | 546 | Imbalanced: 26% PE | NA | Created a weighted average to be used for imbalanced dataset. | The performance is based on the average of the 20 iterations and reported as macro average and weighted average. Used standardized performance metrics. Does not highlight metric trade-offs or interpret the results of the different metrics used. | ||
[18] | Training set | 9945 | Imbalanced: 1.3% PE | 10-fold CV. | Standardized the data and removed patients with missing data. | Use the synthetic minority over-sampling technique. Leaving out features with no occurrence within the PE group. - | - |
Validation set | 1105 | The performance is based on CV. Used standardized metrics. Interpret the performance metrics. | |||||
[19] | Training set | NA | Imbalanced: 5.1% PE | Split by time, where the first part was used to train using a nested CV (5-fold CV in both an outer and inner loop), second part used for validation set. | Used complete data. | The weight assignment of 1:19 ratio was used. | - |
Temporal validation set | NA | - | Used standardized performance metrics. Does not interpret the results of the different metrics. | ||||
[20] | Training set | 20,975 | Imbalanced: 14.6% PE | 10-fold CV and 10-time bootstrapped external validation. | Normalized continuous variables | Using geographical and temporal randomization to avoid ethnic and seasonal effect on preeclampsia. Naïve random sampling was used to oversample the minority outcome with replacement. | - |
Geographical split—external validation set | 1322 | Imbalanced: 11% PE | - | - | - | Used standardized performance metrics. Does not highlight metric trade-offs or interpret the results of the different metrics. | |
Temporal split—external validation set | 904 | Imbalanced: 13.2% PE | - | - | - | Used standardized performance metrics. Does not highlight metric trade-offs or interpret the results of the different metrics. | |
[21] | Training set | 6793 | Imbalanced: 1.7% PE | NA | NA | NA | - |
Test set | 36 | Balanced: 44% PE | NA | - | - | - | |
Validation set | 9 | Balanced: 56% PE | - | - | - | Non-standardized reporting used. Does not highlight metric trade-offs but interpret their results. | |
[22] | Training set | 360,943 | Imbalanced: 4% PE | 10-fold CV repeated 5 times. | Continuous variables were normalized. Multiple Imputation technique using Bayesian ridge regression was used to estimate missing values. Used a 20% drop-out rate in training. | Used weighted cross-entropy loss and alpha-balanced focal loss function, respectively, on a deep neural network | Performance is based on CV. Used standardized performance metrics. Does not interpret the results of the different metrics. |
Training set | 84,632 | Imbalanced: 5.6% PE | 10-fold CV repeated 5 times. | Continuous variables were normalized. Multiple Imputation technique using Bayesian ridge regression was used to estimate missing values. Used a 20% drop-out rate in training. | Used weighted cross-entropy loss and alpha-balanced focal loss function, respectively, on a deep neural network | Performance is based on CV. Used standardized performance metrics. Does not interpret the results of the different metrics. | |
Training set | 31,431 | Imbalanced: 8.7% PE | 10-fold CV repeated 5 times. | Normalized numerical and continuous variables. Features missing in >20% were removed and local outlier factor was used to remove outliers. Multiple Imputation technique using Bayesian ridge regression was used to estimate missing values. Used a 20% drop-out rate in training. | Used weighted cross-entropy loss and alpha-balanced focal loss function, respectively, on a deep neural network | Performance is based on CV. Used standardized performance metrics. Does not interpret the results of the different metrics. | |
[23] | Training set | 6793 | Imbalanced: 1.4% PE | NA | NA | NA | - |
Test set | 36 | Balanced: 44% PE | NA | - | - | - | |
Validation set | 9 | Balanced: 56% PE | - | - | - | Non-standardized reporting used. Does not highlight metric trade-offs but interpret their results. | |
[24] | Training set | 1068 | Imbalanced: 4.1% PE | Used fifty iterations to identify a stable model | Biomarker values were normalized to multiples of the median. Excluded women taking aspirin. | NA | - |
Validation set | 914 | - | - | NA | - | ||
Test set | 1068 | - | - | - | Used standardized performance metrics. Does not highlight metric trade-offs or interpret the results of the different metrics. | ||
[25] | External validation set | 10,110 | Imbalanced: 2.3% PE | - | Categorical values were not normalized but used one-hot encoding. Scaling of the PlGF values were performed to obtain similar average and variance between the PlGF analyzing machines. | - | Used standardized performance metrics. Does not highlight metric trade-offs or interpret the results of the different metrics. |
[27] | Training set | 70 | Balanced: 50% PE | 5-fold CV | NA | NA | - |
Validation set | 163 | - | - | Used standardized performance metrics. Does not interpret the results of the different metrics. | |||
[28] | Training set | 5245 | Imbalanced: 10.7% PE | 4-fold CV repeated 5 times | Missing categorical values was made into a new category. Missing numeric values were replaced with the mean | NA | The performance is based on CV. Used standardized performance metrics. Does not highlight metric trade-offs but interpret the results of the metrics. |
[29] | Training set | 25,709 | Imbalanced: 6.4% PE | 5-fold CV | Features with missing values were removed. | Used the different variants of random under sampling and Gaussian mixture model to find the best solution for class imbalance. | - |
External validation set | 1760 | Imbalanced: 9% PE | - | - | - | Used standardized performance metrics. Does not interpret the results of the different metrics. | |
[30] | Training set | 62,562 | Imbalanced: 4.4% PE | 10-fold CV and bootstrap | Used single-chained imputation for missing values replacing with the mean | NA | The performance is based on CV. Used standardized performance metrics. Does not highlight metric trade-offs or interpret the results of the different metrics. |
[31] | Training set | 7704 | Imbalanced: 4.7% PE | Random split | Used multiple imputation | NA | - |
Validation set | 3302 | - | - | - | Used standardized performance metrics. Does not highlight metric trade-offs or interpret the results of the different metrics. |
References
- Duley, L. The Global Impact of Pre-eclampsia and Eclampsia. Semin. Perinatol. 2009, 33, 130–137. [Google Scholar] [CrossRef] [PubMed]
- Magee, L.A.; Nicolaides, K.H.; von Dadelszen, P. Preeclampsia. N. Engl. J. Med. 2022, 386, 1817–1832. [Google Scholar] [CrossRef] [PubMed]
- Van Doorn, R.; Mukhtarova, N.; Flyke, I.P.; Lasarev, M.; Kim, K.; Hennekens, C.H.; Hoppe, K.K. Dose of aspirin to prevent preterm preeclampsia in women with moderate or high-risk factors: A systematic review and meta-analysis. PLoS ONE 2021, 16, e0247782. [Google Scholar] [CrossRef] [PubMed]
- O’Gorman, N.; Wright, D.; Syngelaki, A.; Akolekar, R.; Wright, A.; Poon, L.C.; Nicolaides, K.H. Competing risks model in screening for preeclampsia by maternal factors and biomarkers at 11–13 weeks gestation. Am. J. Obstet. Gynecol. 2016, 214, e1–e103. [Google Scholar] [CrossRef] [PubMed]
- Hackelöer, M.; Schmidt, L.; Verlohren, S. New advances in prediction and surveillance of preeclampsia: Role of machine learning approaches and remote monitoring. Arch. Gynecol. Obstet. 2023, 308, 1663–1677. [Google Scholar] [CrossRef]
- Wright, D.; Gallo, D.M.; Gil Pugliese, S.; Casanova, C.; Nicolaides, K.H. Contingent screening for preterm pre-eclampsia. Ultrasound Obstet. Gynecol. 2016, 47, 554–559. [Google Scholar] [CrossRef]
- Ranjbar, A.; Montazeri, F.; Ghamsari, S.R.; Mehrnoush, V.; Roozbeh, N.; Darsareh, F. Machine learning models for predicting preeclampsia: A systematic review. BMC Pregnancy Childbirth 2024, 24, 6. [Google Scholar] [CrossRef]
- Brunelli, V.B.; Prefumo, F. Quality of first trimester risk prediction models for pre-eclampsia: A systematic review. BJOG 2015, 122, 904–914. [Google Scholar] [CrossRef]
- Bertini, A.; Salas, R.; Chabert, S.; Sobrevia, L.; Pardo, F. Using Machine Learning to Predict Complications in Pregnancy: A Systematic Review. Front. Bioeng. Biotechnol. 2022, 9, 780389. [Google Scholar] [CrossRef]
- Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. Syst. Rev. 2021, 10, 89. [Google Scholar] [CrossRef]
- Ao, C.; Zhou, W.; Gao, L.; Dong, B.; Yu, L. Prediction of antioxidant proteins using hybrid feature representation method and random forest. Genomics 2020, 112, 4666–4674. [Google Scholar] [CrossRef] [PubMed]
- Moons, K.G.; Wolff, R.F.; Riley, R.D.; Whiting, P.F.; Westwood, M.; Collins, G.S.; Reitsma, J.B.; Kleijnen, J.; Mallett, S. PROBAST: A tool to assess risk of bias and applicability of prediction model studies: Explanation and elaboration. Annals of Internal Medicine. Am. Coll. Physicians 2019, 170, W1–W33. [Google Scholar]
- Velikova, M.; Lucas, P.J.F.; Spaanderman, M. A Predictive Bayesian Network Model for Home Management of Preeclampsia. In Artificial Intelligence in Medicine AIME 2011 Lecture Notes in Computer Science; Peleg, M., Lavrač, N., Combi, C., Eds.; Springer: Berlin/Heidelberg, Germany, 2011; Volume 6747, pp. 179–183. [Google Scholar]
- Martínez-Velasco, A.; Martínez-Villaseñor, L.; Miralles-Pechuán, L. Machine learning approach for pre-eclampsia risk factors association. In Proceedings of the 4th EAI International Conference on Smart Objects and Technologies for Social Good, Bologna, Italy, 28–30 November 2018; Association for Computing Machinery: New York, NY, USA, 2018; pp. 232–237. [Google Scholar]
- Kovacheva, V.P.; Eberhard, B.W.; Cohen, R.Y.; Maher, M.; Saxena, R.; Gray, K.J. Preeclampsia Prediction Using Machine Learning and Polygenic Risk Scores from Clinical and Genetic Risk Factors in Early and Late Pregnancies. Hypertension 2024, 81, 264–272. [Google Scholar] [CrossRef] [PubMed]
- Eberhard, B.W.; Cohen, R.Y.; Rigoni, J.; Bates, D.W.; Gray, K.J.; Kovacheva, V.P. An Interpretable Longitudinal Preeclampsia Risk Prediction Using Machine Learning. medRxiv [Internet]. 2023 Aug 16. Available online: http://www.ncbi.nlm.nih.gov/pubmed/37645797 (accessed on 22 February 2024).
- Li, Z.; Xu, Q.; Sun, G.; Jia, R.; Yang, L.; Liu, G.; Hao, D.; Zhang, S.; Yang, Y.; Li, X.; et al. Dynamic gestational week prediction model for pre-eclampsia based on ID3 algorithm. Front. Physiol. 2022, 13, 1035726. [Google Scholar] [CrossRef]
- Liu, M.; Yang, X.; Chen, G.; Ding, Y.; Shi, M.; Sun, L.; Huang, Z.; Liu, J.; Liu, T.; Yan, R.; et al. Development of a prediction model on preeclampsia using machine learning-based method: A retrospective cohort study in China. Front. Physiol. 2022, 13, 896969. [Google Scholar] [CrossRef]
- Li, Y.X.; Shen, X.P.; Yang, C.; Cao, Z.Z.; Du, R.; Wang, J.P.; Wang, M. Novel electronic health records applied for prediction of pre-eclampsia: Machine-learning algorithms. Pregnancy Hypertens. 2021, 26, 102–109. [Google Scholar] [CrossRef]
- Sufriyana, H.; Wu, Y.W.; Su, E.C.Y. Artificial intelligence-assisted prediction of preeclampsia: Development and external validation of a nationwide health insurance dataset of the BPJS Kesehatan in Indonesia. EBioMedicine 2020, 54, 102710. [Google Scholar] [CrossRef]
- Neocleous, C.; Nicolaides, K.; Neokleous, K.; Schizas, C. Ethnicity as a Factor for the Estimation of the Risk for Preeclampsia: A Neural Network Approach. In Proceedings of the Artificial Intelligence: Theories, Models and Applications: 6th Hellenic Conference on AI, SETN 2010, Athens, Greece, 4–7 May 2010; LNAI; Springer: Berlin/Heidelberg, Germany, 2010; Volume 6040. [Google Scholar]
- Bennett, R.; Mulla, Z.D.; Parikh, P.; Hauspurg, A.; Razzaghi, T. An imbalance-aware deep neural network for early prediction of preeclampsia. PLoS ONE 2022, 17, e0266042. [Google Scholar] [CrossRef]
- Neocleous, C.K.; Anastasopoulos, P.; Nikolaides, K.H.; Schizas, C.N.; Neokleous, K.C. Neural networks to estimate the risk for preeclampsia occurrence. In Proceedings of the IEEE International Joint Conference on Neural Networks, Atlanta, GA, USA, 14–19 June 2009; pp. 2221–2224. [Google Scholar]
- Torres-Torres, J.; Villafan-Bernal, J.R.; Martinez-Portilla, R.J.; Hidalgo-Carrera, J.A.; Estrada-Gutierrez, G.; Adalid-Martinez-Cisneros, R.; Rojas-Zepeda, L.; Acevedo-Gallegos, S.; Camarena-Cabrera, D.M.; Cruz-Martínez, M.Y.; et al. Performance of a machine learning approach for the prediction of pre-eclampsia in a middle-income country. Ultrasound Obstet. Gynecol. 2024, 63, 350–357. [Google Scholar] [CrossRef]
- Gil, M.M.; Cuenca-Gómez, D.; Rolle, V.; Pertegal, M.; Díaz, C.; Revello, R.; Adiego, B.; Mendoza, M.; Molina, F.S.; Santacruz, B.; et al. Validation of machine-learning model for first-trimester prediction of pre-eclampsia using cohort from PREVAL study. Ultrasound Obstet. Gynecol. 2024, 63, 68–74. [Google Scholar] [CrossRef]
- Ansbacher-Feldman, Z.; Syngelaki, A.; Meiri, H.; Cirkin, R.; Nicolaides, K.H.; Louzoun, Y. Machine-learning-based prediction of pre-eclampsia using first-trimester maternal characteristics and biomarkers. Ultrasound Obstet. Gynecol. 2022, 60, 739–745. [Google Scholar] [CrossRef] [PubMed]
- Melinte-Popescu, A.S.; Vasilache, I.A.; Socolov, D.; Melinte-Popescu, M. Predictive Performance of Machine Learning-Based Methods for the Prediction of Preeclampsia—A Prospective Study. J. Clin. Med. 2023, 12, 418. [Google Scholar] [CrossRef] [PubMed]
- Marić, I.; Tsur, A.; Aghaeepour, N.; Montanari, A.; Stevenson, D.K.; Shaw, G.M.; Winn, V.D. Early prediction of preeclampsia via machine learning. Am. J. Obstet. Gynecol. MFM 2020, 2, 100100. [Google Scholar] [CrossRef] [PubMed]
- Wang, L.; Ma, Y.; Bi, W.; Meng, C.; Liang, X.; Wu, H.; Zhang, C.; Wang, X.; Lv, H.; Li, Y. An early screening model for preeclampsia: Utilizing zero-cost maternal predictors exclusively. Hypertens. Res. 2024, 47, 1051–1062. [Google Scholar] [CrossRef]
- Sandström, A.; Snowden, J.M.; Höijer, J.; Bottai, M.; Wikström, A.K. Clinical risk assessment in early pregnancy for preeclampsia in nulliparous women: A population based cohort study. PLoS ONE. 2019, 14, e0225716. [Google Scholar] [CrossRef]
- Jhee, J.H.; Lee, S.; Park, Y.; Lee, S.E.; Kim, Y.A.; Kang, S.-W.; Kwon, J.-Y.; Park, J.T. Prediction model development of late-onset preeclampsia using machine learning-based methods. PLoS ONE 2019, 14, e0221202. [Google Scholar] [CrossRef]
- Zhou, Y.; Tu, F.; Sha, K.; Ding, J.; Chen, H. A Survey on Data Quality Dimensions and Tools for Machine Learning. arXiv 2024, arXiv:2406.19614. [Google Scholar]
- James, G.; Witten, D.; Hastie, T.; Tibshirani, R. Statistical Learning. In An Introduction to Statistical Learning with Applications in R, 2nd ed.; Springer: New York, NY, USA, 2023; pp. 15–58. [Google Scholar]
- Hardt, M.; Blum, A. The Ladder: A Reliable Leaderboard for Machine Learning Competitions. In Proceedings of the 32nd International Conference on Machine Learning, Lille, France, 6–11 July 2015; pp. 1006–1014. [Google Scholar]
- Kabongo, S.; D’Souza, J.; Auer, S. ORKG-Leaderboards: A systematic workflow for mining leaderboards as a knowledge graph. Int. J. Digit. Libr. 2024, 25, 41–54. [Google Scholar] [CrossRef]
- James, G.; Witten, D.; Hastie, T.; Tibshirani, R. Linear Regression. In An Introduction to Statistical Learning with Applications in R, 2nd ed.; Springer: New York, NY, USA, 2023; pp. 59–128. [Google Scholar]
- Lazdam, M.; de la Horra, A.; Diesch, J.; Kenworthy, Y.; Davis, E.; Lewandowski, A.J.; Szmigielski, C.; Shore, A.; Mackillop, L.; Kharbanda, R.; et al. Unique Blood Pressure Characteristics in Mother and Offspring after Early Onset Preeclampsia. Hypertension 2012, 60, 1338–1345. [Google Scholar] [CrossRef]
- Macdonald-Wallis, C.; Lawlor, D.A.; Fraser, A.; May, M.; Nelson, S.M.; Tilling, K. Preeclampsia Blood Pressure Change in Normotensive, Gestational Hypertensive, Preeclamptic, and Essential Hypertensive Pregnancies. Hypertension 2012, 59, 1241–1248. [Google Scholar] [CrossRef]
- Sculley, D.; Holt, G.; Golovin, D.; Davydov, E.; Phillips, T.; Ebner, D.; Chaudhary, V.; Young, M.; Crespo, J.-F.; Dennison, D. Hidden Technical Debt in Machine Learning Systems. Adv. Neural Inf. Process Syst. 2015, 2, 2503–2511. [Google Scholar]
- Baylor, D.; Breck, E.; Cheng, H.-T.; Fiedel, N.; Foo, C.Y.; Haque, Z.; Haykal, S.; Ispir, M.; Jain, V.; Koc, L. TFX: A TensorFlow-based production-scale machine learning platform. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Association for Computing Machinery, Halifax, NS, Canada, 13–17 August 2017; pp. 1387–1395. [Google Scholar]
- Cheng, Y. Kubeflow: A Machine Learning Platform on Kubernetes. In Proceedings of the 21st International Middleware Conference, Delft, The Netherlands, 7–11 December 2020. [Google Scholar]
- Villarmizar, M. Evaluating the Impact of Cloud Resource Auto-scaling in Containerized Microservices. In Proceedings of the IEEE 7th International Conference on Cloud Computing Technology and Science, Vancouver, BC, Canada, 30 November–3 December 2015. [Google Scholar]
- Breck, E.; Cai, S.; Nielsen, E.; Salib, M.; Sculley, D. The ML Test Score: A Rubric for ML Production Readiness and Technical Debt Reduction. In Proceedings of the IEEE International Conference on Big Data (BIGDATA), Boston, MA, USA, 11–14 December 2017; pp. 1123–1132. [Google Scholar]
- Chen, Z. Continuous Integration and Continuous Deployment for Machine Learning. IEEE Trans Emerg Top Comput. 2021. [Google Scholar]
- Zaharia, M.; Chen, A.; Davidson, A.; Ghodsi, A.; Hong, S.A.; Konwinski, A.; Murching, S.; Nykodym, T.; Ogilvie, P.; Parkhe, M.; et al. Accelerating the Machine Learning Lifecycle with MLflow. IEEE Data Eng. Bull. 2018, 41, 39–45. [Google Scholar]
- Goodfellow, I.J.; Shlens, J.; Szegedy, C. Explaining and Harnessing Adversarial Examples. arXiv 2015, arXiv:14126572. [Google Scholar]
- Voigt, P.; von dem Bussche, A. The EU General Data Protection Regulation (GDPR): A Practical Guide [Internet]; Springer International Publishing: Cham, Switzerland, 2017; Available online: http://link.springer.com/10.1007/978-3-319-57959-7 (accessed on 19 September 2024).
- Villamizar, M. Microservices in the Cloud: Load Balancing Patterns and Best Practices. IEEE Cloud Comput. 2015, 2, 50–59. [Google Scholar]
- Amazon Web Services. AWS Auto Scaling. Available online: https://aws.amazon.com/autoscaling/ (accessed on 19 September 2024).
- Abadi, M.; Barham, P.; Chen, J.; Chen, Z.; Davis, A.; Dean, J.; Devin, M.; Ghemawat, S.; Irving, G.; Isard, M.; et al. TensorFlow: A System for Large-Scale Machine Learning. In Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation. USENIX Association, Savannah, GA, USA, 2–4 November 2016; p. 786. [Google Scholar]
- Hummer, W.; Muthusamy, V.; Rausch, T.; Dube, P.; El Maghraoui, K.; Murthi, A. ModelOps: Cloud-based lifecycle management for reliable and trusted AI. In Proceedings of the 2019 IEEE International Conference on Cloud Engineering, IC2E 2019, Prague, Czech Republic, 24–27 June 2019; Institute of Electrical and Electronics Engineers Inc.: Piscataway, NJ, USA, 2019; pp. 113–120. [Google Scholar]
- Oprescu, A.; Miró-Amarante, G.; García-Díaz, L.; Rey, V.; Chimenea-Toscano, A.; Martínez-Martínez, R.; Romero-Ternero, M. Towards a data collection methodology for Responsible Artificial Intelligence in health: A prospective and qualitative study in pregnancy. Inf. Fusion 2022, 83–84, 53–78. [Google Scholar] [CrossRef]
- High-Level Expert Group on Artificial Intelligence. Ethics guidelines for trustworthy AI [Internet]. European Commission. 2019. Available online: https://digital-strategy.ec.europa.eu/en/library/ethics-guidelines-trustworthy-ai (accessed on 28 August 2024).
- Oprescu, A.M.; Miró-Amarante, G.; García-Díaz, L.; Beltrán, L.M.; Rey, V.E.; Romero-Ternero, M. Artificial intelligence in pregnancy: A scoping review. IEEE Access 2020, 8, 181450–181484. [Google Scholar] [CrossRef]
- Clement, T.; Kemmerzell, N.; Abdelaal, M.; Amberg, M. XAIR: A Systematic Metareview of Explainable AI (XAI) Aligned to the Software Development Process. Mach. Learn. Knowl. Extr. 2023, 5, 78–108. [Google Scholar] [CrossRef]
- Mohammad Amini, M.; Jesus, M.; Fanaei Sheikholeslami, D.; Alves, P.; Hassanzadeh Benam, A.; Hariri, F. Artificial Intelligence Ethics and Challenges in Healthcare Applications: A Comprehensive Review in the Context of the European GDPR Mandate. Mach. Learn. Knowl. Extr. 2023, 5, 1023–1035. [Google Scholar] [CrossRef]
- Chiaburu, T.; Haußer, F.; Bießmann, F. Uncertainty in XAI: Human Perception and Modeling Approaches. Mach. Learn. Knowl. Extr. 2024, 6, 1170–1192. [Google Scholar] [CrossRef]
Author (Reference) | Study Type (Country: Year) | Type of Dataset: Participants (PE %) | ML Models Tested | Best Performing ML Model | Top Five Key Features | Intended Deployment: Prediction Time | ||
---|---|---|---|---|---|---|---|---|
ML Model: PE Subgroup (Prediction Time) | AUC | Recall (%) | ||||||
Velikova M. et al. [13] | Retrospective research (The Netherlands: 2011) | Test set: 417 (7.9% PE) | Temporal Bayesian network model | Temporal Bayesian Network Model: All PE (gestational week 12) | - | - | Not specified | Two times: Gestational weeks 12 and 16 (intended to be multiple times). |
Temporal Bayesian Network Model: All PE (gestational week 16) | - | - | ||||||
Martínez-Velasco A. et al. [14] | Retrospective cohort (Italy: 2018) | Training and validation set: 1634 (16.46% PE) | RF AdaBoost CT Stoch. GBoost Glmnet MAR-Splines Linear discriminant analysis Bayesian GLM NN with feature extraction SVM radial kernel SVM linear kernel KNN Single C5.0 Tree Boosted logistic regression C4.5-like Trees | RF: All PE (not specified) | - | 68% | RF variable importance: 1. Gestation weeks completed 2. Poverty 3. Water retention/edema 4. Toxemia 5. Highest educational degree | One time: not specified when |
Kovacheva VP. et al. [15] | Retrospective study (United States: 2024) | Training and validation set: 1125 (7.8% PE) | Logistic regression XGBoost | XGBoost without the hypertension genetic risk score: All PE (before gestational week 14) | 0.74 | 97% | SHAP (<week 14): 1. History of PE 2. Mean diastolic BP (<14 weeks) 3. Mean systolic BP (first prenatal visit) 4. Maternal age 5. BMI SHAP (before birth): 1. Maximum systolic BP during pregnancy 2. Mean systolic BP (week 34—birth) 3. Mean diastolic BP (week 34—birth) 4. History of chronic kidney disease 5. Maximum uric acid during pregnancy | Two times: one model for first prenatal visit and one model for before the delivery admission (not specified further) |
XGBoost without the hypertension genetic risk score: All PE (before birth) | 0.91 | 97% | ||||||
Eberhard BW. et al. [16] * | Retrospective cohort (United States: 2023) | Training set: 98,241 Validation set: 22,511 Total: 120,752 (5.7% PE) External validation set: 7705 (5.9% PE) | XGBoost Deep NN Elastic net RF Linear regression | XGBoost: External validation set: All PE (gestation week 14) | 0.66 | 33% | SHAP (gestational week 14): 1. Chronic and gestational hypertension 2. Interpregnancy interval 3. Medical history 4. Diastolic and systolic BP 5. Maternal age SHAP (before admission) 1. Diastolic and systolic BP 2. Maternal age 3. Laboratory results 4. Chronic and gestational hypertension 5. Insurance | Multiple times: week 14, 20, 24, 28, 32, 36, 39, and on admission. They made a model for each time point. |
XGBoost: External validation set: All PE (gestation week 20) | 0.66 | 35% | ||||||
XGBoost: External validation set: All PE (gestation week 24/28/32/36/39/on admission) | 0.67/0.69/0.71 /0.76/0.86/0.9 | 37%/40%/44% /49%/66%/75% | ||||||
Li Z. et al. [17] | Case-control retrospective (China: 2023) | Training set: 1272 (18% PE) Validation set: 546 (26% PE) Total: 1818 (20.4% PE) | Iterative dichotomiser algorithm | Iterative Dichotomiser algorithm Macro average: All PE (not specified) | 73% | Not specified | Multiple times: At prenatal visits at different gestational weeks (not specified further) | |
Iterative dichotomiser algorithm Weighted average: All PE (not specified) | - | 89% | ||||||
Liu M. et al. [18] | Cohort Retrospective study (China: 2022) | Training set: 9945 Validation set: 1105 Total: 11,050 (1.3% PE) | Deep artificial NN DT Logistic regression RF SVM linear kernel | RF: All PE (not specified) | 0.86 | 42% | Not specified | One time: first prenatal visit (not specified further) |
Li Y-x. et al. [19] | Retrospective cohort study (China: 2021) | Total: 3759 (5.08% PE) | RF SVM linear versus radial kernel XGBoost logistic regression | XGBoost with all features (temporal validation): All PE (not specified) | 0.96 | 79% | XGBoost feature importance: 1. Fasting plasma glucose 2. Mean BP 3. BMI 4. Maternal abdominal circumference 5. Serum uric acid | One time: early second trimester (not specified further) |
XGBoost with simple model (temporal validation): All PE (not specified) | 0.84 | - | ||||||
Sufriyana H. et al. [20] | Retrospective case-control study (Indonesia: 2020) | Training and internal validation set: 20,975 (14.5% PE) External validation with geographic split: 1322 (11% PE) External validation with temporal split: 904 (13.2% PE) Total: 23,201 (14.3% PE) | Logistic regression DT Artificial NN RF SVM Ensemble algorithm | RF with geographical split (external validation): All PE (not specified) | 0.76 | - | Not specified | Not specified |
RF with temporal split (external validation): All PE (not specified) | 0.70 | - | ||||||
RF with geographical split (external validation): All PE (subgroup 9–<12 months—approximation from study figure) | 0.88 | - | ||||||
RF with geographical split (external validation): All PE (subgroup 9–<12 months—approximation from study figure) | 0.86 | - | ||||||
Neocleous CK et al. [21] | Prospective study (England: 2010) | Training set: 6793 (1.7% PE) Validation set: 36 (44% PE) Test set: 9 (56% PE) Total: 6838 (1.99% PE) | NN | NN: All PE (not specified) | - | - | Not specified | Not specified |
Bennett R. et al. [22] | Retrospective study (United States: 2022) | Training and test set 1: 360,943 (3.98% PE) Training and test set 2: 84,632 (5.58% PE) Training and test set 3: 31,431 (8.73% PE) | Deep NN CSDNNWCE CSDNNFL Logistic regression Weighted logistic regression SVM with linear kernel Weighted SVM with linear kernel SVM with radial basis function Weighted SVM with radial basis function | Dataset 1: CSDNNFL: All PE (not specified) | 0.66 | 62% | Chi-square feature selection: Set 1: 1. Hypertension 2. Obesity 3. Pre-existing diabetes 4. Gestational diabetes 5. Multiple gestations Set 2: 1. Obesity 2. Pre-existing diabetes 3. Multiple gestations 4. Proteinuria 5. Native American Set 3: 1. Kidney disease 2. Chronic hypertension 3. Diabetes 4. CNS abnormality 5. Previous incidents of high BP (spikes before week 14) | One time: not specified |
Dataset 2: CSDNNFL: All PE (not specified) | 0.64 | 57% | ||||||
Dataset 3: CSDNNWCE: All PE (not specified) | 0.76 | 67% | ||||||
Neocleous K.C. et al. [23] | Prospective study (England: 2009) | Training set: 6793 (1.4% PE) Validation set: 36 (44% PE) Test set: 9 (56% PE) Total: 6838 (1.7% PE) | Multiple linear regression Multiple nonlinear regression Feedforward neural network | NN: All PE (not specified) | - | - | Not specified | One time: at gestational week 11+0 to 13+6 |
Torres-Torres J. et al. [24] | Prospective cohort study (Mexico: 2023) | Training set: 1068 Validation set: 914 Test set: 1068 Total: 3050 (4.1% PE) | Elastic net | Elastic Net with all features: All PE (not specified) | 0.78 | - | Regularization coefficient: 1. PlGF 2. MAP 3. UtA-PI 4. BMI 5. APS | One time: first trimester (not specified further) |
Elastic net with all features: Early-onset (<34 gestation weeks) (not specified) | 0.96 | - | ||||||
Elastic net with all features: Pre-term PE (<37 gestation weeks) (not specified) | 0.90 | - | ||||||
Gil M.M et al. [25] | Validation using prospective cohort data (Spain: 2024) | Training set: 30,352 Validation set: 10,000 Test set: 20,352 External validation set (PREVAL): 10,110 (2.3% PE) | Feed-forward NN with two hidden layers compared to FMF | NN with all features except PAPP-A: All PE (not specified) | 0.85 | 56% | Not specified by Gil et al. According to the developer of the ML model Ansbacher-Feldman et al. [26] using SHAP: 1. MAP 2. UtA-PI 3. PlGF 4. Racial origin 5. Parous, no previous PE | One time: first prenatal visit specified by Ansbacher-Feldman et al. [26]. (not specified further) |
NN with all features except PAPP-A: Early-onset PE (<34 gestation weeks) (not specified) | 0.92 | 84% | ||||||
NN with all features except PAPP-A: Pre-term PE (<37 gestation weeks) (not specified) | 0.91 | 78% | ||||||
Melinte-Popescu A-S et al. [27] | Prospective case-control study (Romania: 2023) | Training set: 70 Validation set: 163 Total: 233 (50% PE) | DT Naïve Bayes SVM with linear kernel RF | Naïve Bayes: All PE (not specified) | 0.98 | 96% | Not specified | One time: first prenatal visit (not specified further) |
DT: Early-onset (<34 gestation weeks) (not specified) | 0.95 | 75% | ||||||
RF: Late-onset PE (>34 gestation weeks) (not specified) | 0.84 | 93% | ||||||
DT: Moderate PE (not specified) (not specified) | 0.80 | 92% | ||||||
RF: Severe PE (when certain criteria are present) (not specified) | 0.76 | 33% | ||||||
Marić I. et al. [28] | Retrospective cohort study (United States: 2020) | Total: 5245 (10.7% PE) | Elastic net Gradient boosting Multiple logistic regression | Elastic net: All PE (not specified) | 0.79 | 45% | Coefficient impact in Elastic Net: All PE: 1. Hypertension 2. History of PE 3. insulin 4. Mean systolic BP 5. Race unknown Early-onset (<34 gestation weeks): 1. Hypertension 2. Number of babies 3. History of PE 4. Protein 3+ 5. Anemia | One time: week 16 of gestation |
Elastic net: Early-onset (<34 gestation weeks) (not specified) | 0.89 | 72% | ||||||
Wang L. et al. [29] | Retrospective cohort study (China: 2024) | Training and internal validation: 25,709 (6.36% PE) External validation set: 1760 (8.97% PE) | AdaBoost RF Multi-layer perceptron Gradient boosting DT Gaussian naïve Bayes XGBoost Logistic regression SVM Category boosting Light gradient Booste4d machine | AdaBoost: All PE (not specified) | 0.80 | 52% | SHAP: 1. Chronic hypertension 2. Pre-gestational BMI 3. Scarred uterus 4. Age 5. Chronic renal disease | One time: routine first prenatal visit (11+0–13+6 gestational weeks) |
AdaBoost: Ealy-onset PE (<34 gestational weeks) (not specified) | 0.82 | 58% | ||||||
AdaBoost: Preterm-PE (<37 gestational weeks) (not specified) | 0.82 | 53% | ||||||
Sandström A. et al. [30] | Retrospective cohort study (Sweden: 2019) | Total: 62,562 (4.4% PE) | RF Backward selection model on multivariable logistic regression Multivariable regression model using FMF variables | Multivariable regression model: Early-onset (<34 gestation weeks) (not specified) | 0.68 | 31% | Not specified | One time: first prenatal visit (not specified further) |
Multivariable regression model: Preterm PE (<37 gestation weeks) (not specified) | 0.68 | 29% | ||||||
Multivariable regression model: Term PE (≥37 gestation weeks) (not specified) | 0.67 | 28% | ||||||
Jhee J.H. et al. [31] | Retrospective (Korea: 2019) | Training set: 7704 Validation set: 3302 Total: 11,006 (4.7% PE) | Logistic regression DT Naïve Bayes classification SVM RF Stoch. GBoost | Stoch. GBoost: Late-onset PE (>34 gestation week) (not specified) | - | 60% | Mean decrease Gini 1. Systolic BP 2. Serum blood urea nitrogen 3. Serum creatinine 4. Platelet count 5. Serum potassium | Not specified |
Study | Risk of Bias | Applicability | Overall | ||||||
---|---|---|---|---|---|---|---|---|---|
Participants | Predictors | Outcome | Analysis | Participants | Predictors | Outcome | Risk of Bias | Applicability | |
[13] | ? | +/− | +/− | ? | + | + | + | ? | ? |
[14] | + | +/− | + | ? | + | + | + | ? | + |
[15] | + | +/− | + | ? | + | + | + | ? | + |
[16] | + | +/− | + | + | + | + | + | +/− | + |
[17] | + | +/− | + | ? | + | + | + | ? | + |
[18] | + | +/− | + | + | + | + | + | +/− | + |
[19] | + | +/− | + | + | + | + | + | +/− | + |
[20] | + | +/− | + | + | + | + | + | +/− | + |
[21] | + | +/− | +/− | ? | + | + | + | ? | + |
[22] | + | +/− | + | + | + | + | + | +/− | + |
[23] | + | +/− | + | ? | + | + | + | ? | + |
[24] | + | + | + | +/− | + | + | + | +/− | + |
[25] | + | + | + | ? | + | + | + | ? | + |
[27] | + | + | + | + | + | + | + | + | + |
[28] | +/− | + | + | + | + | + | + | +/− | + |
[29] | + | + | + | +/− | + | + | + | +/− | + |
[30] | + | + | + | + | + | + | + | + | + |
[31] | + | +/− | + | +/− | + | + | + | +/− | + |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Pedersen, L.; Mazur-Milecka, M.; Ruminski, J.; Wagner, S. A Review on Machine Learning Deployment Patterns and Key Features in the Prediction of Preeclampsia. Mach. Learn. Knowl. Extr. 2024, 6, 2515-2569. https://doi.org/10.3390/make6040123
Pedersen L, Mazur-Milecka M, Ruminski J, Wagner S. A Review on Machine Learning Deployment Patterns and Key Features in the Prediction of Preeclampsia. Machine Learning and Knowledge Extraction. 2024; 6(4):2515-2569. https://doi.org/10.3390/make6040123
Chicago/Turabian StylePedersen, Louise, Magdalena Mazur-Milecka, Jacek Ruminski, and Stefan Wagner. 2024. "A Review on Machine Learning Deployment Patterns and Key Features in the Prediction of Preeclampsia" Machine Learning and Knowledge Extraction 6, no. 4: 2515-2569. https://doi.org/10.3390/make6040123
APA StylePedersen, L., Mazur-Milecka, M., Ruminski, J., & Wagner, S. (2024). A Review on Machine Learning Deployment Patterns and Key Features in the Prediction of Preeclampsia. Machine Learning and Knowledge Extraction, 6(4), 2515-2569. https://doi.org/10.3390/make6040123