Machine Learning Insight: Unveiling Overlooked Risk Factors for Postoperative Complications in Gastric Cancer

Lee, Sejin; Oh, Hyo-Jung; Yoo, Hosuon; Kim, Chan-Young

doi:10.3390/cancers17071225

Open AccessArticle

Machine Learning Insight: Unveiling Overlooked Risk Factors for Postoperative Complications in Gastric Cancer

¹

Department of Surgery, Jeonbuk National University Hospital, 20 Geonji-ro, Deokjin-gu, Jeonju 54907, Republic of Korea

²

Research Institute of Clinical Medicine of Jeonbuk National University-Biomedical Research Institute of Jeonbuk National University Hospital, Jeonju 54907, Republic of Korea

³

Department of Library & Information Science, Jeonbuk National University, Jeonju 54896, Republic of Korea

⁴

Research Division for Data Analysis, Korea Institute of Science and Technology Information (KISTI), Daegu 41515, Republic of Korea

^*

Author to whom correspondence should be addressed.

Cancers 2025, 17(7), 1225; https://doi.org/10.3390/cancers17071225

Submission received: 28 February 2025 / Revised: 30 March 2025 / Accepted: 2 April 2025 / Published: 4 April 2025

(This article belongs to the Special Issue Clinical Studies in Gastrointestinal Malignancies)

Download

Browse Figures

Versions Notes

Simple Summary

This study aimed to improve the prediction of postoperative complications after gastrectomy for gastric cancer using machine learning (ML) models. Data from 865 patients who underwent surgery between 2018 and 2022 were analyzed, incorporating 85 variables, including clinical, laboratory, intraoperative, and pathological data. ML models were trained and validated (80/20 split) with steps for missing data handling, variable selection, and hyperparameter tuning. Using backward elimination and a moderate missing data strategy, the highest model performance was achieved (AUC 0.744). ML identified 15 key risk factors, with operation time, preoperative albumin, and mean corpuscular hemoglobin levels being the most predictive. Random Forest and XGBoost outperformed traditional linear regression. These findings suggest that ML, combined with refined feature selection, may enhance risk stratification for postoperative complications in gastric cancer surgery.

Abstract

Background: Since postoperative complications after gastrectomy for gastric cancer are associated with poor clinical outcomes, it is important to predict and prepare for the occurrence of complications preoperatively. Conventional models for predicting complications have limitations, prompting interest in machine learning algorithms. Machine learning models have a superior ability to identify complex interactions among variables and nonlinear relationships, potentially revealing new risk factors. This study aimed to explore previously overlooked risk factors for postoperative complications and compare machine learning models with linear regression. Materials and Methods: We retrospectively reviewed data from 865 patients who underwent gastrectomy for gastric cancer from 2018 to 2022. A total of 85 variables, including demographics, clinical features, laboratory values, intraoperative parameters, and pathologic results, were used to conduct the machine learning model. The dataset was partitioned into 80% for training and 20% for validation. To identify the most accurate prediction model, missing data handling, variable selection, and hyperparameter tuning were performed. Results: Machine learning models performed notably well when using the backward elimination method and a moderate missing data strategy, achieving the highest area under the curve values (0.744). A total of 15 variables associated with postoperative complications were identified using a machine learning algorithm. Operation time was the most impactful variable, followed closely by pre-operative levels of albumin and mean corpuscular hemoglobin. Machine learning models, especially Random Forest and XGBoost, outperformed linear regression. Conclusions: Machine learning, coupled with advanced variable selection techniques, showed promise in enhancing risk prediction of postoperative complications for gastric cancer surgery.

Keywords:

gastrectomy; prediction of complication; machine learning algorithms

1. Introduction

Gastric cancer is a prevalent malignancy, significantly contributing to global cancer-related morbidity and mortality [1]. Surgical interventions are standard treatment options for this aggressive disease. However, postoperative morbidity rates have been reported to reach up to 46%, negatively impacting health outcomes and imposing a significant burden on healthcare resources [2]. The precise prediction of these postoperative complications is pivotal, leading to the development of many conventional models [3,4,5,6].

Recently, machine learning algorithms have been increasingly and widely applied across various domains of medicine, such as hybrid models for white blood cell classification and the prediction of maternal health risks [7,8]. In terms of predicting postoperative outcomes, machine learning has demonstrated superior performance, often outshining traditional statistical techniques [9,10,11,12]. Traditional statistical methods generally presuppose linear relationships and assign unchanging weights to risk factors. Such assumptions can inadvertently neglect complex interactions. Conversely, machine learning algorithms are adept at discerning, nonlinear associations and intricate dependencies, providing a more comprehensive and dynamic interpretation of data [13,14,15,16]. Despite the growing reliance on machine learning for predictions, many researchers still tend to focus on well-acknowledged variables, thereby possibly neglecting unexplored yet significant factors [16,17,18,19,20,21].

A truly comprehensive data-driven approach would ideally involve collecting large datasets from multiple medical centers [20,22]. However, given the constraints of time and cost, such an approach can be challenging. Recognizing these challenges, we initiated a preliminary study at one institution to uncover potential overlooked risk factors that could be revealed through machine learning models.

The primary goal of this study was to investigate the role of previously overlooked yet impactful variables in machine learning models. Furthermore, we aimed to determine if machine learning models, when trained in this context, could surpass predictions made through linear regression in varied situations. Our approach differs from prior research by employing a wide array of variables and applying systematic feature selection techniques to highlight clinically meaningful predictors. While most existing studies using machine learning have focused on well-established variables or imaging data [23,24], our study sheds light on routinely available preoperative parameters, such as CBC-related markers, that have rarely been emphasized. Insights from this study could serve as a foundation for future extensive multi-institutional data collection and refined research directions.

2. Material and Methods

2.1. Patient Selection

This study, approved by the local ethics committee (approval number: CUH 2023-07-010), retrospectively investigated the cohort of 936 patients who underwent gastric cancer surgery from January 2018 to December 2022 in single center. After the exclusion of 71 patients due to procedures like bypass surgery, wedge resection, and exploratory surgery, 865 patients remained for the machine learning analysis (Figure 1). All the surgical procedures were conducted according to the Korean gastric cancer treatment guidelines [25]. The Strengthening the Reporting of Cohort, Cross-sectional, and Case–Control Studies in Surgery (STROCSS) research guidelines were used to adhere to research standards [26]. This research also was retrospectively registered in ResearchRegistry.com.

2.2. Data Collection and Completeness

The dataset focused on data acquired pre-operation and intra-operation. However, postoperative pathological results that mirrored preoperative conditions were also incorporated. The collected 83 variables were categorized into preoperative, intraoperative, and postoperative pathologic variables. Preoperative variables comprised demographic and clinical data points, including blood and biochemical markers such as white blood cell count, hemoglobin, and carbohydrate antigen 19-9, among others. Intraoperative variables consisted of details about the surgery, such as the duration of operation, approach type, and tumor gross type. Postoperative pathologic variables included the pathologic TNM status in line with AJCC/UICC 8th edition [27] and tumor differentiation grade (Supplementary Table S1). Postoperative complications were defined as any adverse event that required additional pharmacologic, interventional, or surgical management within 30 days after surgery. All postoperative complications were graded according to the Clavien–Dindo classification at grade 1 or higher [28].

Variables are presented as numbers (percentages).

Of the 83 variables, 32 showcased missing values. Criteria for their inclusion or exclusion for analysis were based on the extent of absent data:

Over 30 missing values—Neutrophil percentage, Lymphocyte percentage, and 21 others, totaling 23 variables
Between 10 and 30 missing values—Total protein, Carcinoembryonic antigen, and Carbohydrate antigen 19-9, totaling 3 variables
Less than 10 missing values—Red blood cell distribution, Plateletcrit, Mean platelet volume (MPV), Platelet distribution width (PDW), Alkaline phosphatase, and Phosphorus, totaling 6 variables.

2.3. Machine Learning Methodology

To counteract data imbalance, we incorporated three sampling techniques: base, under-sampling, and over-sampling. Model efficacy was assessed using metrics such as accuracy, precision, recall, and area under the curve (AUC). The dataset was partitioned into 80% for training and 20% for validation. For enhanced evaluation precision, random resampling was performed 100 times.

We adopted three strategies to handle the 32 variables with missing data. Depending on the strategy, the resultant data varied in terms of the number of patients and variables retained.

Complete Case Strategy: All variables with any missing values were omitted, leaving us with data from all 865 patients but only 53 variables.
Moderate Missing Data Strategy: We removed variables with more than 10 missing values and then excluded patients with any missing data. This process left us with data from 847 patients and 59 variables.
Liberal Missing Data Strategy: We discarded variables with over 30 missing values and then excluded patients with any missing data. As a result, we retained the most variables, 62 in total, but were left with data from 799 patients.

Considering the risk of overfitting in machine learning, we implemented the backward elimination method for variable pruning. Techniques like Random Forest also assisted in gauging feature significance. Post variable selection, we fine-tuned our models using the Grid Search method for hyperparameter optimization. XGBoost and Random Forest were benchmarked against linear regression models. We performed 100 samplings for a thorough comparison.

2.4. Tools and Software

Comparative analyses concerning complications were executed using Pearson’s chi-square two-sided test and the t-test, facilitated by IBM SPSS 29. Statistical analyses were performed using Python (Version 3.8.10, Python Software Foundation), and the machine learning procedure was performed using the Python Sklearn package (Scikit-learn 1.2.0 for Random Forest, Multiple Linear Regression), XGBoost 1.7.5 for XGBoost).

3. Results

3.1. Postoperative Complications

A total of 222 patients had complications after gastric cancer surgery (25.7%). The rate of grade IIIa or higher complications was 8.9% (77 patients). Detailed incidence and type of postoperative complications are presented in Table 1.

3.2. Optimizing AUC in Model Development

In predictive modeling, the metric of AUC serves as an insightful measure of a model’s capability to distinguish between outcomes. Initially, the “Complete strategy” for missing data, when paired with “base” sampling, yields an AUC of 0.709. However, with “under-sampling” and “over-sampling”, the AUC fluctuates marginally to 0.707 and 0.712, respectively. The “Moderate strategy” exhibits a consistent AUC, ranging from 0.715 to 0.718 across different sampling techniques. The “Liberal strategy” shows a similar trend, with its AUC values hovering around 0.712.

During the variable selection process, the “Backward elimination” strategy, particularly when combined with the “Moderate strategy”, proved to be highly effective. This approach achieved AUC values exceeding 0.74, regardless of the sampling method used, indicating its consistent performance in various scenarios. The combination of “Hyperparameter tuning” with the “moderate strategy with backward elimination” also revealed impressive results. Using the “base” sampling yielded an AUC of 0.739, but with the “over-sampling” technique, this number increased slightly to 0.744, the highest recorded for this dataset (Table 2).

3.3. Interpretable Variables Selection

During AUC optimization in model development, variable selection emerged as a key factor in improving model performance. Through the backward elimination method, our study identified 15 pivotal variables: gender, radical resection, method of anastomosis, extent of resection, combined operation, type of co-operated organ, duration of operation, mean corpuscular hemoglobin (MCH), platelet count, MPV, PDW, potassium, total bilirubin, albumin, and estimated glomerular filtration rate (eGFR). These variables highlight their importance within the model, providing insights into their influence on outcomes (Figure 2). Notably, the duration of operation was the most impactful variable, followed closely by pre-operative levels of albumin and MCH. The method of anastomosis and platelet count were also influential. Among the selected variables, nine were pre-operative factors: gender, MCH, platelet count, MPV, PDW, potassium, total bilirubin, albumin, and eGFR. The remaining six variables, including radical resection, method of anastomosis, extent of resection, combined operation, type of co-operated organ, and duration of operation, were identified during surgery.

This graph illustrates the 15 critical variables pinpointed through the backward elimination method during AUC optimization. Each variable emphasizes its significance and impact on the model’s performance. Notably, the ‘Duration of operation’ emerged as the most influential, followed by pre-operative measurements of ‘albumin’ and ‘MCH’. Of these variables, nine were ascertained pre-operatively, and the remaining six were recognized during surgery.

The Gini Index is a measure of impurity. The probability of a randomly selected object from the target variable’s ith category being misclassified into the target variable’s jth category is P(i)P(j). Here, P(i) represents the probability that an object in each node belongs to the target variable’s ith category. Summing up these misclassification probabilities, we obtain

G = \sum_{i - 1}^{c} \times \sum_{i + j} P (i) P (j)

which serves as an estimate of the misclassification probability under the given classification rule. In this context, c refers to the number of categories in the target variable.

Abbreviations: AUC; area under the curve, MCH; mean corpuscular hemoglobin, MPV; mean platelet volume, PDW; platelet distribution width, eGFR; estimated glomerular filtration rate.

3.4. Performance Comparison of Machine Learning Models with Conventional Multiple Linear Regression

The analysis compares the performance of three models, namely Random Forest (RF), XGBoost, and Multiple Linear Regression (MLR), using different variable strategies. Specifically, all variables under the ‘moderate strategy’ were considered, followed by backward elimination (Supplementary Tables S2 and S3). Using all variables post-cleanup, XGBoost outperformed the others in the test data with an AUC of 0.723 during over-sampling. RF and MLR followed closely, with their peak AUCs reaching 0.699 (Figure 3A). Upon variable selection, the test data revealed RF achieving the highest AUC at 0.744. XGBoost and MLR trailed with AUCs of 0.734 and 0.730, respectively (Figure 3B). While MLR serves as a foundational modeling approach, the ensemble methods, namely RF and XGBoost, exhibited enhanced performance in this study. Their enhanced performance can be credited to their ability to detect complex nonlinear relationships and combat overfitting through multiple decision trees or gradient-boosted stages. This performance advantage became even more evident when integrating efficient variable selection techniques, such as backward elimination.

4. Discussion

The evolution of risk prediction tools in surgery, from the American Society of Anesthesiologists classification in the 1940s to more contemporary tools, underscores the quest for better pre-operative patient assessment [4,5,6,29,30]. Despite the depth and breadth of tools available, the reliability of many remains questionable due to their inherent subjectivity and moderate predictive capacity [30].

This study aimed to utilize the capabilities of machine learning, taking advantage of its ability to identify complex data patterns that might be missed by traditional regression analyses. Where conventional statistical methods are bound by linear and additive assumptions and can potentially neglect key predictors, machine learning excels in detecting nonlinear relationships, providing crucial insights to improve data analysis [21,31].

A noteworthy discovery in our study was the significance of certain parameters in the Complete Blood Count (CBC) in predicting postoperative complications. In the field of gastric cancer, it may come as a surprise to many clinicians that detailed components of CBC can be so influential. Despite often being sidelined in routine assessments, parameters like MCH, MPV, and RDW were identified as key features in our machine learning models. When viewed through the lens of the literature, platelet count, MPV, RDW, and lymphocyte are inflammatory markers [32,33,34]. MPV, often linked to inflammatory processes across cardiovascular, respiratory, and neoplastic diseases, fluctuates based on the nature of the inflammation or disease [35,36,37]. The elevated MPV, often seen in cases of nutritional deficiencies or drug and alcohol usage, is strongly associated with increased mortality [38]. Studies have highlighted the significance of RDW and mean corpuscular volume (MCV) in patient outcomes, especially during surgical procedures. Abnormal RDW and MCV values can predict transfusion risks and postoperative outcomes in cardiac surgeries [39]. The relationship between RDW and mortality extends to older adults, regardless of their health conditions [40].

One intriguing observation for clinicians is that out of the 15 prognostic indicators identified, 9 can be readily ascertained preoperatively using basic demographic information and simple blood tests. Only six variables are derived intraoperatively. Surprisingly, the pathological results did not hold significant weight in the machine learning model. This suggests that if at-risk patient groups can be primarily identified through preoperative data, adjustments during surgery based on intraoperative variables might offer a strategy to mitigate postoperative complications.

From a technical standpoint in machine learning modeling, the technique lies not only in prediction accuracy but equally in adept feature selection. Striking a balance between a model laden with too many variables (which might cause overfitting) and one with too few (risking omission of crucial predictors) becomes critical. This delicate equilibrium, evident in our exploration of CBC parameters, highlights the nuanced finesse required in model creation. Incorporating tools like the Random Forest algorithm, known for its prowess in identifying patient characteristics, further bolstered our research. This algorithm, with its growing prominence in medical research, allows for a detailed dissection of risk factors, providing clearer insight into postoperative outcomes.

There are several limitations to this study, as outlined below:

Given that our research is preliminary, the significance of variables derived from our modeling cannot be fully ascertained with confidence.
As a result, validation through multi-institutional big data, particularly from high-volume tertiary centers [41], is imperative to reaffirm our findings.
The data are sourced from a single institution, and although comprehensive, they might not be representative of a broader patient population.
While we employed three strategies to handle missing values in the quest for an optimal predictive model, this approach may have inadvertently excluded variables that could be deemed significant based on clinical experience.
While we included essential model comparisons and variable selection strategies, more comprehensive ablation analyses—such as comparing the impact of different feature subsets or evaluating confusion matrices—were not fully explored. These analyses are crucial for evaluating model robustness and methodological innovation. To address this, a follow-up study is currently underway. This upcoming research will compare feature sets selected by AI-based backward elimination with those chosen by experienced surgeons using clinical expertise. We expect this future work to provide deeper insight into the clinical applicability and interpretability of machine learning models in surgical outcomes.

5. Conclusions

Our study underscores the transformative potential of machine learning in the realm of gastric surgery risk prediction. By unveiling previously underemphasized risk factors and emphasizing the nuanced value of CBC parameters, we pave the way for a new frontier in preoperative assessment. As the digital age progresses, so does the promise of leveraging advanced technology in refining our medical approaches. We believe our findings set the stage for expansive research, further harnessing the power of machine learning to enhance patient care in surgical settings.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/cancers17071225/s1, Table S1: List of Included Variables: Categorical and Numerical; Table S2: Performance Evaluation of Random Forest, XGBoost, and Linear Regression Models Across Different Sampling Techniques without variables selection; Table S3: Performance Evaluation of Random Forest, XGBoost, and Linear Regression Models Across Different Sampling Techniques with variables selection.

Author Contributions

Conceptualization, C.-Y.K. and S.L.; methodology, C.-Y.K., H.-J.O. and S.L.; software, H.-J.O. and H.Y.; validation, H.-J.O. and H.Y.; formal analysis, S.L.; investigation, C.-Y.K. and S.L.; resources, C.-Y.K.; data curation, S.L.; writing—original draft preparation, C.-Y.K. and S.L.; writing—review and editing, C.-Y.K. and H.-J.O.; visualization, H.-J.O. and H.Y.; supervision, C.-Y.K. and H.-J.O.; project administration, C.-Y.K.; funding acquisition, S.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Fund of the Biomedical Research Institute, Jeonbuk National University Hospital (CUH 2022-0025). The funding sources had no role in the design and conduct of the study, collection of data, analysis and interpretation, preparation, review, or approval of the manuscript, or decision to submit the manuscript for publication.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki. This study was approved by the Jeonbuk National University’s Institutional Review Board (approval number: CUH 2023-07-010, on 11 July 2023).

Informed Consent Statement

The informed consent for the use of patient data was waived due to the retrospective nature of the study.

Data Availability Statement

The data presented in this study are available upon reasonable request from the corresponding author.

Conflicts of Interest

The authors declare that they have no competing interests.

References

Sung, H.; Ferlay, J.; Siegel, R.L.; Laversanne, M.; Soerjomataram, I.; Jemal, A.; Bray, F. Global cancer statistics 2020: Globocan estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 2021, 71, 209–249. [Google Scholar] [CrossRef] [PubMed]
Hamilton, T.D.; Mahar, A.L.; Haas, B.; Beyfuss, K.; Law, C.H.L.; Karanicolas, P.J.; Coburn, N.G.; Hallet, J. The impact of advanced age on short-term outcomes following gastric cancer resection: An acs-nsqip analysis. Gastric Cancer 2018, 21, 710–719. [Google Scholar] [CrossRef] [PubMed]
Hyder, J.A.; Reznor, G.; Wakeam, E.; Nguyen, L.L.; Lipsitz, S.R.; Havens, J.M. Risk prediction accuracy differs for emergency versus elective cases in the acs-nsqip. Ann. Surg. 2016, 264, 959–965. [Google Scholar] [CrossRef] [PubMed]
Protopapa, K.L.; Simpson, J.C.; Smith, N.C.; Moonesinghe, S.R. Development and validation of the surgical outcome risk tool (sort). Br. J. Surg. 2014, 101, 1774–1783. [Google Scholar] [CrossRef]
Copeland, G.P.; Jones, D.; Walters, M. Possum: A scoring system for surgical audit. Br. J. Surg. 1991, 78, 355–360. [Google Scholar] [CrossRef]
Beal, E.W.; Saunders, N.D.; Kearney, J.F.; Lyon, E.; Wei, L.; Squires, M.H.; Jin, L.X.; Worhunsky, D.J.; Votanopoulos, K.I.; Ejaz, A.; et al. Accuracy of the acs nsqip online risk calculator depends on how you look at it: Results from the united states gastric cancer collaborative. Am. Surg. 2018, 84, 358–364. [Google Scholar] [CrossRef]
Cengil, E.; Çınar, A.; Yıldırım, M. A hybrid approach for efficient multi-classification of white blood cells based on transfer learning techniques and traditional machine learning methods. Concurrency Comput. Pract. Exp. 2022, 34, e6789. [Google Scholar] [CrossRef]
Mutlu, H.B.; Durmaz, F.; Yücel, N.; Cengil, E.; Yıldırım, M. Prediction of maternal health risk with traditional machine learning methods. Naturengs 2023, 4, 18–25. [Google Scholar] [CrossRef]
FitzHenry, F.; Murff, H.J.; Matheny, M.E.; Gentry, N.; Fielstein, E.M.; Brown, S.H.; Reeves, R.M.; Aronsky, D.; Elkin, P.L.; Messina, V.P.; et al. Exploring the frontier of electronic health record surveillance: The case of postoperative complications. Med. Care 2013, 51, 509–516. [Google Scholar] [CrossRef]
Hofer, I.S.; Lee, C.; Gabel, E.; Baldi, P.; Cannesson, M. Development and validation of a deep neural network model to predict postoperative mortality, acute kidney injury, and reintubation using a single feature set. NPJ Digit. Med. 2020, 3, 58. [Google Scholar] [CrossRef]
Weller, G.B.; Lovely, J.; Larson, D.W.; Earnshaw, B.A.; Huebner, M. Leveraging electronic health records for predictive modeling of post-surgical complications. Stat. Methods Med. Res. 2018, 27, 3271–3285. [Google Scholar] [CrossRef] [PubMed]
Fritz, B.A.; Cui, Z.; Zhang, M.; He, Y.; Chen, Y.; Kronzer, A.; Ben Abdallah, A.; King, C.R.; Avidan, M.S. Deep-learning model for predicting 30-day postoperative mortality. Br. J. Anaesth. 2019, 123, 688–695. [Google Scholar] [CrossRef] [PubMed]
Waljee, A.K.; Higgins, P.D. Machine learning in medicine: A primer for physicians. Am. J. Gastroenterol. 2010, 105, 1224–1226. [Google Scholar] [CrossRef] [PubMed]
Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2009. [Google Scholar]
Sohil, F.; Sohali, M.U.; Shabbir, J. An introduction to statistical learning with applications in r. Stat. Theory Relat. Fields 2022, 6, 87. [Google Scholar] [CrossRef]
Goldstein, B.A.; Navar, A.M.; Carter, R.E. Moving beyond regression techniques in cardiovascular risk prediction: Applying machine learning to address analytic challenges. Eur. Heart J. 2017, 38, 1805–1814. [Google Scholar] [CrossRef]
Chen, J.H.; Asch, S.M. Machine learning and prediction in medicine—Beyond the peak of inflated expectations. N. Engl. J. Med. 2017, 376, 2507–2509. [Google Scholar] [CrossRef]
Velmahos, C.S.; Paschalidis, A.; Paranjape, C.N. The not-so-distant future or just hype? Utilizing machine learning to predict 30-day post-operative complications in laparoscopic colectomy patients. Am. Surg. 2023, 89, 5648–5654. [Google Scholar] [CrossRef]
Steyerberg, E.W.; Vickers, A.J.; Cook, N.R.; Gerds, T.; Gonen, M.; Obuchowski, N.; Pencina, M.J.; Kattan, M.W. Assessing the performance of prediction models a framework for traditional and novel measures. Epidemiology 2010, 21, 128–138. [Google Scholar] [CrossRef]
Obermeyer, Z.; Emanuel, E.J. Predicting the future—Big data, machine learning, and clinical medicine. N. Engl. J. Med. 2016, 375, 1216–1219. [Google Scholar] [CrossRef]
Litjens, G.; Kooi, T.; Bejnordi, B.E.; Setio, A.A.A.; Ciompi, F.; Ghafoorian, M.; van der Laak, J.; van Ginneken, B.; Sanchez, C.I. A survey on deep learning in medical image analysis. Med. Image Anal. 2017, 42, 60–88. [Google Scholar] [CrossRef]
Rendon, E.; Alejo, R.; Castorena, C.; Isidro-Ortega, F.J.; Granda-Gutierrez, E.E. Data sampling methods to deal with the big data multi-class imbalance problem. Appl. Sci. 2020, 10, 1276. [Google Scholar] [CrossRef]
Jung, J.O.; Pisula, J.I.; Bozek, K.; Popp, F.; Biebl, M.; Raab, H.R.; Schiffmann, L. Prediction of postoperative complications after oesophagectomy using machine-learning methods. Br. J. Surg. 2023, 110, 1362–1370. [Google Scholar]
Zhou, C.M.; Wang, Y.; Yang, J.J.; Zhu, Y. Predicting postoperative gastric cancer prognosis based on inflammatory factors and machine learning technology. BMC Med. Inform. Decis. Mak. 2023, 23, 144. [Google Scholar]
Guideline Committee of the Korean Gastric Cancer Association (KGCA), Development Working Group & Review Panel. Korean practice guideline for gastric cancer 2018: An evidence-based, multi-disciplinary approach. J. Gastric Cancer 2019, 19, 1–48. [Google Scholar]
Mathew, G.; Agha, R.; Albrecht, J.; Goel, P.; Mukherjee, I.; Pai, P.; D’Cruz, A.K.; Nixon, I.J.; Roberto, K.; Enam, S.A.; et al. Strocss 2021: Strengthening the reporting of cohort, cross-sectional and case-control studies in surgery. Int. J. Surg. 2021, 96, 106165. [Google Scholar] [CrossRef]
Amin, M.B.; Greene, F.L.; Edge, S.B.; Compton, C.C.; Gershenwald, J.E.; Brookland, R.K.; Meyer, L.; Gress, D.M.; Byrd, D.R.; Winchester, D.P. The eighth edition ajcc cancer staging manual: Continuing to build a bridge from a population-based to a more “personalized” approach to cancer staging. CA Cancer J. Clin. 2017, 67, 93–99. [Google Scholar]
Clavien, P.A.; Sanabria, J.R.; Strasberg, S.M. Proposed classification of complications of surgery with examples of utility in cholecystectomy. Surgery 1992, 111, 518–526. [Google Scholar]
Bihorac, A.; Ozrazgat-Baslanti, T.; Ebadi, A.; Motaei, A.; Madkour, M.; Pardalos, P.M.; Lipori, G.; Hogan, W.R.; Efron, P.A.; Moore, F.; et al. Mysurgeryrisk: Development and validation of a machine-learning risk algorithm for major complications and death after surgery. Ann. Surg. 2019, 269, 652–662. [Google Scholar]
Horvath, B.; Kloesel, B.; Todd, M.M.; Cole, D.J.; Prielipp, R.C. The evolution, current value, and future of the american society of anesthesiologists physical status classification system. Anesthesiology 2021, 135, 904–919. [Google Scholar] [CrossRef]
Deo, R.C. Machine learning in medicine. Circulation 2015, 132, 1920–1930. [Google Scholar]
Klinger, M.H.; Jelkmann, W. Role of blood platelets in infection and inflammation. J. Interferon Cytokine Res. 2002, 22, 913–922. [Google Scholar] [CrossRef] [PubMed]
Fowler, A.J.; Agha, R.A. Neutrophil/lymphocyte ratio is related to the severity of coronary artery disease and clinical outcome in patients undergoing angiography—The growing versatility of nlr. Atherosclerosis 2013, 228, 44–45. [Google Scholar] [CrossRef] [PubMed]
Guthrie, G.J.; Charles, K.A.; Roxburgh, C.S.; Horgan, P.G.; McMillan, D.C.; Clarke, S.J. The systemic inflammation-based neutrophil-lymphocyte ratio: Experience in patients with cancer. Crit. Rev. Oncol. Hematol. 2013, 88, 218–230. [Google Scholar] [CrossRef] [PubMed]
Gunaldi, M.; Erdem, D.; Goksu, S.; Gunduz, S.; Okuturlar, Y.; Tiken, E.; Aksoy, H.; Yildirim, M. Platelet distribution width as a predictor of metastasis in gastric cancer patients. J. Gastrointest. Cancer 2017, 48, 341–346. [Google Scholar] [CrossRef]
Korniluk, A.; Koper-Lenkiewicz, O.M.; Kaminska, J.; Kemona, H.; Dymicka-Piekarska, V. Mean platelet volume (mpv): New perspectives for an old marker in the course and prognosis of inflammatory conditions. Mediat. Inflamm. 2019, 2019, 9213074. [Google Scholar] [CrossRef]
Gasparyan, A.Y.; Ayvazyan, L.; Mikhailidis, D.P.; Kitas, G.D. Mean platelet volume: A link between thrombosis and inflammation? Curr. Pharm. Des. 2011, 17, 47–58. [Google Scholar] [CrossRef]
Chae, D.; Kim, N.Y.; Kim, H.J.; Kim, T.L.; Kang, S.J.; Kim, S.Y. A risk scoring system integrating postoperative factors for predicting early mortality after major non-cardiac surgery. Clin. Transl. Sci. 2022, 15, 2230–2240. [Google Scholar] [CrossRef]
Dai, L.; Mick, S.L.; McCrae, K.R.; Houghtaling, P.L.; Blackstone, E.H.; Koch, C.G. Prognostic value of preoperative red cell distribution width: Fine-tuning by mean corpuscular volume. Ann. Thorac. Surg. 2019, 108, 1830–1838. [Google Scholar] [CrossRef]
Patel, K.V.; Semba, R.D.; Ferrucci, L.; Newman, A.B.; Fried, L.P.; Wallace, R.B.; Bandinelli, S.; Phillips, C.S.; Yu, B.; Connelly, S.; et al. Red cell distribution width and mortality in older adults: A meta-analysis. J. Gerontol. A Biol. Sci. Med. Sci. 2010, 65, 258–265. [Google Scholar] [CrossRef]
Marano, L.; Verre, L.; Carbone, L.; Poto, G.E.; Marrelli, D.; Roviello, F. Current trends in volume and surgical outcomes in gastric cancer. J. Clin. Med. 2023, 12, 4238. [Google Scholar] [CrossRef]

Figure 1. Study flowchart.

Figure 2. Identification of key variables via AUC optimization.

Figure 3. RF and XGBoost Models vs. MLR Performance Analysis: (A). Initial model comparison post-missing value treatment. (B). Performance post-variable selection using backward elimination. RF—Random Forest, MLR—Multiple Linear Regression.

Table 1. Postoperative complications.

Variables	Value (N = 865)
Overall complication	222 (25.7)
Surgical complication
Anastomotic leakage	30 (3.5)
Anastomotic stricture	3 (0.3)
Intra-abdominal bleeding	4 (0.5)
Intra-luminal bleeding	4 (0.5)
Fluid collection	9 (1.0)
Intra-abdominal abscess	2 (0.2)
Pancreatitis	6 (0.7)
Gastric stasis	19 (2.2)
Ileus	14 (1.6)
Intestinal obstruction	6 (0.7)
Fever	19 (2.2)
Wound	19 (2.2)
Other surgical complication	7 (0.8)
Medical complication
Pulmonary	45 (5.2)
Renal	5 (0.6)
Urinary	8 (0.9)
Gastrointestinal	9 (1.0)
Hepatic	4 (0.5)
Other medical complication	9 (1.0)
Clavien–Dindo grade
I	23 (2.7)
II	122 (14.1)
IIIa	42 (4.9)
IIIb	19 (2.2)
IV	4 (0.5)
V	12 (1.4)

Values in parentheses are percentages.

Table 2. Comparative analysis of missing data treatment and sampling on model performance metrics.

Step of Analysis	Missing Data Tx	Sampling	Train Accuracy	Test Accuracy	Test AUC	Test Precision	Test Recall
Missing data strategy	Complete strategy	Base	0.808 (0.783–0.824)	0.752 (0.717–0.780)	0.709 (0.593–0.838)	0.586 (0.000–1.000)	0.098 (0.000–0.182)
		Under	0.830 (0.792–0.871)	0.668 (0.572–0.751)	0.707 (0.597–0.803)	0.400 (0.313–0.508)	0.609 (0.364–0.864)
		Over	0.815 (0.775–0.841)	0.712 (0.642–0.786)	0.712 (0.558–0.797)	0.447 (0.318–0.590)	0.530 (0.318–0.773)
	Moderate strategy	Base	0.808 (0.793–0.829)	0.751 (0.712–0.788)	0.718 (0.601–0.789)	0.620 (0.000–1.000)	0.100 (0.000–0.205)
		Under	0.841 (0.793–0.891)	0.671 (0.582–0.735)	0.715 (0.604–0.796)	0.411 (0.319–0.493)	0.618 (0.455–0.818)
		Over	0.827 (0.792–0.853)	0.711 (0.629–0.788)	0.718 (0.589–0.797)	0.452 (0.327–0.611)	0.536 (0.364–0.750)
	Liberal strategy	Base	0.810 (0.789–0.826)	0.758 (0.725–0.800)	0.712 (0.630–0.825)	0.687 (0.000–1.000)	0.110 (0.000–0.244)
		Under	0.853 (0.807–0.883)	0.679 (0.606–0.769)	0.709 (0.611–0.824)	0.415 (0.333–0.550)	0.605 (0.415–0.805)
		Over	0.840 (0.792–0.870)	0.727 (0.650–0.850)	0.712 (0.623–0.852)	0.472 (0.340–0.707)	0.509 (0.293–0.756)
Backward elimination	Only moderate strategy	Base	0.799 (0.780–0.799)	0.747 (0.706–0.782)	0.741 (0.665–0.820)	0.562 (0.000–1.000)	0.099 (0.000–0.227)
		Under	0.807 (0.767–0.848)	0.675 (0.600–0.735)	0.738 (0.666–0.822)	0.424 (0.346–0.493)	0.694 (0.523–0.864)
		Over	0.799 (0.759–0.828)	0.710 (0.629–0.800)	0.741 (0.645–0.815)	0.459 (0.344–0.600)	0.649 (0.477–0.818)
Hyper parameter tuning	Only moderate strategy with backward elimination	Base	0.761 (0.747–0.783)	0.740 (0.718–0.759)	0.739 (0.663–0.814)	0.377 (0.000–1.000)	0.017 (0.000–0.091)
		Under	0.773 (0.741–0.813)	0.687 (0.629–0.759)	0.741 (0.659–0.841)	0.435 (0.368–0.526)	0.683(0.500–0.864)
		Over	0.767 (0.721–0.801)	0.711 (0.624–0.776)	0.744 (0.646–0.808)	0.461 (0.353–0.552)	0.657 (0.500–0.818)

The above result table summarizes the averages, maximums, and minimums of each metric after conducting 100 rounds of sampling. Abbreviations: AUC—area under the curve.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lee, S.; Oh, H.-J.; Yoo, H.; Kim, C.-Y. Machine Learning Insight: Unveiling Overlooked Risk Factors for Postoperative Complications in Gastric Cancer. Cancers 2025, 17, 1225. https://doi.org/10.3390/cancers17071225

AMA Style

Lee S, Oh H-J, Yoo H, Kim C-Y. Machine Learning Insight: Unveiling Overlooked Risk Factors for Postoperative Complications in Gastric Cancer. Cancers. 2025; 17(7):1225. https://doi.org/10.3390/cancers17071225

Chicago/Turabian Style

Lee, Sejin, Hyo-Jung Oh, Hosuon Yoo, and Chan-Young Kim. 2025. "Machine Learning Insight: Unveiling Overlooked Risk Factors for Postoperative Complications in Gastric Cancer" Cancers 17, no. 7: 1225. https://doi.org/10.3390/cancers17071225

APA Style

Lee, S., Oh, H.-J., Yoo, H., & Kim, C.-Y. (2025). Machine Learning Insight: Unveiling Overlooked Risk Factors for Postoperative Complications in Gastric Cancer. Cancers, 17(7), 1225. https://doi.org/10.3390/cancers17071225

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Machine Learning Insight: Unveiling Overlooked Risk Factors for Postoperative Complications in Gastric Cancer

Simple Summary

Abstract

1. Introduction

2. Material and Methods

2.1. Patient Selection

2.2. Data Collection and Completeness

2.3. Machine Learning Methodology

2.4. Tools and Software

3. Results

3.1. Postoperative Complications

3.2. Optimizing AUC in Model Development

3.3. Interpretable Variables Selection

3.4. Performance Comparison of Machine Learning Models with Conventional Multiple Linear Regression

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI