Next Article in Journal
Enhancing Post-Surgical Rehabilitation Outcomes in Patients with Chronic Ankle Instability: Impact of Subtalar Joint Axis Balance Exercises Following Arthroscopic Modified Broström Operation
Previous Article in Journal
Comparative Analysis of Vascular Structures in OLIF51 and the Lateral Corridor Approach under Supine MRI and Intraoperative Enhanced CT in the Lateral Decubitus Position
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Explainable Model Using Shapley Additive Explanations Approach on Wound Infection after Wide Soft Tissue Sarcoma Resection: “Big Data” Analysis Based on Health Insurance Review and Assessment Service Hub

1
Department of Orthopedic Surgery, Anam Hospital, Korea University College of Medicine, 73 Goryeodae-ro, Seongbuk-gu, Seoul 02841, Republic of Korea
2
Anam Hospital Bloodless Medicine Center, Korea University College of Medicine, Seoul 02841, Republic of Korea
3
School of Mechanical Engineering, Korea University College of Medicine, 73 Goryeodae-ro, Seongbuk-gu, Seoul 02841, Republic of Korea
4
AI Center, Anam Hospital, Korea University College of Medicine, 73 Goryeodae-ro, Seongbuk-gu, Seoul 02841, Republic of Korea
5
Department of Obstetrics and Gynecology, Anam Hospital, Korea University College of Medicine, Seoul 02841, Republic of Korea
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Medicina 2024, 60(2), 327; https://doi.org/10.3390/medicina60020327
Submission received: 21 December 2023 / Revised: 4 February 2024 / Accepted: 12 February 2024 / Published: 14 February 2024
(This article belongs to the Section Oncology)

Abstract

:
Background and Objectives: Soft tissue sarcomas represent a heterogeneous group of malignant mesenchymal tissues. Despite their low prevalence, soft tissue sarcomas present clinical challenges for orthopedic surgeons owing to their aggressive nature, and perioperative wound infections. However, the low prevalence of soft tissue sarcomas has hindered the availability of large-scale studies. This study aimed to analyze wound infections after wide resection in patients with soft tissue sarcomas by employing big data analytics from the Hub of the Health Insurance Review and Assessment Service (HIRA). Materials and Methods: Patients who underwent wide excision of soft tissue sarcomas between 2010 and 2021 were included. Data were collected from the HIRA database of approximately 50 million individuals’ information in the Republic of Korea. The data collected included demographic information, diagnoses, prescribed medications, and surgical procedures. Random forest has been used to analyze the major associated determinants. A total of 10,906 observations with complete data were divided into training and validation sets in an 80:20 ratio (8773 vs. 2193 cases). Random forest permutation importance was employed to identify the major predictors of infection and Shapley Additive Explanations (SHAP) values were derived to analyze the directions of associations with predictors. Results: A total of 10,969 patients who underwent wide excision of soft tissue sarcomas were included. Among the study population, 886 (8.08%) patients had post-operative infections requiring surgery. The overall transfusion rate for wide excision was 20.67% (2267 patients). Risk factors among the comorbidities of each patient with wound infection were analyzed and dependence plots of individual features were visualized. The transfusion dependence plot reveals a distinctive pattern, with SHAP values displaying a negative trend for individuals without blood transfusions and a positive trend for those who received blood transfusions, emphasizing the substantial impact of blood transfusions on the likelihood of wound infection. Conclusions: Using the machine learning random forest model and the SHAP values, the perioperative transfusion, male sex, old age, and low SES were important features of wound infection in soft-tissue sarcoma patients.

1. Introduction

Soft tissue sarcomas represent a heterogeneous group of rare malignant tumors arising from mesenchymal tissues. Despite their low prevalence, soft tissue sarcomas present a significant clinical challenge for orthopedic surgeons owing to their aggressive nature, propensity for local recurrence, and potential for distant metastasis. The management of soft tissue sarcomas involves a multidisciplinary approach, with surgical resection as the primary treatment modality. Innovation in radiotherapy and systematic treatment as adjuvant therapies are likely to further improve the quality of local control while decreasing the complications and burden of treatment. However, surgery in the soft tissue sarcoma is indicated and remains the best means for control. The success of such interventions is intricately related to the wound-healing process, which plays a pivotal role in preventing complications, enhancing recovery, and improving overall patient outcomes. Postoperative complications in the form of wound-related issues continue to contribute significantly to morbidity, encompassing concerns like wound dehiscence, cellulitis, abscess, seromas, hematomas, and wound necrosis. Factors such as diabetes, smoking, obesity, tumor diameter, and preoperative radiotherapy have been identified as independent predictors for major wound complications. However, existing research is confined to single-institute cohorts, and a comprehensive nationwide study on postoperative wound complications in soft tissue sarcoma patients is lacking [1,2].
Perioperative wound infections present a formidable challenge for clinicians managing patients with sarcoma, as they can lead to prolonged hospital stays, impaired wound healing, compromised oncological outcomes, and increased healthcare costs [3,4]. One factor that warrants further investigation is perioperative blood transfusion [5]. Although the biological mechanisms linking blood transfusion to wound infections are not entirely understood, potential immunomodulatory effects and alterations to immune responses have been proposed as contributing factors.
The low prevalence of soft tissue sarcomas has hindered the availability of large-scale studies focused on wound healing in this specific context. For a single institution to obtain sufficient data to conduct robust investigations is challenging. Consequently, our understanding of the wound healing mechanisms and factors influencing outcomes in patients with soft tissue sarcomas remains limited. Regarding the low prevalence of sarcomas, leveraging the vast repository of information stored in the “big data” Hub of the Health Insurance Review and Assessment Service (HIRA) presents a unique and compelling opportunity to conduct a nationwide analysis of data for sarcoma patients. The HIRA Service Hub is an invaluable resource and data source for diverse healthcare institutions nationwide. By harnessing the power of big data, we can expect to overcome the limitations posed by the rarity of this malignancy and obtain a more comprehensive understanding of wound-healing patterns in sarcoma patients.
When analyzing big data, the sample size can be very large, leading traditional statistical methods to detect even minor differences as statistically significant with high sensitivity. Understanding the complex relationships and contributions of individual features to model predictions is paramount for making informed decisions when dealing with big data. To overcome these limitations, new statistical methods such as machine learning techniques are gaining popularity [6]. Shapley Additive Explanations (SHAP) is a framework used in machine learning for understanding feature importance and to quantify variable contributions to dependent variables [7]. This study approached big data using random forest feature importance and the SHAP framework to analyze the contribution of each feature and interpretable approach to attribute feature importance.
This study aimed to conduct an in-depth analysis of wound infections in patients with soft tissue sarcomas who underwent wide surgical resection. By employing big data analytics from the HIRA Service Hub, our study sought to provide valuable evidence that can pave the way for improved perioperative management strategies, reduced post-operative complications, and favorable treatment outcomes for individuals with this rare and challenging malignancy. This is the first study to identify and analyze variables correlated with wound infection after performing wide resections involving soft tissue sarcomas using a nationwide Korean database. Moreover, the study is the first to analyze feature importance and provide explanations to improve clinical understanding in soft tissue sarcoma using the SHAP approach.

2. Materials and Methods

2.1. Ethics Approval

This study was approved by the Institutional Review Board of the Tertiary Referral Medical Center, and the need for informed consent was waived owing to the retrospective nature of the study.

2.2. Study Populations and Data Source

Data were collected from the HIRA database, which contains information regarding approximately 50 million individuals in the Republic of Korea. The data collected included demographic information, diagnoses, prescribed medications, surgical procedures, and prescription records. Each participant was identified by the unique Korean Resident Registration Number assigned in the Republic of Korea at birth. Data duplication or omission was impossible. All data were anonymized.
The study population consisted of patients who underwent wide excision of soft tissue sarcomas between 2010 and 2021. To recognize patients diagnosed with malignant soft tissue tumors, those assigned to the C49 code (International Statistical Classification of Disease and Related Health Problems, 10th Revision, ICD-10 code) were identified. For patients who underwent wide resection of soft tissue sarcoma, the corresponding Anatomical Therapeutic Chemical (ATC) codes were N0151, NA281, NA282, NA283, NA284, and N0232. After confirming wide resection surgery, patients who underwent infection-related procedures within 3 months after wide resection of soft tissue sarcomas were identified. Patients who received a transfusion during admission for surgery were identified based on ATC codes. Perioperative transfusion was defined as a transfusion performed within 3 months before or after the date of wide resection. All disease data, medication histories, and medical procedures were screened using ICD-10 and ATC codes from the Healthcare Common Procedure Coding System of the HIRA.

2.3. Data Analyses

Random forest has been used to predict wound infection after wide resection of soft tissue sarcomas and to analyze the major associated determinants, including transfusion [8,9]. A decision tree is composed of an intermediate node (the test of a predictor), a branch (the value of the predictor as an outcome of the test), and a terminal node (the value of the dependent variable). These trees form a random forest, also referred to as “bootstrap aggregation.” In other words, decision trees are constructed from random samples with replacement (bootstrapping), and they make the majority of the decision on the dependent variable (aggregation) [8]. A total of 10,906 observations with complete data were divided into training and validation sets in an 80:20 ratio (8773 vs. 2193 cases). A standard for the validation of the trained models was the area under the receiver operating characteristic curve (AUC), that is, the area under the plot of sensitivity vs. 1—specificity, which can be considered the degree of sensitivity when its threshold and specificity increase from 0 to 1. Random forest permutation importance was employed to identify the major predictors of infection and SHAP values were derived to analyze the directions of associations with predictors. The random-forest permutation importance calculates the overall decrease in accuracy from the permutation of the data on the predictor. Additionally, the random-forest permutation importance is the average or sum of all trees in a random forest with a range of zero and the number of all trees. The SHAP value of a predictor for a participant is calculated as the difference between what the random forest predicts for the probability of wound infection with and without the predictor. In a hypothetical example of the SHAP values of transfusion for wound infection over the range of −0.02, 0.10, some participants have SHAP values as low as −0.02, and other participants have values as high as 0.10. The inclusion of a predictor (transfusion) into the random forest will decrease or increase the probability of the dependent variable (wound infection) over the range of −0.02 and 0.10. The SHAP values are skewed toward their maximum, hence a positive association exists between transfusion and wound infection in general. Finally, R-Studio 1.3.959 (R-Studio Inc.: Boston, MA, USA) was employed for the analysis between 1 January 2023, and 31 May 2023.

3. Results

A total of 10,969 patients who underwent wide excision of soft tissue sarcomas were included. Among the study population, 886 (8.08%) patients had post-operative infections requiring surgery. The overall transfusion rate for wide excision was 20.67% (2267 patients). Risk factors among the comorbidities of each patient with wound infection were analyzed (Table 1).

SHAP Values

The relative contribution of each comorbidity to wound infection after wide resection of soft tissue sarcoma was compared. Mean SHAP values were calculated to explain and compare the effects of these features. The summary plot displays SHAP values on the horizontal (X) axis which represents the average contribution of the feature value to the output. A SHAP value < 0 represents a negative contribution, whereas a value > 0 indicates a positive contribution to the output. A positive contribution indicates that the features were highly important to the outcome. The vertical axis on the left displays features arranged from top to bottom according to their importance. The vertical axis on the right illustrates the values of the features in color: red for high and blue for low. The summary plot demonstrates that transfusion had a highly positive impact on wound infection (Figure 1).
To explain each feature in detail, dependence plots of individual features were visualized.
In the context of the transfusion dependence plot, a noteworthy fact is that SHAP values on the y axis exhibited a negative trend among individuals who did not undergo blood transfusions (x axis value 0.0), whereas a positive trend was observed on the Y-axis among those who received blood transfusions (x axis value 1.0). This divergence in SHAP values strongly suggests the significance of blood transfusions as a contributing factor to the likelihood of wound infection. Moreover, an interesting correlation emerged, as we observed that advanced age (denoted by red dots in the plot) corresponded to high SHAP values. This implies that old individuals exhibit an increased susceptibility to wound infections in this context (Figure 2).
In the sex-dependence plot, the 0 and 1 values on the horizontal axis depict males and females, respectively. This finding indicates that males have higher wound infection rates compared to females (Figure 3).
In the age-dependent plot, the SHAP values tended to increase slightly with age. This indicates that the incidence of wound infection increases with age. The right Y-axis displays the correlation with transfusions. According to this plot, a large number of people < 20 years of age received a transfusion, and as age increased, the SHAP values for wound infections also increased. Simultaneously, when comparing the Y-axis of the same age, an individual who received a transfusion, displayed as a red dot, had a larger SHAP value (Figure 4).
In the socioeconomic status dependence plot, the lower the socioeconomic status (SES), the greater the SHAP value, indicating a high wound infection rate (Figure 5).

4. Discussion

This study demonstrated that perioperative transfusion, male sex, old age, and low SES increased the risk of post-operative wound infection based on SHAP values. This nationwide cohort study established a predictive model for post-operative wound infections involving soft tissue sarcomas. The AUC of the prediction model using random forest was 0.6422. This study model has classification ability, but the performance of the study is poor.
The relationship between wound outcomes in soft tissue sarcomas and transfusions remains an intriguing area of investigation in the field of oncology. Although studies have explored the potential impact of transfusions on post-operative wound healing and overall prognosis in cancer patients, any specific link to soft tissue sarcoma outcomes has not yet been fully elucidated. Transfusions, particularly red blood cell transfusions, are often administered to mitigate anemia in cancer patients undergoing surgery or aggressive treatment. However, certain studies have suggested that excessive transfusions may be associated with an increased risk of complications, including surgical site infections, which can be a critical determinant of wound outcomes in patients with soft tissue sarcomas [5]. This study demonstrated that perioperative transfusion was variable with a high predictive value for post-operative wound infection. Although transfusion and post-operative wound infection cannot be taken as proof of a cause-and-effect relationship as the SHAP value of wound infection is higher in the perioperative transfusion variable than in any other single variable, wound infection is expected to be effectively controlled by reducing the risk of transfusion before and after surgery through analysis of SHAP values. Nevertheless, further research is needed to unravel the precise nature of this relationship, considering the diverse histological subtypes and therapeutic approaches for soft tissue sarcomas. A deeper understanding of how transfusion practices affect wound healing and overall prognosis in this context could result in more tailored treatment strategies for these patients.
In males, the feature value for wound infection was the second highest after transfusion. Multiple clinical investigations have indicated sex-based variations in the occurrence of sepsis and its consequences [10]. In a bacteremia epidemiological study, sepsis was more prevalent among males than females [11,12]. Previous studies have presented disadvantages for the male sex due to hormonal differences and an additional high prevalence of malnutrition and comorbidities [13,14,15,16]. Various hypotheses derived from previous studies support our results.
In the age-dependent plot, the SHAP value tended to increase as age increased. As people age, a high chance is present of multiple underlying diseases that can cause problems with wound healing and increase the probability of wound infection [17]. As in Figure 1, liver disease, diabetes mellitus, and other medical issues had a positive impact on the wound infection model output. Although, in the present study, the impacts of each underlying disease on wound infection were analyzed, the effects of an increase in the number of underlying diseases on wound infections were not analyzed.
In terms of SES, the SHAP value was high in patients with low SES, implying that such patients have a high chance of wound infection. The current literature surrounding SES notes that while no universal definition of SES is available, multiple individual SES factors are linked to a heightened risk of compromised wound healing [18,19]. This phenomenon may be attributed to the likelihood of patients with the low SES having a high prevalence in concurrent comorbidities, such as tobacco use, obesity, and diabetes [20]. Furthermore, individuals with lower SES may encounter barriers to accessing essential medical equipment and timely healthcare interventions compared to those with higher SES [21].
In the context of wound infections, factors such as age, sex, and SES exhibit greater importance than specific underlying medical conditions. Additionally, it is of significance that perioperative transfusions have a higher feature importance concerning wound infections when compared to these sociodemographic factors.
Soft tissue sarcomas represent a heterogeneous group of rare malignancies characterized by diverse histological subtypes and anatomical locations. Magnetic Resonance Imaging (MRI) plays a pivotal role in the assessment of soft tissue sarcoma, providing essential imaging features that contribute to the evaluation of treatment strategies, surgical planning, and the prediction of patients’ prognosis [22]. Researchers and clinicians have conducted various studies to assess prognoses in soft tissue sarcoma patients [2,3]. Nonetheless, acknowledging that despite the considerable efforts made to study soft tissue sarcoma wound outcomes, the interpretation of these analyses may be encumbered by a significant limitation is crucial: the low prevalence of sarcoma itself. Soft tissue sarcomas are relatively rare compared with more common malignancies, rendering the sample sizes in many single-institute and multiple-institute studies as relatively small. This inherent rarity poses a challenge in achieving statistical power and generalizability, potentially leading to results that may not accurately reflect the true diversity of this heterogeneous group of cancers. Furthermore, the scarcity of data concerning rare subtypes and specific clinical scenarios of soft tissue sarcomas can further obscure the precision of prognostic models and predictions. This characteristic of sarcomas promotes the use of large databases and population-based nationwide big data registries [23,24,25].
In big data analysis, understanding the complex relationships and contributions of individual features to model predictions is paramount for making informed decisions. As traditional statistical methods encounter limitations in handling large-scale and high-dimensional data, the SHAP framework has emerged as a powerful tool to unravel the intricate dynamics hidden within vast datasets. By leveraging cooperative game theory principles, SHAP provides a comprehensive and interpretable approach to attribute feature importance, even amid the complexities of big data. Predictive modeling encompasses the utilization of data to train machine/deep learning models, enabling the anticipation of future outcomes based on unforeseen data. In the realm of healthcare, predictive modeling assumes a pivotal role, offering insights into diverse areas such as forecasting disease progression, identifying patients susceptible to specific conditions, and optimizing treatment plans. The intricacy of healthcare data and the substantial volume involved pose challenges in healthcare predictive modeling. Machine learning algorithms emerge as potent tools for constructing high-accuracy predictive models, especially in the analysis of clinical and biological data. In the medical domain, a paramount consideration is trustworthiness, reflecting the model’s validity and reliability. The trustworthiness of Artificial Intelligence (AI) is intrinsically linked to the interpretability of the model, addressing the critical question of how individuals can trust AI-generated information when the outcome lacks interpretability. Addressing the interpretability concern involves various strategies, including the development of simplified models through techniques like knowledge distillation. These models preserve high performance while enhancing interpretability. Another approach employs algorithms such as SHAP (SHapley Additive exPlanations) values, which delineate the impact of each feature on a model’s prediction, facilitating a clearer understanding of how different features contribute to the overall result [7,8,9,25,26].

Limitations

This study had certain limitations. First, this was a retrospective cohort study, and inherent potential bias cannot be ruled out. However, previous studies have validated the accuracy of the HIRA coding system as 70–90%, indicating an acceptable level of accuracy for analyses. Second, comorbidities were identified based on codes. If the diagnosis code was not entered, the possibility of data exclusion existed. Third, the data quality was limited. Nationwide data were obtained through the HIRA, but variables previously identified to be associated with wound infection, such as body weight and operative time, could not be obtained. Social histories, such as smoking, could not be obtained, and no continuous data, such as laboratory results, were present. Finally, Korea is a homogeneous country with few racial variables. Homogeneity can reduce potential biases related to racial or ethnic disparities in terms of access to healthcare, treatments, and outcomes. Researchers could also focus on investigating other potential sources of bias, resulting in more accurate and unbiased results. Although homogeneity can offer advantages in some research contexts, it may also limit the ability to generalize the findings to more diverse populations globally.
Despite the above limitations, this is the first retrospective cohort study of sarcomas conducted using nationwide data. Additionally, this study sought to present a predictive model for post-operative wound infection in soft tissue sarcomas using a random forest model and SHAP values. This explainable model will become the upcoming majority in predicting and analyzing big data.

5. Conclusions

The limited prevalence of soft tissue sarcomas has posed challenges for conducting extensive studies. This study aimed to conduct an in-depth analysis of wound infections in patients with soft tissue sarcomas by employing big data analytics from the nationwide database. Employing the machine learning random forest model and the SHAP values, this study identifies perioperative transfusion, male gender, advanced age, and low socioeconomic status (SES) as pivotal factors influencing wound infections in individuals with soft tissue sarcomas.
The investigation gains significance in the context of a global blood shortage following the coronavirus pandemic, elevating the importance of comprehending the necessity and potential complications associated with perioperative transfusions. While infection has previously been suggested as a transfusion-related complication, the lack of substantial data on postoperative wound infections in soft tissue sarcoma underscores the valuable contribution of this study to the existing knowledge in the field.
As for further exploration, future studies could concentrate on gathering and analyzing data to the tumor itself and its treatment. Factors such as size, location, histologic type, chemotherapy, and radiotherapy, considered crucial features in wound infections according to the nationwide database analyzed with SHAP values, could be investigated in detail using hospital-based data.

Author Contributions

Conceptualization: J.-H.C., K.-S.L., K.-H.A. and W.Y.J. Data curation: Y.C., K.-S.L. and K.-H.A. Formal analysis: Y.C. and K.-S.L. Investigation: Y.C. and K.-S.L. Methodology: K.-S.L., K.-H.A. and W.Y.J. Validation: K.-S.L. Visualization: Y.C. Writing—original draft preparation: J.-H.C. and K.-S.L. Revising and editing original manuscript draft: J.-H.C., K.-H.A. and W.Y.J. All authors were involved in writing, reviewing, discussing, and agreeing to the final submitted version of this paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by a grant from the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health and Welfare, Republic of Korea (Grant number: HI22C1463 and HR22C1302), and technically by 4P Lab, Co., Ltd. for data analysis.

Institutional Review Board Statement

This study was approved by the Institutional Review Board of the Tertiary Referral Medical Center (ethic committee name: Korea University Anam Hospital Institutional Review Board, approval code: 2022AN0378 approval date: 8 August 2022), and the need for informed consent was waived owing to the retrospective nature of the study.

Informed Consent Statement

Patient consent was waived due to the retrospective nature of the study.

Data Availability Statement

The data used in this study are available from the Big Data Hub of the Health Insurance Review and Assessment (HIRA) Service. However, the data are available for institutions under license and are therefore not publicly available. Access to the study data, however, is available from the authors for researchers who meet the criteria for access to confidential data with the permission of the big data Hub of the HIRA service.

Conflicts of Interest

The authors declare no competing interests, personal relationships, or religious or political beliefs that might otherwise influence their objectivity.

References

  1. Moore, J.; Isler, M.; Barry, J.; Mottard, S. Major wound complication risk factors following soft tissue sarcoma resection. Eur. J. Surg. Oncol. 2014, 40, 1671–1676. [Google Scholar] [CrossRef]
  2. Lahat, G.; Dhuka, A.R.; Lahat, S.; Lazar, A.J.; Lewis, V.O.; Lin, P.P.; Feig, B.; Cormier, J.N.; Hunt, K.K.; Pisters, P.W.T.; et al. Complete Soft Tissue Sarcoma Resection is a Viable Treatment Option for Select Elderly Patients. Ann. Surg. Oncol. 2009, 16, 2579–2586. [Google Scholar] [CrossRef] [PubMed]
  3. Bensaid, S.; Contejean, A.; Morand, P.; Enser, M.; Eyrolle, L.; Charlier, C.; Kernéis, S.; Anract, P.; Biau, D.; Canouï, E. Surgical site infection after pelvic bone and soft tissue sarcoma resection: Risk factors, microbiology, and impact of extended postoperative antibiotic prophylaxis. J. Surg. Oncol. 2023, 128, 344–349. [Google Scholar] [CrossRef]
  4. Severyns, M.; Briand, S.; Waast, D.; Touchais, S.; Hamel, A.; Gouin, F. Postoperative infections after limb-sparing surgery for primary bone tumors of the pelvis: Incidence, characterization and functional impact. Surg. Oncol. 2017, 26, 171–177. [Google Scholar] [CrossRef] [PubMed]
  5. Vamvakas, E.C. Possible mechanisms of allogeneic blood transfusion-associated postoperative infection. Transfus. Med. Rev. 2002, 16, 144–160. [Google Scholar] [CrossRef] [PubMed]
  6. Amann, J.; Blasimme, A.; Vayena, E.; Frey, D.; Madai, V.I.; Consortium, P.Q. Explainability for artificial intelligence in healthcare: A multidisciplinary perspective. BMC Med. Inform. Decis. Mak. 2020, 20, 310. [Google Scholar] [CrossRef] [PubMed]
  7. Roth, A.E. The Shapley Value: Essays in Honor of Lloyd S. Shapley; Cambridge University Press: Cambridge, UK, 1988. [Google Scholar]
  8. Lee, K.-S.; Ham, B.-J. Machine learning on early diagnosis of depression. Psychiatry Investig. 2022, 19, 597. [Google Scholar] [CrossRef]
  9. Lee, K.-S.; Kim, E.S. Explainable Artificial Intelligence in the Early Diagnosis of Gastrointestinal Disease. Diagnostics 2022, 12, 2740. [Google Scholar] [CrossRef] [PubMed]
  10. Offner, P.J.; Moore, E.E.; Biffl, W.L. Male gender is a risk factor for major infections after surgery. Arch. Surg. 1999, 134, 935–940. [Google Scholar] [CrossRef]
  11. Lipska, M.A.; Bissett, I.P.; Parry, B.R.; Merrie, A.E. Anastomotic leakage after lower gastrointestinal anastomosis: Men are at a higher risk. ANZ J. Surg. 2006, 76, 579–585. [Google Scholar] [CrossRef]
  12. Dekker, J.W.; Liefers, G.J.; de Mol van Otterloo, J.C.; Putter, H.; Tollenaar, R.A. Predicting the risk of anastomotic leakage in left-sided colorectal surgery using a colon leakage score. J. Surg. Res. 2011, 166, e27–e34. [Google Scholar] [CrossRef]
  13. Yang, L.-l.; Xiao, Z.-l.; An, P.-j.; Yan, H.-j.; Li, Q. Association between pressure ulcers and the risk of postoperative infections in male adults with spinal cord injury. Br. J. Neurosurg. 2023, 37, 254–257. [Google Scholar] [CrossRef]
  14. Coleman, S.; Gorecki, C.; Nelson, E.A.; Closs, S.J.; Defloor, T.; Halfens, R.; Farrin, A.; Brown, J.; Schoonhoven, L.; Nixon, J. Patient risk factors for pressure ulcer development: Systematic review. Int. J. Nurs. Stud. 2013, 50, 974–1003. [Google Scholar] [CrossRef] [PubMed]
  15. Villarroel, R.M.; Formiga, F.; Alert, P.D.; Sangrà, R.A. Prevalence of malnutrition in Spanish elders: Systematic review. Med. Clin. 2012, 139, 502–508. [Google Scholar]
  16. Deren, M.E.; Huleatt, J.; Winkler, M.F.; Rubin, L.E.; Salzler, M.J.; Behrens, S.B. Assessment and Treatment of Malnutrition in Orthopaedic Surgery. JBJS Rev. 2014, 2, e1. [Google Scholar] [CrossRef] [PubMed]
  17. Extermann, M. Measurement and impact of comorbidity in older cancer patients. Crit. Rev. Oncol./Hematol. 2000, 35, 181–200. [Google Scholar] [CrossRef]
  18. Edelman, L.S. Social and economic factors associated with the risk of burn injury. Burns 2007, 33, 958–965. [Google Scholar] [CrossRef]
  19. Bakshi, S.C.; Fobare, A.; Benarroch-Gampel, J.; Teodorescu, V.; Rajani, R.R. Lower socioeconomic status is associated with groin wound complications after revascularization for peripheral artery disease. Ann. Vasc. Surg. 2020, 62, 76–82. [Google Scholar] [CrossRef]
  20. Everson, S.A.; Maty, S.C.; Lynch, J.W.; Kaplan, G.A. Epidemiologic evidence for the relation between socioeconomic status and depression, obesity, and diabetes. J. Psychosom. Res. 2002, 53, 891–895. [Google Scholar] [CrossRef]
  21. Kelley, E.; Moy, E.; Stryer, D.; Burstin, H.; Clancy, C. The national healthcare quality and disparities reports: An overview. Med. Care 2005, 43, I3–I8. [Google Scholar] [CrossRef] [PubMed]
  22. Sedaghat, S.; Schmitz, F.; Meschede, J.; Sedaghat, M. Systematic analysis of post-treatment soft-tissue edema and seroma on MRI in 177 sarcoma patients. Surg. Oncol. 2020, 35, 218–223. [Google Scholar] [CrossRef]
  23. Lyu, H.G.; Haider, A.H.; Landman, A.B.; Raut, C.P. The opportunities and shortcomings of using big data and national databases for sarcoma research. Cancer 2019, 125, 2926–2934. [Google Scholar] [CrossRef]
  24. Lawrenz, J.M.; Johnson, S.R.; Hajdu, K.S.; Chi, A.; Bendfeldt, G.A.; Kang, H.; Halpern, J.L.; Holt, G.E.; Schwartz, H.S. Is the number of national database research studies in musculoskeletal sarcoma increasing, and are these studies reliable? Clin. Orthop. Relat. Res. 2023, 481, 491–508. [Google Scholar] [CrossRef]
  25. Nohara, Y.; Matsumoto, K.; Soejima, H.; Nakashima, N. Explanation of machine learning models using shapley additive explanation and application for real data in hospital. Comput. Methods Programs Biomed. 2022, 214, 106584. [Google Scholar] [CrossRef]
  26. Nosrati, H.; Nosrati, M. Artificial Intelligence in Regenerative Medicine: Applications and Implications. Biomimetics 2023, 8, 442. [Google Scholar] [CrossRef]
Figure 1. Summary plot of SHAP value. SHAP: Shapley Additive Explanations.
Figure 1. Summary plot of SHAP value. SHAP: Shapley Additive Explanations.
Medicina 60 00327 g001
Figure 2. Dependence plot of transfusion.
Figure 2. Dependence plot of transfusion.
Medicina 60 00327 g002
Figure 3. Dependence plot of sex.
Figure 3. Dependence plot of sex.
Medicina 60 00327 g003
Figure 4. Dependence plot of age.
Figure 4. Dependence plot of age.
Medicina 60 00327 g004
Figure 5. Dependence plot of socioeconomic status.
Figure 5. Dependence plot of socioeconomic status.
Medicina 60 00327 g005
Table 1. Variables and demographic characteristics of the study population.
Table 1. Variables and demographic characteristics of the study population.
Total (n = 10,969)Infection (n = 886)No-Infection (n = 10,083)p-Value
npernpernper
Age (years) 55.95 58.63 55.71p < 0.05
Transfusion226720.7%32336.5%194419.3%p < 0.05
Socioeconomic Status130,64811.9110,20911.52120,43911.94
Female494645.1%32236.3%462445.9%p < 0.05
Liver Disease358632.7%35139.6%323532.1%p < 0.05
Iron8057.3%10311.6%7027.0%p < 0.05
Diabetes Mellitus305127.8%30934.9%274227.2%p < 0.05
Peripheral Vascular Disease309728.2%23927.0%285828.3%
Hypertension444340.5%41947.3%402439.9%p < 0.05
Antithrombotic783771.4%70379.3%713470.8%p < 0.05
Anemia238521.7%21524.3%217021.5%
COPD189617.3%16518.6%173117.2%
Cardiovascular Disease9558.7%788.8%8778.7%
Congestive Heart Failure5875.4%566.3%5315.3%
Peptic Ulcer Disease112310.2%9310.5%103010.2%
Dementia6946.3%687.7%6266.2%
Myocardial Infraction5475.0%616.9%4864.8%p < 0.05
Tranexamic Acid5014.6%515.8%4504.5%
Hypothyroidism6265.7%525.9%5745.7%
Thrombocytopenia2792.5%313.5%2482.5%
Chronic Kidney Disease2572.3%242.7%2332.3%
Leukemia700.6%80.9%620.6%
Thyrotoxicosis Hyperthyroidism2021.8%121.4%1901.9%
Hemiplegia1091.0%91.0%1001.0%
Solid Tumor10,79098.4%87899.1%991298.3%
Lymphoma1561.4%70.8%1491.5%
Connective Tissue Disease1111.0%60.7%1051.0%
AIDS200.2%20.2%180.2%
COPD: Chronic Obstructive Pulmonary Disease; AIDS: acquired immunodeficiency syndrome.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Choi, J.-H.; Choi, Y.; Lee, K.-S.; Ahn, K.-H.; Jang, W.Y. Explainable Model Using Shapley Additive Explanations Approach on Wound Infection after Wide Soft Tissue Sarcoma Resection: “Big Data” Analysis Based on Health Insurance Review and Assessment Service Hub. Medicina 2024, 60, 327. https://doi.org/10.3390/medicina60020327

AMA Style

Choi J-H, Choi Y, Lee K-S, Ahn K-H, Jang WY. Explainable Model Using Shapley Additive Explanations Approach on Wound Infection after Wide Soft Tissue Sarcoma Resection: “Big Data” Analysis Based on Health Insurance Review and Assessment Service Hub. Medicina. 2024; 60(2):327. https://doi.org/10.3390/medicina60020327

Chicago/Turabian Style

Choi, Ji-Hye, Yumin Choi, Kwang-Sig Lee, Ki-Hoon Ahn, and Woo Young Jang. 2024. "Explainable Model Using Shapley Additive Explanations Approach on Wound Infection after Wide Soft Tissue Sarcoma Resection: “Big Data” Analysis Based on Health Insurance Review and Assessment Service Hub" Medicina 60, no. 2: 327. https://doi.org/10.3390/medicina60020327

Article Metrics

Back to TopTop