Next Article in Journal
Transcatheter Aortic Valve Implantation and Replacement: The Latest Advances and Prospects
Previous Article in Journal
Application Results of an Extracorporeal Therapy Protocol in Cardiorespiratory Arrest: A Historical Cohort Study
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Predicting Major Preoperative Risk Factors for Retears After Arthroscopic Rotator Cuff Repair Using Machine Learning Algorithms

Department of Orthopedic Surgery, Seoul St. Mary’s Hospital, The Catholic University of Korea, Banpo-Daero 222, Secho-gu, Seoul 06591, Republic of Korea
*
Author to whom correspondence should be addressed.
J. Clin. Med. 2025, 14(6), 1843; https://doi.org/10.3390/jcm14061843
Submission received: 5 February 2025 / Revised: 4 March 2025 / Accepted: 7 March 2025 / Published: 9 March 2025
(This article belongs to the Section Orthopedics)

Abstract

:
Background/Objectives: This study aimed to identify the risk factors for retears after arthroscopic rotator cuff repair (ARCR) and to establish a hierarchy of their importance using machine learning. Methods: This study analyzed 788 primary ARCR cases performed by a single senior surgeon from January 2016 to December 2022. The condition of the repaired supraspinatus was assessed via magnetic resonance imaging (MRI) or sonography within 2 years after surgery. In total, 27 preoperative demographic, objective, and subjective clinical variables were analyzed using five well-established models: Extreme Gradient Boosting (XGBoost), Random Forest (RF), Support Vector Machine (SVM), Neural Network (NN), and logistic regression (LR). The models were trained on an 8:2 split training and test set, with three-fold validation. The primary metric for evaluating model performance was the area under the receiver operating characteristic curve (AUC). The top five influential features were extracted from the best-performing models. Univariate and multivariate LRs were performed independently as a reference. Results: The overall retear rate was 11.9%. The two best-performing prediction models were RF (validation AUC = 0.9790) and XGBoost (validation AUC = 0.9785). Both models consistently identified the tear size in the medial–lateral (ML) and anterior–posterior (AP) dimensions, full-thickness tears, and BMI among the top five risk factors. XGBoost uniquely included female sex, while RF highlighted the visual analogue scale (VAS) pain score. While conventional univariate regression indicated multiple significant factors associated with retears (age, full-thickness tear, AP and ML tear size, biceps conditions, fatty infiltration of three rotator cuff muscles, and atrophy of supraspinatus), multivariate analysis demonstrated that only age and the ML tear size are significant factors. Conclusions: Machine learning models demonstrated enhanced predictive accuracy compared to traditional LR in predicting retears, and the importance of risk factors was derived. Tear size, full-thickness tears, BMI, female sex, and VAS pain score emerged as the most influential risk factors.

1. Introduction

Arthroscopic rotator cuff repair (ARCR) is a well-established treatment for rotator cuff tears. Despite its effectiveness, the occurrence of tendon retears post repair remains a significant clinical challenge. Numerous studies have sought to identify and predict the risk factors for retears following ARCR [1,2,3,4,5,6,7]. The reported risk factors vary and include age, tear size, critical shoulder angle, and fatty infiltration (FI) of the tendon. However, findings regarding these factors have been inconsistent, with some studies reporting contradictory outcomes [3,6,7]. This indicates that the issue is complex and warrants further exploration.
The majority of research on retear risk factors after ARCR has focused on objective measures, often relegating subjective factors to the margins. For example, Kwon et al. [3] proposed a scoring system to assess rotator cuff healing post ARCR that included six objective factors but did not incorporate patient-reported outcome measurements (PROMs). This reflects a broader trend in research that prioritizes objective indicators over subjective ones, such as the visual analogue scale (VAS) pain score or the American Shoulder and Elbow Surgeon (ASES) score. While objective factors are undoubtedly important, the dismissal of subjective factors overlooks their potential significance. Numerous studies have identified a significant correlation between a patient’s subjective pain experiences and adverse prognosis, suggesting that integrating subjective factors with objective metrics could provide a more comprehensive understanding of retear risk factors [8,9,10].
Alongside these clinical investigations, the introduction of machine learning techniques has presented new methodologies for analyzing retear risk factors [1,11,12,13]. Harada et al. [1] used a decision tree algorithm to analyze data from 286 patients. Although this sample size may seem small for machine learning applications, the study demonstrated the potential of machine learning in predicting retears. Subsequent research has not only leveraged machine learning algorithms for prediction but also for identifying and prioritizing risk factors, using tools such as SHapley Additive exPlanation (SHAP) values and models such as Extreme Gradient Boosting (XGBoost) and Random Forest (RF), which have shown high predictive accuracies [11,14,15,16]. These efforts have been documented across multiple publications [11,17,18]. However, this innovative approach to exploring risk factors has yet to be applied specifically to retears after ARCR.
Therefore, this study aims to identify the risk factors influencing retears following ARCR using multiple machine learning algorithms and to rank and reveal the relative importance of these risk factors using high-accuracy machine learning models. Our approach includes both objective parameters and PROMs, providing a holistic view of predictive factors for retears. We hypothesize that (1) machine learning algorithms will not only predict retears after ARCR with high precision but also facilitate a ranking of risk factors and (2) that, while objective factors will emerge as significant predictors, some subjective factors should also be considered important risk factors.

2. Materials and Methods

Study approval was obtained from the institutional review board of our hospital. A total of 1042 adult patients underwent ARCR at our hospital from January 2016 to December 2022. Included in this study were patients who had full- or partial-thickness tears of the supraspinatus (SSP) tendon with or without combined tears of other rotator cuff tendons and underwent arthroscopic repair, a minimum follow-up period of 1 year, and radiological records available to evaluate the integrity of the repaired tendon.
We excluded cases with additional allograft augmentation or tendon transfer procedures, neuromuscular diseases, a partial repair of footprint coverage of 80% or less, infections, or revision surgeries. Figure 1 shows the number of patients included in and excluded from this study.
Tendon integrity was assessed via magnetic resonance imaging (MRI) at 2 months and 1 year after the surgery and with sonography at 3 and 6 months after the surgery. An annual follow-up MRI at our institution was recommended after 1 year. For patients encountering financial difficulties, sonography was used instead of MRI. We defined a retear as occurring within 2 years after surgery, following the findings of Iannotti et al. [19] and Miller et al. [20] who reported that retears typically occur within the early postoperative period, from 3 months to 26 weeks after surgery.

2.1. Operative Technique and Rehabilitation

All surgical procedures were performed by a single senior surgeon (YSK) with the patient in the lateral decubitus position under general anesthesia. A capsulectomy was conducted if the patient demonstrated a limited preoperative range of motion (ROM) or findings of adhesive capsulitis intraoperatively. Various techniques were used for the repair of the torn tendon, including single- and double-row techniques. In some instances, the long head of the biceps tendon (LHBT) was used for the repair of the torn tendon, either by rerouting the biceps or using a biceps augmentation technique [21]. Subscapularis (SSC) repair, biceps tenotomy or tenodesis, and acromioplasty were performed as necessary.
Postoperatively, patients were immobilized for 4–6 weeks using an abduction brace. Passive ROM exercises commenced after the immobilization period of 4–6 weeks. Active ROM and muscle strengthening exercises began 8 weeks following surgery. For patients who underwent capsulectomy, passive ROM exercises were initiated 1 day after surgery, and active ROM and muscle strengthening exercises commenced 8 weeks after surgery.

2.2. Variable Selection and Clinical Assessment

All preoperatively measured variables that could be retrieved through electronic medical record (EMR) review were considered for inclusion in this study. Variables that were missing in >10% of patients were excluded from consideration.
The size of the SSP tear was measured following intraoperative debridement. Tear retraction was measured as the distance from the medial margin of the torn tendon to its original footprint. The anterior to posterior size of the tear was determined by measuring the longest distance from the anterior to the posterior margin of the torn tendon. The condition of the SSC muscle was assessed according to the Lafosse classification [22]. Fatty infiltrations of the SSP, ISP, and SSC muscles were evaluated on preoperative MRI (Verio, 3-T, Siemens, Munich, Germany) in accordance with the Goutallier classification [23]. Muscle atrophy of the SSP, ISP, and SSC was assessed using the tangent sign on preoperative MRI [24]. Biceps lesions were classified into five groups: 0: intact; 1: tendinopathy; 2: partial tear < 50%; 3: partial tear > 50%; and 4: complete tear. Retears were assessed by MRI or sonography (HD11xe Ultrasound System; Philips, Amsterdam, The Netherlands), with the integrity of the tendon evaluated with MRI using the classification proposed by Sugaya et al. [25], where types IV and V were categorized as retears. Sonography assessments were carried out by orthopedic clinical fellows, considering a hypoechoic or anechoic defect extending from the bursal to the articular surface as indicative of a retear.
For subjective variables, preoperatively recorded validated PROMs were used. These included the VAS for pain, which ranges from 0 to 10; the ASES score; the Constant–Murley (CM) score; and the Korean Shoulder Scoring System (KS) score. The preoperative active ROM, including forward flexion, external rotation in the side arm position, external rotation in the abduction position, and internal rotation, was quantified using a handheld goniometer. These measurements were consistently performed by a single senior surgeon, with internal rotation documented as the highest vertebral level the thumb could reach along the spine when extended upward. Demographic factors, such as age, sex, smoking status, body mass index (BMI), the presence of diabetes mellitus (DM), and whether the dominant or non-dominant arm was involved, were also included as feature variables in the model.

2.3. Model Development and Feature Importance

To predict retears, we trained five distinct machine learning models: Extreme Gradient Boosting (XGBoost), Random Forest (RF), Support Vector Machine (SVM), Neural Network (NN), and logistic regression (LR). These models were selected due to their demonstrated effectiveness in forecasting medical outcomes [11,12,17,18]. Logistic regression was used as a baseline for comparative analysis against the more advanced models, given its conventional use in risk prediction. For LR analysis, both univariate and multivariate analyses were conducted, yielding odds ratios (ORs) and p-values for each factor.
The dataset was split into training and test sets at an 8:2 ratio. The dataset was initially imbalanced with a retear rate of 11.9%, so it was balanced using the synthetic minority oversampling technique (SMOTE), applied solely to the training set [26,27]. To ensure transparency and confirm that the SMOTE did not substantially alter the original data distributions, we have included Supplementary Materials presenting pairplots comparing variable distributions before and after applying the SMOTE. Within the training set, three-fold cross-validation was performed, along with a grid search to fine-tune the hyperparameters of the models. The integrity of the variable distributions post oversampling was verified to ensure there was no distortion.
Standardization was applied to all continuous variables to normalize scale variability, while nonordinal categorical variables were converted into dummy variables. The performance of the final models was evaluated using data from the test set, with performance metrics including the area under the receiver operating characteristic curve (AUC), predictive accuracy, sensitivity, and specificity. The receiver operating characteristic (ROC) curve was plotted to compute the AUC and Youden’s J Index [28].
To assess variable importance in the XGBoost model, SHAP values were calculated, as outlined in previous research [15,29]. In the RF model, an intrinsic feature importance metric was used for ranking features [16]. Our machine learning models were developed using Python Version 3.9.4 and the open-source library scikit-learn 1.1.0.

2.4. Statistical Analysis

Baseline characteristics for patients with and without retears are presented as the mean and standard deviation (SD) for continuous variables and counts for categorical variables. To determine differences between the two groups, continuous variables were analyzed using the t-test, and categorical variables were analyzed with Fisher’s exact test. The outcomes from the univariate and multivariate LR analyses are reported as ORs with 95% confidence intervals (CIs). A p-value of less than 0.05 was considered statistically significant. Statistical analyses were conducted using R version 4.2.1.

3. Results

3.1. Demographic Features

The final dataset included 788 patients, categorized into non-retear and retear groups. Table 1 provides a detailed distribution of demographic features. Retears were observed in 11.9% of cases (94/788). The average age was significantly higher in the retear group (p < 0.01). The tear profile of the retear group demonstrated a higher incidence of full-thickness tears and larger tear sizes in both the anterior–posterior (AP) and medial–lateral (ML) dimensions compared to the non-retear group (p < 0.05). There were significant differences in the FI proportions between the two groups, although the majority of Goutallier stages for all three muscles in both groups were 0 and I. No significant differences were observed in the muscle atrophy status between the groups. Similarly, clinical outcomes, including ROM and clinical scores, showed no significant differences.

3.2. Performance of Machine Learning Algorithms

The performance of the five machine learning algorithm models within the training and validation cohorts, based on the AUC through three-fold cross-validation, is depicted in Figure 2. The AUC for the RF model was 0.9790 (SD = 0.0051); for the LR model, it was 0.8974 (SD = 0.0235); for the SVM model, it was 0.6379 (SD = 0.0145); for the NN model, it was 0.9354 (SD = 0.0037); and for the XGBoost model, it was 0.9785 (SD = 0.0011). Details on the performance of the models on the independent test set are provided in Table 2. The XGBoost and RF algorithms were the top-performing models across both the cross-validation process and the test set evaluation.

3.3. Feature Importance in the XGBoost and Random Forest Model

Figure 3 illustrates the feature importance plots derived from the XGBoost and RF models, which were the top-performing models. The five most important features identified by XGBoost included the tear size in the ML and AP dimensions, female sex, full-thickness tears, and BMI. The RF model showed similar feature importances to XGBoost but included the VAS pain score instead of female sex. The tear size in the ML dimension was the most predictive feature for the presence of a retear in both models, with the AP dimension tear size as the next most important factor. Full-thickness tears and BMI were also among the top five most important factors in both analyses.

3.4. Univariate and Multivariate Logistic Regression Analysis

Univariate LR analysis identified age, the presence of a full-thickness tear, the tear size in both the AP and ML dimensions, the condition of the biceps tendon, the FI of the SSP, SSC, and ISP muscles, and SSP atrophy as variables significantly associated with retear after ARCR (p < 0.05). However, multivariate analysis demonstrated that only age (OR = 1.040, 95%CI [1.006–1.075], p = 0.02) and the ML tear size (OR = 1.580, 95%CI [1.052–2.392], p = 0.029) were significantly related to the occurrence of a retear. The detailed results of both univariate and multivariate analyses are provided in Table 3.

4. Discussion

We developed and validated multiple widely used machine learning algorithms to predict the occurrence of retears after ARCR. Among five machine learning models, the RF and XGBoost models were the top-performing models, surpassing the classical LR model. The size of the tear in both the AP and ML dimensions, BMI, full-thickness tears, female sex, and VAS pain scores were identified as significant in terms of feature importance for the prediction of retear occurrence. The most influential features were remarkably similar across both models. Traditional multivariate LR analysis, however, highlighted significant ORs for age and tear size in the medial-to-lateral dimension only.
The top five risk factors identified by the two highest-performing models closely align with findings from previous studies. For instance, Manop et al. [6] conducted a multivariate LR analysis on risk factors for ARCR failure and identified the AP tear size, retraction, BMI, and high work activity as significant predictors. The dimension of the tear, whether ML or AP, has been frequently cited as a primary factor in retear occurrence [2,3,6,7]. In addition, a high BMI has been extensively associated with poorer healing outcomes after ARCR, as evidenced by several studies, including meta-analyses [7,30,31]. Our findings, which highlight the tear size, full-thickness tears, BMI, female sex, and the VAS pain score as influential risk factors, are in partial agreement with previous studies [6,7]. In particular, BMI and female sex are supported by recent studies, which have similarly identified these factors as influential in rotator cuff healing, thereby reinforcing the clinical relevance of our model [32,33,34].
One novel aspect of our study was the concurrent comparison of subjective and objective factors. The results demonstrated that the feature importances of subjective factors were not included in the top five ranking except for the VAS pain score, which ranked fourth in the RF model. PROM values face challenges in being included in risk factor surveys due to reasons such as their highly subjective nature or low response rates [10]. However, PROMs should also be considered valuable measurements to evaluate prognosis because they are directly related to the patient’s subjective satisfaction. Although only the RF model highlighted VAS pain as a highly important factor, high levels of preoperative pain may contribute to poorer retear outcomes, potentially explaining the lower postoperative PROM results documented in other research [8,9,10].
Recognizing high-risk factors in clinical practice could help surgeons better identify patients who might be at increased risk. For instance, patients with larger tear sizes, patients with an elevated BMI, patients with higher preoperative pain scores, or female patients may benefit from targeted preoperative counseling regarding their heightened retear risk. These patients could also receive individualized postoperative management strategies, such as more rigorous follow-up imaging or tailored rehabilitation protocols. Thus, the integration of these insights into clinical practice could foster a more personalized approach to patient care and potentially improve clinical outcomes. Future studies should aim to develop and validate practical tools, such as individualized risk prediction models, to further enhance clinical decision-making.
Our study did not prioritize frequently suggested risk factors for rotator cuff healing, such as FI, DM, or smoking. This highlights the complexity of risk factor analysis and the impact of methodological approaches on outcomes. While FI was significant in univariate analysis, machine learning revealed a different hierarchy of importance. Real-world scenarios involve multifactorial interactions where dominant factors may overshadow others in predictive models. Therefore, our findings advocate for a multifactorial and methodologically diverse approach to understanding retear risk after ARCR. Traditional risk factor analysis in orthopedic research has relied on univariate and multivariate LR models, which may not fully capture complex relationships.
The machine learning models identify BMI and female sex as key predictors, but the multivariate logistic regression only finds age and the ML tear size to be significant. The apparent discrepancy arises because machine learning algorithms can detect nonlinear interactions and subtle effects, which may not be identified by traditional multivariate logistic regression. Thus, while age and the ML tear size were statistically significant in the logistic model, BMI and female sex may still exert clinically meaningful influences captured by the machine learning approach. Our study used various machine learning algorithms for a more complex understanding. SHAP values and RF feature importance have provided transparent insights into the decision-making processes of predictive models [11,12,17,18]. This methodological shift aligns with the growing call for advanced analytical techniques in medical research, as highlighted by Beam et al. [35] in their discussion on the potential of machine learning in healthcare. Similar to findings by Rajkomar et al. [36], our study demonstrates that machine learning algorithms can surpass conventional statistical methods in various medical fields by enhancing predictive accuracy and providing a deeper understanding of risk factor dynamics.
Our study has some limitations. First, all procedures were conducted by a single senior surgeon with over two decades of experience, using a variety of repair techniques. This may limit the generalizability of the results. Future studies could use our risk factors to assess the outcomes from multiple surgeons across different centers. Second, not all measurable features were included, such as the critical shoulder angle or acromiohumeral distance, which could have influenced our findings. Further studies should incorporate these parameters to provide a more comprehensive assessment of risk factors. Additionally, retears were assessed using MRI or ultrasound at different postoperative intervals, potentially introducing variability into the accuracy or consistency of retear diagnosis. Also, despite high cross-validation AUCs (~0.98), the drop to ~0.67 on the test set likely reflects the inherent challenge of overfitting in complex models applied to retrospective data. This limitation is common in studies with imbalanced outcomes, and our primary goal was to explore risk factor importance rather than achieve perfect predictive accuracy. Another limitation is the lack of an external validation set. Although our study relied on three-fold cross-validation—an approach commonly used in similar retrospective studies—future research should consider external validation or bootstrapping to further verify these findings. Our study focused exclusively on preoperative risk factors and did not account for postoperative rehabilitation adherence, which could influence retear rates. Finally, the retrospective nature of this review may introduce certain inaccuracies, although the medical records were carefully examined and considered reliable. While our study provides valuable insights, the use of a single-center, retrospective dataset may limit generalizability and introduce bias, a limitation inherent to many such studies.

5. Conclusions

Machine learning algorithms can predict retears with greater accuracy than conventional LR. Through machine learning models, the tear size and BMI were identified as the most influential risk factors, while the VAS pain score and female sex should also be considered risk factors for retears after ARCR.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/jcm14061843/s1. S1.1. Pairplot displaying the distribution and relationships among variables in the original dataset; S1.2 Pairplot displaying the distribution and relationships among variables in the SMOTE-balanced dataset.

Author Contributions

Conceptualization, S.-H.C.; Methodology, S.-H.C.; Software, S.-H.C.; Validation, Y.-S.K.; Formal Analysis, S.-H.C.; Investigation, S.-H.C.; Resources, Y.-S.K.; Data Curation, S.-H.C. and Y.-S.K.; Writing—Original Draft Preparation, S.-H.C.; Writing—Review and Editing, Y.-S.K.; Visualization, S.-H.C.; Supervision, Y.-S.K.; Project Administration, Y.-S.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.

Institutional Review Board Statement

This study was conducted according to the guidelines of the Declaration of Helsinki and approved by the institutional review board of the Catholic University of Korea, Seoul St. Mary’s Hospital (KC22RASI0386).

Informed Consent Statement

Patient consent was waived due to the retrospective nature of this research.

Data Availability Statement

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Conflicts of Interest

Each author certifies that neither he or she nor any member of his or her immediate family has funding or commercial associations (consultancies, stock ownership, equity interest, patent/licensing arrangements, etc.) that might pose a conflict of interest in connection with the submitted article.

References

  1. Harada, N.; Gotoh, M.; Ishitani, E.; Kakuma, T.; Yano, Y.; Tatara, D.; Kawakami, J.; Imai, T.; Karasuyama, M.; Kudoh, Y.; et al. Combination of risk factors affecting retear after arthroscopic rotator cuff repair: A decision tree analysis. J. Shoulder Elb. Surg. 2021, 30, 9–15. [Google Scholar] [CrossRef] [PubMed]
  2. Kim, I.B.; Jung, D.W.; Suh, K.T. Prediction of the Irreparability of Rotator Cuff Tears. Arthroscopy 2018, 34, 2076–2084. [Google Scholar] [CrossRef] [PubMed]
  3. Kwon, J.; Kim, S.H.; Lee, Y.H.; Kim, T.I.; Oh, J.H. The Rotator Cuff Healing Index: A New Scoring System to Predict Rotator Cuff Healing After Surgical Repair. Am. J. Sports Med. 2019, 47, 173–180. [Google Scholar] [CrossRef]
  4. Lee, Y.S.; Jeong, J.Y.; Park, C.D.; Kang, S.G.; Yoo, J.C. Evaluation of the Risk Factors for a Rotator Cuff Retear After Repair Surgery. Am. J. Sports Med. 2017, 45, 1755–1761. [Google Scholar] [CrossRef]
  5. Lobo-Escolar, L.; Ramazzini-Castro, R.; Codina-Grano, D.; Lobo, E.; Minguell-Monyart, J.; Ardevol, J. Risk factors for symptomatic retears after arthroscopic repair of full-thickness rotator cuff tears. J. Shoulder Elb. Surg. 2021, 30, 27–33. [Google Scholar] [CrossRef]
  6. Manop, P.; Apivatgaroon, A.; Puntu, W.; Chernchujit, B. Risk Factors for Rotator Cuff Repair Failure and Reliability of the Rotator Cuff Healing Index (RoHI) in Thai Patients: Comparison of the RoHI With a Modified Scoring System. Orthop. J. Sports Med. 2023, 11, 23259671231179449. [Google Scholar] [CrossRef]
  7. Zhao, J.; Luo, M.; Pan, J.; Liang, G.; Feng, W.; Zeng, L.; Yang, W.; Liu, J. Risk factors affecting rotator cuff retear after arthroscopic repair: A meta-analysis and systematic review. J. Shoulder Elb. Surg. 2021, 30, 2660–2670. [Google Scholar] [CrossRef]
  8. O’Connor, J.P.; Holden, P.; Gagnier, J.J. Systematic review: Preoperative psychological factors and total hip arthroplasty outcomes. J. Orthop. Surg. Res. 2022, 17, 457. [Google Scholar] [CrossRef] [PubMed]
  9. Jochimsen, K.N.; Noehren, B.; Mattacola, C.G.; Di Stasi, S.; Duncan, S.T.; Jacobs, C. Preoperative Psychosocial Factors and Short-term Pain and Functional Recovery After Hip Arthroscopy for Femoroacetabular Impingement Syndrome. J. Athl. Train. 2021, 56, 1064–1071. [Google Scholar] [CrossRef]
  10. Rizvi, S.M.T.; Bishop, M.; Lam, P.H.; Murrell, G.A.C. Factors Predicting Frequency and Severity of Postoperative Pain After Arthroscopic Rotator Cuff Repair Surgery. Am. J. Sports Med. 2021, 49, 146–153. [Google Scholar] [CrossRef]
  11. Sun, J.; Sun, C.K.; Tang, Y.X.; Liu, T.C.; Lu, C.J. Application of SHAP for Explainable Machine Learning on Age-Based Subgrouping Mammography Questionnaire Data for Positive Mammography Prediction and Risk Factor Identification. Healthcare 2023, 11, 2000. [Google Scholar] [CrossRef] [PubMed]
  12. Schnabel, R.B.; Witt, H.; Walker, J.; Ludwig, M.; Geelhoed, B.; Kossack, N.; Schild, M.; Miller, R.; Kirchhof, P. Machine learning-based identification of risk-factor signatures for undiagnosed atrial fibrillation in primary prevention and post-stroke in clinical practice. Eur. Heart J. Qual. Care Clin. Outcomes 2022, 9, 16–23. [Google Scholar] [CrossRef]
  13. Lu, Y.; Ning, Y.; Li, Y.; Zhu, B.; Zhang, J.; Yang, Y.; Chen, W.; Yan, Z.; Chen, A.; Shen, B.; et al. Risk factor mining and prediction of urine protein progression in chronic kidney disease: A machine learning- based study. BMC Med. Inform. Decis. Mak. 2023, 23, 173. [Google Scholar] [CrossRef] [PubMed]
  14. Dunn, J.; Mingardi, L.; Zhuo, Y.D. Comparing interpretability and explainability for feature selection. arXiv 2021, arXiv:2105.05328. Available online: https://ui.adsabs.harvard.edu/abs/2021arXiv210505328D (accessed on 1 May 2021).
  15. Fryer, D.; Strümke, I.; Nguyen, H. Shapley values for feature selection: The good, the bad, and the axioms. IEEE Access 2021, 9, 144352–144360. [Google Scholar] [CrossRef]
  16. Kursa, M.B.; Rudnicki, W.R. The All Relevant Feature Selection Using Random Forest. arXiv 2011, arXiv:1106.5112. Available online: https://ui.adsabs.harvard.edu/abs/2011arXiv1106.5112K (accessed on 1 June 2011).
  17. Wang, M.; Sun, M.; Yu, Y.; Li, X.; Ren, Y.; Yin, D. Predictive value of machine learning algorithm of coronary artery calcium score and clinical factors for obstructive coronary artery disease in hypertensive patients. BMC Med. Inform. Decis. Mak. 2023, 23, 244. [Google Scholar] [CrossRef]
  18. Li, B.; Verma, R.; Beaton, D.; Tamim, H.; Hussain, M.A.; Hoballah, J.J.; Lee, D.S.; Wijeysundera, D.N.; de Mestral, C.; Mamdani, M.; et al. Predicting Major Adverse Cardiovascular Events Following Carotid Endarterectomy Using Machine Learning. J. Am. Heart Assoc. 2023, 12, e030508. [Google Scholar] [CrossRef]
  19. Iannotti, J.P.; Deutsch, A.; Green, A.; Rudicel, S.; Christensen, J.; Marraffino, S.; Rodeo, S. Time to failure after rotator cuff repair: A prospective imaging study. J. Bone Jt. Surg. Am. 2013, 95, 965–971. [Google Scholar] [CrossRef]
  20. Miller, B.S.; Downie, B.K.; Kohen, R.B.; Kijek, T.; Lesniak, B.; Jacobson, J.A.; Hughes, R.E.; Carpenter, J.E. When do rotator cuff repairs fail? Serial ultrasound examination after arthroscopic repair of large and massive rotator cuff tears. Am. J. Sports Med. 2011, 39, 2064–2070. [Google Scholar] [CrossRef]
  21. Kim, Y.S.; Lee, H.J.; Park, I.; Sung, G.Y.; Kim, D.J.; Kim, J.H. Arthroscopic In Situ Superior Capsular Reconstruction Using the Long Head of the Biceps Tendon. Arthrosc. Tech. 2018, 7, e97–e103. [Google Scholar] [CrossRef] [PubMed]
  22. Lafosse, L.; Jost, B.; Reiland, Y.; Audebert, S.; Toussaint, B.; Gobezie, R. Structural integrity and clinical outcomes after arthroscopic repair of isolated subscapularis tears. J. Bone Jt. Surg. Am. 2007, 89, 1184–1193. [Google Scholar] [CrossRef]
  23. Goutallier, D.; Postel, J.M.; Bernageau, J.; Lavau, L.; Voisin, M.C. Fatty muscle degeneration in cuff ruptures. Pre- and postoperative evaluation by CT scan. Clin. Orthop. Relat. Res. 1994, 304, 78–83. [Google Scholar] [CrossRef]
  24. Smid, P.; Hart, R.; Puskeiler, M. Tangent sign—a reliable predictor of risk for tendon re-rupture in rotator cuff repair. Acta Chir. Orthop. Traumatol. Cech. 2014, 81, 227–232. [Google Scholar] [CrossRef]
  25. Sugaya, H.; Maeda, K.; Matsuki, K.; Moriishi, J. Functional and structural outcome after arthroscopic full-thickness rotator cuff repair: Single-row versus dual-row fixation. Arthroscopy 2005, 21, 1307–1316. [Google Scholar] [CrossRef]
  26. Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic minority over-sampling technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
  27. Fernández, A.; Garcia, S.; Herrera, F.; Chawla, N.V. SMOTE for learning from imbalanced data: Progress and challenges, marking the 15-year anniversary. J. Artif. Intell. Res. 2018, 61, 863–905. [Google Scholar] [CrossRef]
  28. Ruopp, M.D.; Perkins, N.J.; Whitcomb, B.W.; Schisterman, E.F. Youden Index and optimal cut-point estimated from observations affected by a lower limit of detection. Biom. J. J. Math. Methods Biosci. 2008, 50, 419–430. [Google Scholar] [CrossRef]
  29. Marcílio, W.E.; Eler, D.M. From explanations to feature selection: Assessing SHAP values as feature selection mechanism. In Proceedings of the 2020 33rd SIBGRAPI conference on Graphics, Patterns and Images (SIBGRAPI), Porto de Galinhas, Brazil, 7–10 November 2020; IEEE: New York, NY, USA, 2020; pp. 340–347. [Google Scholar]
  30. Chung, S.W.; Oh, J.H.; Gong, H.S.; Kim, J.Y.; Kim, S.H. Factors affecting rotator cuff healing after arthroscopic repair: Osteoporosis as one of the independent risk factors. Am. J. Sports Med. 2011, 39, 2099–2107. [Google Scholar] [CrossRef]
  31. Kim, Y.K.; Jung, K.H.; Kim, J.W.; Kim, U.S.; Hwang, D.H. Factors affecting rotator cuff integrity after arthroscopic repair for medium-sized or larger cuff tears: A retrospective cohort study. J. Shoulder Elb. Surg. 2018, 27, 1012–1020. [Google Scholar] [CrossRef]
  32. Cao, Z.; Li, Q.; Li, Y.; Wu, J. The association of metabolic syndrome with rotator cuff tendinopathy: A two-sample Mendelian randomization study. Diabetol. Metab. Syndr. 2023, 15, 211. [Google Scholar] [CrossRef] [PubMed]
  33. Gumina, S.; Kim, H.; Jung, Y.; Song, H.S. Rotator cuff degeneration and healing after rotator cuff repair. Clin. Shoulder Elb. 2023, 26, 323–329. [Google Scholar] [CrossRef] [PubMed]
  34. Zhao, J.; Luo, M.; Liang, G.; Pan, J.; Han, Y.; Zeng, L.; Yang, W.; Liu, J. What Factors Are Associated with Symptomatic Rotator Cuff Tears: A Meta-analysis. Clin. Orthop. Relat. Res. 2022, 480, 96–105. [Google Scholar] [CrossRef] [PubMed]
  35. Beam, A.L.; Kohane, I.S. Big data and machine learning in health care. Jama 2018, 319, 1317–1318. [Google Scholar] [CrossRef]
  36. Rajkomar, A.; Dean, J.; Kohane, I. Machine learning in medicine. New Engl. J. Med. 2019, 380, 1347–1358. [Google Scholar] [CrossRef]
Figure 1. Flowchart of the number of included and excluded patients.
Figure 1. Flowchart of the number of included and excluded patients.
Jcm 14 01843 g001
Figure 2. Receiver operating characteristic (ROC) curve of the three-fold cross-validation results.
Figure 2. Receiver operating characteristic (ROC) curve of the three-fold cross-validation results.
Jcm 14 01843 g002
Figure 3. Feature importance derived from the XGBoost-SHAP and Random Forest (RF) models. SHAP (SHapley Additive exPlanation) values quantify the contribution of each feature to the model’s prediction by measuring how each feature affects the prediction when interacting with other features. RF importance scores reflect the significance of each feature based on how much it reduces impurity across decision trees. BMI, body mass index; DM, diabetes mellitus; AP, anterior–posterior; ML, medial–lateral; FI, fatty infiltration; abd., abduction; VAS, visual analogue scale; FTRCT, full-thickness rotator cuff tear; ASES, American Shoulder and Elbow Surgeon; CM, Constant–Murley; KS, Korean Shoulder.
Figure 3. Feature importance derived from the XGBoost-SHAP and Random Forest (RF) models. SHAP (SHapley Additive exPlanation) values quantify the contribution of each feature to the model’s prediction by measuring how each feature affects the prediction when interacting with other features. RF importance scores reflect the significance of each feature based on how much it reduces impurity across decision trees. BMI, body mass index; DM, diabetes mellitus; AP, anterior–posterior; ML, medial–lateral; FI, fatty infiltration; abd., abduction; VAS, visual analogue scale; FTRCT, full-thickness rotator cuff tear; ASES, American Shoulder and Elbow Surgeon; CM, Constant–Murley; KS, Korean Shoulder.
Jcm 14 01843 g003
Table 1. Patient population (mean ± SD or n).
Table 1. Patient population (mean ± SD or n).
Non-Retear (n = 694)Retear (n = 94)p-Value
Sex, male–female253:44164:300.424
Age, years61.7 ± 8.464.9 ± 7.40.001
BMI,24.6 ± 3.224.1 ± 2.90.115
DM, yes–no95:59917:770.270
Smoking, yes–no54:64011:830.228
Dominant side, yes–no488:20672:220.227
Acromion Type, 1:2:394:511:8916:61:170.055
Full-thickness tear, +:− 442:25284:10<0.001
AP tear size, cm1.2 ± 0.71.7 ± 0.9<0.001
ML tear size, cm 1.3 ± 0.91.9 ± 1.0<0.001
Subscapularis tear,
I:II:III:IV:V
440:133:91:21:955:26:9:3:10.454
Biceps condition,
0:1:2:3:4
341:152:91:101:936:17:17:24:00.029
Supraspinatus FI,
0:I:II:III:IV
137:373:134:37:1310:42:29:12:10.002
Subscapularis FI,
0:I:II:III:IV
452:230:12:0:047:44:2:0:10.004
Infraspinatus FI,
0:I:II:III:IV
375:248:63:7:136:46:11:0:10.015
Supraspinatus Tangent,
1:0
511:18335:590.070
Subscapularis Tangent,
1:0
676:182:921.000
Infraspinatus Tangent,
1:0
680:143:910.493
Forward flexion140.9 ± 15.3142.8 ± 12.80.189
External rotation (side)82.3 ± 13.383.4 ± 12.60.411
External rotation (abd.)84.0 ± 12.784.8 ± 10.70.473
Internal rotation9.2 ± 3.49.7 ± 3.20.157
Adduction23.3 ± 4.824.0 ± 4.40.179
VAS for pain4.2 ± 2.03.8 ± 2.40.206
ASES score57.8 ± 18.958.7 ± 22.40.710
CM score69.1 ± 16.568.8 ± 16.10.873
KS score65.9 ± 15.964.3 ± 16.10.368
BMI, body mass index; DM, diabetes mellitus; AP, anterior–posterior; ML, medial–lateral; FI, fatty infiltration; abd., abduction; VAS, visual analogue scale; ASES, American Shoulder and Elbow Surgeon; CM, Constant–Murley; KS, Korean Shoulder.
Table 2. Model performances using test set in five models.
Table 2. Model performances using test set in five models.
ModelAUC-ROCAccuracySensitivitySpecificity
RF0.6530.5760.7370.554
LR0.5700.4300.8420.374
SVM0.5970.3800.9470.302
NN0.6410.4870.8950.432
XGBoost 0.6760.5060.8420.460
Table 3. Univariate and multivariate logistic regression analysis for retears.
Table 3. Univariate and multivariate logistic regression analysis for retears.
VariablesUnivariate AnalysisMultivariate Analysis
OR (95%CI)p-ValueOR (95%CI)p-Value
Sex (M)0.817 (0.510–1.284)0.390.614 (0.330–1.099)0.11
Age1.051 (1.023–1.081)<0.0011.040 (1.006–1.075)0.02
BMI0.947 (0.879–1.016)0.1400.958 (0.883–1.035)0.289
DM1.392 (0.767–2.405)0.2541.240 (0.649–2.274)0.5
Smoking1.571 (0.753–3.017)0.1982.192 (0.886–5.231)0.081
Dominant side1.382 (0.847–2.335)0.2091.318 (0.759–2.361)0.339
Acromion Type
Type 1-[Reference] -[Reference]
Type 20.701 (0.396–1.306)0.2410.514 (0.268–1.022)0.05
Type 31.122 (0.533–2.373)0.7610.673 (0.287–1.569)0.357
FTRCT4.789 (2.559–9.985)<0.0012.106 (0.972–4.898)0.069
AP size2.345 (1.794–3.081)<0.0011.476 (0.926–2.348)0.1
ML size2.122 (1.690–2.678)<0.0011.580 (1.052–2.392)0.029
SSC Lafosse1.008 (0.790–1.260)0.9450.920 (0.700–1.185)0.532
BC1.265 (1.060–1.506)0.0091.148 (0.938–1.400)0.176
SSC FI1.787 (1.234–2.579)0.0021.258 (0.714–2.210)0.425
SSP FI1.487 (1.181–1.868)<0.0010.996 (0.645–1.544)0.985
ISP FI1.406 (1.060–1.805)0.0160.864 (0.508–1.454)0.585
SSC Tangent0.816 (0.128–2.891)0.7880.425 (0.520–2.194)0.354
SSP Tangent1.656 (1.047–2.588)0.0280.683 (0.363–1.257)0.227
ISP Tangent1.601 (0.364–5.021)0.4661.127 (0.193–5.450)0.885
Forward flexion1.010 (0.994–1.029)0.251.008 (0.974–1.046)0.673
ER (side)1.007 (0.991–1.028)0.4311.008 (0.959–1.054)0.748
ER (abduction)1.006 (0.989–1.028)0.5280.980 (0.946–1.032)0.347
IR1.048 (0.981–1.128)0.1771.022 (0.919–1.142)0.701
Adduction1.031 (0.984–1.082)0.2051.019 (0.955–1.077)0.491
VAS pain0.925 (0.830–1.028)0.1520.849 (0.668–1.080)0.181
ASES score1.002 (0.991–1.014)0.6720.986 (0.956–1.016)0.374
CM score0.999 (0.986–1.012)0.8731.032 (0.998–1.069)0.073
KS score0.994 (0.981–1.007)0.3610.972 (0.967–1.008)0.125
BMI, body mass index; DM, diabetes mellitus; AP, anterior–posterior; ML, medial–lateral; FI, fatty infiltration; abd., abduction; VAS, visual analogue scale; FTRCT, full-thickness rotator cuff tear; ASES, American Shoulder and Elbow Surgeon; CM, Constant–Murley; KS, Korean Shoulder; SSP, supraspinatus; SSC, subscapularis; ISP, infraspinatus.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Cho, S.-H.; Kim, Y.-S. Predicting Major Preoperative Risk Factors for Retears After Arthroscopic Rotator Cuff Repair Using Machine Learning Algorithms. J. Clin. Med. 2025, 14, 1843. https://doi.org/10.3390/jcm14061843

AMA Style

Cho S-H, Kim Y-S. Predicting Major Preoperative Risk Factors for Retears After Arthroscopic Rotator Cuff Repair Using Machine Learning Algorithms. Journal of Clinical Medicine. 2025; 14(6):1843. https://doi.org/10.3390/jcm14061843

Chicago/Turabian Style

Cho, Sung-Hyun, and Yang-Soo Kim. 2025. "Predicting Major Preoperative Risk Factors for Retears After Arthroscopic Rotator Cuff Repair Using Machine Learning Algorithms" Journal of Clinical Medicine 14, no. 6: 1843. https://doi.org/10.3390/jcm14061843

APA Style

Cho, S.-H., & Kim, Y.-S. (2025). Predicting Major Preoperative Risk Factors for Retears After Arthroscopic Rotator Cuff Repair Using Machine Learning Algorithms. Journal of Clinical Medicine, 14(6), 1843. https://doi.org/10.3390/jcm14061843

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop