Next Article in Journal
Blood Transfusion and Survival of Children, Adolescent, and Young Adult Patients with Osteosarcoma: A Multicenter Retrospective Cohort Study
Previous Article in Journal
Updates in the Management of Richter Transformation
Previous Article in Special Issue
The Role of Radiotherapy to the Primary Site in Oropharyngeal Cancer with Limited Metastases—An Analysis of a Hospital-Based Registry
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Prediction of Persistent Tumor Status in Nasopharyngeal Carcinoma Post-Radiotherapy-Related Treatment: A Machine Learning Approach

1
Department of Radiation Oncology, Chung Shan Medical University Hospital, Taichung 40201, Taiwan
2
School of Medicine, Chung Shan Medical University, Taichung 40201, Taiwan
3
Department of Medical Imaging, Chung Shan Medical University Hospital, Taichung 40201, Taiwan
4
Department of Nuclear Medicine, Chung Shan Medical University Hospital, Taichung 40201, Taiwan
5
Department of Otolaryngology, Chung Shan Medical University Hospital, Taichung 40201, Taiwan
6
Artificial Intelligence Center, Chung Shan Medical University Hospital, Taichung 40201, Taiwan
7
Department of Medical Informatics, Chung Shan Medical University, Taichung 40201, Taiwan
*
Author to whom correspondence should be addressed.
Cancers 2025, 17(1), 96; https://doi.org/10.3390/cancers17010096
Submission received: 29 November 2024 / Revised: 20 December 2024 / Accepted: 27 December 2024 / Published: 31 December 2024
(This article belongs to the Special Issue Advances in Radiation Therapy for Head and Neck Cancer)

Simple Summary

Radiotherapy is the primary and only curative treatment for nasopharyngeal carcinoma (NPC). The adoption of intensity-modulated radiotherapy (IMRT) alone or combined with optimized chemotherapeutic strategies has improved survival rates while reducing treatment-related toxicities. However, the persistent tumor status, including residual tumor presence and early recurrence, is the predominant cause of treatment failure, associated with poorer survival outcomes. This study extracted the radiomic features from pretreatment PET images, which were used to construct a prediction model to identify patients with NPC at high risk of having persistent tumors after treatment. The model trained by the AdaBoost algorithm showed significant diagnostic performance in predicting treatment failure. This model exhibited the feasibility of assisting clinicians in identifying high-risk patients before treatment, allowing for more personalized treatment plans and improving patient outcomes.

Abstract

Background/Objectives: The duration of the response to radiotherapy-related treatment is a critical prognostic indicator for patients with nasopharyngeal carcinoma (NPC). Persistent tumor status, including residual tumor presence and early recurrence, is associated with poorer survival outcomes. To address this, we developed a prediction model to identify patients at a high risk of persistent tumor status prior to initiating treatment. Methods: This retrospective study included 104 patients with NPC receiving radiotherapy-related treatment who had completed a 3-year follow-up period; 29 were classified into the persistent tumor status group and 75 into the disease-free group. Radiomic features were extracted from pretreatment positron emission tomography (PET) images and used to construct a prediction model by employing machine learning algorithms. The model’s diagnostic performance was assessed using the area under the receiver operating characteristic curve (AUC), whereas SHapley Additive exPlanations (SHAP) analysis was conducted to determine the contribution of individual features to the model. Results: The prediction model developed using the AdaBoost algorithm and validated through five-fold cross-validation achieved the highest AUC of 0.934. Its sensitivity, specificity, positive predictive value, negative predictive value, and accuracy were 89.66%, 86.67%, 72.22%, 95.59%, and 87.5%, respectively. SHAP analysis revealed that the feature of high dependence low metabolic uptake emphasis50 had the greatest impact on model predictions. Furthermore, patients classified as disease-free exhibited markedly higher overall survival rates compared with those with persistent tumor status. Conclusions: In conclusion, the proposed prediction model efficiently identified patients with NPC at a high risk of persistent tumor status by using radiomic features extracted from pretreatment PET images.

1. Introduction

Nasopharyngeal carcinoma (NPC) is a form of head and neck cancer originating in the epithelial tissue of the nasopharynx. Relative to other malignancies, NPC is uncommon [1,2]. According to the International Agency for Research on Cancer, an estimated 120,416 new cases of NPC were diagnosed in 2022, with these diagnoses representing only 0.6% of all cancer diagnoses globally in that year [2]. NPC exhibits a distinct geographical distribution [1,2], with the highest incidence and mortality rates reported in East Asia, followed by in Southeast Asia and South–Central Asia. A consistent male predominance in both incidence and mortality is observed across all regions worldwide.
Radiotherapy is the primary and only curative treatment for NPC [3]. The widespread adoption of intensity-modulated radiotherapy (IMRT) combined with optimized chemotherapeutic strategies, including induction, concurrent, and adjuvant strategies, has improved survival rates while reducing treatment-related toxicities [4]. However, despite these advances, 20–30% of survivors experience recurrent disease at the primary or local site, with this being a predominant cause of treatment failure [5,6]. The temporal risk of NPC recurrence sharply peaks at 24 months post-treatment, with 50–60% of recurrences occurring within this critical period [7,8,9,10]. Early recurrence is a key prognostic indicator for NPC that is strongly associated with poorer survival outcomes [4,7,8,9,10,11]. Managing recurrent NPC requires a multidisciplinary approach that incorporates personalized therapeutic strategies tailored to individual clinical profiles [12]. However, balancing the potential for salvage treatment with the risk of severe adverse effects related to reirradiation remains a considerable challenge [6]. Therefore, timely identification and intervention are crucial to improving management and prognosis for NPC. Notably, several reliable artificial intelligence applications have been developed that can assist with treatment decision-making, providing a promising avenue for enhancing outcomes in NPC.
The integration of radiomic features with machine learning has demonstrated considerable potential in predicting treatment response and survival outcomes for patients with NPC [13]. Studies have applied magnetic resonance imaging (MRI)-based radiomics before treatment to predict eligibility for adaptive radiation therapy [14], assess responses to neoadjuvants [15] or induction chemotherapy (ICT) [16], and compare responses across different chemotherapy regimens [17]. These radiomics have also been employed to investigate radioresistance through the analysis of recurrence patterns in patients treated with IMRT [18] and used in combination with machine learning to predict local and distant failure [19]. The predictive value of multiparametric MRI-based radiomics was also explored for predicting ICT response and survival outcomes [20]. Additionally, the prognostic system integrating MRI features and clinical data study was developed and validated to improve risk stratification and treatment decisions for locoregionally advanced NPC patients receiving concurrent chemoradiotherapy (CCRT) [21]. PET/computed tomography (CT)-based radiomics combined with machine learning was used to differentiate between local recurrence and inflammation in post-treatment patients [22]. Deep learning–based contrast CT and PET radiomics were reported to have capabilities for risk stratification, personalized ICT planning, and predicting disease-free survival [23]. Furthermore, a study employed positron emission tomography/computed tomography (PET/CT)-based deep learning signatures to stratify patients with recurrent NPC into high- and low-risk groups to enhance survival outcome predictions [24]. Radiomics and deep learning models [25,26,27,28] have also been applied extensively in this context to further improve prediction accuracy. The nomogram based on CT radiomics and clinical factors established the risk prediction of local recurrence in NPC patients after IMRT [29]. Research indicates that tumor response can be predicted by detecting anatomical changes in NPC during treatment [30]. Furthermore, a systematic review and meta-analysis evaluated the prognostic value of MRI-based radiomics in predicting progression-free survival in patients with NPC. Although the findings indicated robust prognostic performance, they highlighted a need for consistent and robust study protocols to strengthen reliability and clinical applicability [31].
In this study, we introduce a machine learning approach to predict persistent tumor status in NPC patients undergoing radiotherapy-related (RT-related) treatments. This approach leverages radiomic features extracted from pretreatment PET images to construct an explainable prediction model. The patient cohort is not limited to a specific stage but includes all patients receiving radiotherapy, whether as a standalone treatment or combined with various chemotherapy plans. The innovations of this study include capturing the biological characteristics of tumors through radiomic features, applying machine learning techniques to construct a model for predicting treatment response and analyzing the predictions to explore the correlation between radiomic features and treatment outcomes. Our contributions to the literature include providing a new predictive tool for clinicians to assess therapeutic effects and potentially improve personalized treatment strategies for NPC patients.
The structure of this paper is organized as follows: the Section 1 provides background information on NPC and the importance of the early prediction of treatment response. The Section 2 details the study design, patient population, patient treatment and follow-up methods, imaging processing techniques, and machine learning algorithms used to develop the prediction model. The Section 3 presents the diagnostic performance of radiomic features and the model and analyzes the impact of radiomic features on predictions. The Section 4 interprets the findings, highlights the clinical implications, and addresses the limitations of the study. Finally, the Section 5 summarizes the key contributions and suggests directions for future research.

2. Materials and Methods

The protocol of this retrospective study was approved by the Institutional Review Board of Chung Shan Medical University Hospital, and the requirement to obtain informed consent was waived (no. CS2-22194). Additionally, this article was self-assessed using the CheckList for EvaluAtion of Radiomics research (CLEAR) [32] (Table S2).

2.1. Study Design and Patient Population

The study design, involving the clinical evaluation process and the construction of a predictive model, is illustrated in Figure 1. This study retrospectively collected clinical data from all patients given a diagnosis of NPC at our hospital. The diagnostic evaluation tools included MRI, PET/CT, and Epstein–Barr virus (EBV) tests, with final diagnoses confirmed through biopsy. Patients were eligible for inclusion if they underwent RT-related treatments and pretreatment clinical staging with PET/CT and if complete follow-up data were available for at least 3 years post-treatment. Patients were classified as “disease-free” if no residual or recurrent tumor was detected during the follow-up period; otherwise, they were categorized as having “persistent tumor status”.
Pretreatment PET/CT images were annotated for each patient by using the maximum standardized uptake value (SUVmax). Two metabolic tumor volumes, metabolic tumor volume (MTV)50 and MTV60, were delineated on the basis of thresholds calculated from SUVmax * 50% and SUVmax * 60%, respectively. From these volumes, the radiomic features (RFs) RF50 and RF60 and differential radiomic features, that is, RFDiff, were derived. A predictive model was constructed using various machine learning methods to investigate the relationship between radiomic features and persistent tumor status. The performance of each predictive model was evaluated through k-fold cross-validation to validate the robustness and to aggregate the validation results across folds to identify the optimal model. This cross-validation approach provides insights into the model’s performance on unseen data and demonstrates its generalizability. Finally, the radiomic features with the most notable predictive value were identified to elucidate the decision-making process of the most effective model.

2.2. Patient Inclusion Criteria

Figure 2 presents a flowchart detailing the patient selection process. Between January 2012 and February 2022, 220 patients with NPC received RT-related treatments at our hospital. A two-stage screening process was employed to select eligible participants for the study.
In the first stage, 86 patients were excluded because of the following reasons: thirty-eight patients either did not undergo a PET/CT examination for clinical staging or had damaged PET/CT image backups. Thirty-four patients had a history of NPC, were given a diagnosis of NPC and synchronous cancer, or had previously undergone local treatment for head and neck cancer or systemic treatment for other cancers. Additionally, 12 patients underwent therapies that did not conform to the 6 standard treatment protocols defined in this study. Furthermore, two patients were lost to follow-up. At the end of this stage, 134 patients with complete diagnostic and treatment data were deemed eligible for inclusion in further screening.
In the second stage, an additional 30 patients were excluded for the following reasons: nineteen disease-free patients had follow-up periods of <3 years. Seven patients experienced over-segmentation during the delineation of MTVs. Four disease-free patients died from causes unrelated to cancer during the follow-up period. Ultimately, 104 patients who met all inclusion criteria were included in this study.

2.3. Pretreatment PET/CT for Clinical Staging

All PET/CT examinations were performed using either a GE Discovery MI PET/CT scanner (GE Healthcare, Milwaukee, WI, USA) or a Philips Gemini GXL PET/CT scanner (Philips Healthcare, Cleveland, OH, USA). Prior to the examination, the patients were instructed to fast for at least 6 h, and their blood glucose levels were examined to ensure that they were <200 mg/dL. Approximately 60 min after an intravenous injection of 175–370 MBq (5–10 mCi) of fluorodeoxyglucose (FDG), with the amount determined by the patient’s body weight, a PET/CT scan was conducted. The scan covered the area from the vertex of the skull to the mid-thigh, in accordance with established guidelines for tumor imaging with FDG PET/CT [33]. CT images were obtained without the use of oral or intravenous contrast agents. Each CT scan began with a scout view at 10 mA and 120 kVp, followed by a helical CT scan with a 0.5 s rotation time at 120 kVp and 15–180 mA (auto mA), with a 3.5 mm section thickness. Upon completion of the CT imaging, the scanner was returned to the landmark position and the FDG PET scan was acquired in three-dimensional acquisition mode at 2 min per bed position. The FDG PET/CT images were reconstructed using a standard iterative algorithm (ordered-subset expectation maximization), and the CT data were used for attenuation correction.

2.4. RT-Related Treatment

Patients received RT delivered using either a 6 or a 10 MV linear accelerator by using the IMRT or volumetric modulated arc therapy technique. The treatment regimen comprised conventional fractionated RT involving doses of 1.8–2 Gy per fraction, with five daily fractions per week. All patients were treated in a supine position. For primary tumor management, a dose of 45–50 Gy was administered, followed by a tumor boost with reduced fields. The planned total dose for the primary tumor was 70–74 Gy, whereas the dose for positive neck lymph nodes ranged from 60 to 70 Gy. In addition to RT, concurrent cisplatin chemotherapy was administered at a dose of 30 mg/m2 weekly, starting from week 1 and continuing for 6 consecutive weeks during the course of RT.

2.5. Follow-Up

Post-treatment follow-up protocols for patients with NPC included MRI scans conducted at 3-month intervals during the first 2 years post-treatment, with the follow-up intervals extending to 6 months thereafter. Plasma EBV DNA tests were conducted every 3 months, and PET/CT scans were performed annually. Further biopsies were arranged when necessary. At 1 month after completion of RT, high-risk patients with NPC were prescribed oral UFT (tegafur–uracil) for 1 year. Disease-free status was defined as the absence of positive findings from any of the four follow-up tools (i.e., MRI, PET/CT, EBV DNA test, and biopsy) during the follow-up period. Conversely, the presence of positive findings from any of these tools during follow-up was considered to indicate persistent tumor status.

2.6. Extraction of Radiomic Features and Differential Radiomic Features

Image analysis was conducted using in-house developed software, with radiomic features calculated using open-source Pyradiomics [34]. The computing environment had the following specifications: an Intel Core i9 processor, 96 GB of RAM, and an NVIDIA GeForce RTX 3090 GPU. The software used included Python 3.9.12, numpy 1.26.4, pandas 1.5.2, scikit-learn 1.1.1, scipy 1.11.4, lightgbm 4.3.0, xgboost 2.0.3, and shap 0.46.0. Each patient’s pretreatment PET/CT image was adopted for analysis. The process began with the user positioning a reference point within the NPC on the PET/CT images. This reference point was considered the center of a volume of interest (VOI), with dimensions of 7 cm in width, height, and depth. A computer algorithm was then employed to automatically detect all local maxima of SUVs within the VOI. The user then identified the local maximum within the spatial extent of the NPC, corresponding to the SUVmax.
On the basis of the SUVmax, two thresholds were defined, namely SUVmax × 0.5 and SUVmax × 0.6. A computerized region-growing method was employed, and these thresholds generated two MTVs, MTV50 and MTV60. The radiomic features were categorized into three distinct groups. For the first category, first-order features, descriptive statistics were used to characterize the distribution of SUV values within the MTV. Our data analysis led to the classification of conventional PET features commonly used in clinical practice as a separate subgroup. This distinction enables a comparison between traditional clinically validated features and radiomic features. The second category, shape features, encompassed metrics that describe the morphological characteristics of the MTV. That is, 5 conventional PET features, 16 first-order features, and 15 shape features were calculated for each MTV.
SUVs within the MTV were quantified into discrete SUV levels, with the number of discrete SUV levels being 32. A set of probability-based features was calculated based on the probability of occurrence of each discretized SUV, and these features were classified into the first category. The spatial arrangement of the discretized SUVs was described using texture features, which were assigned to the third category. Five matrices were used to depict this arrangement, namely the gray-level (GL) co-occurrence matrix (GLCM), GL dependence matrix (GLDM), GL run length matrix (GLRLM), GL size zone matrix (GLSZM), and neighboring gray-tone difference matrix (NGTDM). The definitions of these texture features were derived from GL image analysis. When applied to PET images, the discretized metabolic uptake (MU) is treated as analogous to the GL. For clarity, we used MU instead of GL in subsequent analyses. Thus, the five matrices were renamed as MUCM, MUDM, MURLM, MUSZM, and NMUDM. Specifically, 24, 14, 16, 16, and 5 features were extracted from MUCM, MUDM, MURLM, MUSZM, and NMUDM, respectively. That is, for each MTV, a total of 111 radiomic features were calculated. Finally, each differential radiomic feature was defined as the absolute value of the difference between corresponding features calculated from MTV50 and MTV60, yielding a total of 332 features. Because SUVmax is identical for both MTVs, no differential radiomic features were derived for SUVmax.

2.7. Predictive Modeling Using Machine Learning Techniques

To explore the relationship between radiomic features, differential radiomic features, and post-treatment tumor status, we employed seven machine learning methods, which were AdaBoost, LightGBM, XGBoost, extra trees, random forest, logistic regression, and decision trees [35]. The objective was to develop a prediction model capable of accurately forecasting persistent tumor status in patients with NPC on the basis of these radiomic features.
For each machine learning model, recursive feature elimination (RFE) was adopted as a feature selection method to identify the radiomic features that contributed the most to the prediction. Initially, all features were included in the training of the prediction model, and the diagnostic performance was assessed as a baseline. RFE was then employed to recursively eliminate the least important features and rebuild the model with the remaining features. This process continued iteratively until the optimal subset of features was identified. RFE enhanced the model’s performance by eliminating redundant or less informative features, thereby enhancing computational efficiency and model interpretability.
The included patients were randomly divided into five independent datasets to train and validate the models developed using each machine learning method through five-fold cross-validation. In each round of cross-validation, four datasets (80% of the data) were used to train the machine learning model, whereas the remaining dataset (20% of the data) was used for validation. This process was repeated five times, with each dataset being used once as the validation set. After all five rounds were completed, the five validation results were aggregated to assess the overall predictive performance of each machine learning method. The predictive performance of the models developed using the seven machine learning methods was compared, and the method demonstrating the best performance was selected. The primary purpose of this step was to identify the best-performing algorithm by comparing the predictive performance of the models developed using the seven machine learning methods. The method demonstrating the best performance was then selected for further analysis.
SHapley Additive exPlanations (SHAP) values were employed to evaluate the importance of radiomic features by quantifying the contribution of each feature to the final model’s predictions [36]. Positive SHAP values indicate a bias toward predicting persistent tumor status, whereas negative SHAP values indicate a bias toward predicting disease-free status. The absolute SHAP values were ranked to assess the impact of individual features on the model’s predictive performance, which led to the identification of the most influential features for distinguishing between persistent tumor status and disease-free status. Analysis of the relationship between SHAP values and feature intensities provided valuable insights into the connection between radiomic features and post-treatment tumor status and highlighted biomarkers that could be clinically relevant for predicting treatment response in patients with NPC.

2.8. Statistical Analysis

The area under the receiver operating characteristic curve (AUC) was used to evaluate the predictive performances of radiomic features and the prediction models. Persistent tumor status was defined as a positive event in this study. An AUC value of >0.5 indicated that higher feature values were associated with an increased likelihood of persistent tumor status, whereas an AUC value of <0.5 indicated that higher feature values were more likely to correspond to disease-free status. Chi-square tests for independence and independent samples t tests were employed to assess the significant differences between clinical factors and post-treatment tumor status groups with a significance threshold set at p < 0.05. Survival analysis was performed using the Kaplan–Meier method. All statistical analyses were conducted using SPSS version 29.0 (IBM Corp, Armonk, NY, USA).

3. Results

3.1. Patient Characteristics

This study included 104 patients with NPC, with the patients divided into two groups on the basis of their post-treatment tumor status; 29 patients had persistent tumor status and 75 had disease-free status. The chi-square test revealed no significant differences in age (p = 0.830) and sex (p = 0.254) between the two groups (Table 1).
Histological and clinical staging revealed significant differences between the groups. Specifically, the difference in histological types (differentiated nonkeratinizing carcinoma vs. undifferentiated carcinoma) was significant (p = 0.037). Moreover, clinical distant metastasis stages (cM0 and cM1) were significantly associated with tumor status (p = 0.005), as was overall clinical staging (p = 0.036).
Regarding treatment modalities, no significant differences were observed in their impact on post-treatment tumor status. The primary treatment modalities (RT, CCRT, and ICT + CCRT) did not significantly influence post-treatment tumor status (p = 0.055). Additionally, the inclusion of maintenance therapy (p = 0.073) and the duration of maintenance therapy (≥1 year vs. <1 year, including those without maintenance therapy) did not significantly differ between the disease-free and persistent tumor status groups (p = 0.512).

3.2. Diagnostic Performance of Radiomic Features for Post-treatment Tumor Status

The SUVmax, a conventional and widely used PET feature, had an AUC of 0.614 (±0.064), with a p value of 0.073, indicating weak diagnostic ability. Other conventional PET features, including MTV, TLGmax, and TLGmean calculated from MTV50 and MTV60, also exhibited nonsignificant diagnostic performance (p > 0.05). However, differential features of TLGmax and TLGmean demonstrated significant potential, with p values of 0.036 and 0.023, respectively (Table S1).
Within the first-order feature group, the differential features of energy, entropy, interquartile range, and total energy exhibited significant diagnostic performance (p = 0.002, 0.048, 0.036, and 0.002, respectively). In the shape feature group, differential features such as flatness, maximum 2D diameter row, and maximum 3D diameter exhibited a significant ability to distinguish post-treatment statuses (p = 0.026, 0.041, and 0.005, respectively).
Texture features derived from various matrices provided additional diagnostic insights. Significant diagnostic performance was observed for the maximal correlation coefficient in the MUCM, high dependence low MU emphasis in the MUDM, and large area low MU emphasis in the MUSZM, calculated from MTV50 and their corresponding differential features (all p < 0.05). Moreover, the differential features of MU nonuniformity normalized, low MU zone emphasis and small area low MU emphasis in the MUSZM exhibited significant diagnostic ability (p = 0.007, 0.035, and 0.004).
These findings highlight the diagnostic potential of radiomic features, particularly differential radiomic features, in effectively differentiating between disease-free and persistent tumor status.

3.3. Diagnostic Performance of AI Models in Predicting Persistent Tumor Status

Figure 3 presents the performance of the AI models in predicting persistent tumor status. The probabilities of persistent tumor status derived from the five validation sets were integrated and analyzed using receiver operating characteristic (ROC) analysis. Among the models, AdaBoost and LightGBM demonstrated superior predictive abilities, with AUC values of 0.934 and 0.896, respectively, although the difference did not reach significance (p = 0.43). However, LightGBM significantly outperformed the third-ranked model, XGBoost (AUC = 0.827, p = 0.004). Although AdaBoost and LightGBM had comparable performance, AdaBoost was selected as the final predictive model because of its higher AUC. ROC analysis was conducted to establish optimal thresholds for classifying patients into persistent tumor status and disease-free groups for each model. Table 2 presents the predictive performance of the seven models. The sensitivity, specificity, positive predicted value, negative predicted value, and accuracy of AdaBoost were 89.66%, 86.67%, 72.22%, 95.59%, and 87.5%, respectively.

3.4. Survival Analysis of Disease-Free and Persistent Tumor Status

Based on the AdaBoost model’s predictions, patients with NPC were classified into disease-free and persistent tumor status groups. As presented in Figure 4, the survival curves revealed a significant difference in overall survival between the two groups. Patients classified as disease-free demonstrated a markedly higher overall survival rate than did those with persistent tumor status. Moreover, the median overall survival for the disease-free group was significantly longer than that for the persistent tumor status group. Statistical analysis conducted using the log-rank test further confirmed that the difference in survival between the two groups was highly significant (p < 0.001).

3.5. Impact of Radiomic Features on Final Model in Predicting Persistent Tumor Status

The AdaBoost method was adopted to train the final predictive model by leveraging the radiomic features and hyperparameters identified during the cross-validation step and including the entire patient dataset. This approach ensured the model was trained using the maximum available data for enhanced interpretability. The objective of this step was not to further validate the model’s predictive performance but to gain insights into how the model interprets the data. A global interpretation of the impact of radiomic features and their mean absolute SHAP values are presented in Figure 5. Higher SHAP values indicated a more significant contribution of a feature to predicting persistent tumor status. The high dependence low MU emphasis50 exerted the greatest influence on the model’s prediction outcomes. Other notable features included surface area50, maximal correlation coefficientDiff, inverse varianceDiff, and interquartile range60. These features ranked as the top five were pivotal in determining the likelihood of a tumor being classified as persistent.
Figure 6 plots the relationship between individual radiomic features and the prediction of post-treatment tumor status. Each dot represents a particular prediction, with the coordinate indicating feature values (x-axis) and the likelihood of persistent tumor status (y-axis). For the high dependence low MU emphasis50 feature, higher values were associated with a greater likelihood of predicting persistent tumor status (Figure 6a). A positive correlation was also noted for surface area50, with increasing values corresponding to a higher probability of persistent tumor status (Figure 6b). The third and fifth significant features, maximal correlation coefficientDiff (Figure 6c), and interquartile range60 (Figure 6e) exhibited comparable patterns. Higher values for these features were also associated with an increased likelihood of persistent tumor status.
Inverse varianceDiff was the fourth most influential feature, and it exhibited complex patterns of influence on the model’s predictions (Figure 6d). Low values of inverse varianceDiff were distributed across both ends of the SHAP value spectrum, indicating that such values could increase the likelihood of predicting either persistent tumor status or disease-free status. Conversely, higher values of inverse varianceDiff were predominantly clustered around a SHAP value of approximately −0.01, indicating a slight tendency toward predicting disease-free status.

4. Discussion

This study demonstrated the potential of using radiomic features extracted from pretreatment PET images to effectively predict the treatment response in patients with NPC undergoing RT-related therapy, offering a promising avenue for personalized treatment strategies. The identified key features provided critical insights into the metabolic heterogeneity within the MTVs, shape characteristics of the MTVs, and texture differences between MTV50 and MTV60. These findings highlight the strong association between radiomic features and the biological properties of NPC, revealing their significant correlations with treatment outcomes. Moreover, the application of the AdaBoost algorithm enabled the construction of a robust predictive model by accurately capturing these complex relationships.
The interpretability of a prediction model is crucial to its clinical applicability. Therefore, SHAP analysis was conducted to identify the radiomic features that most contributed to the differentiation between persistent tumor status and disease-free status. The findings regarding the most influential feature, high dependence low MU emphasis50, indicated that regions with large, homogeneous, and low metabolic uptakes within MTV50 were strongly associated with a higher likelihood of persistent tumor status. The findings regarding the second most influential feature, surface area50, indicated that a larger surface area of MTV50 corresponded to an increased probability of persistent tumor status. The third key feature, maximal correlation coefficientDiff, which measures the complexity of texture, can be used to calculate the absolute difference between MTV50 and MTV60. A higher value for this feature was positively correlated with persistent tumor status. Interquartile range60 was the fifth most influential feature. The findings related to this feature indicated that a broader range of SUVs between the 25th and 75th percentiles within MTV60 was associated with a lower probability of being disease-free. Conversely, inverse varianceDiff, which was the fourth most influential feature, is a measure of the difference in local homogeneity between MTV50 and MTV60.
These radiomic features effectively captured the biological characteristics of the tumor, serving as indicators of poor treatment response. Within the MTV, the presence of homogeneous, low-metabolism areas may indicate regions of hypoxia or necrosis [37,38]. Hypoxic tumor cells are often more resistant to radiation and chemotherapy because of their lower proliferation rates and altered metabolic pathways, which reduce their susceptibility to treatment-induced damage [39,40]. Additionally, a wide range of SUVs, a key indicator of high metabolic heterogeneity, indicates the presence of tumor regions with varying levels of glucose uptake and metabolic activity [37]. This metabolic heterogeneity is indicative of diverse tumor cell populations exhibiting distinct biological behaviors and treatment sensitivities. Furthermore, MTV with a larger surface area reflects a more complex and irregular geometry characteristic of a highly invasive tumor. Such tumors often have more substantial interactions with surrounding tissues. This invasiveness facilitates tumor spread and metastasis, leading to poorer responses to localized treatments.
Although FDG PET is a widely used tool for pretreatment staging, the complexity of glucose metabolism in cancer cells, influenced by the tumor microenvironment, poses challenges in terms of accurately depicting the heterogeneity of the tumor microenvironment according to FDG uptake levels and identifying prognostic factors. Therefore, radiotracers must be developed that target specific prognostic factors. For instance, hypoxia is a critical prognostic factor for the failure of RT and chemotherapy. Fluoromisonidazole (FMISO) is a specialized radiotracer designed to detect hypoxia [41]. However, despite holding considerable potential, such targeted tracers are less commonly used than FDG is. This may be because performing two PET scans with these tracers may lead to increased costs and higher patient radiation exposure. To address these challenges, researchers have explored the correlation between FDG and FMISO, and they have reported a moderate relationship between these tracers [41].
Our proposed predictive model can considerably enhance the clinical application of FDG PET for evaluating the treatment response of NPC. By identifying significant radiomic features related to metabolic activity and tumor volume shape, the model highlights the critical role of biological factors such as hypoxia and metabolic heterogeneity in influencing treatment outcomes. Furthermore, the treatment of locally advanced recurrent NPC remains a challenge because reirradiation after a high-dose RT course often results in severe late toxicity, potentially negating its therapeutic benefits. Emerging evidence suggests that hyperfractionated intensity-modulated RT can significantly reduce the incidence of such complications and improve overall survival rates [42]. By enabling the stratification of patients on the basis of predicted response, the model facilitates the development of personalized treatment plans tailored to each patient’s tumor characteristics. Being able to identify high-risk patients likely to respond poorly to standard treatment can allow clinicians to implement more intensive monitoring or consider more aggressive treatment options.
Despite its promising results, this study has several limitations that must be acknowledged. First, the study used patient data sourced from a single hospital, resulting in a relatively small sample size, which may limit the generalizability of the findings. To enhance the robustness and applicability of the predictive model, larger multicenter studies involving more diverse patient groups must be conducted. Second, the predictive model was developed using radiomic features extracted solely from pretreatment PET images. Future research should consider incorporating additional imaging modalities and clinical data to enhance the model’s accuracy. Addressing these limitations in future research is critical for ensuring that the predictive model can be effectively translated into routine clinical practice.

5. Conclusions

This study demonstrated the feasibility of using a machine learning approach to predict tumor persistence status in patients with NPC undergoing RT-related therapy. The significant radiomic features used to predict treatment response may reflect the biological characteristics of the tumor, including hypoxia, the heterogeneity of FDG uptake, and MTV shape. These findings suggest that radiomic features can serve as valuable biomarkers for predicting treatment outcomes in NPC patients. The prediction model developed in this study showed promising diagnostic performance, with high sensitivity, specificity, and accuracy. However, additional studies and clinical trials are required to validate this prediction model and to further explore the underlying biological mechanisms. Future research should also consider incorporating larger multicenter datasets and additional imaging modalities to enhance the model’s robustness and generalizability. Ultimately, integrating such predictive models into clinical practice could improve personalized treatment strategies and patient outcomes in NPC.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/cancers17010096/s1, Table S1: Diagnostic performance of radiomic features; Table S2: CheckList for EvaluAtion of Radiomics.

Author Contributions

Conceptualization, H.-C.T., C.-Y.S., C.-Y.C. and W.-C.S.; data curation, Y.-H.L., P.-F.K. and X.-P.L.; formal analysis, Y.-H.L. and T.-J.S.; methodology, D.-Y.Y. and W.-C.S.; project administration, W.-C.S.; resources, W.-C.S. and P.-F.K.; writing—W.-C.S., H.-C.T., C.-Y.S. and Y.-H.L.; writing—review and editing, W.-C.S. and Y.-H.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

This study was approved by the Institutional Review Board of Chung Shan Medical University Hospital (date: 13 February 2023/no.CS2-22194).

Informed Consent Statement

Patient consent was waived due to the retrospective nature of the study, and the analysis used anonymous clinical data.

Data Availability Statement

The datasets generated during and/or analyzed in this study are available upon reasonable request from the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Zhang, Y.; Rumgay, H.; Li, M.; Cao, S.; Chen, W. Nasopharyngeal Cancer Incidence and Mortality in 185 Countries in 2020 and the Projected Burden in 2040: Population-Based Global Epidemiological Profiling. JMIR Public Health Surveill 2023, 9, e49968. [Google Scholar] [CrossRef]
  2. Bray, F.; Laversanne, M.; Sung, H.; Ferlay, J.; Siegel, R.L.; Soerjomataram, I.; Jemal, A. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 2024, 74, 229–263. [Google Scholar] [CrossRef]
  3. Li, Y.; Huang, Z.; Zeng, X.; Pan, Y.; Wu, L.; Wang, J.; Chen, R.; Xie, Y.; Lai, J.; Lin, D.; et al. Early recurrence as a pivotal event in nasopharyngeal carcinoma: Identifying predictors and key molecular signals for survivors. Head Face Med. 2024, 20, 55. [Google Scholar] [CrossRef] [PubMed]
  4. Chen, Y.P.; Chan, A.T.C.; Le, Q.T.; Blanchard, P.; Sun, Y.; Ma, J. Nasopharyngeal carcinoma. Lancet 2019, 394, 64–80. [Google Scholar] [CrossRef]
  5. Bossi, P.; Chan, A.T.; Licitra, L.; Trama, A.; Orlandi, E.; Hui, E.P.; Halámková, J.; Mattheis, S.; Baujat, B.; Hardillo, J.; et al. Nasopharyngeal carcinoma: ESMO-EURACAN Clinical Practice Guidelines for diagnosis, treatment and follow-up. Ann. Oncol. 2021, 32, 452–465. [Google Scholar] [CrossRef] [PubMed]
  6. Juarez-Vignon Whaley, J.J.; Afkhami, M.; Onyshchenko, M.; Massarelli, E.; Sampath, S.; Amini, A.; Bell, D.; Villaflor, V.M. Recurrent/Metastatic Nasopharyngeal Carcinoma Treatment from Present to Future: Where Are We and Where Are We Heading? Curr. Treat Options. Oncol. 2023, 24, 1138–1166. [Google Scholar] [CrossRef] [PubMed]
  7. Chen, Y.H.; Luo, S.D.; Wu, S.C.; Wu, C.N.; Chiu, T.J.; Wang, Y.M.; Yang, Y.H.; Chen, W.C. Clinical Characteristics and Predictive Outcomes of Recurrent Nasopharyngeal Carcinoma-A Lingering Pitfall of the Long Latency. Cancers 2022, 14, 3795. [Google Scholar] [CrossRef] [PubMed]
  8. Li, F.; Chen, F.P.; Chen, Y.P.; Chen, Y.; He, X.J.; Huang, X.D.; Zheng, Z.Q.; Zheng, W.H.; Liu, X.; Sun, Y.; et al. Clinical Characteristics and Prognostic Factors of Early and Late Recurrence After Definitive Radiotherapy for Nasopharyngeal Carcinoma. Front. Oncol. 2020, 10, 1469. [Google Scholar] [CrossRef] [PubMed]
  9. Liu, X.; Tang, L.L.; Du, X.J.; Li, W.F.; Chen, L.; Zhou, G.Q.; Guo, R.; Liu, Q.; Sun, Y.; Ma, J. Changes in Disease Failure Risk of Nasopharyngeal Carcinoma over Time: Analysis of 749 Patients with Long-Term Follow-Up. J. Cancer 2017, 8, 455–459. [Google Scholar] [CrossRef]
  10. Siak, P.Y.; Heng, W.S.; Teoh, S.S.H.; Lwin, Y.Y.; Cheah, S.C. Precision medicine in nasopharyngeal carcinoma: Comprehensive review of past, present, and future prospect. J. Transl. Med. 2023, 21, 786. [Google Scholar] [CrossRef]
  11. Chee, J.; Ting, Y.; Ong, Y.K.; Chao, S.S.; Loh, K.S.; Lim, C.M. Relapse status as a prognostic factor in patients receiving salvage surgery for recurrent or residual nasopharyngeal cancer after definitive treatment. Head Neck 2016, 38, 1393–1400. [Google Scholar] [CrossRef] [PubMed]
  12. Zhang, L.L.; Zheng, W.H.; Zhu, W.J.; Deng, Q.L.; Peng, J.L.; Li, Y.Y.; Sun, Y.; Lin, L. Prognostic models for early and late tumor progression prediction in nasopharyngeal carcinoma: An analysis of 8292 endemic cases. Cancer Med. 2023, 12, 5384–5396. [Google Scholar] [CrossRef]
  13. Suryani, L.; Lee, H.P.Y.; Teo, W.K.; Chin, Z.K.; Loh, K.S.; Tay, J.K. Precision Medicine for Nasopharyngeal Cancer—A Review of Current Prognostic Strategies. Cancers 2024, 16, 918. [Google Scholar] [CrossRef]
  14. Yu, T.T.; Lam, S.K.; To, L.H.; Tse, K.Y.; Cheng, N.Y.; Fan, Y.N.; Lo, C.L.; Or, K.W.; Chan, M.L.; Hui, K.C.; et al. Pretreatment Prediction of Adaptive Radiation Therapy Eligibility Using MRI-Based Radiomics for Advanced Nasopharyngeal Carcinoma Patients. Front. Oncol. 2019, 9, 1050. [Google Scholar] [CrossRef] [PubMed]
  15. Yongfeng, P.; Chuner, J.; Lei, W.; Fengqin, Y.; Zhimin, Y.; Zhenfu, F.; Haitao, J.; Yangming, J.; Fangzheng, W. The Usefulness of Pretreatment MR-Based Radiomics on Early Response of Neoadjuvant Chemotherapy in Patients with Locally Advanced Nasopharyngeal Carcinoma. Oncol. Res. 2021, 28, 605–613. [Google Scholar] [CrossRef]
  16. Wang, G.; He, L.; Yuan, C.; Huang, Y.; Liu, Z.; Liang, C. Pretreatment MR imaging radiomics signatures for response prediction to induction chemotherapy in patients with nasopharyngeal carcinoma. Eur. J. Radiol. 2018, 98, 100–106. [Google Scholar] [CrossRef]
  17. Zhang, L.; Ye, Z.; Ruan, L.; Jiang, M. Pretreatment MRI-Derived Radiomics May Evaluate the Response of Different Induction Chemotherapy Regimens in Locally advanced Nasopharyngeal Carcinoma. Acad. Radiol. 2020, 27, 1655–1664. [Google Scholar] [CrossRef]
  18. Li, S.; Wang, K.; Hou, Z.; Yang, J.; Ren, W.; Gao, S.; Meng, F.; Wu, P.; Liu, B.; Liu, J.; et al. Use of Radiomics Combined with Machine Learning Method in the Recurrence Patterns After Intensity-Modulated Radiotherapy for Nasopharyngeal Carcinoma: A Preliminary Study. Front. Oncol. 2018, 8, 648. [Google Scholar] [CrossRef] [PubMed]
  19. Zhang, B.; He, X.; Ouyang, F.; Gu, D.; Dong, Y.; Zhang, L.; Mo, X.; Huang, W.; Tian, J.; Zhang, S. Radiomic machine-learning classifiers for prognostic biomarkers of advanced nasopharyngeal carcinoma. Cancer Lett. 2017, 403, 21–27. [Google Scholar] [CrossRef]
  20. Zhang, B.; Tian, J.; Dong, D.; Gu, D.; Dong, Y.; Zhang, L.; Lian, Z.; Liu, J.; Luo, X.; Pei, S.; et al. Radiomics Features of Multiparametric MRI as Novel Prognostic Factors in Advanced Nasopharyngeal Carcinoma. Clin. Cancer Res. 2017, 23, 4259–4269. [Google Scholar] [CrossRef] [PubMed]
  21. Qiang, M.; Li, C.; Sun, Y.; Sun, Y.; Ke, L.; Xie, C.; Zhang, T.; Zou, Y.; Qiu, W.; Gao, M.; et al. A Prognostic Predictive System Based on Deep Learning for Locoregionally Advanced Nasopharyngeal Carcinoma. J. Natl. Cancer Inst. 2021, 113, 606–615. [Google Scholar] [CrossRef] [PubMed]
  22. Du, D.; Feng, H.; Lv, W.; Ashrafinia, S.; Yuan, Q.; Wang, Q.; Yang, W.; Feng, Q.; Chen, W.; Rahmim, A.; et al. Machine Learning Methods for Optimal Radiomics-Based Differentiation Between Recurrence and Inflammation: Application to Nasopharyngeal Carcinoma Post-therapy PET/CT Images. Mol. Imaging Biol. 2020, 22, 730–738. [Google Scholar] [CrossRef] [PubMed]
  23. Peng, H.; Dong, D.; Fang, M.J.; Li, L.; Tang, L.L.; Chen, L.; Li, W.F.; Mao, Y.P.; Fan, W.; Liu, L.Z.; et al. Prognostic Value of Deep Learning PET/CT-Based Radiomics: Potential Role for Future Individual Induction Chemotherapy in Advanced Nasopharyngeal Carcinoma. Clin. Cancer Res. 2019, 25, 4271–4279. [Google Scholar] [CrossRef] [PubMed]
  24. Zhao, X.; Liang, Y.J.; Zhang, X.; Wen, D.X.; Fan, W.; Tang, L.Q.; Dong, D.; Tian, J.; Mai, H.Q. Deep learning signatures reveal multiscale intratumor heterogeneity associated with biological functions and survival in recurrent nasopharyngeal carcinoma. Eur. J. Nucl. Med. Mol. Imaging 2022, 49, 2972–2982. [Google Scholar] [CrossRef] [PubMed]
  25. Gu, B.; Meng, M.; Xu, M.; Feng, D.D.; Bi, L.; Kim, J.; Song, S. Multi-task deep learning-based radiomic nomogram for prognostic prediction in locoregionally advanced nasopharyngeal carcinoma. Eur. J. Nucl. Med. Mol. Imaging 2023, 50, 3996–4009. [Google Scholar] [CrossRef]
  26. Jing, B.; Zhang, T.; Wang, Z.; Jin, Y.; Liu, K.; Qiu, W.; Ke, L.; Sun, Y.; He, C.; Hou, D.; et al. A deep survival analysis method based on ranking. Artif. Intell. Med. 2019, 98, 1–9. [Google Scholar] [CrossRef] [PubMed]
  27. Zhao, L.; Gong, J.; Xi, Y.; Xu, M.; Li, C.; Kang, X.; Yin, Y.; Qin, W.; Yin, H.; Shi, M. MRI-based radiomics nomogram may predict the response to induction chemotherapy and survival in locally advanced nasopharyngeal carcinoma. Eur. Radiol. 2020, 30, 537–546. [Google Scholar] [CrossRef] [PubMed]
  28. Zhong, L.Z.; Fang, X.L.; Dong, D.; Peng, H.; Fang, M.J.; Huang, C.L.; He, B.X.; Lin, L.; Ma, J.; Tang, L.L.; et al. A deep learning MR-based radiomic nomogram may predict survival for nasopharyngeal carcinoma patients with stage T3N1M0. Radiother. Oncol. 2020, 151, 1–9. [Google Scholar] [CrossRef]
  29. Zhu, C.; Huang, H.; Liu, X.; Chen, H.; Jiang, H.; Liao, C.; Pang, Q.; Dang, J.; Liu, P.; Lu, H. A Clinical-Radiomics Nomogram Based on Computed Tomography for Predicting Risk of Local Recurrence After Radiotherapy in Nasopharyngeal Carcinoma. Front. Oncol. 2021, 11, 637687. [Google Scholar] [CrossRef]
  30. Yang, B.; Liu, Y.; Wei, R.; Men, K.; Dai, J. Deep learning method for predicting weekly anatomical changes in patients with nasopharyngeal carcinoma during radiotherapy. Med. Phys. 2024, 51, 7998–8009. [Google Scholar] [CrossRef] [PubMed]
  31. Lee, S.; Choi, Y.; Seo, M.K.; Jang, J.; Shin, N.Y.; Ahn, K.J.; Kim, B.S. Magnetic Resonance Imaging-Based Radiomics for the Prediction of Progression-Free Survival in Patients with Nasopharyngeal Carcinoma: A Systematic Review and Meta-Analysis. Cancers 2022, 14, 653. [Google Scholar] [CrossRef] [PubMed]
  32. Kocak, B.; Baessler, B.; Bakas, S.; Cuocolo, R.; Fedorov, A.; Maier-Hein, L.; Mercaldo, N.; Müller, H.; Orlhac, F.; Pinto dos Santos, D.; et al. CheckList for EvaluAtion of Radiomics research (CLEAR): A step-by-step reporting guideline for authors and reviewers endorsed by ESR and EuSoMII. Insights Into Imaging 2023, 14, 75. [Google Scholar] [CrossRef] [PubMed]
  33. Boellaard, R.; Delgado-Bolton, R.; Oyen, W.J.; Giammarile, F.; Tatsch, K.; Eschner, W.; Verzijlbergen, F.J.; Barrington, S.F.; Pike, L.C.; Weber, W.A.; et al. FDG PET/CT: EANM procedure guidelines for tumour imaging: Version 2.0. Eur. J. Nucl. Med. Mol. Imaging 2015, 42, 328–354. [Google Scholar] [CrossRef] [PubMed]
  34. van Griethuysen, J.J.M.; Fedorov, A.; Parmar, C.; Hosny, A.; Aucoin, N.; Narayan, V.; Beets-Tan, R.G.H.; Fillion-Robin, J.C.; Pieper, S.; Aerts, H. Computational Radiomics System to Decode the Radiographic Phenotype. Cancer Res. 2017, 77, e104–e107. [Google Scholar] [CrossRef]
  35. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
  36. Lundberg, S.M.; Erion, G.; Chen, H.; DeGrave, A.; Prutkin, J.M.; Nair, B.; Katz, R.; Himmelfarb, J.; Bansal, N.; Lee, S.I. From Local Explanations to Global Understanding with Explainable AI for Trees. Nat. Mach. Intell. 2020, 2, 56–67. [Google Scholar] [CrossRef] [PubMed]
  37. Peppicelli, S.; Andreucci, E.; Ruzzolini, J.; Bianchini, F.; Calorini, L. FDG uptake in cancer: A continuing debate. Theranostics 2020, 10, 2944–2948. [Google Scholar] [CrossRef]
  38. Vander Heiden, M.G.; Cantley, L.C.; Thompson, C.B. Understanding the Warburg effect: The metabolic requirements of cell proliferation. Science 2009, 324, 1029–1033. [Google Scholar] [CrossRef]
  39. Gray, L.H.; Conger, A.D.; Ebert, M.; Hornsey, S.; Scott, O.C. The concentration of oxygen dissolved in tissues at the time of irradiation as a factor in radiotherapy. Br. J. Radiol. 1953, 26, 638–648. [Google Scholar] [CrossRef] [PubMed]
  40. Nordsmark, M.; Bentzen, S.M.; Rudat, V.; Brizel, D.; Lartigau, E.; Stadler, P.; Becker, A.; Adam, M.; Molls, M.; Dunst, J.; et al. Prognostic value of tumor oxygenation in 397 head and neck tumors after primary radiation therapy. An international multi-center study. Radiother. Oncol. 2005, 77, 18–24. [Google Scholar] [CrossRef] [PubMed]
  41. Nishikawa, Y.; Yasuda, K.; Okamoto, S.; Ito, Y.M.; Onimaru, R.; Shiga, T.; Tsuchiya, K.; Watanabe, S.; Takeuchi, W.; Kuge, Y.; et al. Local relapse of nasopharyngeal cancer and Voxel-based analysis of FMISO uptake using PET with semiconductor detectors. Radiat. Oncol. 2017, 12, 148. [Google Scholar] [CrossRef]
  42. You, R.; Liu, Y.P.; Xie, Y.L.; Lin, C.; Duan, C.Y.; Chen, D.P.; Pan, Y.; Qi, B.; Zou, X.; Guo, L.; et al. Hyperfractionation compared with standard fractionation in intensity-modulated radiotherapy for patients with locally advanced recurrent nasopharyngeal carcinoma: A multicentre, randomised, open-label, phase 3 trial. Lancet 2023, 401, 917–927. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Overview of the study design illustrating the development of a machine learning model for predicting persistent tumor status in patients with NPC undergoing radiotherapy-related treatments.
Figure 1. Overview of the study design illustrating the development of a machine learning model for predicting persistent tumor status in patients with NPC undergoing radiotherapy-related treatments.
Cancers 17 00096 g001
Figure 2. Flowchart of patient selection.
Figure 2. Flowchart of patient selection.
Cancers 17 00096 g002
Figure 3. Diagnostic performance of AI models.
Figure 3. Diagnostic performance of AI models.
Cancers 17 00096 g003
Figure 4. Overall survival curve.
Figure 4. Overall survival curve.
Cancers 17 00096 g004
Figure 5. Mean SHAP value analysis of radiomic features.
Figure 5. Mean SHAP value analysis of radiomic features.
Cancers 17 00096 g005
Figure 6. SHAP dependence plot for the top 5 radiomic features contributing to the AdaBoost model. (a) High dependence low metabolic uptake level emphasis50, (b) surface area50, (c) maximal correlation coefficientDiff, (d) inverse varianceDiff, and (e) interquartile range60.
Figure 6. SHAP dependence plot for the top 5 radiomic features contributing to the AdaBoost model. (a) High dependence low metabolic uptake level emphasis50, (b) surface area50, (c) maximal correlation coefficientDiff, (d) inverse varianceDiff, and (e) interquartile range60.
Cancers 17 00096 g006
Table 1. Patient characteristics (n = 104).
Table 1. Patient characteristics (n = 104).
Clinical FactorPost-Treatment Tumor Statusp-Value
Disease-Free (n = 75)Persistent Tumor Status (n = 29)
Age (mean ± SD)50.7 ± 11.451.2 ± 11.10.830
Sex 0.254
Male51 (68.0%)23 (79.3%)
Female24 (32.0%)6 (20.7%)
Histological type 0.037
Differentiated carcinoma5 (6.7%)6 (20.7%)
Undifferentiated carcinoma70 (93.3%)23 (79.3%)
Clinical distant metastasis stage 0.005
cM071 (94.7%)22 (75.9%)
cM14 (5.3%)7 (24.1%)
Clinical stage 0.036
I11 (14.7%)1 (3.4%)
II19 (25.3%)2 (6.9%)
III25 (33.3%)14 (48.3%)
IV20 (26.7%)12 (41.4%)
Treatment modalities 0.055
RT12 (16.0%)1 (3.5%)
CCRT27 (36.0%)7 (24.1%)
ICT + CCRT36 (48.0%)21 (72.4%)
RT-related treatment with MT 0.073
RT alone12 (16.0%)1 (3.5%)
CCRT9 (12.0%)0 (0.0%)
CCRT + MT18 (24.0%)7 (24.1%)
ICT + CCRT7 (9.3%)5 (17.2%)
ICT + CCRT + MT29 (38.7%)16 (55.2%)
Maintenance therapy (MT) 0.512
≥1 years39 (52.0%)13 (48.3%)
<1 years or without MT36 (48.0%)16 (51.7%)
Table 2. Diagnostic performance of constructed predictive models.
Table 2. Diagnostic performance of constructed predictive models.
Predictive ModelDiagnostic Performance Indices
Sensitivity (95% CI)Specificity (95% CI)PPV (95% CI)NPV (95% CI)Accuracy (95% CI)
adaBoost89.66% (78.57–100)86.67% (78.97–94.36)72.22% (57.59–86.85)95.59% (90.71–100)87.5% (81.14–93.86)
lightGBM82.76% (69.01–96.51)84% (75.7–92.3)66.67% (51.27–82.07)92.65% (86.44–98.85)83.65% (76.55–90.76)
XGBoost100% (100–100)53.33% (42.04–64.62)45.31% (33.12–57.51)100% (100–100)66.35% (57.26–75.43)
extraTrees93.1% (83.88–100)57.33% (46.14–68.53)45.76% (33.05–58.48)95.56% (89.53–100)67.31% (58.29–76.32)
randomForest82.76% (69.01–96.51)62.67% (51.72–73.61)46.15% (32.6–59.7)90.38% (82.37–98.4)68.27% (59.32–77.21)
Logistic Regression82.76% (69.01–96.51)58.67% (47.52–69.81)43.64% (30.53–56.74)89.8% (81.32–98.27)65.38% (56.24–74.53)
Decision Tree48.28% (30.09–66.46)84% (75.7–92.3)53.85% (34.68–73.01)80.77% (72.02–89.52)74.04% (65.61–82.46)
Abbreviations: PPV, positive predictive value; NPV, negative predictive value.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Tseng, H.-C.; Shen, C.-Y.; Kao, P.-F.; Chuang, C.-Y.; Yan, D.-Y.; Liao, Y.-H.; Lu, X.-P.; Sheu, T.-J.; Shen, W.-C. Prediction of Persistent Tumor Status in Nasopharyngeal Carcinoma Post-Radiotherapy-Related Treatment: A Machine Learning Approach. Cancers 2025, 17, 96. https://doi.org/10.3390/cancers17010096

AMA Style

Tseng H-C, Shen C-Y, Kao P-F, Chuang C-Y, Yan D-Y, Liao Y-H, Lu X-P, Sheu T-J, Shen W-C. Prediction of Persistent Tumor Status in Nasopharyngeal Carcinoma Post-Radiotherapy-Related Treatment: A Machine Learning Approach. Cancers. 2025; 17(1):96. https://doi.org/10.3390/cancers17010096

Chicago/Turabian Style

Tseng, Hsien-Chun, Chao-Yu Shen, Pan-Fu Kao, Chun-Yi Chuang, Da-Yi Yan, Yi-Han Liao, Xuan-Ping Lu, Ting-Jung Sheu, and Wei-Chih Shen. 2025. "Prediction of Persistent Tumor Status in Nasopharyngeal Carcinoma Post-Radiotherapy-Related Treatment: A Machine Learning Approach" Cancers 17, no. 1: 96. https://doi.org/10.3390/cancers17010096

APA Style

Tseng, H.-C., Shen, C.-Y., Kao, P.-F., Chuang, C.-Y., Yan, D.-Y., Liao, Y.-H., Lu, X.-P., Sheu, T.-J., & Shen, W.-C. (2025). Prediction of Persistent Tumor Status in Nasopharyngeal Carcinoma Post-Radiotherapy-Related Treatment: A Machine Learning Approach. Cancers, 17(1), 96. https://doi.org/10.3390/cancers17010096

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop