Next Article in Journal
A Comparison of Surgeon Estimated Times and Actual Operative Times in Pediatric Dental Rehabilitation under General Anesthesia. A Retrospective Study
Previous Article in Journal
Exercise and Weight Management: The Role of Leptin—A Systematic Review and Update of Clinical Data from 2000–2022
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Risk Stratification for Herpes Simplex Virus Pneumonia Using Elastic Net Penalized Cox Proportional Hazard Algorithm with Enhanced Explainability

1
Department of Medicine, Brigham and Women’s Hospital, Boston, MA 02115, USA
2
Department of Medicine, Harvard Medical School, Boston, MA 02115, USA
3
Syu Kang Sport Clinic, Taipei 112053, Taiwan
4
Department of Computer Science, National Yang Ming Chiao Tung University, Hsinchu 30010, Taiwan
5
Computational Health Informatics Program, Boston Children’s Hospital, Boston, MA 02115, USA
6
Department of Medicine, Kaiser Permanente Santa Clara Medical Center, Santa Clara, CA 95051, USA
7
Cardiovascular Institute, AdventHealth Orlando, Orlando, FL 32803, USA
8
Department of Thoracic Medicine, Chang Gung Memorial Hospital, Taipei 333, Taiwan
9
Department of Laboratory Medicine, Linkou Chang Gung Memorial Hospital, Taipei 333, Taiwan
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
J. Clin. Med. 2023, 12(13), 4489; https://doi.org/10.3390/jcm12134489
Submission received: 4 April 2023 / Revised: 5 June 2023 / Accepted: 13 June 2023 / Published: 5 July 2023
(This article belongs to the Section Pulmonology)

Abstract

:
Herpes simplex virus (HSV) pneumonia is a serious and often fatal respiratory tract infection that occurs in immunocompromised individuals. The early detection of accurate risk stratification is essential in identifying patients who are at high risk of mortality and may benefit from more aggressive treatment. In this study, we developed and validated a risk stratification model for HSV bronchopneumonia using an elastic net penalized Cox proportional hazard algorithm. We analyzed data from a cohort of 104 critically ill patients with HSV bronchopneumonia identified in Chang Gung Memorial Hospital, Linkou, Taiwan: one of the largest tertiary medical centers in the world. A total of 109 predictors, both clinical and laboratory, were identified in this process to develop a risk stratification model that could accurately predict mortality in patients with HSV bronchopneumonia. This model was able to differentiate the risk of death and predict mortality in patients with HSV bronchopneumonia compared to the APACHE II score in the early stage of ICU admissions. Both hazard ratio coefficient and selection frequency were used as the metrics to enhance the explainability of the informative predictors. Our findings suggest that the elastic net penalized Cox proportional hazard algorithm is a promising tool for risk stratification in patients with HSV bronchopneumonia and could be useful in identifying those at high risk of mortality.

1. Introduction

HSV is a double-stranded DNA virus that is capable of infecting a variety of host tissues, including the central nervous system, respiratory tract systems, and the mucocutaneous system [1]. Transmission typically occurs through direct contact between oral-to-oral, oral-to-genital, genital-to-genital, or infected oral secretions [2]. Herpes infections are very common worldwide. In the United States, approximately 50–80% of adults have contracted the Herpes simplex virus (HSV) infection at least once during their lifetime. HSV infection remains largely asymptomatic, and only 20–25% of the infection develop clinical symptoms. The most common clinical manifestation of primary HSV infection is usually fever, malaise, headaches, sore throat, multiple herpetic gingivostomatitis, and localized lymphadenopathy [3,4]. HSV is a rather versatile organism that could infect immunocompetent or immunocompromised hosts. The infection of the upper respiratory tract is common and often self-limited, whereas the infection of the lower respiratory tract is more serious, causing bronchopneumonia mainly in immunocompromised or critically ill patients [4].
Due to difficulties in isolating HSV from the lower respiratory tract culture, the first characterization in the literature regarding HSV bronchopneumonia was guided solely by disease manifestations, with evidence of pulmonary parenchymal invasion and an unexplained clinical rapid decline [5]. In the present day, the universally accepted diagnostic testing of HSV bronchopneumonia is the polymerase chain reaction (PCR) test, which is performed using broncho-alveolar lavage (BAL) specimens from the lower respiratory tract [6]. Unfortunately, HSV bronchopneumonia is associated with a higher incidence of adult respiratory distress syndrome (ARDS) by up to 55% in comparison to 30% of all other organisms. Because of its extensive damage to the respiratory tract system, prolonged ventilator support use and increased mortality rate (approximately 45–60%) are often reported [5,7]. However, there is currently no evidence suggesting effective treatment for HSV bronchopneumonia. The benefit of the utilization of antiviral therapy, acyclovir remains unclear—some studies have shown benefits while others report no significant clinical improvement [5,7,8,9].
Given the fact that there is no reliable treatment for HSV bronchopneumonia, efforts have been made to identify its risk factors and correlations with mortality rates. Previous studies have identified the risk factors associated with increased mortality in patients diagnosed with HSV bronchopneumonia, including age, acute respiratory distress syndrome (ARDS), APACHE II score, and other comorbidities [10,11,12,13,14]. However, no previous studies have been able to provide the estimated contribution of each risk factor to the mortality rate of HSV bronchopneumonia. Additionally, most of these studies have used the univariable analysis method for its analysis, as it is unable to reflect the real clinical scenario, which usually involves multiple different factors in play. Therefore, the need to develop a more accurate, precise, and comprehensive method for HSV bronchopneumonia risk stratification is imperative and essential. Moreover, this method is vital in guiding clinical management due to its ability to identify patients at high risk, initiate proper or more aggressive treatment early on, and reduce mortality in these patients. Given the rapidly advancing machine learning algorithm, it has the power to analyze multivariable data in many medical fields, including obesity prediction [15], survival prediction for head-and-neck cancer patients [16], daily function prediction for brain stroke patients [17], and antibiotic resistance prediction [18], and is an excellent tool to use when investigating implicit patterns behind data. Thus, this study used machine learning and artificial intelligence to perform a comprehensive cross-group comparison and provide an accurate estimation of the mortality rate of HSV bronchopneumonia using the identified risk factors.

2. Materials and Methods

2.1. Study Design

We aimed to develop a predictive model that could provide risk stratification for patients with HSV bronchopneumonia. To be a clinically useful tool in a real-world setting, this model should avoid various biases in the training process. We used Elastic net penalized Cox (ENP Cox) proportional hazard regression with an internal feature penalty function to select informative features in the model training process. Additionally, we employed repeated nested cross-validation to evaluate the model’s robustness. To provide a clear explainability of the model, we estimated feature importance using both hazard ratio (HR) coefficients and selection frequency. In summary, we developed and validated an unbiased and understandable predictive model that could estimate the prognosis of HSV bronchopneumonia patients. The schematic illustration of this study design can be found in Figure 1.

2.2. Patients Eligibility

In this study, 119 adult patients (>18 years old) who had HSV detected from lower airway specimens were included (from January 2015 to January 2019). Demographic and clinical information, such as age, gender, smoking history, underlying diseases, laboratory test results, and history of steroid or immunosuppressive drug use, were recorded. Positive PCR or culture data were used to define HSV detection. The APACHE II score was used to assess the severity of patients. ARDS was defined based on the Berlin definition [19]. These HSV bronchopneumonia patients were divided into two groups: an alive group and a non-survival group based on the survival status in ICU admissions. The retrospective study was approved by the Institution Review Board (IRB) of CGMH (No. 202001331B0), and informed consent was waived on the basis of the retrospective study design. More detailed definitions for the information that was collected in this study can be found in a previously published study [20]. Fifteen patients were excluded due to missing data.

2.3. Model Development and Evaluation

The ENP Cox proportional hazard algorithm was used for training and validating a prognosis predictive model for HSV patients. The elastic-net mixing parameter (i.e., alpha) was set at 0.2. The prognosis-predictive model was trained and validated on the basis of the comprehensive features, including the clinical features and the lab tests (Table 1). The repeated nested cross-validation (NCV) method (five-fold) five times was used to robustly evaluate the models. The regulation parameter (i.e., lambda) was tuned in the inner layer of NCV. The performance of the models was evaluated in the outer layer of NCV. Given the settings of a five-fold CV repeated five times, there were 25 measurements in total. The survival index of the HSV patients was calculated in each iteration, and the median survival index was used as the cut-off to categorize the patients into high-risk and low-risk groups. The models were trained and tested using R software (version 3.5.2) with the glmnet package [16]. To evaluate the discriminative abilities of the ML model and APACHE II, the Harrell concordance index (C index) was used as the metric [21].

2.4. Features Importance

The importance of these features was estimated by two metrics. The first metric was the HR coefficients of the elastic net-Cox proportional hazard models. The second metric was the frequency of being selected in the models. Based on the design of model development and evaluation, 25 models were generated. The number of times they were selected for the features was used as the other metric to evaluate the feature’s importance.

2.5. Statistical Analysis

The data were presented as the mean ± standard deviation. Statistical significance between these two groups of continuous variables and categorical variables was determined using the t-test and Fisher’s exact test, respectively. The survival distributions of high-risk and low-risk groups were compared using a log-rank test. The threshold of the p-value was set as 0.5, where a p-value less than 0.05 was considered statistically significant. All data were analyzed using R, version 3.5.2 (R foundation for statistical computing).

3. Results

3.1. Baseline Clinical Characteristics of Subjects

Among all the 104 patients, 38 (36.5%) of them were survivors, and 66 (63.5%) were non-survivors. The patients included 76 (73%) males and 28 (27%) females. The mean age was 66.58 ± 13.16 in alive patients and 66.7 ± 15.5 in non-survival patients. The body mass index was 22.93 ± 4.4 in survivors and 23.08 ± 4.67 in non-survivors. There were 74 (71.2%) smokers or ex-smokers among the patients and 30 (28.8%) non-smokers. The mean APACHE II score was higher in non-survival patients (30.86 ± 5.7 vs. 25 ± 8.23, p < 0.001). More survivors had diabetes mellitus (DM) (15 (39.5%) vs. 3 (4.5%), p < 0.001), but fewer alive patients had an acute kidney injury (AKI) (13 (34.2%) vs. 52 (78.8%), p < 0.001). In terms of ventilation settings, the mean of the fraction of inspired oxygen (FiO2) was higher in non-survivors (54.55 ± 20.01% vs. 44.47 ± 13.35%, p = 0.007), and the mean of the A-a gradient was also higher in non-survivors (230.33 ± 133.89 vs. 178.96 ± 106.36, p = 0.046). However, tidal volume, positive end-expiratory pressure (PEEP) and positive inspiratory pressure (PIP) showed no significant differences between the two populations. Most of the non-survivors developed ARDS (8 (21.1%) vs. 46 (69.7%)) during the same admission (Table 1).

3.2. Risk Stratification Based on Different Indicators

A comparison between the survival probability of high-risk patients and low-risk patients are presented in Figure 2A,B by using two different discrimination methods. In Figure 2A, the survival probability (in percentage) of high-risk (red) and low-risk (blue) patients were analyzed using the ENP Cox model during their days in the ICU, with correspondence of a 95% confidence interval painted in the shaded area. The survival probability decreased over time, indicating that the two groups’ mortality risk increased over time. However, the high-risk group had a significantly lower survival probability (p = 0.003) than the low-risk group, which started at the very beginning of their ICU stay.
More significantly, while comparing the 95% confidence intervals (shaded region) of the survival curves between Figure 2A,B, it demonstrated that there was less overlapping in the shaded area when using the ENP Cox model (Figure 2A) than the APACHE II score (Figure 2B), with a more notable separation between the high-risk and low-risk groups in the early phase of their ICU stay (<50 days of ICU stay) when using the ENP Cox model. No significance was found between high-risk and low-risk patients, and constant overlapping throughout the shaded regions for both curves were noted when the APACHE II score was used for analysis. In addition, the median survival time for the high-risk group was 24 days, and for the low-risk group was 46 days in Figure 2A. In Figure 2B, the median survival time for both high- and low-risk groups were noted to be similar, both around 37 days. Impressively, the ENP Cox model demonstrated a higher classification performance than APACHE II when identifying high-risk patients (mean (SD) C index, 0.624 (0.093) vs. 0.603 (0.093); p < 0.001).

3.3. Risk Factor Identification

The elastic net Cox regression model was able to estimate the importance of a specific feature contributing to the survival curve using HR and selection frequency, as shown in Figure 3 and Table 2. The selection frequencies were presented when a feature was selected more than 50% of the time. FiO2 was the most informative feature and was noted with a 100% selection frequency before a subsequently atypical lymphocyte percentage (96%) and APACHE II (92%) in Figure 3. HRs were also noted in Figure 3 and Table 2. Nonetheless, despite a significantly high selection frequency for FiO2, the HR of FiO2 was 1.01 (0.01). Other notable data findings included the HR of steroids, which was 0.81 (<1.0), and the HR of combined bacteria, which was 1.14 (>1.0).

4. Discussion

The significance of critically ill patients with HSV bronchopneumonia who survived advanced treatments has recently gained the attention of ICU physicians. The population of patients with this condition is growing and is relatively new [22], making their precise stratification an unmet need. In our study, we aimed to develop a risk stratification model that could identify HSV bronchopneumonia patients with a high risk of mortality. To create a model that is unbiased and clinically useful, we needed to avoid overfitting and provide explainability. We used the ENP Cox proportional hazards regression with an internal feature penalty function to select informative features in the training process (Figure 1). Additionally, we employed repeated nested cross-validation to evaluate the models in an unbiased manner (Figure 1). Our results showed that this model could effectively discriminate between high-risk and low-risk patients. According to the repeated nested cross-validation, the selection frequency in multiple iterations of the cross-validation also provided valuable information regarding feature importance, in addition to the HR coefficient. This model could predict the risk of HSV bronchopneumonia in patients and provide a key for better management and treatment clinically.
A total of 109 clinical features were collected for the patients in our study. In the univariable analysis, several features showed significant differences between the alive and non-survival groups (Table 1). Not surprisingly, the APACHE II score, a widely used ICU score, was significantly higher in the non-survival group than in the alive group. Similar trends were found for the percentages of ARDS, FiO2, and AKI. These findings indicated that poor lung and renal functions were associated with a higher mortality rate of HSV bronchopneumonia. The percentage of DM was also significantly different between the alive and non-survival groups. However, DM was associated with survival rather than mortality. This finding was somewhat counterintuitive, as DM is typically considered a risk factor for many diseases [23]. In an observational prospective multicenter study, DM was associated with a lower rate of developing ARDS in critically ill patients [24]. The protective effect of DM in preventing ARDS development remained even when factors such as blood sugar levels or anti-glycemic agents were adjusted [24]. DM was also found to be associated with a lower mortality rate in sepsis patients [25]. Although its underlying mechanism is not well understood, the protective effect of DM may be due to DM-associated hyperglycemia or an altered metabolism that is not seen in non-DM patients.
As shown above, the ENP Cox model can effectively distinguish high-risk patients from low-risk patients (Figure 2A). The mortality risk over time for the two groups (i.e., high-risk and low-risk) showed significant differences. The same trend was also found when we used the APACHE II score alone as the predictor to discriminate between the high-risk and low-risk groups (Figure 2B). Both the ENP Cox model with a full set of features and the APACHE II score showed an ability to discriminate between high-risk and low-risk patients. However, the survival curves in the two discrimination methods demonstrated different patterns. Briefly, there was less overlap in the 95% confidence intervals of the survival curves when using the ENP Cox model (Figure 2A) for classification than when using the APACHE II score alone (Figure 2B). The separation of the survival curves was more obvious in the early phase of the disease course (i.e., <50 days of ICU stay). When the ENP Cox model was used to classify the patients, the median survival time for the high-risk and low-risk groups was 24 days and 46 days, respectively. The median survival time of the low-risk group was around two times that of the high-risk group. By contrast, the median survival time for both the high-risk and low-risk groups was around 37 days (Figure 2B). These results indicate that the ENP Cox model is a more adequate clinical tool than using the APACHE II score alone in detecting high-risk HSV bronchopneumonia patients, especially in the early stages of the disease course.
The explainability of a predictive model is important due to its predictive performance. We adopted both conventional methods (i.e., univariable analysis) and ENP Cox algorithm embedded methods (i.e., penalized methods) when estimating feature importance. The discordance between these two methods was obvious, where some risk factors, such as DM, ARDS, and AKI in univariable analysis (in Table 1), were not ranked as top informative features in the ENP Cox model (Figure 3). By contrast, informative features of the ENP Cox model, such as the atypical lymphocyte percentage, lymphocyte counts, and patient height, were not regarded as predictors for discriminating between the alive and non-survival groups. To better fit clinical use in a real-world setting, the feature importance derived from the ENP Cox algorithm and repeated nested cross-validation would likely be more adequate in prediction and future use. The first reason for this is that weak features in the ENP Cox algorithm were penalized and removed in the competition of features, thus eliminating a confounding bias in the model learning process. Moreover, the overfitting bias caused by information leakage could be avoided by using nested cross-validation. Based on the central limit theorem, a repeated process of sampling and cross-validation could provide additional benefits when estimating true statistics. These advantages are particularly important for estimating the true characteristics of the real world using small datasets.
In addition to the traditional HR coefficient, we also used selection frequency as another indicator of feature importance. All the HR coefficients of the informative features were around one (see Table 2), indicating that there was no predominantly strong predictor in the ENP Cox model. Additionally, the HR coefficients between the informative features were similar, making it difficult to determine their relative importance. By contrast, the selection frequency of features in the ENP Cox model could serve as an alternative metric to estimate feature importance. For example, when FIO2 was selected in all model training iterations (100%) as an informative feature (Figure 3), the HR coefficient of FIO2 was not significantly higher compared to the other informative features (Table 2). Therefore, combining both metrics could be a more comprehensive way to estimate the impact of features.
The top five informative features in the ENP Cox model were FIO2, atypical lymphocytes, APACHEII, height, and lymphocytes. All of these features are risk factors for mortality in HSV pneumonia. FIO2 is the level of oxygenation that is used to maintain adequate oxygen saturation. When higher FIO2 is needed, the inhaled oxygen is difficult to transfer from the alveoli into the blood, thus indicating more severe lung destruction. Higher mortality associated with higher FIO2 is accordingly reasonable. Reactive lymphocytosis, as well as atypical lymphocytosis, can be potential prognostic markers for HSV pneumonia. Previous studies have demonstrated the diagnostic and prognostic value of lymphocytosis in various pulmonary conditions, such as hypersensitivity pneumonitis and lymphocytic interstitial pneumonia [26]. However, the significance of blood lymphocytosis in the prognosis of HSV pneumonia remains relatively unexplored. Body height was also found to be a risk factor for HSV pneumonia in the ENP Cox model. Body height has been associated with poor prognosis in various lung diseases, including COVID pneumonia [27], chronic obstructive pulmonary disease [28], and non-small cell lung cancer [29]. The association between body height and pneumonia prognosis could be understood in the context of the overall health status of individuals. The distribution of the pectoralis muscle per height square reduces with a taller body height and can impact the severity and recovery from pneumonia [27].
A risk stratification model for HSV bronchopneumonia that can predict the duration of hospitalization, complications during hospitalization, recovery duration, and final outcomes would be clinically useful. In comparison to the well-established APACHE II Score model, the results show that our model for HSV bronchopneumonia could better predict the duration of hospitalization, complications during hospitalization, recovery duration, and final outcomes. This risk stratification model is revolutionary and significantly more useful clinically. Not only would this model be better able to guide clinical management and treatment plans, but clinicians would also be able to utilize the results from this model and direct the goals of care discussion amongst patients and their family members. However, based on the dataset we used in this study, survival time is the only outcome measurement we have. Thus, the ML model can only predict survival time according to the dataset. The extrapolation of the ML model to other outcomes is inadequate in this model. Therefore, predictions on other clinically meaningful outcomes are worthy of further investigation in specified ML models.

5. Conclusions

We developed a risk stratification model to identify patients with HSV bronchopneumonia who are at high risk of mortality. This model adopted the ENP Cox algorithm with an internal feature penalty function to select informative features in the training process and repeated nested cross-validation to evaluate the models in an unbiased manner. The results showed that this model could effectively discriminate between high-risk and low-risk HSV bronchopneumonia patients. This model could be useful in predicting patient outcomes and improving clinical management and the treatment of patients with HSV bronchopneumonia.

Author Contributions

Conceptualization: Y.-C.W., W.-Y.L. and H.-Y.W. Methodology: Y.-J.T. and H.-Y.W. Formal analysis: Y.-C.W., W.-Y.L., Y.-J.T. and H.-Y.W. Data curation: Y.-C.H. and H.-Y.W. Writing—original draft preparation: Y.-C.W., W.-Y.L. and H.-Y.W. Writing—review and editing: Y.-J.T., Y.F., W.L. and H.-Y.W. Supervision: Y.-J.T. and H.-Y.W. Project administration: Y.-C.H. and H.-Y.W. Funding acquisition, H.-Y.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Chang Gung Memorial Hospital, grant number (CMRPG3M0851, CMRPG3L1011, CMRPG3N0641), and the Ministry of Science and Technology, Taiwan, grant number (111-2320-B-182A-002-MY2).

Institutional Review Board Statement

The retrospective study was approved by the Institution Review Board (IRB) of CGMH (No. 202001331B0, date of approval: 18 August 2020).

Informed Consent Statement

Patient consent was waived on the basis of the retrospective study design.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to privacy.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Horowitz, R.; Aierstuck, S.; Williams, E.A.; Melby, B. Herpes simplex virus infection in a university health population: Clinical manifestations, epidemiology, and implications. J. Am. Coll. Health 2010, 59, 69–74. [Google Scholar] [CrossRef] [PubMed]
  2. Ramchandani, M.; Kong, M.; Tronstein, E.; Selke, S.; Mikhaylova, A.; Magaret, A.; Huang, M.L.; Johnston, C.; Corey, L.; Wald, A. Herpes Simplex Virus Type 1 Shedding in Tears and Nasal and Oral Mucosa of Healthy Adults. Sex. Transm. Dis. 2016, 43, 756–760. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Arduino, P.G.; Porter, S.R. Herpes Simplex Virus Type 1 infection: Overview on relevant clinico-pathological features. J. Oral Pathol. Med. 2008, 37, 107–121. [Google Scholar] [CrossRef] [PubMed]
  4. Aisenberg, G.M.; Torres, H.A.; Tarrand, J.; Safdar, A.; Bodey, G.; Chemaly, R.F. Herpes simplex virus lower respiratory tract infection in patients with solid tumors. Cancer 2009, 115, 199–206. [Google Scholar] [CrossRef] [PubMed]
  5. Tuxen, D.V.; Cade, J.F.; McDonald, M.I.; Buchanan, M.R.; Clark, R.J.; Pain, M.C. Herpes simplex virus from the lower respiratory tract in adult respiratory distress syndrome. Am. Rev. Respir. Dis. 1982, 126, 416–419. [Google Scholar]
  6. Luyt, C.E.; Combes, A.; Nieszkowska, A.; Trouillet, J.L.; Chastre, J. Viral infections in the ICU. Curr. Opin. Crit. Care 2008, 14, 605–608. [Google Scholar] [CrossRef]
  7. Luyt, C.E.; Combes, A.; Deback, C.; Aubriot-Lorton, M.H.; Nieszkowska, A.; Trouillet, J.L.; Capron, F.; Agut, H.; Gibert, C.; Chastre, J. Herpes simplex virus lung infection in patients undergoing prolonged mechanical ventilation. Am. J. Respir. Crit. Care Med. 2007, 175, 935–942. [Google Scholar] [CrossRef]
  8. Schuierer, L.; Gebhard, M.; Ruf, H.G.; Jaschinski, U.; Berghaus, T.M.; Wittmann, M.; Braun, G.; Busch, D.H.; Hoffmann, R. Impact of acyclovir use on survival of patients with ventilator-associated pneumonia and high load herpes simplex virus replication. Crit. Care 2020, 24, 12. [Google Scholar] [CrossRef] [Green Version]
  9. Luyt, C.E.; Forel, J.M.; Hajage, D.; Jaber, S.; Cayot-Constantin, S.; Rimmele, T.; Coupez, E.; Lu, Q.; Diallo, M.H.; Penot-Ragon, C.; et al. Acyclovir for Mechanically Ventilated Patients with Herpes Simplex Virus Oropharyngeal Reactivation: A Randomized Clinical Trial. JAMA Intern. Med. 2020, 180, 263–272. [Google Scholar] [CrossRef]
  10. Gursel, G.; Demirtas, S. Value of APACHE II, SOFA and CPIS scores in predicting prognosis in patients with ventilator-associated pneumonia. Respiration 2006, 73, 503–508. [Google Scholar] [CrossRef]
  11. Kollef, K.E.; Reichley, R.M.; Micek, S.T.; Kollef, M.H. The modified APACHE II score outperforms Curb65 pneumonia severity score as a predictor of 30-day mortality in patients with methicillin-resistant Staphylococcus aureus pneumonia. Chest 2008, 133, 363–369. [Google Scholar] [CrossRef] [PubMed]
  12. Mirsaeidi, M.; Peyrani, P.; Ramirez, J.A.; Improving Medicine through Pathway Assessment of Critical Therapy of Hospital-Acquired Pneumonia I. Predicting mortality in patients with ventilator-associated pneumonia: The APACHE II score versus the new IBMP-10 score. Clin. Infect. Dis. 2009, 49, 72–77. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Naeini, A.E.; Abbasi, S.; Haghighipour, S.; Shirani, K. Comparing the APACHE II score and IBM-10 score for predicting mortality in patients with ventilator-associated pneumonia. Adv. Biomed. Res. 2015, 4, 47. [Google Scholar]
  14. Richards, G.; Levy, H.; Laterre, P.F.; Feldman, C.; Woodward, B.; Bates, B.M.; Qualy, R.L. CURB-65, PSI, and APACHE II to assess mortality risk in patients with severe sepsis and community acquired pneumonia in PROWESS. J. Intensive Care Med. 2011, 26, 34–40. [Google Scholar] [CrossRef]
  15. Wang, H.Y.; Chang, S.C.; Lin, W.Y.; Chen, C.H.; Chiang, S.H.; Huang, K.Y.; Chu, B.Y.; Lu, J.J.; Lee, T.Y. Machine Learning-Based Method for Obesity Risk Evaluation Using Single-Nucleotide Polymorphisms Derived from Next-Generation Sequencing. J. Comput. Biol. 2018, 25, 1347–1360. [Google Scholar] [CrossRef] [PubMed]
  16. Tseng, Y.J.; Wang, H.Y.; Lin, T.W.; Lu, J.J.; Hsieh, C.H.; Liao, C.T. Development of a Machine Learning Model for Survival Risk Stratification of Patients With Advanced Oral Cancer. JAMA Netw. Open 2020, 3, e2011768. [Google Scholar] [CrossRef] [PubMed]
  17. Lin, W.Y.; Chen, C.H.; Tseng, Y.J.; Tsai, Y.T.; Chang, C.Y.; Wang, H.Y.; Chen, C.K. Predicting post-stroke activities of daily living through a machine learning-based approach on initiating rehabilitation. Int. J. Med. Inform. 2018, 111, 159–164. [Google Scholar] [CrossRef]
  18. Wang, Z.; Wang, H.Y.; Chung, C.R.; Horng, J.T.; Lu, J.J.; Lee, T.Y. Large-scale mass spectrometry data combined with demographics analysis rapidly predicts methicillin resistance in Staphylococcus aureus. Brief. Bioinform. 2021, 22, bbaa293. [Google Scholar] [CrossRef] [PubMed]
  19. Force, A.D.T.; Ranieri, V.M.; Rubenfeld, G.D.; Thompson, B.T.; Ferguson, N.D.; Caldwell, E.; Fan, E.; Camporota, L.; Slutsky, A.S. Acute respiratory distress syndrome: The Berlin Definition. JAMA 2012, 307, 2526–2533. [Google Scholar]
  20. Chang, W.J.; Wang, H.Y.; Huang, Y.C.; Lin, C.Y.; Leu, S.W.; Hsieh, M.J.; Huang, C.C. Outcomes of Herpes Simplex Virus Pneumonitis in Critically Ill Patients. Viruses 2022, 14, 205. [Google Scholar] [CrossRef]
  21. Harrell, F.E.; Lee, K.L.; Mark, D.B. Multivariable Prognostic Models: Issues in Developing Models, Evaluating Assumptions and Adequacy, and Measuring and Reducing Errors. Stat. Med. 1996, 15, 361–387. [Google Scholar] [CrossRef]
  22. Sacks, D.; Baxter, B.; Campbell, B.C.V.; Carpenter, J.S.; Cognard, C.; Dippel, D.; Eesa, M.; Fischer, U.; Hausegger, K.; Hirsch, J.A.; et al. Multisociety Consensus Quality Improvement Revised Consensus Statement for Endovascular Therapy of Acute Ischemic Stroke. Int. J. Stroke 2018, 13, 612–632. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Li, S.; Wang, J.; Zhang, B.; Li, X.; Liu, Y. Diabetes Mellitus and Cause-Specific Mortality: A Population-Based Study. Diabetes Metab. J. 2019, 43, 319–341. [Google Scholar] [CrossRef] [PubMed]
  24. Yu, S.; Christiani, D.C.; Thompson, B.T.; Bajwa, E.K.; Gong, M.N. Role of diabetes in the development of acute respiratory distress syndrome. Crit. Care Med. 2013, 41, 2720–2732. [Google Scholar] [CrossRef] [PubMed]
  25. Graham, B.B.; Keniston, A.; Gajic, O.; Trillo Alvarez, C.A.; Medvedev, S.; Douglas, I.S. Diabetes mellitus does not adversely affect outcomes from a critical illness. Crit. Care Med. 2010, 38, 16–24. [Google Scholar] [CrossRef]
  26. Jellinge, M.E.; Hansen, F.; Coia, J.E.; Song, Z. Herpes Simplex Virus Type 1 Pneumonia—A Review. J. Intensive Care Med. 2021, 36, 1398–1402. [Google Scholar] [CrossRef]
  27. Ufuk, F.; Demirci, M.; Sagtas, E.; Akbudak, I.H.; Ugurlu, E.; Sari, T. The prognostic value of pneumonia severity score and pectoralis muscle Area on chest CT in adult COVID-19 patients. Eur. J. Radiol. 2020, 131, 109271. [Google Scholar] [CrossRef]
  28. Bak, S.H.; Kwon, S.O.; Han, S.S.; Kim, W.J. Computed tomography-derived area and density of pectoralis muscle associated disease severity and longitudinal changes in chronic obstructive pulmonary disease: A case control study. Respir. Res. 2019, 20, 226. [Google Scholar] [CrossRef] [Green Version]
  29. Cury, S.S.; de Moraes, D.; Freire, P.P.; de Oliveira, G.; Marques, D.V.P.; Fernandez, G.J.; Dal-Pai-Silva, M.; Hasimoto, E.N.; Dos Reis, P.P.; Rogatto, S.R.; et al. Tumor Transcriptome Reveals High Expression of IL-8 in Non-Small Cell Lung Cancer Patients with Low Pectoralis Muscle Area and Reduced Survival. Cancers 2019, 11, 1251. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Scheme of the study. We adopted three steps, including data preparation, repetitive nested cross-validation, and the evaluation of model outputs. In data preparation, critically ill patients (ICU admission, n = 119) with HSV bronchopneumonia were enrolled in initial patient screening. Cases with missing data were excluded, and eligible cases (n = 104) were used for the following analyses. In model training and validation, ENP Cox proportional hazards regression algorithm-based models were trained and validated by the repetitive nested cross-validation method (Dark Gray color represents samples selected by the machine learning process, while Light Gray color represents samples that were not selected). A built-in property of the ENP Cox proportional hazards regression algorithm could be used for feature selection. Moreover, unbiased estimation of model performance was possible through the repetitive nested cross-validation method. In the outputs, the model performance on risk stratification for HSV bronchopneumonia patients was revealed. Additionally, selection frequency and the hazard ratio coefficient of the features were both used to enhance the explainability of the informative features in the predictive model.
Figure 1. Scheme of the study. We adopted three steps, including data preparation, repetitive nested cross-validation, and the evaluation of model outputs. In data preparation, critically ill patients (ICU admission, n = 119) with HSV bronchopneumonia were enrolled in initial patient screening. Cases with missing data were excluded, and eligible cases (n = 104) were used for the following analyses. In model training and validation, ENP Cox proportional hazards regression algorithm-based models were trained and validated by the repetitive nested cross-validation method (Dark Gray color represents samples selected by the machine learning process, while Light Gray color represents samples that were not selected). A built-in property of the ENP Cox proportional hazards regression algorithm could be used for feature selection. Moreover, unbiased estimation of model performance was possible through the repetitive nested cross-validation method. In the outputs, the model performance on risk stratification for HSV bronchopneumonia patients was revealed. Additionally, selection frequency and the hazard ratio coefficient of the features were both used to enhance the explainability of the informative features in the predictive model.
Jcm 12 04489 g001
Figure 2. Risk stratification based on different indicators. (A). Survival curves of high-risk (red) and low-risk (green) patients using an elastic net Cox regression model. The high-risk group could be discriminated against from the low-risk group (p = 0.003). Of note, the discrimination between the high-risk and low-risk groups is obvious in the relatively early phase (<50th day), where the two survival curves do not overlap with each other. (B). Survival curves of high-risk (red) and low-risk (green) patients using an APACHE II score. The survival curves of the high-risk and low-risk groups are statistically distinguishable (p = 0.044). However, 95% confidence intervals of the survival curves overlap with each other throughout the whole phase. LOS in ICU: length of stay in intensive care unit.
Figure 2. Risk stratification based on different indicators. (A). Survival curves of high-risk (red) and low-risk (green) patients using an elastic net Cox regression model. The high-risk group could be discriminated against from the low-risk group (p = 0.003). Of note, the discrimination between the high-risk and low-risk groups is obvious in the relatively early phase (<50th day), where the two survival curves do not overlap with each other. (B). Survival curves of high-risk (red) and low-risk (green) patients using an APACHE II score. The survival curves of the high-risk and low-risk groups are statistically distinguishable (p = 0.044). However, 95% confidence intervals of the survival curves overlap with each other throughout the whole phase. LOS in ICU: length of stay in intensive care unit.
Jcm 12 04489 g002aJcm 12 04489 g002b
Figure 3. Feature importance of elastic net Cox regression model.
Figure 3. Feature importance of elastic net Cox regression model.
Jcm 12 04489 g003
Table 1. Patient characteristics of alive and non-survival groups.
Table 1. Patient characteristics of alive and non-survival groups.
Feature NameLevelAlive (n = 38)Non-Survival (n = 66)p
Gender (%)F11 (28.9)17 (25.8)0.902
M27 (71.1)49 (74.2)
Age (mean (SD)) 66.58 (13.16)66.70 (15.50)0.968
Height (mean (SD)) 162.26 (5.90)162.45 (6.30)0.882
BW (mean (SD)) 60.25 (11.42)60.90 (12.43)0.792
BMI (mean (SD)) 22.93 (4.40)23.08 (4.67)0.876
Smoking (%)NO12 (31.6)18 (27.3)0.809
YES26 (68.4)48 (72.7)
Oral Ulcer (%)NO18 (47.4)27 (40.9)0.664
YES20 (52.6)39 (59.1)
APACHE II (mean (SD)) 25.00 (8.23)30.86 (5.70)<0.001
CRP (mean (SD)) 116.74 (82.02)122.11 (108.60)0.792
WBC (mean (SD)) 10,828.95 (5247.49)12,342.42 (8524.54)0.324
Lymphocyte percentage (mean (SD)) 8.21 (5.43)8.78 (14.10)0.812
Lymphocyte count (mean (SD)) 817.43 (614.34)618.85 (587.21)0.106
Atypical lymphocyte percentage (mean (SD)) 0.31 (0.66)1.21 (4.64)0.24
HSV alone (%)NO28 (73.7)57 (86.4)0.178
YES10 (26.3)9 (13.6)
Bacteria combined (%)NO16 (42.1)24 (36.4)0.711
YES22 (57.9)42 (63.6)
Fungus combined (%)NO32 (84.2)48 (72.7)0.273
YES6 (15.8)18 (27.3)
Aspergillosis (%)NO37 (97.4)58 (87.9)0.195
YES1 (2.6)8 (12.1)
Combine Virus (%)NO30 (78.9)42 (63.6)0.159
YES8 (21.1)24 (36.4)
PJP (%)NO35 (92.1)53 (80.3)0.185
YES3 (7.9)13 (19.7)
Mycobacterium spp. (%)NO38 (100.0)64 (97.0)0.732
YES0 (0.0)2 (3.0)
Organ failure (%)NO32 (84.2)43 (65.2)0.063
YES6 (15.8)23 (34.8)
Diabetes mellitus (%)NO23 (60.5)63 (95.5)<0.001
YES15 (39.5)3 (4.5)
Immunocompromise (%)NO22 (57.9)27 (40.9)0.142
YES16 (42.1)39 (59.1)
Sepsis (%)NO23 (60.5)53 (80.3)0.05
YES15 (39.5)13 (19.7)
Cardiovascular Crisis (%)NO35 (92.1)65 (98.5)0.271
YES3 (7.9)1 (1.5)
CXR.GGO (%)NO33 (86.8)59 (89.4)0.941
YES5 (13.2)7 (10.6)
CXR.Interstitial (%)NO28 (73.7)45 (68.2)0.713
YES10 (26.3)21 (31.8)
CXR.Consolidation (%)NO15 (39.5)31 (47.0)0.592
YES23 (60.5)35 (53.0)
CT.GGO (%)NO37 (97.4)63 (95.5)1
YES1 (2.6)3 (4.5)
CT.Interstitial (%)NO33 (86.8)57 (86.4)1
YES5 (13.2)9 (13.6)
CT.Consolidation (%)NO26 (68.4)50 (75.8)0.56
YES12 (31.6)16 (24.2)
Bronchoscopy (%)NO12 (31.6)20 (30.3)1
YES26 (68.4)46 (69.7)
ARDS (%)NO30 (78.9)20 (30.3)<0.001
YES8 (21.1)46 (69.7)
AKI (%)NO25 (65.8)14 (21.2)<0.001
YES13 (34.2)52 (78.8)
Steroids (%)NO19 (50.0)26 (39.4)0.398
YES19 (50.0)40 (60.6)
Treatment (%)NO30 (78.9)49 (74.2)0.762
YES8 (21.1)17 (25.8)
Treat enough (%)NO32 (84.2)50 (75.8)0.443
YES6 (15.8)16 (24.2)
PEEP (mean (SD)) 9.32 (1.76)9.36 (1.76)0.894
delta P (mean (SD)) 16.29 (5.02)18.61 (6.08)0.049
PIP (mean (SD)) 25.87 (5.79)27.82 (6.46)0.127
TV (mean (SD)) 486.13 (88.36)468.61 (104.29)0.386
FIO2 (mean (SD)) 44.47 (13.35)54.55 (20.01)0.007
A-a gradient (mean (SD)) 178.96 (106.36)230.33 (133.89)0.046
Table 2. Coefficients of the important features in the elastic net Cox regression model.
Table 2. Coefficients of the important features in the elastic net Cox regression model.
Feature TypeFeature NameSelection Frequency (%)HR (SD)
Risky factorFIO21001.01 (0.01)
Risky factorAtypical lymphocyte percentage961.04 (0.03)
Risky factorAPACHEII921.02 (0.02)
Risky factorHeight681.03 (0.05)
Risky factorLymphocyte percentage641.01 (0.01)
Risky factorAge561.01 (0.01)
Protective factorSteroids560.81 (0.14)
Risky factorBacteria combined521.14 (0.21)
HR: hazard ratio coefficient. SD: standard deviation.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, Y.-C.; Lin, W.-Y.; Tseng, Y.-J.; Fu, Y.; Li, W.; Huang, Y.-C.; Wang, H.-Y. Risk Stratification for Herpes Simplex Virus Pneumonia Using Elastic Net Penalized Cox Proportional Hazard Algorithm with Enhanced Explainability. J. Clin. Med. 2023, 12, 4489. https://doi.org/10.3390/jcm12134489

AMA Style

Wang Y-C, Lin W-Y, Tseng Y-J, Fu Y, Li W, Huang Y-C, Wang H-Y. Risk Stratification for Herpes Simplex Virus Pneumonia Using Elastic Net Penalized Cox Proportional Hazard Algorithm with Enhanced Explainability. Journal of Clinical Medicine. 2023; 12(13):4489. https://doi.org/10.3390/jcm12134489

Chicago/Turabian Style

Wang, Yu-Chiang, Wan-Ying Lin, Yi-Ju Tseng, Yiwen Fu, Weijia Li, Yu-Chen Huang, and Hsin-Yao Wang. 2023. "Risk Stratification for Herpes Simplex Virus Pneumonia Using Elastic Net Penalized Cox Proportional Hazard Algorithm with Enhanced Explainability" Journal of Clinical Medicine 12, no. 13: 4489. https://doi.org/10.3390/jcm12134489

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop