Next Article in Journal
Development of a General Health Score Based on 12 Objective Metabolic and Lifestyle Items: The Lifestyle and Well-Being Index
Next Article in Special Issue
SARIMA Model Forecasting Performance of the COVID-19 Daily Statistics in Thailand during the Omicron Variant Epidemic
Previous Article in Journal
Risk of SARS-CoV-2 Infection Breakthrough among the Non-Vaccinated and Vaccinated Population in Italy: A Real-World Evidence Study Based on Big Data
Previous Article in Special Issue
Explaining Cannabis Use by Adolescents: A Comparative Assessment of Fuzzy Set Qualitative Comparative Analysis and Ordered Logistic Regression
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Predicting the Mortality of ICU Patients by Topic Model with Machine-Learning Techniques

1
Department of Business Management, National Taipei University of Technology, Taipei 106, Taiwan
2
College of Management, National Taipei University of Technology, Taipei 106, Taiwan
3
Department of Obstetrics and Gynecology, Taipei Medical University Hospital, Taipei 110, Taiwan
4
College of Medicine, Taipei Medical University, Taipei 110, Taiwan
*
Author to whom correspondence should be addressed.
Healthcare 2022, 10(6), 1087; https://doi.org/10.3390/healthcare10061087
Submission received: 9 May 2022 / Revised: 7 June 2022 / Accepted: 8 June 2022 / Published: 11 June 2022
(This article belongs to the Special Issue Health Informatics: The Foundations of Public Health)

Abstract

:
Predicting clinical patients’ vital signs is a leading critical issue in intensive care units (ICUs) related studies. Early prediction of the mortality of ICU patients can reduce the overall mortality and cost of complication treatment. Some studies have predicted mortality based on electronic health record (EHR) data by using machine learning models. However, the semi-structured data (i.e., patients’ diagnosis data and inspection reports) is rarely used in these models. This study utilized data from the Medical Information Mart for Intensive Care III. We used a Latent Dirichlet Allocation (LDA) model to classify text in the semi-structured data of some particular topics and established and compared the classification and regression trees (CART), logistic regression (LR), multivariate adaptive regression splines (MARS), random forest (RF), and gradient boosting (GB). A total of 46,520 ICU Patients were included, with 11.5% mortality in the Medical Information Mart for Intensive Care III group. Our results revealed that the semi-structured data (diagnosis data and inspection reports) of ICU patients contain useful information that can assist clinical doctors in making critical clinical decisions. In addition, in our comparison of five machine learning models (CART, LR, MARS, RF, and GB), the GB model showed the best performance with the highest area under the receiver operating characteristic curve (AUROC) (0.9280), specificity (93.16%), and sensitivity (83.25%). The RF, LR, and MARS models showed better performance (AUROC are 0.9096, 0.8987, and 0.8935, respectively) than the CART (0.8511). The GB model showed better performance than other machine learning models (CART, LR, MARS, and RF) in predicting the mortality of patients in the intensive care unit. The analysis results could be used to develop a clinically useful decision support system.

1. Introduction

The spread of the COVID-19 pandemic and the increasing number of infected patients are challenging global medical units, especially intensive care units (ICUs). Hospitals need to make reasonable and accurate decisions, such as how they allocate their equipment and labor, making comprehensive assessments of information and resources available at ICU. The World Health Organization advocated that hospitals regularly monitor the specific clinical variables of hospitalized patients with COVID-19 and, when feasible, analyze the variables by using medical technology [1]. However, because patients’ illnesses are rapidly changing, making quick and accurate decisions without sufficient up-to-date information is challenging for clinicians [2]. Electronic health records (EHR) are personal health electronic records that include medical records, electrocardiograms, and medical images. Researchers can analyze their archived medical information to help clinicians make critical clinical decisions [3]. Archiving personal health records electronically not only elevates hospitals’ management and service levels but also provides medical researchers with more resources that enable them to conduct related research, such as developing and verifying prediction models [4,5,6]. Using EHR data to make clinical predictions (e.g., predicting patients’ mortality, hospital stay, disease diagnoses, and onset time) is crucial in intensive care research. In other words, identifying how to effectively predict ICU patient mortality by using EHR data allows medical personnel to accurately assess the patients’ mortality risks, detect high-risk groups early, and implement interventions promptly, improving patient prognoses and enhancing care planning and resource allocation [7].
Although many studies have used EHR data, most of them have only used quantitative EHR data [8,9,10,11]. In fact, 80% of EHR data comprises semi-structured data such as patients’ physiological conditions (free-text notes and clinician progress notes) at the time of their visits [12]. To deal with huge data volumes in the form of unstructured text has become one of the main challenges for healthcare analytics. In this respect, the application of natural language processing (NLP) has received increasing attention in the medical field to bring more benefits to health organizations in a wide range of applications. The power of NLP lies in extracting information from unstructured textual data in order to form and explore new facts or hypotheses [13]. The potentials of NLP techniques, such as Latent Dirichlet Allocation (LDA) and Bidirectional Encoder Representations for Transformers (BERT), have been constantly discussed in the healthcare literature (e.g., [14,15,16,17,18]).
To maximize the use of the semi-structured EHR data, this study used the Latent Dirichlet Allocation (LDA) to build topic models. Latent Dirichlet Allocation (LDA) is a topic generation model and uses Bayes’ rule approach to treat all text modeling as a mixture of topics and vocabularies. The word “mixture” here refers to a set of elements (i.e., topics or vocabularies) that have certain probabilities of being selected. A body of a document will incorporate multiple themes, and the topics will be fluid in nature. Each document can be represented by a vector of topic probabilities, and each topic can be represented by a vector of word probabilities [19]. Many recent studies have used LDA topic modeling [20,21,22,23]. In this study, we first applied LDA to classify text in the semi-structured data to some particular topics. Subsequently, we employed five machine learning approaches, classification and regression trees (CART), logistic regression (LR), multivariate adaptive regression splines (MARS), random forest (RF), and gradient boosting (GB), to predict ICU patient mortality. From these methods, the MARS approach was selected as the benchmark for model comparison because MARS is a mature learning technology that has the advantages of high learning efficiency and strong generalizability [24,25], and CART/RF, which were developed by Breiman [26], has been widely used [27,28,29,30,31]. Regarding the advantages and disadvantages of certain machine learning methods, please refer to [32,33,34,35] for more detailed explanations.
According to the comparison results, the topics generated by LDA do contain useful information that can considerably affect the prediction accuracy of constructed models. Moreover, in our comparison of five machine learning models (CART, LR, MARS, RF, and GB), the GB model showed the best performance with the highest area under the receiver operating characteristic curve, recall, accuracy, and F1-statistic.
This study contributed to variable generation and mortality prediction through the following three aspects. First, LDA was applied to analyze the semi-structured data (i.e., patients’ diagnosis data and inspection reports) and generate some particular topics variables. The results can be used as a reference when selecting the appropriate predictors for mortality prediction. Second, the significance of the generated variables can be further analyzed by machine learning approaches, such as gradient boosting, to understand the effect of input variables at different situations on the construction of the mortality prediction model. Third, with the topic variables generated using the LDA approach, the machine learning model provides a higher AUROC value, signifying that models built using the semi-structured data more accurately predicted whether the ICU patients would die. All these results may enhance healthcare personnel’s predictions of patients’ mortality, providing patients, their families, and healthcare personnel with more information for clinical decision-making.

2. Materials and Methods

The flow chart of the proposed approach is illustrated in Figure 1. The data used in this study consists of structured EHR data and clinical notes data. To obtain a meaningful dataset, a list of query and data preprocessing were executed. Subsequently, the preprocessed data were input into the five machine learning models for mortality prediction. Finally, the prediction performance was evaluated by five different metrics. A detailed description of the main research procedures is presented as follows:

2.1. The Medical Information Mart for Intensive Car—III Dataset

The data used in this work were obtained from the Medical Information Mart for Intensive Care (MIMIC III) clinical database. MIMIC-III contained the comprehensive clinical data of patients hospitalized at the Beth Israel Deaconess Medical Center (BIDMC) in Boston, Massachusetts [36]. MIMIC-III data contained different ICU data from 2001 to 2012, where the data included patients’ vital signs, medications, data measured in labs, and observation records. Table 1 provides a breakdown of the adult population by care unit. According to the table, 49,785 hospitalization-related data were collected from 38,597 adult patients in the MIMIC-III database. Among the patients, 55.9% were males, and the median age was 65.8. The median length of an ICU stay is 2.1 days, and the median length of a hospital stay is 6.9 days.
This study received approval from the institutional review boards of the BIDMC and the MIT (Cambridge, MA, USA) to use the MIMIC-III database to perform analyses. The MIMIC-III dataset analyzed was obtained from the MIT Laboratory for Computational Physiology and a research team with which it collaborated. The dataset website URL was MIMIC-III Clinical Database [36]. Available online: https://physionet.org/content/mimiciii/1.4/ (accessed on 1 June 2021).

2.2. Latent Dirichlet Allocation

Latent Dirichlet Allocation (LDA) is a topic modeling algorithm for discovering the underlying topics in corpora in an unsupervised manner. Proposed by Blei et al. [19], LDA is a typical “bag of words” model that treats each text as a vocabulary frequency vector and as a collection of multiple sets of vocabularies. In addition, each group of vocabularies represents a topic, and text topics are extracted without considering the order of and relevance between the vocabularies [37,38]. Normally, an LDA builds its topic generation model through the following steps: (1) a topic is selected from the various topics in a text; (2) a vocabulary is chosen from the list of vocabularies corresponding to the topic selected; and (3) the process is repeated until all of the vocabulary in the text has been selected. Because each text consists of multiple topics that contain multiple key vocabularies, identical vocabularies may be found in different topics. Assuming that M is the number of text, K is the number of topics, Zmn is the number of times that vocabulary n appears in the different topics of Text m (where Zmn has a multinomial distribution); θm is the probability that each of the k number of topics occurs in Text m, where Dirichlet distribution (which has a hyperparameter α) is used as a priori distribution; Wmn is the nth vocabulary in Text m and has a multinomial distribution; and Φk is the probability that each vocabulary in the kth topic occurs where Dirichlet distribution (which has a hyperparameter β) is used as a priori distribution, then an overall LDA framework depicted in Figure 2 can be obtained.
We adopted the approximation algorithm proposed by [39] to filter out duplicate or incorrect notes found in the NOTEVENTS table in the MIMIC-III database, deleted unrecognizable sentences, and reserved only sentences composed of letters for text marking. Next, we used the LDA method to generate basic “topics” viewed as input variables to construct models for predicting patient mortality. Additionally, this study referenced the Grid Search method proposed by Teng et al. [40] to determine the optimal number of topics and the final LDA model to be used. According to the analysis results, 10 topics produced the optimal prediction results. Appendix A and Appendix B list the 10 topics and their corresponding keywords generated by applying the LDA method in this study.

2.3. Data Preprocessing

2.3.1. Data Extraction

To ensure the generalizability of the analysis results, this study analyzed all patients as opposed to patients with specific diseases. Additionally, to enable one to compare the results of this study with those of relevant studies, this study set adult ICU patients older than 16 years of age and who were admitted to ICUs for the first time as its participants. The analysis data were mostly data of said patients 12 and 24 h after they were admitted to ICUs [2,41,42,43]. Figure 3 shows the detailed process of data extraction.
Table 2 provides the demographic information of the selected patient cohort after data preprocessing in our study. Among the patients, 24,252 had an ICU stay of 12 h, while 27,809 had an ICU stay of 24 h. The average age of the patients was 63 years and 56% were male. More than 70% of the patients were white, and over 80% of the patients were admitted to the ICUs because of emergencies. As many as close to 40% of patients were admitted to Medical ICUs. The patients stayed at the hospitals and ICUs for an average of 8.9 and 4.2 days, respectively.

2.3.2. Variable Selection

To determine the variables to be used, we referenced relevant studies [7,41,42,44] and manually selected 16 quantitative variables based on their clinical importance in the domain from admission, chartevents, labevents, and output events data tables in the MIMIC-III dataset. The variables were Glasgow Coma Scale, heart rate, systolic blood pressure, temperature, FiO2, urine output, PO2, blood urea nitrogen, white blood cell count, potassium level, sodium level, serum bicarbonate level, bilirubin, admission type, patient’s sex, and age. Subsequently, this study adopted the data preprocessing method introduced by Guo et al. [7] to initiate a three-stage missing value processing. First, patients missing a value more than 30% were eliminated. Second, predictors missing a value more than 40% were eliminated. Third, the statistics for which the missing data rate was greater than 20% under these indicators were eliminated. The mean interpolation to interpolate the remaining missing value was then used. Then, the Information Gain Technique (Entropy) [44] was used to evaluate the importance of these 16 variables. Finally, our variable selection was based on the highest ranked attributes that scored 0.01 or more. Appendix C shows the significance rank of these features where the White Blood Cells Court is ranked the highest in the list while gender is ranked the least.
In addition to selecting the abovementioned quantitative variables, this study also extracted topic modeling variables from NOTEVENTS data. NOTEVENTS referred to clinical notes taken by doctors, nurses, imaging professionals, nutritionists, and physical therapists on patients. In the MIMIC-III database, the NOTEVENTS file contained 2,083,180 pieces of data, of which roughly 56% were data recorded by doctors or nurses and 39% were echo reports, ECG reports, and radiology reports.

2.3.3. Dealing with Imbalanced Dataset

Table 3 presents the descriptive statistics of the data used in this study. The table shows that patients’ survival-to-death ratios are significantly imbalanced for both 12 and 24 h after hospital admission. Because imbalanced datasets frequently result in inaccurate model prediction results [45], researchers often balance data by adding minority samples or deleting majority samples [46]. In this study, because the sample size of ICU patients who died was much smaller than that of ICU patients who survived, we used the synthetic minority oversampling technique (SMOTE) to increase the sample size of ICU patients who died to achieve balanced results [46]. The SMOTE technique is a type of oversampling method that has been widely used in machine learning with imbalance data [47,48]. The SMOTE technique randomly generates new samples of the minority class from the nearest neighbor of the line connecting samples of the minority class. These new samples are generated based on the features of the original dataset so that they become similar to the original instances of the minority class [49].
In our study, we applied the SMOTE techniques with different percentages for different cases. As a result, several new training datasets were generated (Table 4). Take the dataset of patients 12 h after hospital admission as an example. SMOTE (900%) increased the sample with class “died” from 2384 instances to 21,456 instances. This made an incremental increase in the minority class from 9.83% in the original dataset to 49.52% in the SMOTE with 900% dataset.

2.3.4. Model Validation

To validate the model’s performance after training, we used the K-fold cross-validation method [50] in this study. Using the k-fold method, we first divide the dataset into k parts, and each part will have instances of the same size. The training process is applied on all parts except one part for testing. This process is iterative and is repeated by the specified K number, where each part has the chance to be tested once. The final performance measure will be the average of all the tests’ performance of all parts. The advantage of this approach is that all instances of the entire dataset are trained and tested, so that lower variance occurs in the ensemble estimator. This ensures that the true rate estimator’s predictions are more accurate and less biased; however, this approach is computationally expensive and validation takes a long time to complete. In our study, we employed 10-fold cross-validation to construct models, which has been used in several health care and medical related studies [51,52].

2.4. Mortality Prediction

We assessed the effects of combining ICU patients’ structured data (vital signs and laboratory test results) and semi-structured data (diagnosis data and inspection reports) 12 and 24 h after they had been hospitalized on their mortality predictions for different periods, where death and alive were defined as “1” and “0”, respectively. ICU patient mortality-related definitions are summarized as follows:
  • In-hospital mortality: Whether the ICU patient died during hospitalization.
  • Short-term mortality: Whether the ICU patient died within 48 h or 72 h of hospital admission.
  • Long-term mortality: Whether the ICU patient died within 30 days or 1 year of hospital admission.

2.5. Machine Learning

To illustrate the effects of adding textual data such as clinical notes and pathology reports on patient mortality predictions, this study used five classic machine learning classification algorithms to construct models for predicting ICU patient mortality. These are classification and regression trees (CART), logistic regression (LR), multivariate adaptive regression splines (MARS), random forest (RF), and gradient boosting (GB). All data mining tasks of this research were performed using python programming. Detailed descriptions of the machine learning classification algorithms are organized as follows:
  • Classification and Regression Tree (CART)
CART is a decision tree algorithm that uses binary splitting to analyze gargantuan datasets. Through a recursive process, CART divides existing training samples into several known categories according to its predictor variables and their corresponding indicators. The training sample division process is subsequently set as a series of rules [53,54].
  • Logistic Regression (LR)
LR is a log probability model that can assess statistical interactions and control multivariate confidence intervals. It is most commonly used to check the risk relationships between diseases and exposures [55,56]. This study employed Python’s scikit-learn library to realize LR and selected the stochastic average gradient linear convergence algorithm as the hyperparameter setting optimization method. LR is a gradient descent method that is especially effective when the number of sample data is large.
  • Multivariate Adaptive Regression Splines (MARS)
MARS is a multivariate, nonparametric regression technique and a tool that accumulates several basis functions to explain nonlinear states [57]. Once objective variables are set and a set that contains selectable predictor variables is given, MARS can automate the entire model construction process, including separating meaningful and less appropriate variables, determining the interactions between predictor variables, dealing with the missing value problem by using variable clustering techniques, and avoiding overfitting by using numerous self-tests [38,58].
  • Random Forest (RF)
RF is an ensemble algorithm that uses decision trees as its basic classifiers [59]. It boasts the characteristic of providing accurate prediction results without having to thoroughly adjust model hyperparameters [60]. The only parameters that require thorough adjustments are the depths and number of decision trees. This study found that setting the maximum tree depth and number at 35 and 200, respectively, produced the optimal ICU patient mortality prediction results.
  • Gradient Boosting (GB)
GB is an ensemble learning algorithm that can be used to elevate the accuracy of different types of prediction models. It uses the negative gradient information of the loss function in the model to train models with unfavorable prediction accuracy and cumulatively integrates trained results into existing models [61,62]. This study used the scikit-learn library to realize GB, set the maximum number of iterations to 100, and trained hyperparameters by using the default values provided by the scikit-learn library.

2.6. Evaluation Metrics

To fully compare the effects of integrated structured and semi-structured data on ICU patient mortality predictions, this study selected five indicators (i.e., AUROC, specificity, sensitivity, precision, and F1-statistic) as the assessment tools for constructing models. Table 5 shows the confusion matrix. Detailed definitions of each assessment indicator are as follows:
Specificity = TNR = TN TN + FP
Sensitivity = TPR = TP TP + FN
Precision = PPV = TP TP + FP
F 1 - Statistic = 2 × Precision × Recall Precision + Recall
  • Specificity: The percentage of negative samples that were predicted to be negative.
  • Sensitivity: The percentage of positive samples that were predicted to be positive.
  • Precision: The percentage of samples that were predicted to be positive among samples that were categorized as being positive.
  • F1-Statistic: The harmonic mean between precision and sensitivity.
  • AUROC: The area under the receiver operating curve is primarily used to measure the classification threshold performance of classifiers. ROC is a curve consisting of points generated by the true positive rate (TPR) and false positive rate (FPR) of model. TPR signify the probabilities that models can correctly locate positive samples. Such probabilities are commonly referred to as recall rates and represent revenue. By contrast, FPR signify the probabilities that models incorrectly locate positive samples and represent losses. AUROC values range from 0 to 1, where the larger the value, the more superior the result.

3. Results

This study employed 10-fold cross-validation to construct models. The models constructed were using ICU patients 12 and 24 h after hospital admission to predict ICU patient mortality for different periods, including in-hospital mortality, short-term mortality, and long-term mortality. The results are presented in Table 6 and Table 7. Figure 4 and Figure 5 compare the AUROC predicted using the five machine learning methods.

3.1. Variable Importance

One advantage of using the GB method is that once prediction models have been built, the importance of their variables can be obtained by sorting the variable importance scores. In general, the importance score denotes the degree to which an input variable increases the value of the decision trees in the model; the more frequently that the variable is used in the decision tree, the higher its relative importance. For the GB method, the importance of an input variable is calculated using the degree to which the variable can increase the value of the decision tree at the decision tree split point multiplied by the number of samples (weights) at the node. Common decision tree value measurement methods include the Gini index, cross-entropy, and information gain (in this study, the Gini index was used to measure increases in decision tree values). For more information on how the GB method calculates the importance of input variables, please refer to Hastie et al. [63]. The variable importance obtained for the best GB model in ICU patients 12 and 24 h after hospital admission is presented in Table 8.
As presented in Table 8, when using models (constructed using the data of ICU patients 12 h after they have been admitted to hospitals) to predict in-hospital and short-term ICU patient mortality, blood urea nitrogen (x5) is a critical variable. By contrast, when predicting long-term ICU patient mortality (i.e., mortality within 1 year), serum bicarbonate level (x9), and intracranial hemorrhage (TOPICA3) are more critical. When predicting in-hospital, short-term, and long-term ICU patient mortality, age (x12) and the Glasgow Coma Scale (x1) are critical variables. As for using models (constructed using the data of ICU patients 24 h after they had been admitted to hospitals) to predict in-hospital ICU patient mortality, the Glasgow Coma Scale (x1), blood urea nitrogen (x5), and serum bicarbonate level (x9) are critical. When predicting short-term and long-term ICU patient mortality, potassium level (x7) and admission type (x10) are more critical. When predicting in-hospital, short-term, and long-term ICU patient mortality, age (x12) and hydroperitoneum (TOPICB1) are critical variables.

3.2. Prediction with Semi-Structure Data vs. Prediction w/o Semi-Structure Data

To illustrate the effects of semi-structured data on ICU patient mortality predictions, this study used the GB method to compare the ICU patient mortality prediction results obtained by models that used the structured and semi-structured data of patients 24 h after hospital admission and those obtained by models that used merely the structured data of patients 24 h after hospital admission. Table 9 shows the prediction results.
According to the table, models that used both the structured and semi-structured data of patients 24 h after hospital admission generated more accurate in-hospital, short-term, and long-term ICU patient mortality prediction results than those generated by models that used only the structured data of ICU patients. These results indicated that semi-structured data (i.e., clinical notes, which contained patients’ diagnosis data and inspection reports) contain useful information that can considerably affect the prediction accuracy of constructed models. Overall, models predicted short-term mortality more accurately than they did in-hospital and long-term mortality, and short-term, 48 h mortality predictions produced the highest AUROC values, signifying that models built using the structured and semi-structured data of ICU patients 24 h after hospital admission more accurately predicted whether the ICU patients would die 48 h after hospital admission.

4. Discussion

4.1. Principal Findings

The purpose of this study was to use a machine learning model to evaluate the impact of integrating the structured data (vital signs and laboratory test results) and semi-structured data (diagnosis data and inspection reports) collected by ICU patients during hospitalization on predicting whether ICU patients die (death = 1, survival = 0) in different periods. On the basis of the analysis results, this study presented the following findings:
  • A longer ICU patient hospital stay signified more accumulated medical records. The increased number of medical records elevated the ICU patient mortality prediction accuracy. This study collected and used the data of ICU patients 12 and 24 h after hospital admission to construct and analyze patient mortality prediction models. Related analysis results revealed that, in general, models built using 24 h data outperformed those built using 12 h data in terms of model prediction accuracy.
  • Overall, the prediction models predicted short-term mortality more accurately than they did long-term mortality and predicted short-term, 48 h mortality more accurately than they did all other periods. As the prediction time increased, the prediction accuracy substantially decreased. This signified that the prediction models are more suitable for short-term mortality predictions. Future studies should strengthen the long-term mortality prediction accuracy of these prediction models by increasing the duration of the data collection process of inpatients and including more factors of influence in the models.
  • Models constructed using the five machine learning classification algorithms produced an ICU patient short-term and long-term mortality prediction accuracy of over 70%. Moreover, GB outperformed the other machine learning algorithms for all periods. These results demonstrated the rapid development of machine learning algorithms and that they can provide immense help to clinical doctors when making clinical decisions.
  • Combining the structured and semi-structured data of ICU patients can strengthen the ICU patient mortality prediction accuracy for different periods. This confirmed that ICU patients’ clinical notes (e.g., diagnosis data and inspection reports) contain useful information that can help clinical doctors make crucial clinical decisions [64,65].
  • Analyses on ICU patients’ semi-structured data (e.g., clinical notes and pathology reports) performed with the LDA method revealed some critical information. According to Table 7, when patients’ hospitalization data are limited (i.e., only their 12 h data are available), those who wish to predict the patients’ short-term mortality should pay attention to variables such as the patents’ ages (x12), Glasgow Coma Scales (x1), and blood urea nitrogen (x5); and those who wish to predict the patients’ long-term mortality should pay attention to variables such as the patents’ ages (x12), Glasgow Coma Scales (x1), serum bicarbonate levels (x9), and intracranial hemorrhage (TOPICA3). By contrast, when patients’ hospitalization data are sufficient (i.e., their 24 h data are available), those who wish to predict the patients’ in-hospital mortality should pay attention to variables such as the patients’ ages (x12), hydroperitoneum (TOPICB1), Glasgow Coma Scales (x1), blood urea nitrogen (x5), and serum bicarbonate levels (x9); and those who wish to predict the patients’ short-term and long-term mortality should pay attention to variables such as the patients’ ages (x12), hydroperitoneum (TOPICB1), potassium levels (x7), and admission types (x10). In other words, when patients’ hospitalization data are limited, their Glasgow Coma Scales, age, blood urea nitrogen (bicarbonate), and intracranial hemorrhage will determine their prognoses. By contrast, when patients’ hospitalization data are sufficient, their Glasgow Coma Scales, age, blood urea nitrogen (bicarbonate), hydroperitoneum, and admission types will determine their prognoses. Overall, topics generated using the LDA method can extract patients’ critical medical characteristics. These medical characteristics can be used by doctors to offer personalized clinical advice according to the patients’ situations [43]. Additionally, compared with conventional methods of clustering or drawing associations with individual taxa, the LDA provides unique analytical advantages. For example, in addition to avoiding the effects of outlier samples, the LDA method can prevent the patients’ critical medical characteristics hidden in textual data from being overlooked [66].

4.2. Limitations

This study had a few limitations. First, because of the retrospective design, inherent biases were unavoidable. Because this study built prediction models by using the dynamic EHR data of ICU patients, the models are applicable only to patients in ICU environments or ICU-related environment. This is a common problem when constructing prediction models by using machine learning methods and dynamic EHR data [67].
Second, to collect data thoroughly and conveniently, we obtained comprehensive, dynamic patient data from a database where such data could be easily obtained. The MIMIC-III data used were obtained from BIDMC in Boston, MA, USA. Future studies should collect data from other regions and from other types of medical institutions before conducting comprehensive assessments. Moreover, because the data in this study were the medical data of ICU patients (in a large medical institution in Boston) found in the MIMIC-III database, the analysis results are not entirely applicable to ICU patients in small medical institutions. Future studies should simultaneously compare the data of ICU patients in rural medical indications and those of ICU patients in other general medical institutions to yield more comprehensive results.
Third, the medical data were only those of patients who had been admitted to ICUs for the first time and did not include those of patients who were readmitted. Because ICU patient readmissions often lead to excessive use of medical resources and higher financial risks for medical institutions, analyzing the morbidity and mortality of readmitted ICU patients will benefit patients and medical institutions more pronouncedly [68]. Future studies can collect the data of ICU patients who have been hospitalized multiple times, perform comprehensive assessments for different time series, or provide different types of analysis results to patients, health care personnel, and patients’ families to enable them to use such reference information when making related assessments.
Fourth, the LDA topic modeling method adopted in this study is an alternative nonparametric method that normally requires one to observe data complexity to determine the number of topics required. Such an analysis procedure often violates the objective analysis principle. In addition, standard LDA models frequently interpret data as disordered “bag of words” and remove them from analyses, resulting in wasteful use of information [69]. Follow-up studies that resolve these problems by conducting more comprehensive assessments and analyses will ensure more objective and complete study results.

5. Conclusions

In this study, in addition to using quantitative data (e.g., ICU patients’ vital signs and laboratory test results), we use an LDA method to model the semi-structured data (e.g., patients’ clinical notes and pathology reports) of ICU patients and discuss the effectiveness of combining LDA and five machine learning methods to predict ICU patient mortality. Our results revealed that the semi-structured data (diagnosis data and inspection reports) of ICU patients contain useful information that can assist clinical doctors in making critical clinical decisions. However, the prediction models built in this study are mainly used to predict ICU patient mortality, and follow-up studies are required to enable the models to make other clinical predictions, for instance, hospital stay, complication, and disease predictions.
Possible directions for follow-up studies are as follows: first, they may collect the structured and semi-structured data of patients in different departments (e.g., cardiology or nephrology departments) and with different diseases (e.g., sepsis) and perform more detailed classifications and analyses. Second, they may collect patient data from different medical departments, such as the outpatient departments, emergency departments, inpatient departments, and ICUs, and conduct more comprehensive model construction and assessments to strengthen the generalizability and applicability of the models. Third, they may collect and integrate different types of unstructured data, for instance, consultation processes in hospitals, patient demands, and messages left by patients on social media such as Facebook, Instagram, and Twitter when building models to increase model prediction accuracy. Moreover, the NLP research is gradually dominated by the use of some new transformer models (e.g BERT). Future studies may aim to combine other new topic modeling tools, such as BERT, to evaluate the ability of the proposed prediction scheme.

Author Contributions

Conceptualization, C.-C.C. and T.-N.C.; Data curation, C.-C.C. and T.-N.C.; Formal analysis, C.-C.C. and T.-N.C.; Methodology, C.-C.C. and T.-N.C.; Supervision, C.-C.C. and T.-N.C.; Writing—original draft, C.-C.C. and T.-N.C.; Writing—review and editing, C.-C.C., C.-M.W., T.-N.C., L.-J.K. and J.T.Q. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to sincerely thank the editor and reviewers for their kind comments.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Topics for 12 h Dataset

VariableTopicKeywords
TopicA1Sepsissepsis, blood, delivery, matermal, admission, distress, stable, murmur, respiratory, active, monitor
TopicA2Pneumothorax and pleural effusionchest, placement, examination, portable, radiology, pneumothorax, medical, catheter, position, diagnosis, effusion
TopicA3Intracranial hemorrhagecontrast, hemorrhage, acute, frontal, radiology, intracranial, ventricle, brain, sinus, findings, subdural, comparison
TopicA4Medical assessmentassessment, present, medication, blood, pulse, respiratory, assess, acute, fluid, balance, action
TopicA5Hydroperitoneumcontrast, abdomen, pelvis, liver, fluid, abdominal, bowel, kidney, evidence, radiology, lesion, peritoneal
TopicA6Pleural effusionchest, effusion, pulmonary, pleural, examination, medical, radiology, pneumonia, portable, impression, comparison, opacity, acute
TopicA7Cervical spine traumafracture, contrast, spine, trauma, injury, radiology, cervical, chest, examination, evidence, tissue, impression, hematoma
TopicA8Coronary artery diseaseartery, carotid, aneurysm, identifier, numeric, stenosis, intermal, procedure, common, femoral, vertebral, cerebral, perform, catheter, distal
TopicA9Urinary retentionneuro, screen, monitor, receive, urine, order, admission, foley, transfer, follow, stable, man, urinary
TopicA10Endotracheal intubationintubate, respiratory, sedate, assessment, intubation, sound, propofol, airway, ventilation, endotracheal, secretion, breathing, sedation, suction, sputum, fentanyl, procedure, failure

Appendix B. Topics for 24 h Dataset

VariableTopicKeywords
TopicB1Hydroperitoneumcontrast, abdomen, liver, pelvis, fluid, abdominal, bowel, kidney, evidence, peritoneal, lesion, radiology
TopicB2Urinary retentionneuro, monitor, foley, urine, stable, follow, receive, urinary, order, drain, family, intact, man
TopicB3Newborn respiratory distressinfant, sepsis, stable, parent, blood, monitor, active, murmur, delivery, respiratory, distress, parents
TopicB4Coronary artery diseaseartery, carotid, aneurysm, identifier, numeric, stenosis, procedure, intermal, catheter, common, femoral, aortic, perform, cerebral, coronary, vertebral
TopicB5Pneumothorax and pleural effusionchest, effusion, examination, radiology, medical, pleural, portable, pulmonary, placement, impression, comparison, pneumothorax
TopicB6Nursing assessmentassessment, medication, action, fluid, rhythm, pulse, response, balance, present, respiratory, extremity
TopicB7Endotracheal intubationintubate, propofol, sedate, intubation, respiratory, ventilation, placement, airway, sound, secretion, endotracheal, suction, sedation, breathing, fentanyl
TopicB8Sepsisfailure, renal, fever, status, urine, hypotension, likely, mental, infection, acute, pneumonia, sepsis, lactate, blood
TopicB9Medical assessmentpresent, assessment, blood, pulse, chest, response, action, assess, deny, medication, system
TopicB10Spinal hematomascontrast, fracture, hemorrhage, spine, radiology, trauma, examination, acute, injury, hematoma, impression, image

Appendix C. Selected Quantitative Predictors with Corresponding Information Gain

VariableFeatureItem NameInformation GainItem IDTable
x1Glasgow Coma ScaleGCS Verbal
Verbal Response
GCS Motor
Motor Response
GCS Eyes
Eye Opening
0.4340223900
723
223901
454
220739
184
Chartevents
x2Heart RateHeart Rate0.3602211
220045
Chartevents
x3Systolic Blood PressureNoninvasive Systolic Blood Pressure0.2702455
220179
Chartevents
x4TemperatureTemperature Fahrenheit
Temperature Celsius
0.4681678
223761
676
223762
Chartevents
x5Blood Urea NitrogenBlood Urea Nitrogen0.217251006Labevents
x6White Blood Cells CourtWhite Blood Cells0.472551301
51300
Labevents
x7Potassium LevelPotassium0.228750971
50822
Labevents
x8Sodium LevelSodium0.248650983Labevents
x9Serum Bicarbonate LevelBicarbonate0.095450882Labevents
x10Admission TypeAdmission Type0.0184 Admissions
x11GenderGender0.0104 Admissions
x12AgeAge0.1555 Admissions

References

  1. Sharafoddini, A.; Dubin, J.A.; Lee, J. Identifying subpopulations of septic patients: A temporal data-driven approach. Comput. Biol. Med. 2021, 130, 104182. [Google Scholar] [CrossRef] [PubMed]
  2. Chen, W.; Long, G.; Yao, L.; Sheng, Q.Z. AMRNN: Attended multi-task recurrent neural networks for dynamic illness severity prediction. World Wide Web 2019, 23, 2753–2770. [Google Scholar] [CrossRef]
  3. Aldosari, B. Supportive care pathway functionalities of EHR system in a Saudi Arabian hospital. Comput. Biol. Med. 2017, 89, 190–196. [Google Scholar] [CrossRef] [PubMed]
  4. Magnin, B.; Mesrob, L.; Kinkingnehun, S.; Pelegrini-Issac, M.; Colliot, O.; Sarazin, M.; Dubois, B.; Lehericy, S.; Benali, H. Support vector machine-based classification of Alzheimer’s disease from whole-brain anatomical MRI. Neuroradiology 2009, 51, 73–83. [Google Scholar] [CrossRef] [PubMed]
  5. Asadi, H.; Dowling, R.; Yan, B.; Mitchell, P. Machine Learning for Outcome Prediction of Acute Ischemic Stroke Post Intra-Arterial Therapy. PLoS ONE 2014, 9, 11. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. An, S.; Malhotra, K.; Dilley, C.; Han-Burgess, E.; Valdez, J.N.; Robertson, J.; Clark, C.; BrandonWestover, M.; Sun, J.M. Predicting drug-resistant epilepsy—A machine learning approach based on administrative claims data. Epilepsy Behav. 2018, 89, 118–125. [Google Scholar] [CrossRef]
  7. Guo, C.H.; Lu, M.L.; Chen, J.F. An evaluation of time series summary statistics as features for clinical prediction tasks. BMC Med. Inform. Decis. Mak. 2020, 20, 48. [Google Scholar] [CrossRef] [PubMed]
  8. Kaji, D.A.; Zech, J.R.; Kim, J.S.; Cho, S.K.; Dangayach, N.S.; Costa, A.B.; Oermann, E.K. An attention based deep learning model of clinical events in the intensive care unit. PLoS ONE 2019, 14, e0211057. [Google Scholar] [CrossRef] [Green Version]
  9. Alsinglawi, B.; Alshari, O.; Alorjani, M.; Mubin, O.; Alnajjar, F.; Novoa, M.; Darwish, O. An explainable machine learning framework for lung cancer hospital length of stay prediction. Sci. Rep. 2022, 12, 607. [Google Scholar] [CrossRef]
  10. Le, S.; Allen, A.; Calvert, J.; Palevsky, P.M.; Braden, G.; Patel, S.; Pellegrini, E.; Green-Saxena, A.; Hoffman, J.; Das, R. Convolutional neural network model for intensive care unit acute kidney injury prediction. Kidney Int. Rep. 2021, 6, 1289–1298. [Google Scholar] [CrossRef]
  11. Hu, Z.; Du, D. A new analytical framework for missing data imputation and classification with uncertainty: Missing data imputation and heart failure readmission prediction. PLoS ONE 2020, 15, e0237724. [Google Scholar] [CrossRef] [PubMed]
  12. Hashir, M.; Sawhney, R. Towards unstructured mortality prediction with free-text clinical notes. J. Biomed. Inform. 2020, 108, 103489. [Google Scholar] [CrossRef] [PubMed]
  13. Hearst, M.A. Untangling text data mining. In Proceedings of the 37th Annual meeting of the Association for Computational Linguistics, College Park, MD, USA, 20–26 June 1999; pp. 3–10. [Google Scholar] [CrossRef] [Green Version]
  14. Jelodar, H.; Wang, Y.; Yuan, C.; Feng, X.; Jiang, X.; Li, Y.; Zhao, L. Latent Dirichlet allocation (LDA) and topic modeling: Models, applications, a survey. Multimed. Tools Appl. 2019, 78, 15169–15211. [Google Scholar] [CrossRef] [Green Version]
  15. Nwankwo, E.; Okolo, C.; Habonimana, C. Topic modeling approaches for understanding COVID-19 misinformation spread in sub-Saharan Africa. In Proceedings of the AI Social Good Workshop; Harvard University: Cambridge, MA, USA, 2020. [Google Scholar]
  16. Demner-Fushman, D.; Chapman, W.W.; McDonald, C.J. What can natural language processing do for clinical decision support? J. Biomed. Inform. 2009, 42, 760–772. [Google Scholar] [CrossRef] [Green Version]
  17. Jensen, P.B.; Jensen, L.J.; Brunak, S. Mining electronic health records: Towards better research applications and clinical care. Nat. Rev. Genet. 2012, 13, 395–405. [Google Scholar] [CrossRef]
  18. Spasic, I.; Uzuner, Ö.; Zhou, L. Emerging clinical applications of text analytics. Int. J. Med. Inform. 2020, 134, 103974. [Google Scholar] [CrossRef]
  19. Blei, D.M.; Ng, A.Y.; Jordan, M.I. Latent Dirichlet allocation. J. Mach. Learn. Res. 2003, 3, 993–1022. [Google Scholar] [CrossRef]
  20. Ekinci, E.; İlhan Omurca, S. Concept-LDA: Incorporating Babelfy into LDA for aspect extraction. J. Inf. Sci. 2020, 46, 406–418. [Google Scholar] [CrossRef]
  21. Qi, Y.; He, J. Application of LDA and word2vec to detect English off-topic composition. PLoS ONE 2022, 17, e0264552. [Google Scholar] [CrossRef]
  22. Chen, S.; Andrienko, N.; Andrienko, G.; Adilova, L.; Barlet, J.; Kindermann, J.; Nguyen, P.H.; Thonnard, O.; Turkay, C. LDA ensembles for interactive exploration and categorization of behaviors. IEEE Trans. Vis. Comput. Graph. 2019, 26, 2775–2792. [Google Scholar] [CrossRef] [Green Version]
  23. Zhao, J.; Huang, J.X.; Deng, H.; Chang, Y.; Xia, L. Are topics interesting or not? An LDA-based topic-graph probabilistic model for web search personalization. ACM Trans. Inf. Syst. (TOIS) 2021, 40, 1–24. [Google Scholar] [CrossRef]
  24. Kartal Koc, E.; Bozdogan, H. Model selection in multivariate adaptive regression splines (MARS) using information complexity as the fitness function. Mach. Learn. 2015, 101, 35–58. [Google Scholar] [CrossRef]
  25. Senthilkumar, D.; Paulraj, S. Diabetes disease diagnosis using multivariate adaptive regression splines. AGE 2013, 768, 52. [Google Scholar]
  26. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
  27. Loke, K. Impact of financial ratios and technical analysis on stock price prediction using random forests. In Proceedings of the 2017 International Conference on Computer and Drone Applications (IConDA), Kuching, Malaysia, 9–11 November 2017; pp. 38–42. [Google Scholar] [CrossRef]
  28. Patel, J.; Shah, S.; Thakkar, P.; Kotecha, K. Predicting stock and stock price index movement using trend deterministic data preparation and machine learning techniques. Expert Syst. Appl. 2015, 42, 259–268. [Google Scholar] [CrossRef]
  29. Ballings, M.; Van den Poel, D.; Hespeels, N.; Gryp, R. Evaluating multiple classifiers for stock price direction prediction. Expert Syst. Appl. 2015, 42, 7046–7056. [Google Scholar] [CrossRef]
  30. Ładyżyński, P.; Żbikowski, K.; Grzegorzewski, P. Stock trading with random forests, trend detection tests and force index volume indicators. In International Conference on Artificial Intelligence and Soft Computing; Springer: Berlin/Heidelberg, Germany, 2013; pp. 441–452. [Google Scholar] [CrossRef]
  31. Booth, A.; Gerding, E.; McGroarty, F. Automated trading with performance weighted random forests and seasonality. Expert Syst. Appl. 2014, 41, 3651–3661. [Google Scholar] [CrossRef]
  32. Fatima, M.; Pasha, M. Survey of machine learning algorithms for disease diagnostic. J. Intell. Learn. Syst. Appl. 2017, 9, 16. [Google Scholar] [CrossRef] [Green Version]
  33. Uddin, S.; Khan, A.; Hossain, M.E.; Moni, M.A. Comparing different supervised machine learning algorithms for disease prediction. BMC Med. Inform. Decis. Mak. 2019, 19, 281. [Google Scholar] [CrossRef]
  34. Ray, S. A quick review of machine learning algorithms. In Proceedings of the 2019 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COMITCon), Faridabad, India, 14–16 February 2019; pp. 35–39. [Google Scholar] [CrossRef]
  35. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
  36. Johnson, A.E.; Pollard, T.J.; Shen, L.; Lehman, L.W.; Feng, M.; Ghassemi, M.; Moody, B.; Szolovits, P.; Celi, L.A.; Mark, R.G. MIMIC-III, a freely accessible critical care database. Sci. Data 2016, 3, 160035. [Google Scholar] [CrossRef] [Green Version]
  37. El-Ganainy, N.O.; Balasingham, I.; Halvorsen, P.S.; Rosseland, L.A. A New Real Time Clinical Decision Support System Using Machine Learning for Critical Care Units. IEEE Access 2020, 8, 185676–185687. [Google Scholar] [CrossRef]
  38. Huang, Y.R.; Wang, R.; Huang, B.; Wei, B.; Zheng, S.L.; Chen, M. Sentiment Classification of Crowdsourcing Participants’ Reviews Text Based on LDA Topic Model. IEEE Access 2021, 9, 108131–108143. [Google Scholar] [CrossRef]
  39. Gabriel, R.A.; Kuo, T.-T.; McAuley, J.; Hsu, C.-N. Identifying and characterizing highly similar notes in big clinical note datasets. J. Biomed. Inform. 2018, 82, 63–69. [Google Scholar] [CrossRef]
  40. Teng, F.; Ma, Z.; Chen, J.; Xiao, M.; Huang, L.F. Automatic Medical Code Assignment via Deep Learning Approach for Intelligent Healthcare. IEEE J. Biomed. Health Inform. 2020, 24, 2506–2515. [Google Scholar] [CrossRef] [PubMed]
  41. Yu, R.X.; Zheng, Y.L.; Zhang, R.K.; Jiang, Y.Q.; Poon, C.C.Y. Using a Multi-Task Recurrent Neural Network With Attention Mechanisms to Predict Hospital Mortality of Patients. IEEE J. Biomed. Health Inform. 2020, 24, 486–492. [Google Scholar] [CrossRef] [PubMed]
  42. Purushotham, S.; Meng, C.Z.; Che, Z.P.; Liu, Y. Benchmarking deep learning models on large healthcare datasets. J. Biomed. Inform. 2018, 83, 112–134. [Google Scholar] [CrossRef] [PubMed]
  43. Gangavarapu, T.; Jayasimha, A.; Krishnan, G.S.; Kamath, S.S. Predicting ICD-9 code groups with fuzzy similarity based supervised multi-label classification of unstructured clinical nursing notes. Knowl.-Based Syst. 2020, 190, 105321. [Google Scholar] [CrossRef]
  44. Lin, Y.W.; Zhou, Y.Q.; Faghri, F.; Shawl, M.J.; Campbell, R.H. Analysis and prediction of unplanned intensive care unit readmission using recurrent neural networks with long shortterm memory. PLoS ONE 2019, 14, e0218942. [Google Scholar] [CrossRef] [Green Version]
  45. Kent, J.T. Information gain and a general measure of correlation. Biometrika 1983, 70, 163–173. [Google Scholar] [CrossRef]
  46. Layeghian Javan, S.; Sepehri, M.M.; Layeghian Javan, M.; Khatibi, T. An intelligent warning model for early prediction of cardiac arrest in sepsis patients. Comput. Methods Programs Biomed. 2019, 178, 47–58. [Google Scholar] [CrossRef]
  47. Loreto, M.; Lisboa, T.; Moreira, V.P. Early prediction of ICU readmissions using classification algorithms. Comput. Biol. Med. 2020, 118, 8. [Google Scholar] [CrossRef] [PubMed]
  48. Blagus, R.; Lusa, L. Joint use of over-and under-sampling techniques and cross-validation for the development and assessment of prediction models. BMC Bioinform. 2015, 16, 363. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  49. Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic minority over-sampling technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
  50. Chawla, N.V. Data mining for imbalanced datasets: An overview. In Data Mining and Knowledge Discovery Handbook; Springer: Berlin/Heidelberg, Germany, 2009; pp. 875–886. [Google Scholar] [CrossRef] [Green Version]
  51. Bengio, Y.; Grandvalet, Y. No unbiased estimator of the variance of k-fold cross-validation. Adv. Neural Inf. Process. Syst. 2003, 16, 1089–1105. [Google Scholar]
  52. Liu, B.; Fang, L.; Liu, F.; Wang, X.; Chou, K.-C. iMiRNA-PseDPC: microRNA precursor identification with a pseudo distance-pair composition approach. J. Biomol. Struct. Dyn. 2016, 34, 223–235. [Google Scholar] [CrossRef]
  53. Liu, B.; Fang, L.; Liu, F.; Wang, X.; Chen, J.; Chou, K.-C. Identification of real microRNA precursors with a pseudo structure status composition approach. PLoS ONE 2015, 10, e0121501. [Google Scholar] [CrossRef] [Green Version]
  54. Breiman, L.; Friedman, J.H.; Olshen, R.A.; Stone, C.G. Classification and Regression Trees; Wadsworth International Group: New York, NY, USA, 1984. [Google Scholar]
  55. Ghiasi, M.M.; Zendehboudi, S.; Mohsenipour, A.A. Decision tree -based diagnosis of coronary artery disease: CART model. Comput. Methods Programs Biomed. 2020, 192, 14. [Google Scholar] [CrossRef]
  56. Cuadrado-Godia, E.; Jamthikar, A.D.; Gupta, D.; Khanna, N.N.; Araki, T.; Maniruzzaman, M.; Saba, L.; Nicolaides, A.; Sharma, A.; Omerzu, T.; et al. Ranking of stroke and cardiovascular risk factors for an optimal risk calculator design: Logistic regression approach. Comput. Biol. Med. 2019, 108, 182–195. [Google Scholar] [CrossRef]
  57. Ergun, U.; Serhatioglu, S.; Hardalac, F.; Guler, I. Classification of carotid artery stenosis of patients with diabetes by neural network and logistic regression. Comput. Biol. Med. 2004, 34, 389–405. [Google Scholar] [CrossRef]
  58. Friedman, J.H. Multivariate Adaptive Regression Splines. Ann. Stat. 1991, 19, 1–67. [Google Scholar] [CrossRef]
  59. Steinberg, D.; Bernard, B.; Phillip, C.; Kerry, M. MARS User Guide, Salford Systems; Salford Systems: San Diego, CA, USA, 1999. [Google Scholar]
  60. Yigin, B.O.; Algin, O.; Saygili, G. Comparison of morphometric parameters in prediction of hydrocephalus using random forests. Comput. Biol. Med. 2020, 116, 103547. [Google Scholar] [CrossRef] [PubMed]
  61. Tetschke, F.; Schneider, U.; Schleussner, E.; Witte, O.W.; Hoyer, D. Assessment of fetal maturation age by heart rate variability measures using random forest methodology. Comput. Biol. Med. 2016, 70, 157–162. [Google Scholar] [CrossRef] [PubMed]
  62. Ramos-Gonzalez, J.; Lopez-Sanchez, D.; Castellanos-Garzon, J.A.; de Paz, J.F.; Corchado, J.M. A CBR framework with gradient boosting based feature selection for lung cancer subtype classification. Comput. Biol. Med. 2017, 86, 98–106. [Google Scholar] [CrossRef]
  63. Shi, H.T.; Wang, H.R.; Huang, Y.X.; Zhao, L.Q.; Qin, C.J.; Liu, C.L. A hierarchical method based on weighted extreme gradient boosting in ECG heartbeat classification. Comput. Methods Programs Biomed. 2019, 171, 1–10. [Google Scholar] [CrossRef]
  64. Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction; Springer: New York, NY, USA, 2001. [Google Scholar]
  65. Arnaud, É.; Elbattah, M.; Gignon, M.; Dequen, G. Deep learning to predict hospitalization at triage: Integration of structured data and unstructured text. In Proceedings of the 2020 IEEE International Conference on Big Data (Big Data), Atlanta, GA, USA, 10–13 December 2020; pp. 4836–4841. [Google Scholar]
  66. Zhou, S.-M.; Lyons, R.A.; Rahman, M.A.; Holborow, A.; Brophy, S. Predicting Hospital Readmission for Campylobacteriosis from Electronic Health Records: A Machine Learning and Text Mining Perspective. J. Pers. Med. 2022, 12, 86. [Google Scholar] [CrossRef]
  67. Breuninger, T.A.; Wawro, N.; Breuninger, J.; Reitmeier, S.; Clavel, T.; Six-Merker, J.; Pestoni, G.; Rohrmann, S.; Rathmann, W.; Peters, A.; et al. Associations between habitual diet, metabolic disease, and the gut microbiota using latent Dirichlet allocation. Microbiome 2021, 9, 18. [Google Scholar] [CrossRef]
  68. Scherpf, M.; Grasser, F.; Malberg, H.; Zaunseder, S. Predicting sepsis with a recurrent neural network using the MIMIC III database. Comput. Biol. Med. 2019, 113, 103395. [Google Scholar] [CrossRef]
  69. Chen, J.H.; Goldstein, M.K.; Asch, S.M.; Mackey, L.; Altman, R.B. Predicting inpatient clinical order patterns with probabilistic topic models vs conventional order sets. J. Am. Med. Inform. Assoc. 2017, 24, 472–480. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Research scheme.
Figure 1. Research scheme.
Healthcare 10 01087 g001
Figure 2. LDA model framework.
Figure 2. LDA model framework.
Healthcare 10 01087 g002
Figure 3. The detailed process of data extraction.
Figure 3. The detailed process of data extraction.
Healthcare 10 01087 g003
Figure 4. The AUROCs of different classifiers (A) based on 12 h dataset (B) based on 24 h dataset.
Figure 4. The AUROCs of different classifiers (A) based on 12 h dataset (B) based on 24 h dataset.
Healthcare 10 01087 g004
Figure 5. AUROC of different classifiers based on 24 h dataset.
Figure 5. AUROC of different classifiers based on 24 h dataset.
Healthcare 10 01087 g005
Table 1. Details of the MIMIC-III patient population for patients aged 16 years and above.
Table 1. Details of the MIMIC-III patient population for patients aged 16 years and above.
Adult Patients Critical Care UnitTotal
Distinct patients38,597
Hospital admissions49,785
Distinct ICU stays53,423
 Coronary Care Unit (CCU)7726 (14.5%)
 Cardiac Surgery Recovery Unit (CSRU)9854 (18.4%)
 Medical Intensive Care Unit (MICU)21,087 (39.5%)
 Surgical Intensive Care Unit (SICU)8891 (16.6%)
 Trauma Surgical Intensive Care Unit (TSICU)5865 (11.1%)
Age, years, median [Q1–Q3]65.8 [52.8–77.8]
Gender, male27,983 (55.9%)
ICU length of stay, median days [Q1–Q3]2.1 [1.2–4.6]
Hospital length of stay, median days [Q1–Q3]6.9 [4.1–11.9]
ICU mortality4565 (8.5%)
Hospital mortality5748 (11.5%)
A mean of # is available for each hospital admission.
 Chartevents (330,712,483)6642.81
 Inputevents (21,136,926)424.56
 Outputevents (4,349,218)87.36
 Labevents (27,854,055)559.49
 Noteevents (2,083,180)41.84
ICU, intensive care unit.
Table 2. Selected Patient Demographic Information.
Table 2. Selected Patient Demographic Information.
12 h24 h
OverallDead at HospitalAlive at HospitalOverallDead at HospitalAlive at Hospital
General (%)
 Number24,252 (100%)2384 (9.83%)21,868 (90.17%)27,809 (100%)2559 (9.20%)25,250 (90.80%)
 Age [Q1–Q3]63.02 [50.96–78.16]70.76 [61.19–83.32]62.17 [50.07–77.27]63.06 [51.32–77.82]70.88 [61.43–83.32]62.26 [50.51–76.97]
 Gender (male)13,675 (56.38%)1267 (9.27%)12,408 (90.73%)15,805 (56.83%)1353 (8.56%)14,452 (91.44%)
Ethnicity (%)
 Asian598 (2.47%)56 (9.36%)542 (90.64%)680 (2.45%)62 (9.12%)618 (90.88%)
 Black1930 (7.96%)106 (5.49%)1824 (94.51%)2142 (7.70%)113 (5.28%)2029 (94.72%)
 Hispanic841 (3.47%)44 (5.23%)797 (94.77%)919 (3.30%)49 (5.33%)870 (94.67%)
 White17,262 (71.18%)1604 (9.29%)15,658 (90.71%)19,809 (71.23%)1733 (8.75%)18,076 (91.25%)
 Other3621 (14.93%)574 (15.85%)3047 (84.15%)4259 (15.32%)602 (14.13%)3657 (85.87%)
Admission Type (%)
 Urgent562 (2.32%)69 (12.28%)493 (87.72%)667 (2.40%)77 (11.54%)590 (88.46%)
 Emergency21,096 (86.99%)2284 (10.83%)18,812 (89.17%)22,890 (82.31%)2427 (10.60%)20,463 (89.40%)
 Elective2594 (10.70%)31 (1.20%)2563 (98.80%)4252 (15.29%)55 (1.29%)4197 (98.71%)
Site (%)
 MICU9654 (39.81%)1099 (11.38%)8555 (88.62%)10,309 (37.07%)1187 (11.51%)9122 (88.49%)
 SICU3942 (16.25%)476 (12.08%)3466 (87.92%)4543 (16.34%)501 (11.03%)4042 (88.97%)
 CCU3925 (16.18%)334 (8.51%)3591 (91.49%)4316 (15.52%)360 (8.34%)3956 (91.66%)
 CSRU2955 (12.18%)126 (4.26%)2829 (95.74%)4482 (16.12%)149 (3.32%)4333 (96.68%)
 TSICU3776 (15.57%)349 (9.24%)3427 (90.76%)4159 (14.96%)362 (8.70%)3797 (91.30%)
Outcomes
 Hospital LOS (days) [Q1–Q3]8.98 [3.79–10.66]9.16 [2.76–11.40]8.97 [3.86–10.58]8.95 [3.88–10.47]9.27 [2.77–11.49]8.92 [3.96–10.34]
 ICU LOS (days) [Q1–Q3]4.26 [1.37–4.57]6.64 [2.12–8.13]4.00 [1.24–3.97]4.15 [1.26–4.17]6.68 [2.08–8.12]3.89 [1.22–3.89]
 Hospital death (%)2384 (9.83%)--2559 (9.20%)--
MICU Denotes Medical ICU; SICU Denotes Surgical ICU; CCU Denotes Coronary Care Unit; CSRU Denotes Cardiac Surgery Recovery Unit; TSICU Denotes Trauma Surgical ICU.
Table 3. Demographic information of the selected patient cohort.
Table 3. Demographic information of the selected patient cohort.
In-Hospital MortalityShort-Term MortalityLong-Term Mortality
48 h72 h30 Days1 Year
12 hNumber of Survive21,86823,87323,59021,93221,839
Number of death238437966223202413
Mortality ratio9.83%1.56%2.73%9.57%9.95%
24 hNumber of Survive25,25027,40927,10325,32425,219
Number of death255940070624852590
Mortality ratio9.20%1.44%2.54%8.94%9.31%
Table 4. Number of instances increased by SMOTE technique.
Table 4. Number of instances increased by SMOTE technique.
Hours after Hospital AdmissionMortalityPercentage of SMOTE IncreaseClass “Survived”Class “Died”
12 hIn-Hospital900%21,86821,456
Short Term48 h6200%23,87323,498
72 h3500%23,59023,170
Long Term30 Days900%21,93220,880
1 Year900%21,83921,717
24 hIn-Hospital900%25,25023,031
Short Term48 h6800%27,40927,200
72 h3800%27,10326,828
Long Term30 Days1000%25,32424,850
1 Year900%25,21923,310
Table 5. Confusion Matrix.
Table 5. Confusion Matrix.
Prediction
PositiveNegative
ActualPositiveTrue Positive (TP)False Negative (FN)
NegativeFalse Positive (FP)True Negative (TN)
Table 6. Comparisons between different models constructed using 12 h dataset in terms of their prediction accuracy.
Table 6. Comparisons between different models constructed using 12 h dataset in terms of their prediction accuracy.
MetricMethodIn-Hospital MortalityShort-Term MortalityLong-Term Mortality
48 h72 h30 Days1 Year
AUROCCART0.81010.80330.80060.79250.8471
LR0.80290.86590.82220.82240.9082
MARS0.81240.85020.81950.81700.8716
RF0.84150.88670.84980.85430.8953
GB0.84890.88620.85420.85560.9171
SpecificityCART0.74160.84250.70710.75440.8181
LR0.73520.79210.74550.77430.8521
MARS0.56960.62430.58610.63440.7090
RF0.17350.24080.16380.19000.3796
GB0.75280.87120.75780.79070.9259
SensitivityCART0.74600.72110.78640.75530.8080
LR0.71260.81880.74570.71580.8000
MARS0.85990.88420.86650.82750.8480
RF0.99120.98420.99700.99330.9760
GB0.78100.72110.78640.76470.6720
PrecisionCART0.23230.11670.21790.26050.0788
LR0.22730.10410.24000.27560.0952
MARS0.17310.06360.17850.20590.0531
RF0.11170.03610.11010.12310.0294
GB0.24870.13910.25200.29500.1487
F1-StatisticCART0.35420.20090.34130.38740.1436
LR0.34470.18480.36320.39790.1702
MARS0.28820.11870.29600.32970.1000
RF0.20070.06960.19830.21910.0571
GB0.37730.23320.38170.42580.2435
CART, classification and regression trees; LR, logistic regression; MARS, multivariate adaptive regression splines; RF, random forest; and GB, gradient boosting.
Table 7. Comparisons between different models constructed using 24 h dataset in terms of their prediction accuracy.
Table 7. Comparisons between different models constructed using 24 h dataset in terms of their prediction accuracy.
MetricMethodIn-Hospital MortalityShort-Term MortalityLong-Term Mortality
48 h72 h30 Days1 Year
AUROCCART0.80490.82460.80640.81400.8511
LR0.83310.90140.84380.84340.8987
MARS0.80530.88430.82500.81020.8935
RF0.86230.92030.87050.87100.9096
GB0.86230.92490.87600.87360.9280
SpecificityCART0.76390.79270.75370.76580.8197
LR0.77070.83130.78120.76840.8772
MARS0.64430.61510.61150.60540.6765
RF0.25200.30110.20560.23010.4212
GB0.81840.85070.81230.78820.9316
SensitivityCART0.74020.80710.75780.76070.7983
LR0.72780.80900.75970.76830.7642
MARS0.80230.91370.85120.84090.8992
RF0.98990.98480.99320.98660.9580
GB0.74270.83250.76180.78210.7563
PrecisionCART0.24990.09050.23030.24250.0658
LR0.26580.11680.26730.26010.0955
MARS0.19330.05720.17560.17360.0423
RF0.12330.03480.10840.11220.0257
GB0.30300.12480.28310.26690.1495
F1-StatisticCART0.37360.16280.35320.36780.1216
LR0.38940.20410.39550.38870.1698
MARS0.31160.10770.29120.28780.0809
RF0.21930.06720.19550.20140.0500
GB0.43040.21710.41280.39800.2497
CART, classification and regression trees; LR, logistic regression; MARS, multivariate adaptive regression splines; RF, random forest; and GB, gradient boosting.
Table 8. The selected six important variables for 12 h and 24 h datasets by using GB.
Table 8. The selected six important variables for 12 h and 24 h datasets by using GB.
DatasetOrder of Variable ImportanceIn-Hospital MortalityShort-Term MortalityLong-Term Mortality
48 h72 h30 Days1 Year
12 h1x1x1x1x1x1
2x12x5x12x12x9
3x5x12x5x5TOPICA3
4x2TOPICA3x6x9x12
5x6x9TOPICA3x2x2
6x4x3x4x4x4
24 h1x1x7x7x7x7
2x5x10TOPICB1x12TOPICB1
3x12TOPICB1x12x6x10
4x9x12x10TOPICB1x12
5TOPICB1x8x3x10x3
6x2x1x5x3x1
Table 9. Comparisons (made using the GB method) of the prediction results generated by models based on 24 h dataset.
Table 9. Comparisons (made using the GB method) of the prediction results generated by models based on 24 h dataset.
DatasetMetricIn-Hospital MortalityShort-Term MortalityLong-Term Mortality
48 h72 h30 Days1 Year
With semi-structure dataAUROC0.86230.92490.87600.87360.9280
Specificity0.81840.85070.81230.78820.9316
Sensitivity0.74270.83250.76180.78210.7563
Precision0.30300.12480.28310.26690.1495
F1-Statistic0.43040.21710.41280.39800.2497
Without semi-structure dataAUROC0.85450.91410.86430.86830.9152
Specificity0.81130.82760.80460.79320.9215
Sensitivity0.73890.84260.75640.76870.7143
Precision0.29390.11110.27350.26820.1265
F1-Statistic0.42050.19630.40170.39760.2149
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Chiu, C.-C.; Wu, C.-M.; Chien, T.-N.; Kao, L.-J.; Qiu, J.T. Predicting the Mortality of ICU Patients by Topic Model with Machine-Learning Techniques. Healthcare 2022, 10, 1087. https://doi.org/10.3390/healthcare10061087

AMA Style

Chiu C-C, Wu C-M, Chien T-N, Kao L-J, Qiu JT. Predicting the Mortality of ICU Patients by Topic Model with Machine-Learning Techniques. Healthcare. 2022; 10(6):1087. https://doi.org/10.3390/healthcare10061087

Chicago/Turabian Style

Chiu, Chih-Chou, Chung-Min Wu, Te-Nien Chien, Ling-Jing Kao, and Jiantai Timothy Qiu. 2022. "Predicting the Mortality of ICU Patients by Topic Model with Machine-Learning Techniques" Healthcare 10, no. 6: 1087. https://doi.org/10.3390/healthcare10061087

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop