Next Article in Journal
Metered-Dose Inhaler Spacer with Integrated Spirometer for Home-Based Asthma Monitoring and Drug Uptake
Next Article in Special Issue
Exploring Pain Reduction through Physical Activity: A Case Study of Seven Fibromyalgia Patients
Previous Article in Journal
Coordinating Obstacle Avoidance of a Redundant Dual-Arm Nursing-Care Robot
Previous Article in Special Issue
Unveiling the Unpredictable in Parkinson’s Disease: Sensor-Based Monitoring of Dyskinesias and Freezing of Gait in Daily Life
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Optimizing Acute Coronary Syndrome Patient Treatment: Leveraging Gated Transformer Models for Precise Risk Prediction and Management

1
People’s Hospital of Ningxia Hui Autonomous Region, Ningxia Medical University, Yinchuan 750101, China
2
Tongxin County People’s Hospital, Wuzhong 751309, China
3
College of Biomedical Engineering and Instrument Science, Ministry of Education Key Laboratory of Biomedical Engineering, Zhejiang University, Hangzhou 310027, China
4
Centre of Intelligent Medical Technology and Equipment, Binjiang Institute of Zhejiang University, Hangzhou 310053, China
*
Author to whom correspondence should be addressed.
Bioengineering 2024, 11(6), 551; https://doi.org/10.3390/bioengineering11060551
Submission received: 26 April 2024 / Revised: 17 May 2024 / Accepted: 27 May 2024 / Published: 29 May 2024
(This article belongs to the Special Issue Intelligent Health Management, Nursing and Rehabilitation Technology)

Abstract

:
Background: Acute coronary syndrome (ACS) is a severe cardiovascular disease with globally rising incidence and mortality rates. Traditional risk assessment tools are widely used but are limited due to the complexity of the data. Methods: This study introduces a gated Transformer model utilizing machine learning to analyze electronic health records (EHRs) for an enhanced prediction of major adverse cardiovascular events (MACEs) in ACS patients. The model’s efficacy was evaluated using metrics such as area under the curve (AUC), precision–recall (PR), and F1-scores. Additionally, a patient management platform was developed to facilitate personalized treatment strategies. Results: Incorporating a gating mechanism substantially improved the Transformer model’s performance, especially in identifying true-positive cases. The TabTransformer+Gate model demonstrated an AUC of 0.836, a 14% increase in average precision (AP), and a 6.2% enhancement in accuracy, significantly outperforming other deep learning approaches. The patient management platform enabled healthcare professionals to effectively assess patient risks and tailor treatments, improving patient outcomes and quality of life. Conclusion: The integration of a gating mechanism within the Transformer model markedly increases the accuracy of MACE risk predictions in ACS patients, optimizes personalized treatment, and presents a novel approach for advancing clinical practice and research.

1. Introduction

Acute coronary syndrome (ACS) is identified as the most severe category within coronary heart diseases, characterized by a range of symptoms resulting from diminished coronary artery blood flow. This impairment can lead to the malfunction or death of heart muscle tissue [1]. The “2022 Heart Disease and Stroke Statistical Report” by the American Heart Association, drawing on 2019 data, revealed a mortality rate of 214.6 per 100,000 individuals in the United States due to cardiovascular diseases. Statistically, cardiovascular disease causes a death every 36.1 s, totaling 2396 deaths daily. Notably, for individuals aged 45 and above, the mortality rate one year post-first myocardial infarction is reported at 18% for males and 23% for females [2]. Concurrently, in China, there is a year-over-year increase in ACS incidence, with the patient count exceeding 2.5 million, thereby becoming the foremost cause of mortality among both urban and rural populations [3]. The “China Cardiovascular Health and Disease Report 2022” indicates that in 2020, cardiovascular diseases accounted for 48.00% and 45.86% of all deaths in rural and urban areas, respectively, highlighting them as the primary mortality cause. Annually, China records nearly one million new ACS cases. The abrupt onset, grave severity, and intricate progression of ACS frequently culminate in death or disability due to acute myocardial ischemia [4], thereby imposing a significant challenge on China’s public health and healthcare systems.
In the early management of ACS, accurately determining patients’ risk levels for MACEs, such as myocardial infarctions, strokes, or death, is crucial. This phase of risk assessment enables physicians to tailor treatment strategies effectively, thereby reducing ACS mortality rates. Widely adopted in clinical settings are the Thrombolytic Therapy for Myocardial Infarction (TIMI) [5,6], the Global Registry of Acute Coronary Events (GRACE) [7,8], and the CRUSADE* bleeding risk assessment [9] tools. These traditional scoring mechanisms, generally derived from longitudinal cohort studies, offer substantial reliability. They track participants over extended periods to collect objective data on adverse outcomes, providing a vital evidence base for understanding cardiovascular disease etiology and evaluating diagnostic and therapeutic interventions. However, their effectiveness is constrained by a limited array of considered risk factors and the stringent inclusion criteria of cohort studies, which may not fully represent the broader clinical context. Additionally, the lengthy duration required to conclude these studies introduces challenges in maintaining consistent patient data collection. Moreover, despite the extensive use of these risk assessment tools in practice, their limitations in patient management are increasingly apparent. Particularly notable is the gap in integrating a digital patient management system, which leads to data fragmentation, inadequate real-time monitoring, and obstacles in personalizing treatment regimens [10]. Traditional risk evaluation tools depend on static data, neglecting the potential for patients’ conditions to evolve [11]. This static methodology hampers the ability of healthcare providers to accurately gauge current risk levels, potentially delaying or compromising treatment decisions. Furthermore, the lack of digital support complicates the personalization of treatment, a necessity in the era of precision medicine, which demands tailoring strategies to individual patient profiles, including lifestyles and medical history [12]. Without access to comprehensive, actionable patient data, developing and implementing personalized treatment plans faces considerable hurdles.
In recent years, the advancement of hospital informatization has paved the way for the development of sophisticated machine learning models leveraging electronic health record (EHR) data. These models are designed to mirror the complexity of the clinical environment closely and integrate a broader spectrum of risk factors. Wu et al. [13] analyzed data from 536 cases at the Geisinger Clinical Center to predict heart failure occurrence within six months prior to a formal diagnosis, employing various methodologies, including logistic regression, support vector machines, and boosting, complemented by diverse feature selection techniques. Similarly, Syed et al. [14] introduced a similarity-based unsupervised algorithm for the risk stratification of patients facing major adverse cardiac events, demonstrating its efficacy in identifying individuals at double the risk of such events within a 90-day timeframe. Further, Churpek et al. [15] developed a discrete-time multinomial logistic regression model targeting cardiac arrest events, which, by utilizing vital signs, demographic data, and test results from EHRs, outperformed traditional models relying solely on vital signs in predictive accuracy. Dong et al. [16] innovated by integrating genetic algorithms with fuzzy association rules to create an intelligent system for assessing the risk of unstable angina, wherein fuzzy sets are utilized to model the imprecision of input data, facilitating the extraction of a set of fuzzy rules for a risk evaluation. Verma [17] proposed a novel hybrid algorithm for a computer-aided diagnosis that combines effective feature selection, particle swarm optimization search techniques, and K-means clustering algorithms for risk factor identification. Krittanawong et al. [18] explored the utilization of artificial intelligence in identifying cardiovascular disease risk factors, optimizing treatment approaches, and forecasting patient outcomes, suggesting that a holistic digital management and data analysis approach could offer patients more customized treatment plans, thus enhancing therapeutic outcomes and patient satisfaction. Muthukumarasamy et al. developed a cloud-based tool to enhance ACS management by focusing on the revascularization of non-culprit lesions. Popa-Fotea et al. [19] demonstrated that machine learning improves diagnostic accuracy and treatment effectiveness across diverse diseases [20]. Lastly, Johnson et al. [21] focused on the extensive application of artificial intelligence in cardiology, encompassing the automatic detection of arrhythmias, the risk assessment of myocardial infarction, and the early diagnosis of cardiovascular diseases, noting that the integration of multiple machine learning techniques, such as logistic regression, support vector machines, and deep learning, can significantly augment the accuracy of cardiovascular disease predictions and diagnoses.
In summary, unlike traditional models based on cohort studies, data-driven research methods using structured EHR can handle a large number of cases without strict inclusion or exclusion criteria, reflecting the real clinical environment and more conveniently incorporating new risk factors. However, predicting MACEs in ACS is a typical imbalanced learning problem. Identifying outcomes from minority class samples correctly and considering potential associations among samples and features are challenges that need to be addressed. This paper proposes a Transformer model based on a gating mechanism aimed at improving the prediction of MACEs in patients with ACS. First, we utilized a TabTransformer [22] based on the Transformer architecture to process structured tabular data, effectively capturing potential relationships between samples and understanding and predicting interactions among complex biomarkers related to the disease. We then introduced a gating mechanism to solve the imbalanced learning problem. The gating mechanism regulates the flow of information, enhancing the model’s learning from minority class features, thus improving the identification of minority class samples without sacrificing the overall performance. Finally, a management platform for ACS patients was established around the TabTransformer model with the gating mechanism. This platform is used for risk prediction of major cardiovascular adverse events and recommends personalized management and treatment strategies based on the specific conditions of the patients. Through this approach, we aim to improve the treatment outcomes for ACS patients, reduce the incidence of major adverse cardiovascular events, and ultimately enhance the quality of life for patients.

2. Materials and Methods

2.1. Data Preprocessing

Data Collection. This study analyzed a comprehensive dataset containing records of cardiovascular inpatients collected from January 2014 to December 2019 at a hospital in Ningxia. The dataset includes six categories of real data: basic information, medical history, laboratory test results, follow-up data, echocardiogram (ECHO) data, and angiography data. The dataset contains structured basic information, medical history, laboratory test results, echocardiogram data, and angiography data, while follow-up data only retained the data structure without data instances.
Data Processing and Analysis. The data underwent rigorous preprocessing to ensure consistency, accuracy, and integrity of the analysis. The following preprocessing methods were applied:
Numerical Data: Numerical data were standardized by subtracting the mean and dividing by the standard deviation [23], and data were normalized between 0 and 1 to mitigate biases caused by different scales and distributions.
Categorical Data: Categorical variables were converted into binary columns, with each category represented by an independent column, using 1 to indicate presence and 0 to indicate absence [24]. A unique integer was assigned to each category.
Time Series Data: Features such as year, month, day, day of the week, and holiday flags were extracted to capture cyclical changes over time [25]. Sliding windows were used to generate historical aggregates to capture dynamic changes in time series data.
Handling Missing Data: Missing values were filled using the mean, median, mode, or model predictions [26]. Rows or columns with missing values were deleted when there were minimal missing data.
Outlier Handling: Data points significantly different from others were excluded [27].
These preprocessing steps ensured that the neural network model would not be biased due to differences in scale, missing values, or outliers.

2.2. Table Data Modeling Based on Gated Controllers

The TabTransformer rethinks the application of the Transformer architecture to tabular data, offering an effective way to encode categorical features into meaningful representations. Unlike traditional processing methods, where categorical features are treated as independent entities, the TabTransformer introduces the concept of context embeddings, allowing for the model to capture dependencies between features, thereby deepening its understanding of the data. Based on this, as shown in Figure 1, our work enhances the TabTransformer architecture by incorporating a gating mechanism into the self-attention mechanism. The introduction of the gating mechanism aims to improve the model’s dynamic adjustment capability for feature importance, enabling the model to identify and utilize the interactions between key features more flexibly.
Algorithm 1 outlines the steps for the gated Transformer model designed for MACE risk prediction. The procedure begins with the initialization of the dataset, which contains EHR data from ACS patients, followed by preprocessing to extract relevant features. The model, featuring a series of Transformer layers augmented with a gating mechanism, processes each feature set. This gating mechanism dynamically adjusts the attention outputs by applying calculated gating coefficients, which allow for precise modulation of the information flow based on the features’ significance. The model undergoes iterative training until convergence, ensuring adaptability and robust generalization across varied datasets. This method enhances the model’s accuracy in predicting MACE risk, making it highly applicable for clinical settings.
Algorithm 1: Gated Transformer Model for MACE Risk Prediction
1:   Initialize dataset D containing EHR data of ACS patients
2:   Preprocess D to extract features F for each patient
3:   Initialize the gated Transformer model M
4:   Define gate structure within M that adjusts information flow
5:   while model M has not converged do
6:   for each batch B from F do
7:     for each feature set f in B do
8:       Encode f using input embeddings to receive E
9:       for each Transformer layer L in M do
10:         Apply self-attention to E to receive attention outputs A
11:         Apply gate G to A:
12:            Compute   gating   coefficients   G c = s i g m o i d ( W g A + b g )
13:             Adjust   A   using   G c   to   receive   gated   outputs   G A = G c A
14:          Pass   G A to the next layer or output layer
15:       end for
16:     end for
17:     Calculate predictions P for B using final outputs from M
18:     Compute loss L between P and actual outcomes of B
19:     Update weights of M, including gate parameters, to minimize L
20:   end for
21:   Validate M on a separate validation set and adjust if necessary
22:   end while
23:    Deploy   M   for   evaluation   on   a   testing   set   F t e s t
24:    for   each   patient   p   in   F t e s t  do
25:    Predict   risk   R p using M for p’s features
26:    Output   R p indicating MACE risk
27:   end for
Ensure: High-accuracy MACE risk prediction incorporating dynamic gating

2.2.1. Pre-Training Embedding

The pre-training of embeddings was carried out by pre-training the embedding layers on related tasks, allowing for the transfer of knowledge between different tasks and improving the model’s performance on the target task. A common method for pre-training is to use unsupervised learning, such as autoencoders, to learn rich embeddings that represent the features. Suppose there is an autoencoder whose encoder part maps the input feature x i to a hidden representation h i , and then, the decoder part tries to reconstruct the original input, outputting x ^ i . The operations of the encoder and decoder can be expressed by the following formulas:
h i = f W e x i + b e
x ^ i = g W d h i + b d
Here, f and g are activation functions; W e and W d are the weight matrices for the encoder and decoder, respectively; and b e and b d are their biases.
The objective of pre-training is to minimize the difference between the input feature x i and the reconstructed feature x ^ i , typically using a loss function, such as mean squared error (MSE):
L o s s = 1 N i = 1 N x i x ^ i 2
After pre-training is completed, the weights W e and biases b e from the encoder part can be used to initialize the embedding matrix for the corresponding features in the TabTransformer. This method of pre-trained embeddings can help the model capture richer feature representations early in training, thereby improving learning efficiency and performance on the final task.

2.2.2. Transformer Layer

The Transformer layer is the cornerstone of the TabTransformer architecture, facilitating the generation of contextual embeddings for tabular data. It consists of multiple self-attention mechanisms and position-wise feed-forward networks.
Self-Attention Mechanism: Each self-attention mechanism computes the attention scores between all pairs of positions in the input sequence, enabling the model to focus on different parts of the sequence for different tasks. The attention function is defined as follows:
A t t e n t i o n Q , K , V = s o f t m a x Q K T d k V
where Q , K , and V are the queries, keys, and values matrices, respectively, and d k is the dimensionality of the keys.
Position-wise Feed-Forward Networks: Following the self-attention mechanism, each position in the input sequence is processed by a feed-forward network, applied independently. This network consists of two linear transformations with a ReLU activation in between.
Residual Connection and Layer Normalization: Both the self-attention and feed-forward networks incorporate residual connections followed by layer normalization, enhancing the flow of gradients and stabilizing the training process.

2.2.3. Column Embedding

For each categorical feature, an embedding layer maps each unique category to a high-dimensional space where the relationships between categories can be learned and represented more effectively. These embeddings serve as the input to the Transformer layers, facilitating the generation of context embeddings. For each categorical feature c i , its embedding representation can be obtained by looking up the embedding matrix E , where each row of E represents the embedding vector of a category. If the index corresponding to the categorical feature c i is j , then its embedding vector e i can be obtained using the following formula: e i = E [ j ] . For a feature with N possible categories, the embedding matrix E will be an N × D matrix, where D is the embedding dimension. In this way, each categorical feature is transformed into a D -dimensional vector, which is then fed into the Transformer layers of the TabTransformer.

2.2.4. Gated Self-Attention Mechanism

For the output of the attention mechanism A , the gating mechanism is applied:
G = σ X W g + b g
where W g   is the gating weight matrix, b g   is the bias term, and σ   is the Sigmoid activation function that produces gating coefficients G   ranging from 0 to 1, used to regulate the flow of information. The final gated self-attention output can be represented as: A = G A , where   denotes element-wise multiplication. Through this method, the model can dynamically adjust the contribution of each feature.

2.2.5. Gated Feed-Forward Network

In the standard feed-forward network (FFN), the gating mechanism should actually be integrated into the information flow to regulate and optimize the passage of information. Introducing a gating coefficient G F in the FFN aims to modulate the output after the nonlinear activation function, ensuring that only important signals can pass through:
F = R e L U A F 1 + b 1
where A is the output from the previous layer (such as the gated self-attention layer) and F 1 and b 1 are the weights and biases of the first layer of the FFN.
Similar to the gated self-attention mechanism, the gated FFN also employs a gating mechanism to regulate the flow of information. The calculation of the gating coefficient is as follows:
G F = σ A F g + b g F
Here, F g   and b g F are the weights and biases used for calculating the gating coefficient, and σ is the activation function (usually sigmoid) that ensures the output values are between 0 and 1, representing the degree to which each feature is allowed to pass.
After calculating the gating coefficient G F , it is applied to the output of the FFN as follows:
F = F G F
This allows for the model to dynamically adjust the amount of information passing through the feed-forward network, enhancing the model’s expressiveness and generalization ability.

2.3. Design of Acute Coronary Syndrome Patient Management Platform

As shown in Figure 2, this was the overall framework structure of the system. Healthcare providers operate visually through a system interface in the form of a Web website. The healthcare provider logs into the website, selects patients online, imports and visualizes the EHR of patients, and calls the model prediction service to perform adverse event predictions for the patient online. The data are stored in a database, which includes the imported EHR of patients, extracted feature data, model data, predicted result data, and intermediate process data.
EHR are collected with the patient as the basic unit, indexed by a unique patient ID, which can automatically associate the information of the patient’s past hospital stays. A patient corresponds to one or multiple hospital admissions. The diagnosis and treatment actions and the examination results of each admission are imported based on the hospitalization ID. After the electronic health record is imported into the system, it can be browsed and analyzed. Medical staff can use the models already loaded into memory to predict adverse events for patients. For example, to predict the occurrence of major cardiovascular adverse events, the system will automatically extract corresponding key factors from the entered patient information and make predictions. Particularly, medical personnel can also choose surgical or pharmaceutical interventions to predict the effect of intervention treatments on the patient.
The system’s web page was written based on Vue 3.3.0, the backend service was written in Java 1.8 based on Spring Boot, and the prediction model was set up using Python 3.9 based on the Flask framework.

3. Results

3.1. Results of Data Processing

After rigorous processing of 8583 data entries, we obtained 8005 training data entries. As shown in the table, the data are divided into six categories, comprising 134 variables: 8 items of basic information, 31 items of medical history, 30 items of laboratory test results, 25 items of follow-up data, 20 items of ECHO data, and 20 items of angiography data. The percentages shown in Table 1 represent the proportion of each data type relative to the total number of variables.
These proportions demonstrate the comprehensive nature of the dataset, ensuring that all relevant aspects of a patient’s medical profile are included. Each feature was verified by clinical experts to ensure accuracy and relevance to the study.
All six categories of data underwent preprocessing as described in Section 2.1. This included the normalization of numerical data, the binary encoding of categorical data, the handling of missing values, and outlier detection. This preprocessing ensured that the data were consistently formatted and free from biases that could affect model performance. The structured data were then presented to the model in a tabular format, capturing all relevant details in a standardized manner. This includes both binary classification variables and continuous numerical variables.
A complete set of the feature variables used in this study is provided in Appendix A. This appendix includes the names, types, and descriptions of each feature.

3.2. Comparison of Classification Results

In this study, the gold standard for comparison was a clinical assessment provided by a cardiologist. These experts reviewed patient data, including medical history, laboratory test results, and imaging data, from the 134 indicators in Table A1 to determine the risk of MACEs based on their clinical judgment and experience. Predictions made by different machine learning models are then compared to these expert assessments to assess their accuracy and effectiveness. This comparison ensures that the model’s predictions are consistent with established clinical practice and standards.
When evaluating the performance of different machine learning models on classification tasks, the receiver operating characteristic (ROC) curve and area under the curve (AUC) are widely used metrics. Figure 3 shows the ROC curves of five models for predicting major adverse cardiovascular events, namely the TabTransformer+Gate, TabTransformer, TabNet [28], GatedAdditiveTreeEnsemble [29], FTTransformer [30], and CategoryEmbedding [31] model, with the training parameters for the TabTranformer+Gate and TabTransformer presented in Table 2.
By comparing the area under the curve, it can be clearly seen that the TabTransformer+Gate model shows significant performance advantages. This result can be attributed to the model’s enhanced capability in feature selection and representation through the integration of the gating mechanism. The gated network can dynamically adjust the contribution of different features, optimizing feature representation, thereby achieving higher accuracy in classification tasks. Furthermore, the gating mechanism may also help the model achieve better generalization across different data subsets, reducing the risk of overfitting.
Second, the TabTransformer model without the gate mechanism and the FTTransformer model also exhibited high AUC values. Although there is a slight gap compared to the TabNet model, this demonstrates the Transformer architecture’s good representational learning ability in handling tabular data, effectively capturing long-distance dependencies between features through the self-attention mechanism. The TabNet model, with its sparse attention mechanism for feature selection, showed competitiveness on tabular datasets. Nonetheless, there is still room for improvement in terms of feature selection accuracy and model complexity.
The AUC value of the GatedAdditiveTreeEnsemble model, although lower than the aforementioned models, also showed relatively good performance. This indicates that hybrid methods combining ensemble tree models and deep learning models can improve classification effectiveness to some extent, even if they may not perform as well as purely deep learning methods. The CategoryEmbedding model had the lowest AUC value, which might indicate certain limitations of this model on the current dataset. This could be related to the model’s inability to fully capture complex patterns or feature interactions in the data, pointing out the need for improvement in its representation learning capability.
Gating mechanisms are particularly well-suited to Transformer architectures due to their ability to dynamically adjust the importance of each feature during training. The self-attention mechanism in Transformers dynamically weighs different parts of the input sequence, and the gating mechanism enhances this by controlling the flow of information, ensuring the model focuses on the most relevant features. Additionally, hierarchical processing through multiple Transformer layers benefits from the gating mechanism’s regulation of information flow, enhancing the model’s ability to learn hierarchical representations. This combination is crucial for tabular data with complex, multi-dimensional feature relationships.
To validate the effectiveness of the gating mechanism, additional experiments compared the ROC curves of the TabTransformer+Gate, TabTransformer, FTTransformer+Gate, and FTTransformer models. The results, shown in Figure 4, indicate that the TabTransformer+Gate model demonstrates superior performance with an AUC of 0.838. The FTTransformer+Gate model also shows improvement over the standard FTTransformer, achieving an AUC of 0.836 compared to 0.826. These findings suggest that while the gating mechanism benefits both models, it is particularly effective for the TabTransformer architecture.
In summary, the experimental results showcase the potential of using Transformer-based models on tabular data while highlighting the superior performance of the TabTransformer+Gate model in classification tasks on tabular data.
When evaluating the performance of classification models, the precision–recall (PR) curve is an important complement to the ROC curve, especially in cases of imbalanced data distribution. The PR curve illustrates the relationship between precision and recall at different thresholds, intuitively showing the model’s ability to classify minority classes.
As can be seen from Figure 5, the TabTransformer+Gate model continues to perform best among all models, with an area under the PR curve of 0.79. This highlights the superiority of the gating mechanism in improving the precision of model predictions in the face of imbalanced datasets.
The area under the PR curve for the TabTransformer and FTTransformer is slightly lower than that of the TabTransformer+Gate and FTTransformer+Gate. However, this still indicates the effectiveness of the Transformer in handling imbalanced data.
The confusion matrix is a particularly helpful tool for evaluating the performance of classification models because it not only shows the number of samples correctly classified by the model but also reveals cases of misclassification. Figure 6a–d represent the confusion matrices for the TabTransformer, TabTransformer+Gate, FTTransformer, and FTTransformer+Gate models, respectively, indicating model performance.
The confusion matrices for the models are summarized as follows: The TabTransformer+Gate model (Figure 6a) correctly predicted 1094 true positives (TP) and 160 true negatives (TN). In comparison, the TabTransformer model (Figure 6b) predicted 1099 TPs and 140 TNs but misclassified 49 negatives as false positives (FP) and 164 positives as false negatives (FN). The FTTransformer+Gate model (Figure 6c) predicted 1090 TPs and 114 TNs, with 190 negatives misclassified as FPs and 58 positives as FNs. Lastly, the FTTransformer model (Figure 6d) predicted 1080 TPs and 118 TNs, misclassifying 185 negatives as FPs and 68 positives as FNs.
Compared to the TabTransformer, the TabTransformer+Gate model shows a slight improvement in correctly predicting positives and a noticeable increase in correctly predicting negatives, resulting in a more balanced performance. Similarly, the FTTransformer+Gate model improves in correctly predicting positives but slightly decreases in correctly predicting negatives, excelling in reducing false negatives at the cost of increased false positives. Overall, the inclusion of the gate mechanism enhances the models’ ability to correctly predict positives and reduce false negatives, though it slightly increases false positives. The choice of model with or without a gate depends on the specific need to balance positive and negative prediction accuracy, with the TabTransformer+Gate offering a more balanced performance and the FTTransformer+Gate prioritizing positive prediction accuracy.
To further validate the gating performance, we also used F1-score, precision, and recall to evaluate the prediction performance of five adverse events: ischemic events, bleeding events, infectious events, febrile events, and revascularization. The results are shown in Figure 7, Figure 8, Figure 9, Figure 10 and Figure 11.
The F1-score constitutes the primary metric utilized within this study for assessing the performance of both entity recognition and entity relationship identification models. Essentially, it is calculated by combining precision and recall values. A high F1-score indicates exemplary model performance. These measurements can be computed as follows:
P r e c i s i o n = T P T P + F P
R e c a l l = T P   T P + F N
F 1 s c o r e = 2 P r e c i s i o n R e c a l l P r e c i s i o n + R e c a l
where TP is the number of positive samples that are correctly labelled as positive samples, FP is the number of negative samples that are incorrectly labelled as positive samples, and FN indicates the number of positive samples that are mistakenly labelled as negative samples.
The TabTransformer+Gate model maintains high precision, recall, and F1-scores across various prediction scenarios, demonstrating exceptional performance. This reflects the model’s effectiveness in feature learning and decision boundary delineation. Its advantage stems from the introduction of the Gate mechanism, which provides the model with more flexible representation capabilities when dealing with complex data structures. This flexibility helps the model exhibit better adaptability and robustness in predicting different types of adverse events.
Compared to the TabTransformer+Gate, the TabTransformer, while utilizing a similar foundational architecture, lacks the Gate mechanism, which may be why it performs slightly less well in certain scenarios. Other models, such as the FTTransformer+Gate, FTTransformer, TabNet, GatedAdditiveTreeEnsemble, and CategoryEmbedding, can achieve relatively satisfactory precision, recall, and F1-scores in prediction tasks. However, their overall performance reveals potential limitations in model complexity, feature representation, and generalization capabilities.

3.3. Implementation of the Design of Acute Coronary Syndrome Patient Management Platform

The Acute Coronary Syndrome Patient Management System, through the collection and analysis of real-time data, achieves the provision of timely feedback and intervention suggestions for patients, as well as offering real-time information on patient management for healthcare providers, thus realizing comprehensive management of patients with acute coronary syndrome. The system mainly consists of two modules: electronic health record entry and adverse event prediction.
Electronic health records are entered based on the patient as the basic unit. The system uses the unique patient ID as the index condition, automatically associating the patient’s hospitalization information from multiple admissions. Each patient corresponds to one or several hospitalizations. As shown in Figure 12, information entry is conducted through six types of structured questionnaires. Within the blue box, healthcare providers can choose any category among basic information, medical history, laboratory test results, follow-up data, echocardiogram data, and angiography data to start entering patient information. If the patient ID already exists, the system will update with the newly entered data while retaining unchanged data to ensure the uniqueness of the information.
After importing electronic health records into the system, they can be viewed and analyzed. Healthcare providers can use the TabTransformer+Gated model, already loaded into the memory, to predict adverse events for patients, estimating the probability of short-term major adverse cardiovascular events occurring as well as the probability of such events occurring after interventions, such as surgical or medication treatments. As shown in Figure 13, the yellow box displays the context with the highest model weights during the prediction of adverse events. The red box shows the model’s predicted probability of the current patient experiencing an adverse event. The purple box shows the probability of an adverse event occurring after intervention treatments (such as surgical or medication interventions), highlighting the key risk factors that influence the outcome of intervention treatments.

4. Discussion

Significance of Research Results: Our study introduces an innovative approach to predicting MACEs in patients with ACS through the use of the TabTransformer model enhanced with gating mechanisms. Given the dynamic nature of ACS and its significant burden on global health, this study is crucial. Unlike traditional risk stratification models, our method provides a more detailed understanding of patient data. By incorporating gating mechanisms into the self-attention layers and feed-forward network layers, the TabTransformer+Gate model offers a more flexible and dynamic way to process and learn features from tabular data. It effectively learns and differentiates the importance of various features, which is particularly important for understanding the complex interactions and dependencies between features. This enhanced model architecture gives the TabTransformer+Gate superior performance and generalization capabilities when dealing with tabular data that contain complex relationships and diverse features.
Comparison with Previous Work: The study introduces the TabTransformer+Gate model to predict MACEs in patients with ACS. This model incorporates Transformer architecture enhanced with a gating mechanism, providing a significant improvement in processing and analyzing complex, high-dimensional datasets. This dynamic gating mechanism allows for robust predictions across varying datasets, showing superior generalization capabilities. It effectively captures complex interactions between features, similar to findings in other studies that utilized advanced machine learning models for medical data analysis [32]. Ultimately, the TabTransformer+Gate model offers more precise MACE risk assessments, aligning with enhancements in clinical decision-making and patient management strategies noted in comparative studies [33]. These advancements in AI applications in healthcare suggest a significant improvement over traditional risk assessment methods and other deep learning models.
By integrating the gating mechanism with the TabTransformer+Gate model, we not only improved the accuracy of MACE predictions but also provided a new perspective on the application of deep learning in the healthcare domain.
Clinical Significance: We developed a management platform specifically designed for ACS patients. Through the application of the TabTransformer+Gate model, medical professionals can more effectively identify patients at high risk, thereby providing timely and targeted intervention measures. This early identification and personalized treatment approach not only helps to reduce the risk of MACEs but also significantly improves treatment outcomes for patients while optimizing resource allocation within the healthcare system [34,35]. The platform can integrate various types of data, including real-time patient data, offering a more dynamic, responsive, and adaptable management approach for patient care. Through this platform, medical teams can monitor patients’ health status in real time and develop and adjust treatment plans based on model-predicted risk levels, ensuring that patients receive the most appropriate treatment and care. This integrated and systematic management approach not only improves treatment efficiency but also enhances patient satisfaction, providing a comprehensive and personalized treatment and management plan for ACS patients.
Limitations and Challenges: Although our TabTransformer+Gate model demonstrated excellent performance in predicting MACEs in patients with ACS, its effectiveness highly depends on the quality and comprehensiveness of the input data. This highlights the importance of accurate and thorough data collection in clinical settings [36]. Despite the model considering a wide range of variables, the continuous advancement in medical research means the model needs to be regularly updated and optimized to include the latest risk factors and clinical insights [37].
Future Directions: Future research will focus on enhancing the model’s predictive capability by integrating novel biomarkers and imaging data, which may further improve its accuracy and reliability in clinical applications [38]. Additionally, to ensure the model’s universality and effectiveness, a broader assessment of its applicability across different populations and healthcare settings is needed. Developing a user-friendly interface will be key to facilitating the adoption of this technology in daily clinical practice, allowing for healthcare providers to perform advanced risk predictions without needing to understand the model’s details [39]. Such an interface will also help promote communication and collaboration between interdisciplinary teams, further integrating the model into the clinical decision-making process.
Broader Impact: This study not only contributes to the field of digital health but also demonstrates the potential of machine learning technologies to bridge the gap between data science and clinical practice. By leveraging advanced data analysis techniques, our model offers new perspectives for predicting outcomes of ACS and other cardiovascular diseases, potentially transforming future disease management and treatment strategies. Moreover, the approach and principles of the model can be extended to other medical fields, providing a template for a wide range of disease predictions and health management [40]. With the development of technology and the increasing richness of medical data, we anticipate seeing further development.

5. Conclusions

This study successfully demonstrates the application of a Transformer model combined with a gating mechanism to predict MACEs in ACS patients. Integrating gating mechanisms into Transformer models provides a novel approach to understanding and analyzing structured tabular data in the medical field, providing insights beyond traditional approaches in terms of depth and accuracy. Depending on clinical priorities (e.g., maximizing true-positive identification versus ensuring high predictive accuracy), one can choose between gating and non-gating mechanisms. For situations where the lack of positive cases is critical, a gated model is preferable. For contexts that emphasize accurate positive predictions, ungated models may be more appropriate. Through rigorous data preprocessing, an innovative model design, and a comprehensive performance evaluation, our findings highlight some key achievements and implications for future research and clinical practice. Furthermore, developing a patient management platform that incorporates predictive models as a new means of treating ACS patients could significantly improve the personalization and precision of treatment. Such a platform can optimize patients’ treatment plans, enhance patient compliance and effectiveness, and simultaneously achieve the dynamic monitoring and management of health status, ultimately improving patients’ prognosis and quality of life.

Author Contributions

Conceptualization, Y.M.; methodology, Y.M. and Z.F.; software, Z.F. and W.M.; validation, Y.M. and N.D.; investigation, Y.M.; resources, Y.M.; data curation, Z.J.; writing—original draft preparation, Y.M.; writing—review and editing, N.D. and S.W.; visualization, Z.F.; supervision, Y.M.; project administration, S.W. and N.D. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Key Research and Development Program of Ningxia Hui Autonomous Region, China [grant number 2023BEG02070], and [grant 2021AB11007] from the Key Re-search and Development Program of Guangxi, China.

Institutional Review Board Statement

This study has been reviewed and approved by the Institutional Review Board of the Tongxin County People’s Hospital in Ningxia Hui Autonomous Region. All research activities comply with ethical standards and strictly adhere to both international and domestic guidelines and regulations concerning research involving human participants. The IRB approval number for this study is [2022-11].

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data that support the findings of this study are available on request from the corresponding author. The data are not publicly available due to privacy or ethical restrictions.

Acknowledgments

The authors express their sincere gratitude to all the voluntary subjects who participated in this study.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Appendix A provides a comprehensive overview of the feature variables utilized in this study. Each feature is meticulously documented with its name, type, and a brief description, ensuring clarity and transparency. This detailed catalog includes variables spanning six categories: basic information, medical history, laboratory test results, follow-up data, ECHO data, and angiography data. By offering this level of detail, Appendix A facilitates the reproducibility of the study and allows for other researchers to understand the variables’ significance and potential impact on the model’s performance. This thorough documentation supports further validation and application of our findings in various clinical and research settings.
Table A1. Detailed feature variable overview.
Table A1. Detailed feature variable overview.
Variable NameCategoryTypeDescription
GenderBasic informationCategoricalThe biological sex of the patient.
Birth DateBasic informationNumericalThe date of birth of the patient.
Age at OnsetBasic informationNumericalThe age at which the patient first experienced symptoms.
Body Mass Index (BMI)Basic informationNumericalThe body mass index of the patient.
Initial Heart RateBasic informationNumericalThe patient’s heart rate upon first admission.
Initial Systolic BPBasic informationNumericalThe patient’s systolic blood pressure upon first admission.
Initial Diastolic BPBasic informationNumericalThe patient’s diastolic blood pressure upon first admission.
Initial OxygenationBasic informationNumericalThe patient’s oxygenation level upon first admission.
History of DiabetesMedical historyCategoricalWhether the patient has a history of diabetes.
History of HypertensionMedical historyCategoricalWhether the patient has a history of hypertension.
History of StrokeMedical historyCategoricalWhether the patient has a history of cerebrovascular disease.
History of PVDMedical historyCategoricalWhether the patient has a history of peripheral vascular disease.
History of DyslipidemiaMedical historyCategoricalWhether the patient has a history of dyslipidemia.
History of CADMedical historyCategoricalWhether the patient has a history of coronary artery disease.
History of CABGMedical historyCategoricalWhether the patient has a history of coronary artery bypass graft surgery.
History of Heart FailureMedical historyCategoricalWhether the patient has a history of heart failure.
History of COPDMedical historyCategoricalWhether the patient has a history of chronic obstructive pulmonary disease.
History of CKDMedical historyCategoricalWhether the patient has a history of chronic kidney disease.
History of GI UlcerMedical historyCategoricalWhether the patient has a history of gastrointestinal ulcers.
Rheumatic DiseasesMedical historyCategoricalWhether the patient has rheumatic diseases.
Hematologic DiseasesMedical historyCategoricalWhether the patient has hematologic diseases.
History of Hepatitis BMedical historyCategoricalWhether the patient has a history of hepatitis B.
Smoking HistoryMedical historyCategoricalWhether the patient has a history of smoking.
Smoking IndexMedical historyNumericalThe smoking index of the patient.
Menstrual HistoryMedical historyCategoricalThe patient’s menstrual history.
Age at MenopauseMedical historyNumericalThe age at which the patient experienced menopause.
Alcohol Consumption HistoryMedical historyCategoricalWhether the patient has a history of alcohol consumption.
History of Chest PainMedical historyCategoricalWhether the patient has a history of chest pain.
Angina Episodes > 2 in 24 h Before AdmissionMedical historyCategoricalWhether the patient experienced more than two episodes of angina in the 24 h before admission.
Time to First Medical ContactMedical historyNumericalThe time from symptom onset to first medical contact.
Symptoms at OnsetMedical historyCategoricalThe symptoms experienced by the patient at onset.
Chest PainMedical historyCategoricalWhether the patient experienced chest pain.
Radiating PainMedical historyCategoricalWhether the patient experienced radiating pain.
Nausea and VomitingMedical historyCategoricalWhether the patient experienced nausea and vomiting.
Profuse SweatingMedical historyCategoricalWhether the patient experienced profuse sweating.
Shortness of BreathMedical historyCategoricalWhether the patient experienced shortness of breath.
Loss of ConsciousnessMedical historyCategoricalWhether the patient experienced loss of consciousness.
Duration of SymptomsMedical historyNumericalThe duration of the symptoms experienced by the patient.
ECG Completed within 10 Minutes of ArrivalMedical historyCategoricalWhether an ECG was completed within 10 min of the patient’s arrival.
High-Sensitivity CRPLaboratory testNumericalThe level of high-sensitivity C-reactive protein in the patient.
BNPLaboratory testNumericalThe level of brain natriuretic peptide in the patient.
cTnILaboratory testNumericalThe level of cardiac troponin I in the patient.
PCTLaboratory testNumericalThe level of procalcitonin in the patient.
IL-6Laboratory testNumericalThe level of interleukin-6 in the patient.
D-DimerLaboratory testNumericalThe level of D-dimer in the patient.
pHLaboratory testNumericalThe pH level of the patient’s blood.
AlbuminLaboratory testNumericalThe level of albumin in the patient.
ALTLaboratory testNumericalThe level of alanine aminotransferase in the patient.
WBC CountLaboratory testNumericalThe white blood cell count of the patient.
RDWLaboratory testNumericalThe red cell distribution width of the patient.
RBC CountLaboratory testNumericalThe red blood cell count of the patient.
Hematocrit (HCT)Laboratory testNumericalThe hematocrit level of the patient.
Creatinine (Cr)Laboratory testNumericalThe creatinine level of the patient.
Creatine Kinase (CK)Laboratory testNumericalThe creatine kinase level of the patient.
Potassium (K)Laboratory testNumericalThe potassium level of the patient.
Calcium (Ca)Laboratory testNumericalThe calcium level of the patient.
Total Cholesterol (TC)Laboratory testNumericalThe total cholesterol level of the patient.
LDL-CLaboratory testNumericalThe low-density lipoprotein cholesterol level of the patient.
HDL-CLaboratory testNumericalThe high-density lipoprotein cholesterol level of the patient.
Triglycerides (TG)Laboratory testNumericalThe triglyceride level of the patient.
GlucoseLaboratory testNumericalThe glucose level of the patient.
Fibrinogen (FIB)Laboratory testNumericalThe fibrinogen level of the patient.
Hemoglobin (Hb)Laboratory testNumericalThe hemoglobin level of the patient.
Platelet Count (PLT)Laboratory testNumericalThe platelet count of the patient.
Blood Urea Nitrogen (BUN)Laboratory testNumericalThe blood urea nitrogen level of the patient.
Uric Acid (UA)Laboratory testNumericalThe uric acid level of the patient.
Triiodothyronine (T3)Laboratory testNumericalThe triiodothyronine level of the patient.
Thyroxine (T4)Laboratory testNumericalThe thyroxine level of the patient.
Free Thyroxine (FT4)Laboratory testNumericalThe free thyroxine level of the patient.
Quality of LifeFollow-upCategoricalThe patient’s perceived quality of life.
Regular Follow-upFollow-upCategoricalWhether the patient has regular follow-ups.
Strict Smoking CessationFollow-upCategoricalWhether the patient strictly abstains from smoking.
Blood Pressure ControlFollow-upCategoricalWhether the patient’s blood pressure is controlled.
Blood Glucose ControlFollow-upCategoricalWhether the patient’s blood glucose is controlled.
DeceasedFollow-upCategoricalWhether the patient is deceased.
Cause of DeathFollow-upCategoricalThe cause of the patient’s death.
Recurrent MIFollow-upCategoricalWhether the patient has had a recurrent myocardial infarction.
Number of Recurrent MIsFollow-upNumericalThe number of recurrent myocardial infarctions the patient has had.
ArrhythmiaFollow-upCategoricalWhether the patient has experienced arrhythmia.
Time of Arrhythmia OnsetFollow-upNumericalThe time when the patient experienced arrhythmia.
Atrial FibrillationFollow-upCategoricalWhether the patient has atrial fibrillation.
AV BlockFollow-upCategoricalWhether the patient has atrioventricular block.
Bundle Branch BlockFollow-upCategoricalWhether the patient has bundle branch block.
Ventricular ArrhythmiaFollow-upCategoricalWhether the patient has ventricular arrhythmia.
Arrhythmia TreatmentFollow-upCategoricalThe treatment for the patient’s arrhythmia.
Heart FailureFollow-upCategoricalWhether the patient has heart failure.
Time of Heart Failure OnsetFollow-upNumericalThe time when the patient experienced heart failure.
StrokeFollow-upCategoricalWhether the patient has experienced a stroke.
Time of Stroke OnsetFollow-upNumericalThe time when the patient experienced a stroke.
Type of StrokeFollow-upCategoricalThe type of stroke the patient experienced.
BleedingFollow-upCategoricalWhether the patient has experienced bleeding.
Time of Bleeding OnsetFollow-upNumericalThe time when the patient experienced bleeding.
Bleeding ClassificationFollow-upCategoricalThe classification of the patient’s bleeding.
Current Oral MedicationsFollow-upCategoricalThe current oral medications the patient is taking.
Ascending Aorta DiameterECHONumericalThe diameter of the patient’s ascending aorta.
Aortic Root DiameterECHONumericalThe diameter of the patient’s aortic root.
Right Ventricular Outflow TractECHONumericalThe measurement of the right ventricular outflow tract.
Right Ventricular Anteroposterior DiameterECHONumericalThe anteroposterior diameter of the right ventricle.
Aortic DiameterECHONumericalThe diameter of the patient’s aorta.
Left Ventricular End-Diastolic DiameterECHONumericalThe end-diastolic diameter of the left ventricle.
Left Ventricular End-Systolic DiameterECHONumericalThe end-systolic diameter of the left ventricle.
Left Atrial Anteroposterior DiameterECHONumericalThe anteroposterior diameter of the left atrium.
Interventricular Septal End-Diastolic ThicknessECHONumericalThe end-diastolic thickness of the interventricular septum.
Right Atrial Vertical DiameterECHONumericalThe vertical diameter of the right atrium.
Right Atrial Horizontal DiameterECHONumericalThe horizontal diameter of the right atrium.
Left Ventricular Ejection Fraction (LVEF)ECHONumericalThe ejection fraction of the left ventricle.
Left Ventricular Shortening FractionECHONumericalThe shortening fraction of the left ventricle.
Aortic Valve RegurgitationECHOCategoricalWhether the patient has aortic valve regurgitation.
Aortic Valve CalcificationECHOCategoricalWhether the patient has aortic valve calcification.
Ventricular Wall ThinningECHOCategoricalWhether the patient has thinning of the ventricular wall.
Location of Wall Thinning: Anterior SeptumECHOCategoricalWhether the wall thinning is located at the anterior septum.
Location of Wall Thinning: SeptumECHOCategoricalWhether the wall thinning is located at the septum.
Location of Wall Thinning: Anterior WallECHOCategoricalWhether the wall thinning is located at the anterior wall.
Location of Wall Thinning: Lateral WallECHOCategoricalWhether the wall thinning is located at the lateral wall.
Admission Heart Rate (HR1)AngiographyNumericalThe patient’s heart rate upon admission.
Admission Systolic BP (SP1)AngiographyNumericalThe patient’s systolic blood pressure upon admission.
Admission Diastolic BP (DP1)AngiographyNumericalThe patient’s diastolic blood pressure upon admission.
Initial Oxygen Saturation (Spo21)AngiographyNumericalThe patient’s oxygen saturation level upon first measurement.
Use of Vasopressors During SurgeryAngiographyCategoricalWhether vasopressors were used during surgery.
Intraoperative ComplicationsAngiographyCategoricalThe complications that occurred during surgery.
Surgical ProcedureAngiographyCategoricalWhether the patient underwent surgery.
Intraoperative Contrast Agent DosageAngiographyNumericalThe dosage of contrast agent used during surgery.
Intraoperative Heparin DosageAngiographyNumericalThe dosage of heparin used during surgery.
Coronary DominanceAngiographyCategoricalThe dominance pattern of the coronary arteries.
Culprit Vessel CharacteristicsAngiographyCategoricalThe characteristics of the culprit vessel.
TIMI Flow Grade Pre- and Post-Intervention in Culprit VesselAngiographyCategoricalThe TIMI flow grade in the culprit vessel before and after intervention.
Coronary Lesion Location and Stenosis DegreeAngiographyCategoricalThe location and degree of stenosis of the coronary lesion.
Number and Type of Stents ImplantedAngiographyNumericalThe number and types of stents implanted.
Use of Thrombus Aspiration During SurgeryAngiographyCategoricalWhether thrombus aspiration was used during surgery.
Planned Second PCIAngiographyCategoricalWhether a second PCI is planned.
Medication Adherence at DischargeAngiographyCategoricalThe adherence to prescribed medications at discharge.
Use of Temporary Pacemaker, ECMO, IABP During SurgeryAngiographyCategoricalWhether a temporary pacemaker, ECMO, or IABP was used during surgery.
Management of Intraoperative ComplicationsAngiographyCategoricalThe management strategies for complications during surgery.
Use of Intravascular Imaging During Stent PlacementAngiographyCategoricalWhether intravascular imaging was used during stent placement.

References

  1. Amsterdam, E.A.; Wenger, N.K.; Brindis, R.G.; Casey, D.E.; Ganiats, T.G.; Holmes, D.R.; Jaffe, A.S.; Jneid, H.; Kelly, R.F.; Kontos, M.C.; et al. 2014 AHA/ACC Guideline for the Management of Patients with Non–ST-Elevation Acute Coronary Syndromes A Report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines. J. Am. Coll. Cardiol. 2014, 64, e139–e228. [Google Scholar] [CrossRef] [PubMed]
  2. Tsao, C.W.; Aday, A.W.; Almarzooq, Z.I.; Alonso, A.; Beaton, A.Z.; Bittencourt, M.S.; Boehme, A.K.; Buxton, A.E.; Carson, A.P.; Commodore-Mensah, Y.; et al. Heart disease and stroke statistics—2022 up-date: A report from the American Heart Association. Circulation 2022, 145, e153–e639. [Google Scholar] [CrossRef] [PubMed]
  3. Emergency Physician Branch of the Chinese Medical Doctor Association; Emergency Medicine Expert Committee of the Center for Capacity Building and Continuing Education of the National Health Commission; Emergency and First Aid Branch of the China Association for the Promotion of International Healthcare Exchange. Guidelines for Rapid Diagnosis and Treatment of Acute Coronary Syndrome (2019). Chin. J. Emerg. Med. 2019, 29, 421–428. [Google Scholar]
  4. Zeng, W.; Sheng, H. Interpretation of key points of “China Cardiovascular Health and Disease Report 2022”. Chin. J. Cardiovasc. 2023, 28, 297–312. [Google Scholar]
  5. Antman, E.M.; McCabe, C.H.; Gurfinkel, E.P.; Turpie, A.G.; Bernink, P.J.; Salein, D.; Bayes De Luna, A.; Fox, K.; Lablanche, J.M.; Radley, D.; et al. Enoxaparin prevents death and cardiac ischemic events in unstable angina/non-Q-wave myocardial infarction. Results of the thrombolysis in myocardial infarction (TIMI) 11B trial. Circulation 1999, 100, 1593–1601. [Google Scholar] [CrossRef] [PubMed]
  6. Antman, E.M.; Cohen, M.; Bernink, P.J.; McCabe, C.H.; Horacek, T.; Papuchis, G.; Mautner, B.; Cor-balan, R.; Radley, D.; Braunwald, E. The TIMI risk score for unstable angina/non-ST elevation MI: A method for prognostication and therapeutic decision making. JAMA 2000, 284, 835–842. [Google Scholar] [CrossRef] [PubMed]
  7. GRACE Investigators. Rationale and design of the GRACE (Global Registry of Acute Coronary Events) Project: A multinational registry of patients hospitalized with acute coronary syndromes. Am. Heart J. 2001, 141, 190–199. [Google Scholar] [CrossRef] [PubMed]
  8. Granger, C.B.; Goldberg, R.J.; Dabbous, O.; Pieper, K.S.; Eagle, K.A.; Cannon, C.P.; Van De Werf, F.; Avezum, A.; Goodman, S.G.; Flather, M.D.; et al. Predictors of hospital mortality in the global registry of acute coronary events. Arch. Intern. Med. 2003, 163, 2345–2353. [Google Scholar] [CrossRef] [PubMed]
  9. Subherwal, S.; Bach, R.G.; Chen, A.Y.; Gage, B.F.; Rao, S.V.; Newby, L.K.; Wang, T.Y.; Gibler, W.B.; Ohman, E.M.; Roe, M.T.; et al. Baseline risk of major bleeding in non-ST-segment-elevation myocardial infarction: The CRUSADE (Can Rapid risk stratification of Unstable angina patients Suppress ADverse outcomes with Early implementation of the ACC/AHA Guidelines) Bleeding Score. Circulation 2009, 119, 1873–1882. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  10. Bhavnani, S.P. Digital health: Opportunities and challenges to develop the next-generation technology-enabled models of cardiovascular care. Methodist DeBakey Cardiovasc. J. 2020, 16, 296. [Google Scholar] [CrossRef]
  11. Shah, P.; Kendall, F.; Khozin, S.; Goosen, R.; Hu, J.; Laramie, J.; Ringel, M.; Schork, N. Artificial intelligence and machine learning in clinical development: A translational perspective. NPJ Digit. Med. 2019, 2, 69. [Google Scholar] [CrossRef] [PubMed]
  12. Quer, G.; Arnaout, R.; Henne, M.; Arnaout, R. Machine learning and the future of cardiovascular care: JACC state-of-the-art review. J. Am. Coll. Cardiol. 2021, 77, 300–313. [Google Scholar] [CrossRef] [PubMed]
  13. Wu, J.; Roy, J.; Stewart, W.F. Prediction modeling using EHR data: Challenges, strategies, and a comparison of machine learning approaches. Med. Care. 2010, 48 (Suppl. S6), S106–S113. [Google Scholar] [CrossRef] [PubMed]
  14. Syed, Z.; Guttag, J. Unsupervised Similarity-Based Risk Stratification for Cardiovascular Events Using Long-Term Time-Series Data. J. Mach. Learn. Res. 2011, 12, 999–1024. [Google Scholar]
  15. Churpek, M.M.; Yuen, T.C.; Park, S.Y.; Gibbons, R.; Edelson, D.P. Using electronic health record data to develop and validate a prediction model for adverse outcomes in the wards. Crit Care Med. 2014, 42, 841–848. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  16. Dong, W.; Huang, Z.; Ji, L.; Duan, H. A genetic fuzzy system for unstable angina risk assessment. BMC Med. Inform. Decis. Mak. 2014, 14, 12. [Google Scholar] [CrossRef] [PubMed]
  17. Verma, L.; Srivastava, S.; Negi, P.C. A hybrid data mining model to predict coronary artery disease cases using non-invasive clinical data. J. Med. Syst. 2016, 40, 178. [Google Scholar] [CrossRef] [PubMed]
  18. Krittanawong, C.; Zhang, H.; Wang, Z.; Aydar, M.; Kitai, T. Artificial intelligence in precision cardiovascular medicine. J. Am. Coll. Cardiol. 2017, 69, 2657–2664. [Google Scholar] [CrossRef] [PubMed]
  19. Muthukumarasamy, S.; Tamilarasan, A.K.; Ayeelyan, J.; Adimoolam, M. Machine learning in healthcare diagnosis. Blockchain Mach. Learn. E-Healthc. Syst. 2020, 13, 343–366. [Google Scholar]
  20. Popa-Fotea, N.M.; Calmac, L.; Micheu, M.M.; Cosmin, M.; Scarlatescu, A.; Zamfir, D.; Itu, L.M.; Tache, I.A.; Stoian, D.; Hatfaludi, C.A.; et al. A cloud-based platform for clinical decision support in acute coronary syndrome patients: Study methodology. Kardiol. Pol. Pol. Heart J. 2022, 80, 604–607. [Google Scholar] [CrossRef]
  21. Johnson, K.W.; Torres Soto, J.; Glicksberg, B.S.; Shameer, K.; Miotto, R.; Ali, M.; Ashley, E.; Dudley, J.T. Artificial intelligence in cardiology. J. Am. Coll. Cardiol. 2018, 71, 2668–2679. [Google Scholar] [CrossRef] [PubMed]
  22. Huang, X.; Khetan, A.; Cvitkovic, M.; Karnin, Z. Tabtransformer: Tabular data modeling using con-textual embeddings. arXiv 2020, arXiv:2012.06678. [Google Scholar]
  23. Han, J.; Pei, J.; Tong, H. Data Mining: Concepts and Techniques; Morgan kaufmann: Burlington, MA, USA, 2022. [Google Scholar]
  24. Hastie, T.; Tibshirani, R.; Friedman, J.H.; Friedman, J.H. The Elements of Statistical Learning: Data Mining, Inference, and Prediction; Springer: New York, NY, USA, 2009. [Google Scholar]
  25. Tsay, R.S. Analysis of Financial Time Series; John Wiley & Sons: Hoboken, NJ, USA, 2005. [Google Scholar]
  26. Little, R.J.A.; Rubin, D.B. Statistical Analysis with Missing Data; John Wiley & Sons: Hoboken, NJ, USA, 2019. [Google Scholar]
  27. Barnett, V.; Lewis, T. Outliers in Statistical Data; Wiley: New York, NY, USA, 1994. [Google Scholar]
  28. Arik, S.Ö.; Pfister, T. Tabnet: Attentive interpretable tabular learning. Proc. AAAI Conf. Artif. Intell. 2021, 35, 6679–6687. [Google Scholar] [CrossRef]
  29. Joseph, M.; Raj, H. Gate: Gated additive tree ensemble for tabular classification and regression. arXiv 2022, arXiv:2207.08548. [Google Scholar]
  30. Gorishniy, Y.; Rubachev, I.; Khrulkov, V.; Babenko, A. Revisiting deep learning models for tabular data. Adv. Neural Inf. Process. Syst. 2021, 34, 18932–18943. [Google Scholar]
  31. Türker, R.; Zhang, L.; Koutraki, M.; Sack, H. Knowledge-based short text categorization using entity and category embedding. In Proceedings of the Semantic Web: 16th International Conference, ESWC 2019, Portorož, Slovenia, 2–6 June 2019; Proceedings 16. Springer International Publishing: Cham, Switzerland; pp. 346–362. [Google Scholar]
  32. Zhang, X.; Wang, X.; Xu, L.; Liu, J.; Ren, P.; Wu, H. The predictive value of machine learning for mortality risk in patients with acute coronary syndromes: A systematic review and meta-analysis. Eur. J. Med. Res. 2023, 28, 451. [Google Scholar] [CrossRef] [PubMed]
  33. Kasim, S.; Malek, S.; Song, C.; Ahmad, W.A.W.; Fong, A.; Ibrahim, K.S.; Safiruz, M.S.; Aziz, F.; Hiew, J.H.; Ibrahim, N. In-hospital mortality risk stratification of Asian ACS patients with artificial intelligence algorithm. PLoS ONE 2022, 17, e0278944. [Google Scholar] [CrossRef] [PubMed]
  34. Rajkomar, A.; Oren, E.; Chen, K.; Dai, A.M.; Hajaj, N.; Hardt, M.; Liu, P.J.; Liu, X.; Marcus, J.; Sun, M.; et al. Scalable and accurate deep learning with electronic health records. NPJ Digit. Med. 2018, 1, 1–10. [Google Scholar] [CrossRef] [PubMed]
  35. Angraal, S.; Krumholz, H.M.; Schulz, W.L. Blockchain technology: Applications in health care. Circ. Cardiovasc. Qual. Outcomes 2017, 10, e003800. [Google Scholar] [CrossRef]
  36. Topol, E.J. High-performance medicine: The convergence of human and artificial intelligence. Nat. Med. 2019, 25, 44–56. [Google Scholar] [CrossRef]
  37. Beam, A.L.; Kohane, I.S. Big data and machine learning in health care. JAMA 2018, 319, 1317–1318. [Google Scholar] [CrossRef] [PubMed]
  38. Gulshan, V.; Peng, L.; Coram, M.; Stumpe, M.C.; Wu, D.; Narayanaswamy, A.; Venugopalan, S.; Widner, K.; Madams, T.; Cuadros, J.; et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA 2016, 316, 2402–2410. [Google Scholar] [CrossRef] [PubMed]
  39. Esteva, A.; Robicquet, A.; Ramsundar, B.; Kuleshov, V.; DePristo, M.; Chou, K.; Cui, C.; Corrado, G.; Thrun, S.; Dean, J. A guide to deep learning in healthcare. Nat. Med. 2019, 25, 24–29. [Google Scholar] [CrossRef] [PubMed]
  40. Obermeyer, Z.; Emanuel, E.J. Predicting the future—Big data, machine learning, and clinical medicine. N. Engl. J. Med. 2016, 375, 1216. [Google Scholar] [CrossRef]
Figure 1. TabTransformer structure based on gate mechanism.
Figure 1. TabTransformer structure based on gate mechanism.
Bioengineering 11 00551 g001
Figure 2. System architecture of acute coronary syndrome management platform.
Figure 2. System architecture of acute coronary syndrome management platform.
Bioengineering 11 00551 g002
Figure 3. Comparative ROC curves for multiple models.
Figure 3. Comparative ROC curves for multiple models.
Bioengineering 11 00551 g003
Figure 4. Comparison of gating mechanisms in transformer models.
Figure 4. Comparison of gating mechanisms in transformer models.
Bioengineering 11 00551 g004
Figure 5. Comparative analysis of PR curves across models.
Figure 5. Comparative analysis of PR curves across models.
Bioengineering 11 00551 g005
Figure 6. Comparative confusion matrices for multiple models. (a): Confusion matrix for the model combining the TabTransformer architecture with a gating mechanism. (b) Confusion matrix for the basic TabTransformer model. (c) Confusion matrix for the FTTransformer model enhanced with a gating mechanism. (d) Confusion matrix for the basic FTTransformer model.
Figure 6. Comparative confusion matrices for multiple models. (a): Confusion matrix for the model combining the TabTransformer architecture with a gating mechanism. (b) Confusion matrix for the basic TabTransformer model. (c) Confusion matrix for the FTTransformer model enhanced with a gating mechanism. (d) Confusion matrix for the basic FTTransformer model.
Bioengineering 11 00551 g006
Figure 7. Model performance in predicting ischemic events.
Figure 7. Model performance in predicting ischemic events.
Bioengineering 11 00551 g007
Figure 8. Model performance in predicting bleeding events.
Figure 8. Model performance in predicting bleeding events.
Bioengineering 11 00551 g008
Figure 9. Model performance in predicting infection events.
Figure 9. Model performance in predicting infection events.
Bioengineering 11 00551 g009
Figure 10. Model performance in predicting fever events.
Figure 10. Model performance in predicting fever events.
Bioengineering 11 00551 g010
Figure 11. Model performance in predicting vascular reconstruction.
Figure 11. Model performance in predicting vascular reconstruction.
Bioengineering 11 00551 g011
Figure 12. Patient information entry snapshot.
Figure 12. Patient information entry snapshot.
Bioengineering 11 00551 g012
Figure 13. Patient risk prediction snapshot.
Figure 13. Patient risk prediction snapshot.
Bioengineering 11 00551 g013
Table 1. Data distribution.
Table 1. Data distribution.
Data TypesQuantityProportion
Basic information86%
Medical history3123.1%
Laboratory test results3022.3%
Follow-up 2518.6%
ECHO2015%
Angiography2015%
Table 2. Model selection parameter setting.
Table 2. Model selection parameter setting.
ParameterValue
Learning Rate5 × 10−6
Input embedding dimension128
Batch size256
Epoch200
Dropout0.1
Attention heads 8
Attention blocks6
Attention Dropout0.1
AddNorm Dropout0.1
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Mei, Y.; Jin, Z.; Ma, W.; Ma, Y.; Deng, N.; Fan, Z.; Wei, S. Optimizing Acute Coronary Syndrome Patient Treatment: Leveraging Gated Transformer Models for Precise Risk Prediction and Management. Bioengineering 2024, 11, 551. https://doi.org/10.3390/bioengineering11060551

AMA Style

Mei Y, Jin Z, Ma W, Ma Y, Deng N, Fan Z, Wei S. Optimizing Acute Coronary Syndrome Patient Treatment: Leveraging Gated Transformer Models for Precise Risk Prediction and Management. Bioengineering. 2024; 11(6):551. https://doi.org/10.3390/bioengineering11060551

Chicago/Turabian Style

Mei, Yingxue, Zicai Jin, Weiguo Ma, Yingjun Ma, Ning Deng, Zhiyuan Fan, and Shujun Wei. 2024. "Optimizing Acute Coronary Syndrome Patient Treatment: Leveraging Gated Transformer Models for Precise Risk Prediction and Management" Bioengineering 11, no. 6: 551. https://doi.org/10.3390/bioengineering11060551

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop