Cardiovascular Diseases Diagnosis Using an ECG Multi-Band Non-Linear Machine Learning Framework Analysis

Ribeiro, Pedro; Sá, Joana; Paiva, Daniela; Rodrigues, Pedro Miguel

doi:10.3390/bioengineering11010058

Open AccessArticle

Cardiovascular Diseases Diagnosis Using an ECG Multi-Band Non-Linear Machine Learning Framework Analysis

CBQF—Centro de Biotecnologia e Química Fina, Laboratório Associado, Escola Superior de Biotecnologia, Universidade Católica Portuguesa, Rua de Diogo Botelho 1327, 4169-005 Porto, Portugal

^*

Author to whom correspondence should be addressed.

Bioengineering 2024, 11(1), 58; https://doi.org/10.3390/bioengineering11010058

Submission received: 7 November 2023 / Revised: 13 December 2023 / Accepted: 5 January 2024 / Published: 7 January 2024

(This article belongs to the Section Biosignal Processing)

Download

Browse Figures

Versions Notes

Abstract

:

Background: cardiovascular diseases (CVDs), which encompass heart and blood vessel issues, stand as the leading cause of global mortality for many people. Methods: the present study intends to perform discrimination between seven well-known CVDs (bundle branch block, cardiomyopathy, myocarditis, myocardial hypertrophy, myocardial infarction, valvular heart disease, and dysrhythmia) and one healthy control group, respectively, by feeding a set of machine learning (ML) models with 10 non-linear features extracted every 1 s from electrocardiography (ECG) lead signals of a well-known ECG database (PTB diagnostic ECG database) using multi-band analysis performed by discrete wavelet transform (DWT). The ML models were trained and tested using a leave-one-out cross-validation approach, assessing the individual and combined capabilities of features, per each lead or combined, to distinguish between pairs of study groups and for conducting a comprehensive all vs. all analysis. Results: the

A c c u r a c y

discrimination results ranged between 73% and 100%, the

R e c a l l

between 68% and 100%, and the

A U C

between 0.42 and 1. Conclusions: the results suggest that our method is a good tool for distinguishing CVDs, offering significant advantages over other studies that used the same dataset, including a multi-class comparison group (all vs. all), a wider range of binary comparisons, and the use of classical non-linear analysis under ECG multi-band analysis performed by DWT.

Keywords:

ECG signals; cardiovascular diseases; machine learning models; discrete wavelet transform; non-linear analysis; discrimination

1. Introduction

Heart and blood vessel problems, known as cardiovascular diseases (CVDs), are the main reason why many people die around the world [1]. According to the World Health Organization, 32% of global mortality is attributed to cardiovascular diseases, with the most prevalent being arrhythmias, cardiac arrests, and heart failure. It is estimated that CVDs take about 17.9 million lives every year [2]. Focusing on cardiac pathology and considering how much work the heart constantly does, it is amazing that it functions so well for a long time for many people. However, it can also experience problems and stop working properly due to risk factors like cholesterol, high blood pressure, cigarette smoking, diabetes mellitus, and adiposity [3].

Heart disease is a term for health issues that affect the heart’s function and condition. There are different types of heart disease, including: (1) Cardiomyopathy: heart muscle structural and functional abnormality without underlying coronary issues [4]; (2) Endocarditis: infection and inflammation of heart valves and inner lining [5]; (3) Myocarditis: inflammation of middle heart wall layer affecting blood pumping [6]; (4) Pericarditis: inflammation of the thin sac surrounding the heart [7]; (5) Coronary artery disease: cholesterol-filled plaques blocking heart arteries [8]; (6) Heart attack: sudden blockage of blood flow to heart muscle [9]; (7) Heart failure: symptoms include breathlessness, ankle swelling, and fatigue [10]; (8) Heart rhythm disorders (arrhythmias): irregular heartbeats [11]; (9) Sudden cardiac arrest: sudden stoppage of heartbeat [12]; (10) Heart valve disorders: issues with valves controlling blood flow [13]; (11) Congenital heart disease: heart abnormalities present from birth [14].

The beginning of the diagnosis of heart disease involves evaluating the patient’s medical history and conducting a physical examination. Afterwards, laboratory tests and/or additional non-invasive and invasive diagnostic exams can be performed [2]. Natriuretic peptides are the most common laboratory tests used to diagnose heart diseases. They can help identify individuals at higher risk of sudden cardiac death in the general population or patients with coronary artery disease [11]. However, several other non-invasive and invasive tests can be performed: (1) Electrocardiogram (ECG) and ambulatory monitoring: the 12-lead ECG is a key diagnostic test for cardiovascular diseases, assessing risk, and identifying arrhythmias [11]. Choose monitoring time based on symptom frequency. Holter for daily arrhythmias, patient-activated ECG for less frequent events, and ILRs for serious cases [11,15]; (2) Stress tests: monitor the heart during treadmill/bike exercise to assess response and detect exercise-related disorders like arrhythmias, ventricular tachycardia, coronary artery disease, and long QT syndrome [11,16]. Exercise tests aid in diagnosing long QT syndrome by measuring the QTc interval after 4 min of exercise [16]; (3) Imaging tests: essential for assessing heart function and detecting problems like cardiomyopathies [17]. Negative results may indicate primary electrical diseases [4]; (4) Electrophysiological study: exam to diagnose and guide treatment, involving measuring cardiac intervals, controlling electrical stimulation, and mapping heart structures. Effectiveness varies based on heart condition, presence of spontaneous ventricular tachycardia, medication use, and stimulation mode [18]; (5) Provocative diagnostic tests: use sodium channel blockers, adenosine, or epinephrine to detect syndromes. Use acetylcholine or ergonovine to assess coronary spasm as the cause of ventricular fibrillation [19]; (6) Genetic testing: next-gen sequencing made genetic testing accessible. Comprehensive gene panels reveal variations causing or modifying features in syndromes like Brugada, long QT, and hypertrophic and dilated cardiomyopathy [20]; (7) Cardiac catheterisation: catheter inserted into a blood vessel, and guided to the heart with X-ray images and dye to check for blockages [11].

In recent years, there has been a notable surge in computational power, driven by advanced hardware, parallel computing, cloud resources, and increased data accessibility. These developments have significantly enhanced the applications of machine learning (ML) in the diagnosis of CVDs [21]. The role of ML in CVDs is pivotal, as it harnesses data from medical tests to improve diagnostics and management, reducing human error, improving efficiency, and enhancing patient outcomes [22]. It contributes to early disease detection, precise risk assessment, advanced image analysis, predictive modelling, tailored treatment plans, remote patient monitoring, and expedited drug discovery [23]. However, ongoing research is essential to enhance this critical field further and save lives. Thus, for this study, our method will focus on ML-based ECG signal analysis approaches for discriminating CVDs, and thus we present in Table 1 the state of the art of this topic. The heart, operating as a non-linear system, manifests its electrical activity through the ECG signal [24]. The inherent non-linearity underscores the inadequacy of traditional linear analyses and standard clinical features in comprehensively capturing the intricate dynamics of the ECG signal. This complexity is further underscored by the challenges posed to deep learning tools, as their extraction of features may lack explainable understanding. Consequently, a deeper comprehension of how these tools reach and compute features becomes imperative for a more robust interpretation of ECG signals. Unlike prevailing state-of-the-art methods for the topic (Table 1), which have typically abstained from incorporating non-linear feature extraction in their methodology, our study aims to explore a non-linear approach to ECG analysis supported by classical ML tools. By doing so, we want to seek a more comprehensive understanding that embraces the inherent complexities of the heart’s electrical behaviour for improving CVDs diagnosis. For that, we defined three objectives for this study:

Introduce the utilisation of 10 non-linear features (entropies—approximate, logarithmic, and Shannon, correlation dimension, detrended fluctuation analysis, energy, Higuchi fractal dimension, Hurst exponent, Katz fractal dimension, and Lyapunov exponent) extracted under discrete wavelet transform multi-band ECG signal analysis for characterising CVDs.
Enhance the evaluation of distinguishing between various CVDs by accessing and comparing the individual and combined power of non-linear features.
Evaluate the discriminatory performance of these features by inputting them into a comprehensive set of ML models.

The fulfilment of the goals will provide insights into the predictive power of these non-linear features, both independently and synergistically, contributing to a comprehensive understanding of their impact on CVD discrimination.

Finally, the paper is divided into five major sections in terms of structure. In Section 2, the applied methodology, including the database, signal processing, and feature extraction, is explained. The study results are indicated in Section 3 and discussed in Section 4. Finally, Section 5 draws the study conclusions.

2. Methodology

This proposed methodology, illustrated in Figure 1, is split into 4 main parts:

Data collection/pre-processing and artifacts removal;
Feature extraction;
Data compressor;
Machine Learning classification and statistical analysis.

Figure 1. Workflow diagram.

2.1. Experimental Setup

This study involved the use of two distinct programming languages: MATLAB and Python. MATLAB (version R2022a) was employed to eliminate noise from the ECG signal, extract non-linear features from the ECG data, and compress and structure the data for classification purposes. Python (version 3.9.12) was utilised to develop and implement various ML models and generate a discrimination report based on the obtained results. The choice between MATLAB and Python programming languages is driven by optimisation needs: MATLAB is particularly proficient in signal processing and feature extraction with highly optimised toolboxes, whereas Python takes the lead in optimising ML models.

This research was conducted using a MacBook Pro 14 equipped with an M1 Pro chip featuring an 8-core CPU, a 14-core GPU, and 16 GB of RAM.

2.2. Database Characterisation

The PTB diagnostic ECG database [43] comprises data from seven distinct cardiovascular disease groups as well as a healthy control group. This dataset consists of a total of 512 ECG records, each containing 12 conventional leads (I, II, III,

α

Vr,

α

Vl,

α

Vf, V1, V2, V3, V4, V5, and V6), along with 3 ECG Frank leads (Vx, Vy, and Vz). The ECG data have been digitised at a sampling frequency of 1000 Hz.

Each lead contains ECG signal samples of 10 s and has been recorded by an electroencephalograph with: (1) input voltage: ±16 mV; (2) input resistance: 100

Ω

(DC); (3) bandwidth: 0–1 kHz; and (4) noise voltage: 10

μ

V.

Table 2 represents the number of ECGs per diagnostic class present in the database.

2.3. Artifacts Removal

The ECG signals’ raw data in the database showed artifacts. To ensure the signal quality, complete signal deletion was performed. In the beginning, the database had 512 records. After the removal stage, the number of available signals in the database for the following tasks was reduced to 483 ECG records. Table 3 represents the number of ECGs per diagnostic class after the removal.

2.4. Signal Normalisation

The ECG signals,

x (n)

, were loaded into MATLAB^® and normalised according to the following equation [44].

x (n) = \frac{x (n)}{\sum_{n = 0}^{N - 1} x^{2} (n)},

(1)

where N represents the signal’s length. Then its mean value was removed.

Multi-Band Decomposition via Wavelet Transform and Features Extraction

The discrete-time wavelet transform (DWT) is a powerful technique used to analyse discrete-time signals with finite energy. It involves breaking down the signal into a set of basis functions composed of a limited number of prototype sequences and their time-shifted variations. This process, as described in Guido’s research in 2022 [45], offers significant advantages for analysing signals in the time–frequency domain. By seamlessly transitioning between the time and frequency domains, it enables the localisation of the source of frequency compounds in time.

To perform the decomposition and subsequent reconstruction, an octave-band critically decimated filter bank is employed. This approach, pioneered by Malvar in 1992 and further developed by Vetterli in 1995 [46,47], provides an effective framework. When considering only the positive frequencies, each sub-band in the transform is confined to a specific range,

W_{k} = \{\begin{matrix} [0, π / 2^{S}], & m = 0, \\ [π / 2^{S - m + 1}, π / 2^{S - m}], & m = 1, 2, \dots, S, \end{matrix}

(2)

where S is the number of levels,

S + 1

is the number of sub-bands, and

π

is the normalised angular frequency equivalent to half the sampling rate.

The DWT employs an analysis scale function, denoted as

{\tilde{ϕ}}_{1} (n)

, and an analysis wavelet function, denoted as

{\tilde{ψ}}_{1} (n)

, which are defined as follows:

{\tilde{ϕ}}_{1} (n) = h_{LP} (n)

(3)

and

{\tilde{ψ}}_{1} (n) = h_{HP} (n),

(4)

where

h_{LP} (n)

and

h_{HP} (n)

represent the impulse responses of the analysis filters for the half-band low-pass and high-pass components, respectively.

Defining the following recursion formulas

\begin{matrix} {\tilde{ϕ}}_{i + 1} (n) = {\tilde{ϕ}}_{i} (n / 2) * {\tilde{ϕ}}_{1} (n), \end{matrix}

(5)

\begin{matrix} {\tilde{ψ}}_{i + 1} (n) = {\tilde{ϕ}}_{i} (n) * {\tilde{ψ}}_{1} (n / 2^{i}), \end{matrix}

(6)

where the symbol “∗” signifies the convolution operation, the analysis filter corresponding to the mth sub-band is expressed as follows:

h_{m} (n) = \{\begin{matrix} {\tilde{ϕ}}_{S} (n), & m = 0, \\ {\tilde{ψ}}_{S + 1 - m} (n), & m = 1, 2, \dots, S . \end{matrix}

(7)

The mth sub-band signal is computed as

x_{m} (n) = \{\begin{matrix} \sum_{k = - \infty}^{\infty} x (k) h_{m} (2^{S} n - k), & m = 0, \\ \sum_{k = - \infty}^{\infty} x (k) h_{m} (2^{S - m + 1} n - k), & m = 1, 2, \dots, S . \end{matrix}

(8)

In this research, the DWT was employed to decompose each ECG segment of 1 s length into sub-bands (

x_{m} (n)

) up to level three (

S = 3

). The applied wavelet was Symlet7, and this wavelet proved to be good for ECG signals analysis until decomposition at level 3 [48,49]. To ensure consistency with the original sampling rate, the sub-band signals,

x_{m} (n)

, underwent re-sampling using the wavelet interpolation method [50]. After that, 10 non-linear features (check Table 4 for more information) were collected from each signal sub-band of 1 s length from a total of 10 s signal length. Then, the resulting time series per feature and sub-band were compressed over time, respectively, by 6 distinct statistical functions: average (

A v g

), standard deviation (

S t d

), 95th percentile (

P 95

), variance (

V a r

), median (

M e d

), and kurtosis (

K u r

) [49]. At the end of the process, the data matrix, comprised of all 10-second time series vectors of features extracted from all sub-bands over time for all patients, underwent normalisation using the z-score method [51].

2.5. Data Driven Framework Analysis

2.5.1. Individual Feature Power Analysis over Binary Groups

The evaluation of the discriminating power of each feature distribution between pairs of study groups, such as

V H D

vs.

M

,

V H D

vs.

M I

,

V H D

vs.

M H

,

V H D

vs.

H C

,

V H D

vs.

D i s

,

V H D

vs.

C a r d M y o

,

V H D

vs.

B B B

,

M

vs.

M I

,

M

vs.

M H

,

M

vs.

H C

,

M

vs.

D i s

,

M

vs.

C a r d M y o

,

M

vs.

B B B

,

M I

vs.

M H

,

M I

vs.

H C

,

M I

vs.

D i s

,

M I

vs.

C a r d M y o

,

M I

vs.

B B B

,

M H

vs.

H C

,

M H

vs.

D i s

,

M H

vs.

C a r d M y o

,

M H

vs.

B B B

,

H C

vs.

D i s

,

H C

vs.

C a r d M y o

, and

H C

vs.

B B B

, was conducted using the XROC classifier [58], a binary classifier working within a leave-one-out cross-validation process, and using the Mann–Whitney test. A total of 3600 features, consisting of 10 non-linear features time series compressed over time by (×) 6 statistical measures over (×) 4 sub-bands for each one of the (×) 15 leads per participant, were individually assessed to measure their potential to differentiate between these groups. The methodology variation to perform individual feature assessment for discrimination is signalised in Figure 1. It should be noted that the normality and homoscedasticity of each one of the time series feature vector distributions have been assessed for distinguishing binary classes with the MATLAB function

k t e s t

, which performs the Kolmogorov–Smirnov and Levene tests, respectively. The hypothesis of parametric tests was not met, so we applied a non-parametric test, such as the Mann–Whitney test.

2.5.2. Combined Features Power Analysis for Groups Discrimination Using Sci-Learn ML Models

In this case, the model’s performance for discriminating between pairs of study groups and between

A l l

vs.

A l l

was evaluated by feeding 19 selected Sci-learn ML models [59], presented in Table 5, with combined features—240 features (10 features extracted from (×) 4 sub-bands and compressed (×) by 6 statistics) for the individual lead case or 3600 features (240 features per lead × 15 leads) per combined leads case, for each group comparison, within a leave-one-out cross-validation procedure. The methodology variation to perform combined feature assessment for discrimination is signalised in Figure 1.

2.5.3. Classification Metrics

The model’s performance evaluation was carried out using 9 metrics:

A c c u r a c y

,

P r e c i s i o n

,

R e c a l l

,

F 1

-

S c o r e

,

A U C

,

K a p p a

,

M C C

,

C S I

, and

G m e a n

.

The

A c c u r a c y

represents the number of corrected classified classes concerning all cases [60] and can be defined as

A c c u r a c y = \frac{T P + T N}{T P + T N + F P + F N} \times 100 %,

(9)

where, a

T P

,

T N

,

F P

, and

F N

are, respectively, the true positives, true negatives, false positives, and false negatives [61].

The

P r e c i s i o n

, also known as a positive predictive value, shows the proportion of well-classified positive cases to the total cases predicted as positive [62]. The

P r e c i s i o n

can be defined as

P r e c i s i o n = \frac{T P}{T P + F P} \times 100 % .

(10)

The

R e c a l l

, defined as

R e c a l l = \frac{T P}{T P + F N} \times 100 %,

(11)

represents the proportion of correctly predicted positive cases concerning the total number of positive cases [62].

The

F 1

-

S c o r e

is the harmonic average between the

R e c a l l

and the Precision [63], and the equation is defined as

F 1 - s c o r e = \frac{2 \times P r e c i s i o n \times R e c a l l}{P r e c i s i o n + R e c a l l} \times 100 % .

(12)

The

K a p p a

normalises the

A c c u r a c y

by the possibility of agreement by chance [64] and is defined as

K a p p a = \frac{2 \times (T P \times T N - F N \times F P)}{(T P + F P) \times (F P + T N) + (T P + F N) \times (F N + T N)} .

(13)

The

M C C

is useful for uneven data [65]. It varies between 0 and 1, with 0 as the worst scenario and 1 as the best. It is defined as

M C C = \frac{T P \times T N - F P \times F N}{\sqrt{(T P + F P) \times (T P + F N) \times (T N + F P) \times (T N + F N)}} .

(14)

The

C S I

provides a more nuanced evaluation of a binary classification model’s effectiveness by considering both the correct identification of positive instances and the ability to avoid false positives. [66]. The

C S I

equation can be defined as

C S I = \frac{T P}{T P + F P + F N} .

(15)

The

G m e a n

is the measure that considers a balance between the performance of all classes. The higher the value is, the lower is the risk of models over-fitting. It is defined as

G m e a n = \sqrt{R e c a l l \times S p e c i f i c i t y},

(16)

where

S p e c i f i c i t y

is defined as

S p e c i f i c i t y = \frac{T N}{F N + T N} .

(17)

The area under the curve (

A U C

) of the receiver operating characteristic curve (ROC) is a metric that evaluates how well a model can distinguish between positive and negative classes. It achieves this by comparing the rate of

T P

against the rate of

F P

at different classification thresholds. The value of

A U C

ranges between 0 and 1, with the perfect classifier resulting in a value of 1, while a random classifier has an

A U C

of 0.5. Using

A U C

allows for a single-value measure of a model’s performance. This is especially useful for comparing models and assessing performance in scenarios where there is an imbalance between classes [67].

3. Results

Table 6 displays the individual features’ discrimination power that yielded the best results for statistical and XROC analysis conducted across all 28 binary comparisons (

V H D

vs.

M

,

V H D

vs.

M I

,

V H D

vs.

M H

,

V H D

vs.

H C

,

V H D

vs.

D i s

,

V H D

vs.

C a r d M y o

,

V H D

vs.

B B B

,

M

vs.

M I

,

M

vs.

M H

,

M

vs.

H C

,

M

vs.

D i s

,

M

vs.

C a r d M y o

,

M

vs.

B B B

,

M I

vs.

M H

,

M I

vs.

H C

,

M I

vs.

D i s

,

M I

vs.

C a r d M y o

,

M I

vs.

B B B

,

M H

vs.

H C

,

M H

vs.

D i s

,

M H

vs.

C a r d M y o

,

M H

vs.

B B B

,

H C

vs.

D i s

,

H C

vs.

C a r d M y o

,

H C

vs.

B B B

,

D i s

vs.

C a r d M y o

,

D i s

vs.

B B B

, and

C a r d M y o

vs.

B B B

), respectively.

Table 7 shows the number of occasions where a feature distribution is shown to be significant (

p < 0.05

) for separating binary classes.

Figure 2 illustrates the violin plots for the comparison groups where there was a significant difference, reported in Table 6.

The classification results regarding combined feature power analysis performed by ML classifiers can be found as a heatmap in Figure 3.

The direct comparison between the individual and combined feature power analyses for discrimination is shown in Figure 4.

4. Discussion

For a more comprehensive discussion, we divided this section into three subsections according to the two variations of analysis employed in this study—individual and combined feature power analyses for discrimination—and compared our results with state-of-the-art results. While acknowledging that, in medicine,

A c c u r a c y

may not fully capture the balance between

R e c a l l

and

S p e c i f i c i t y

, our discussion will primarily focus on

A c c u r a c y

for checking the model’s performance as it enables a more direct comparison of our results with those achieved by state-of-the-art methods.

4.1. Data Driven Analysis—Individual Feature Power Analysis

From a broader perspective, we can observe in Table 6 that the top-performing feature, compressor, and wavelet sub-band were the

C o r r D i m

,

A v g

, and 2nd sub-band (DWT details 2nd level), respectively. Notably, they were present in 100% of the best results for comparison groups, encompassing all 28 binary comparison groups.

In addition, 12 of the 28 binary comparisons were shown to be statistically significant (42.86% of all analyses) and, out of the 15 leads utilised in this study, 8 exhibited at least one analysis with statistically significant differences. The most frequently represented lead in the table was

V 2

, appearing in 16% of the cases (4 out of 28 binary groups).

The classes most frequently represented in binary groups with significant differences were the

H C

and the

B B B

classes. Both classes were present in 5 out of the 12 comparison groups with significant differences (41.67% of the cases). Additionally, the

C a r d M y o

and

M H

classes were the only ones that did not show significant differences when compared with the

B B B

class, and the M and

D i s

classes did not show significant differences when compared with the

H C

class.

Regarding each binary analysis:

$V H D$ vs. $M$ analysis yielded a significant p-value of 0.0339, and an $A c c u r a c y$ and $R e c a l l$ of 100%. The feature $C o r r D i m$ , with the compressor $A v g$ , and lead $V 4$ extracted from the 2nd sub-band provided excellent results. As shown in Figure 2a, the violin plot easily illustrates a distinct separation between these two classes.
$V H D$ vs. $M I$ statistical analysis produced a significant result (p-value = 0.0486). The XROC analysis achieved an $A c c u r a c y$ of 98.91% and $R e c a l l$ of 0% for this binary comparison. The achieved $R e c a l l$ of 0% underscores one of the primary limitations of the dataset, namely its imbalance with the XROC, over-adjusting itself too much to the predominant class— $M I$ , and it corroborates also the difficulty of splitting groups by the statistical test. In Figure 2b, we can see some outlier values but the highest density of the data is located close to the median.
$V H D$ vs. $M H$ statistical analysis revealed a non-significance p-value. The XROC metrics— $A c c u r a c y$ and $R e c a l l$ —demonstrated strong performance for discrimination between groups, with values of 87.50% and 75.00%, respectively.
$V H D$ vs. $H C$ group analysis displayed a significant difference, with a p-value of 0.0301. It achieved an $A c c u r a c y$ of 94.94% and a $R e c a l l$ of 0%. Figure 2c indicates some outliers in the $H C$ class, but the highest data density is close to the median. Despite the good statistical analysis results, once more the XROC over-adjusts itself too much to the predominant class— $H C$ , achieving a $R e c a l l$ of 0% for discriminating the class $V H D$ . The imbalanced database and the $H C$ ’s large number of outliers contribute to these results. The XROC employs an averaging method within its genesis, which assigns significant weight to outliers in the final results.
$V H D$ vs. $D i s$ analysis resulted in a non-significant p-value, an $A c c u r a c y$ of 80.00%, and a $R e c a l l$ of 100%. Despite being statistically non-significant, the XROC results showed a good performance for discriminating.
$V H D$ vs. $C a r d M y o$ analysis resulted in a non-significant p-value. The XROC metrics $A c c u r a c y$ and $R e c a l l$ were 78.94% and 50.00%, respectively.
$V H D$ vs. $B B B$ statistical analysis yielded a p-value of 0.0308, and the XROC achieved an $A c c u r a c y$ of 84.62% and $R e c a l l$ of 75.00%. Figure 2d illustrates a higher density of $B B B$ ’s data being located close to the median.
$M$ vs. $M I$ statistical analysis yielded a p-value of non-significance. The $A c c u r a c y$ achieved a value of 99.18% and the $R e c a l l$ resulted in 0%.
$M$ vs. $M H$ statistical analysis exhibited no significant difference in the p-value. The $A c c u r a c y$ and $R e c a l l$ reached 85.71% and 100%, respectively.
$M$ vs. $H C$ analysis also showed no significant p-value. $A c c u r a c y$ and $R e c a l l$ demonstrated an interesting performance, achieving values of 96.15% and 0%, respectively. This underscores the difficulty of the XROC classifier in accurately discriminating between unbalanced data sample sizes.
$M$ vs. $D i s$ analysis also revealed a non-significant p-value, with the $A c c u r a c y$ and $R e c a l l$ showcasing the values of 85.71% and 33.33%, respectively.
$M$ vs. $C a r d M y o$ analysis resulted in a non-significant p-value, an $A c c u r a c y$ of 94.44% and a $R e c a l l$ of 66.67%. Despite being statistically non-significant, the XROC showed good behaviour for discriminating between classes.
$M$ vs. $B B B$ statistical analysis indicated significant differences with a p-value of 0.0126. The $A c c u r a c y$ and $R e c a l l$ reached 100%. In Figure 2e, the violin plot displayed an outlier in the $B B B$ class, but the highest data density was slightly below the median. It is worth noting that there was a clear separation between the two classes.
$M I$ vs. $M H$ analysis resulted in a non-significant p-value. Despite that, the XROC performed well with a discrimination $A c c u r a c y$ of 98.91% and a $R e c a l l$ of 100%.
$M I$ vs. $H C$ statistical analysis yielded a p-value of 0.0017. The $A c c u r a c y$ and the $R e c a l l$ achieved values of 82.84% and 100%, respectively. In Figure 2f, the violin plot exhibited some outliers, but the highest data density was close to the median for both classes.
$M I$ vs. $D i s$ analysis was shown to be non-significant. The XROC metrics of $A c c u r a c y$ and $R e c a l l$ displayed significant performance percentages, with values of 97.05% and 100%, respectively.
$M I$ vs. $C a r d M y o$ statistical analysis revealed a p-value of 0.0326 alongside impressive classification metrics, boasting an $A c c u r a c y$ of 96.29% and a perfect $R e c a l l$ of 100%. In Figure 2g, the violin plot exhibited some outliers, but the highest data density was close to the median for both classes.
$M I$ vs. $B B B$ analysis showed statistical significance and the $A c c u r a c y$ stood at 97.57%, with a flawless $R e c a l l$ of 100%. Figure 2h gives us the opportunity to see some outliers in both classes but the majority of the data were located close to the median.
$M H$ vs. $H C$ analysis achieved a significant p-value, reaching a statistical analysis value of 0.0078. The $A c c u r a c y$ was 94.94% and the $R e c a l l$ was 0%, which perfectly illustrates the imbalance of the dataset. Figure 2i shows the $H C$ class, with the highest density of data close to the median.
$M H$ vs. $D i s$ analysis exhibited non-significant p-values, with $A c c u r a c y$ rates of 80.00% and 50.00%, respectively.
$M H$ vs. $C a r d M y o$ demonstrated an $A c c u r a c y$ of 84.21% and a $R e c a l l$ of 50.00%, while the $M H$ vs. $B B B$ group yielded an $A c c u r a c y$ of 92.31% and a $R e c a l l$ of 75.00%, with both analyses showing a non-statistical significance. While statistical significance may be elusive, the consistently high $A c c u r a c y$ and $R e c a l l$ values underscore the potential efficacy of the model in discriminating between different conditions within the studied groups.
$H C$ vs. $D i s$ analysis showed non-significant difference. The $A c c u r a c y$ and $R e c a l l$ displayed great performance, with values of 87.21% and 100%, respectively.
$H C$ vs. $C a r d M y o$ showed a significant difference, with a p-value of 0.0071. The $A c c u r a c y$ and $R e c a l l$ exhibited strong performance, with values of 83.33% and 94.67%, respectively. In Figure 2j, the violin plot displayed some outliers in the $H C$ class, but the highest data density was close to the median.
$H C$ vs. $B B B$ analysis provided a significant p-value of 0.0047 accompanied by an $A c c u r a c y$ of 89.29% and an impressive $R e c a l l$ of 100%. Figure 2k shows the violin plot with some outliers in the $H C$ class, but there was a higher density of data close to the median.
$D i s$ vs. $C a r d M y o$ comparison analysis yielded a non-significant p-value, with an $A c c u r a c y$ of 73.08% and a $R e c a l l$ of 54.54%.
$D i s$ vs. $B B B$ comparison analysis showed a p-value of 0.0167, achieving an $A c c u r a c y$ of 80.00% and a $R e c a l l$ of 81.81%. Figure 2l shows a violin plot with a couple of outliers in both classes, but the highest density of data was close to the median.
$C a r d M y o$ vs. $B B B$ analysis provided a non-significant p-value, with an $A c c u r a c y$ of 75.00% and a $R e c a l l$ of 73.33%.

Looking to Table 7, we can see the total number of occasions that a feature was demonstrated to be significant over binary groups and in total. It should be noted that each originally defined feature generated 360 features per analysis; for more information check Section 2.5.1. While

C o r r D i m

emerged as a standout performer individually, the results emphasise that the other nine features also demonstrated statistical significance in distinguishing between classes with more moments of appearing to be significant than actually

C o r r D i m

(523 vs. 600).

M I

vs.

H C

and

H C

vs.

B B B

showed the highest number of results with significant differences, which were 1185 and 1184, respectively.

V H D

vs.

M I

,

V H D

vs.

B B B

,

M

vs.

B B B

,

M H

vs.

H C

, and

D i s

vs.

B B B

were the binary groups with the lowest amount of occasions of significant feature distributions, 237 each.

4.2. Data Driven Analysis—Combined Feature Power Analysis

Figure 3 presents the classification metrics report for the comparison groups provided by 19 Sci-learn ML classifiers with combined features as entries. The heatmap employs a gradient of green shades in its colour scheme, serving to vividly illustrate the method’s discrimination capabilities for

A c c u r a c y

,

R e c a l l

,

P r e c i s i o n

,

F 1

-

S c o r e

,

A U C

,

K a p p a

,

M C C

,

C S I

, and

G m e a n

in each comparative analysis. Lighter shades of green represent lower discriminatory power, while deeper, richer greens signify higher effectiveness. By looking into the results, it can be seen that

V H D

vs.

M

,

V H D

vs.

M I

,

V H D

vs.

H C

,

V H D

vs.

D i s

,

V H D

vs.

C a r d M y o

,

M

vs.

M H

,

M

vs.

D i s

,

M

vs.

C a r d M y o

,

M

vs.

B B B

, and

M H

vs.

B B B

obtained 100% on all evaluation metrics. Comparing the individual power discrimination results presented in Table 6, it can be seen that generally the discrimination results have increased, and the ratio of 100% on all evaluated metrics per binary analysis has increased (2/28 to 10/28). Comparing the

A c c u r a c y

results achieved through combined feature power analysis with those obtained through individual feature power analysis (see Figure 4 for a visual representation of the analysis for each binary comparison; this figure provides a clear and concise overview, facilitating an easy assessment of performance differences between the two approaches described in Section 2.5.1 and Section 2.5.2), we observe a significant overall improvement across all binary comparisons. Among the 28 comparisons conducted, the results indicate that 17 exhibits enhanced discrimination

A c c u r a c y

when utilising combined features analysis. In contrast, in five instances, the

A c c u r a c y

remained the same as that observed in individual feature power analysis. There are only six cases where we notice a decrease in

A c c u r a c y

compared with individual power analysis (

V H D

vs.

M H

,

M I

vs.

M H

,

M I

vs.

C a r d M y o

,

M H

vs.

C a r d M y o

,

D i s

vs.

C a r d M y o

, and

D i s

vs.

B B B

).

Returning to the analysis of Figure 3, in

A l l

vs.

A l l

, an

A c c u r a c y

of 81.16%,

R e c a l l

of 72.93%,

P r e c i s i o n

of 81.16%, 76.34% for the

F 1

-

S c o r e

,

K a p p a

of 0.4018,

M C C

of 0.4399,

C S I

of 0.6713,

G m e a n

of 0.7417, and

A U C

of 0.5552 were achieved. The leads ensemble combination was the most represented in the table, corresponding to 28% of the total appearances. The classifier with the most frequent appearances was

L i n S V C

, representing 24% of the cases. The binary groups

V H D

vs.

M H

,

M I

vs.

H C

,

M H

vs.

C a r d M y o

,

D i s

vs.

C a r d M y o

, and

D i s

vs.

B B B

, exhibited

P r e c i s i o n

values below 90%. This challenge in correct classifying can be attributed to the close relationship between

C a r d M y o

and

M H

or

D i s

. In a clinical context, it is common for patients to present with

C a r d M y o

alongside either

V H D

or

D i s

[68,69]. This clinical overlap makes accurate differentiation challenging. Understanding and addressing these interconnected conditions are essential for improving classification

A c c u r a c y

in these scenarios. The

M I

vs.

H C

classification, with an 82.84%

P r e c i s i o n

, presents challenges due to the potential presence of acute

M I

within the

H C

class. Additionally, some patients who have recovered from

M I

may be categorised as

H C

[70]. These factors contribute to a slightly lower classification performance of ML models for discrimination within this context.

Moreover, upon assessing various models and their performance metrics, a notable observation is the impact of the imbalanced dataset, particularly evident in comparison groups involving one of either

M I

or

H C

classes. In such instances, we observed a range of

A U C

results from 0.4272 to 0.6667 across all nine comparison groups where at least one of these two classes was present. These findings underscore the substantial challenge of distinguishing between unevenly represented classes. The

G m e a n

further highlights the noteworthy observation that in 71.42% of cases (five out of seven binary comparisons) where the

M I

class is pitted against another class, the

G m e a n

metric yields a result of 0. However, in comparisons involving

M I

against

D i s

and

B B B

, a lower risk of over-fitting is evident, with

G m e a n

values of 0.9865 and 0.9891, respectively. The

C S I

metric reveals that the preponderance of comparison groups, specifically 17 out of 29, exhibits results surpassing 0.9. This observation underscores a notable challenge in classification, particularly when dealing with classes characterised by higher data abundance. The

M C C

metric highlights a notable trend, with 31% of the comparison groups (9 out of 29 groups) achieving perfect predictions, each obtaining a maximum value of 1. Notably, the class

M I

demonstrates the least favourable outcomes, with its highest

M C C

value capped at 0.3297 when included in a comparison group. The

K a p p a

metric reveals a noteworthy pattern, with 31% of the comparison groups (9 out of 29 groups) achieving perfect agreement, each attaining a maximum value of 1. Additionally, 41.37% of the groups surpass a

K a p p a

value higher than 0.083.

4.3. Study Results vs. State-of-the-Art Results

When we analyse Table 1, it becomes evident that our results closely match or slightly surpass the achievements of the state of the art, offering valuable insights for enhancing robustness. In particular, when considering the eight state-of-the-art studies that utilised the PTB database, our results are lower in the binary comparisons of

M I

vs.

H C

and

H C

vs.

C a r d M y o

, with differences of less than 13% and 0.76%, respectively. Furthermore, the present study offers significant advantages over other studies as it includes a multi-class comparison group (

A l l

vs.

A l l

), a higher variety of binary comparisons, and the application of classical non-linear analysis under ECG multi-band analysis performed by DWT. These particularities allow a high capacity of differentiation of each class present in the database, a level of detail not typically found in state-of-the-art articles. Moreover, it is imperative to underscore that the developed algorithm relies on ECG signals, presenting distinctive advantages when compared with alternative diagnostic sources such as stress tests, imaging tests, electrophysiological studies, provocative diagnostic tests, genetic tests, and cardiac catheterisation, among others. The affordability, non-invasiveness, widespread use in clinical settings, and user-friendly nature of ECG make it an optimal choice. Its efficacy not only facilitates the easy adoption of our algorithm globally but also addresses the unique needs of patients unable to leave their hospital beds. This highlights the algorithm’s versatility and accessibility in diverse healthcare settings.

5. Conclusions

For this research, 10 non-linear features (

E n

,

A p E n

,

L o g E n

,

S h a E n

,

E H

,

E l a y

, H, K,

C o r r D i m

, and

D F A

) were extracted from a well-known ECG database (PTB diagnostic ECG database). From the recorded 15 leads per patient (12 conventional leads and 3 Frank leads), each signal lead underwent a 1-second length non-overlapped windowing process over time for extracting a total of 10 non-linear features per window. At the end of the process, each feature time series was compacted by six statistics. The individual power and combined power were accessed from discriminating between different cardiovascular pathologies (

V H D

vs.

M

,

V H D

vs.

M I

,

V H D

vs.

M H

,

V H D

vs.

H C

,

V H D

vs.

D i s

,

V H D

vs.

C a r d M y o

,

V H D

vs.

B B B

,

M

vs.

M I

,

M

vs.

M H

,

M

vs.

H C

,

M

vs.

D i s

,

M

vs.

C a r d M y o

,

M

vs.

B B B

,

M I

vs.

M H

,

M I

vs.

H C

,

M I

vs.

D i s

,

M I

vs.

C a r d M y o

,

M I

vs.

B B B

,

M H

vs.

H C

,

M H

vs.

D i s

,

M H

vs.

C a r d M y o

,

M H

vs.

B B B

,

H C

vs.

D i s

,

H C

vs.

C a r d M y o

and

H C

vs.

B B B

),

D i s

vs.

C a r d M y o

,

D i s

vs.

B B B

), and

C a r d M y o

vs.

B B B

) and one multi-class comparison (

A l l

vs.

A l l

).

The

A c c u r a c y

discrimination results ranged between 81% and 100%. The results demonstrate that the applied method serves as a robust tool for effectively distinguishing cardiovascular diseases (CVDs) through the analysis of ECG signals. The level of detail and discrimination achieved surpasses what is typically observed in state-of-the-art studies using the same dataset. Despite our results indicating a great ability of the proposed method to diagnose, offering in this way another alternative avenue for medical doctors to arrive at more confident diagnoses, this study had some limitations. (1) The inherently technical nature of utilising unusual standard clinical features extracted from ECG signals may hinder complete interpretability from a clinician’s standpoint. This could pose a challenge to its rapid and widespread integration into clinical practice. (2) The high computation time of multi-band analysis for the chosen methodology led us to choose just one wavelet (Symlet7) from tens of wavelets with the level of decomposition set to 3, based on prior work [48,49], as the main wavelet. A more meticulous analysis needs to be carried out in future to choose the wavelet and level of decomposition that adjusts itself better to each CVD activity. (3) The results should be further enhanced by updating them with a larger and more balanced population to ensure a more reliable generalisation and to split data as hold-out for classifying (e.g., 70% for training and 30% for testing) without employing cross-validation methods. Another possible solution would be, in a future work, to reduce the number of cases inside the highest classes to reduce the uneven data distribution. (4) Additional CVDs should be studied and evaluated in future work to enhance the discriminative capabilities of our algorithm (e.g., arrhythmias such as premature atrial contraction, premature ventricular contraction, and atrial fibrillation).

Nevertheless, upon reviewing state-of-the-art works (refer to Table 1), it becomes apparent that many have encountered similar limitations. These constraints predominantly revolved around imbalances in data distribution, as a significant portion of these studies relied on the same database. Additionally, limitations in computational time and resources, and a restricted variety and diversity of CVD classes were commonly shared among these works. This collective set of limitations across the consulted literature underscores the need for addressing data imbalances and expanding the diversity of CVD classes in future research efforts.

Author Contributions

Conceptualization, P.R. and P.M.R.; methodology, P.R. and P.M.R.; validation, P.M.R.; investigation, P.R. and P.M.R.; writing—original, P.R., J.S., D.P. and P.M.R.; writing—review and editing, P.R. and P.M.R.; supervision, P.M.R.; funding acquisition, P.M.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data presented in this study are available on request from the corresponding author (accurately indicate status).

Acknowledgments

This work was supported by National Funds from FCT—Fundação para a Ciência e a Tecnologia through project UIDB/50016/2020.

Conflicts of Interest

The authors declare no conflicts of interest.

References

American Heart Association. What is Cardiovascular Disease? 2017. Available online: https://www.heart.org/en/health-topics/consumer-healthcare/what-is-cardiovascular-disease (accessed on 5 October 2023).
World Health Organization. Cardiovascular Diseases CVDs. 2021. Available online: https://www.who.int/news-room/fact-sheets/detail/cardiovascular-diseases-(cvds) (accessed on 5 October 2023).
Visseren, F.L.J.; Mach, F.; Smulders, Y.M.; Carballo, D.; Koskinas, K.C.; Bäck, M.; Benetos, A.; Biffi, A.; Boavida, J.M.; Capodanno, D.; et al. 2021 ESC Guidelines on cardiovascular disease prevention in clinical practice. Eur. Heart J. 2021, 42, 3227–3337. [Google Scholar] [CrossRef] [PubMed]
Arbelo, E.; Protonotarios, A.; Gimeno, J.; Arbustini, E.; Barriales-Villa, R.; Basso, C.; Bezzina, C.; Biagini, E.; Blom, N.; Boer, R.; et al. 2023 ESC Guidelines for the management of cardiomyopathies. Eur. Heart J. 2023, 44, 3503–3626. [Google Scholar] [CrossRef] [PubMed]
Delgado, V.; Marsan, N.A.; de Waha, S.; Bonaros, N.; Brida, M.; Burri, H.; Caselli, S.; Doenst, T.; Ederhy, S.; Erba, P.A.; et al. 2023 ESC Guidelines for the management of endocarditis. Eur. Heart J. 2023, 44, 3948–4042. [Google Scholar] [CrossRef] [PubMed]
Brito, D.; Cardim, N.; Rocha-Lopes, L.; Freitas, A.; Lacerda, A.P.D.; Menezes, M.; Belo, A.; Martins, E.; Peres, M.; Goncalves, L.; et al. P3514Diagnosis and treatment of acute myocarditis in Portugal. Data from the national multicenter registry on myocarditis. Eur. Heart J. 2017, 38, ehx504.P3514. [Google Scholar] [CrossRef]
Adler, Y.; Charron, P.; Imazio, M.; Badano, L.; Barón-Esquivias, G.; Bogaert, J.; Brucato, A.; Gueret, P.; Klingel, K.; Lionis, C.; et al. 2015 ESC Guidelines for the diagnosis and management of pericardial diseases. Eur. Heart J. 2015, 36, 2921–2964. [Google Scholar] [CrossRef]
Hamm, C.; Bassand, J.P.; Agewall, S.; Bax, J.; Boersma, E.; Bueno, H.; Caso, P.; Dudek, D.; Gielen, S.; Huber, K.; et al. ESC Guidelines for the management of acute coronary syndromes in patients presenting without persistent ST-segment elevation: The Task Force for the management of acute coronary syndromes (ACS) in patients presenting without persistent ST-segment elevation of the European Society of Cardiology (ESC). Eur. Heart J. 2011, 32, 2999–3054. [Google Scholar] [CrossRef]
Byrne, R.; Rossello, X.; Coughlan, J.; Barbato, E.; Berry, C.; Chieffo, A.; Claeys, M.; Dan, G.A.; Dweck, M.; Galbraith, M.; et al. 2023 ESC Guidelines for the management of acute coronary syndromes: Developed by the task force on the management of acute coronary syndromes of the European Society of Cardiology (ESC). Eur. Heart J. 2023, 44, 3720–3826. [Google Scholar] [CrossRef]
McDonagh, T.A.; Metra, M.; Adamo, M.; Gardner, R.S.; Baumbach, A.; Böhm, M.; Burri, H.; Butler, J.; Čelutkienė, J.; Chioncel, O.; et al. 2021 ESC Guidelines for the diagnosis and treatment of acute and chronic heart failure. Eur. Heart J. 2021, 42, 3599–3726. [Google Scholar] [CrossRef]
Zeppenfeld, K.; Tfelt-Hansen, J.; de Riva, M.; Winkel, B.G.; Behr, E.R.; Blom, N.A.; Charron, P.; Corrado, D.; Dagres, N.; de Chillou, C.; et al. 2022 ESC Guidelines for the management of patients with ventricular arrhythmias and the prevention of sudden cardiac death. Eur. Heart J. 2022, 43, 3997–4126. [Google Scholar] [CrossRef]
Kim, M.; Yoon, M.; Yang, P.; Kim, T.; Uhm, J.; Kim, J.; Pak, H.; Lee, M.; Joung, B. P6422Sex-based disparities in incidence, treatment, and outcomes of sudden cardiac arrest. Eur. Heart J. 2017, 38, ehx493.P6422. [Google Scholar] [CrossRef]
Authors/Task Force Members.; Vahanian, A.; Alfieri, O.; Andreotti, F.; Antunes, M.J.; Barón-Esquivias, G.; Baumgartner, H.; Borger, M.A.; Carrel, T.P.; Bonis, M.D.; et al. Guidelines on the management of valvular heart disease (version 2012). Eur. Heart J. 2012, 33, 2451–2496. [Google Scholar] [CrossRef]
Baumgartner, H.; Backer, J.D.; Babu-Narayan, S.V.; Budts, W.; Chessa, M.; Diller, G.P.; Lung, B.; Kluin, J.; Lang, I.M.; Meijboom, F.; et al. 2020 ESC Guidelines for the management of adult congenital heart disease. Eur. Heart J. 2020, 42, 563–645. [Google Scholar] [CrossRef] [PubMed]
Kligfield, P.; Gettes, L.S.; Bailey, J.J.; Childers, R.; Deal, B.J.; Hancock, E.W.; van Herpen, G.; Kors, J.A.; Macfarlane, P.; Mirvis, D.M.; et al. Recommendations for the Standardization and Interpretation of the Electrocardiogram. Circulation 2007, 115, 1306–1324. [Google Scholar] [CrossRef] [PubMed]
Garner, K.; Pomeroy, W.; Arnold, J. Exercise Stress Testing:Indications and Common Questions. Am. Acad. Fam. Physicians 2017, 96, 293–299. [Google Scholar]
Maron, B. American College of Cardiology/European Society of Cardiology Clinical Expert Consensus Document on Hypertrophic Cardiomyopathy a Rteport of the American College of Cardiology Foundation Task Force on Clinical Expert Consensus Documents and the European Society of Cardiology Committee for Practice Guidelines. Eur. Heart J. 2003, 24, 1965–1991. [Google Scholar] [CrossRef]
Perrot, B.; Clozel, J.P.; de la Chaise, A.T.; Cherrier, F.; Faivre, G. Electrophysiological effects of intravenous prostacyclin in man. Eur. Heart J. 1984, 5, 883–889. [Google Scholar] [CrossRef]
Maseri, A. Safety of provocative tests of coronary artery spasm and prediction of long-term outcome: Need for an innovative clinical research strategy. Eur. Heart J. 2012, 34, 252–254. [Google Scholar] [CrossRef]
Grondin, S.; Davies, B.; Cadrin-Tourigny, J.; Steinberg, C.; Cheung, C.C.; Jorda, P.; Healey, J.S.; Green, M.S.; Sanatani, S.; Alqarawi, W.; et al. Importance of genetic testing in unexplained cardiac arrest. Eur. Heart J. 2022, 43, 3071–3081. [Google Scholar] [CrossRef]
Azmi, J.; Arif, M.; Nafis, M.T.; Alam, M.A.; Tanweer, S.; Wang, G. A systematic review on machine learning approaches for cardiovascular disease prediction using medical big data. Med. Eng. Phys. 2022, 105, 103825. [Google Scholar] [CrossRef]
Rodrigues, P.M.; Madeiro, J.P.; Marques, J.A.L. Enhancing Health and Public Health through Machine Learning: Decision Support for Smarter Choices. Bioengineering 2023, 10, 792. [Google Scholar] [CrossRef]
Krittanawong, C.; Virk, H.U.H.; Bangalore, S.; Wang, Z.; Johnson, K.W.; Pinotti, R.; Zhang, H.; Kaplin, S.; Narasimhan, B.; Kitai, T.; et al. Machine learning prediction in cardiovascular diseases: A meta-analysis. Sci. Rep. 2020, 10, 16057. [Google Scholar] [CrossRef] [PubMed]
Qu, Z.; Hu, G.; Garfinkel, A.; Weiss, J. Nonlinear and stochastic dynamics in the heart. Phys. Rep. 2014, 543, 61–162. [Google Scholar] [CrossRef]
Haraldsson, H.; Edenbrandt, L.; Ohlsson, M. Detecting acute myocardial infarction in the 12-lead ECG using Hermite expansions and neural networks. Artif. Intell. Med. 2004, 32, 127–136. [Google Scholar] [CrossRef] [PubMed]
Begum, R.; Ramesh, M. Detection of cardiomyopathy using support vector machine and artificial neural network. Int. J. Comput. Appl. 2016, 133, 29–34. [Google Scholar] [CrossRef]
Chowdhuryy, H.; Sultana, M.; Ghosh, R.; Ahamed, J.; Mahmood, M. AI Assisted Portable ECG for Fast and Patient Specific Diagnosis. In Proceedings of the 2018 International Conference on Computer, Communication, Chemical, Material and Electronic Engineering (IC4ME2), Rajshahi, Bangladesh, 8–9 February 2018. [Google Scholar] [CrossRef]
Kachuee, M.; Fazeli, S.; Sarrafzadeh, M. ECG Heartbeat Classification: A Deep Transferable Representation. In Proceedings of the 2018 IEEE International Conference on Healthcare Informatics (ICHI), New York, NY, USA, 4–7 June 2018. [Google Scholar] [CrossRef]
Baloglu, U.; Talo, M.; Yildirim, O.; Tan, R.; Acharya, U. Classification of myocardial infarction with multi-lead ECG signals and deep CNN. Pattern Recognit. Lett. 2019, 122, 23–30. [Google Scholar] [CrossRef]
Ali, L.; Niamat, A.; Khan, J.A.; Golilarz, N.A.; Xingzhong, X.; Noor, A.; Nour, R.; Bukhari, S.A.C. An Optimized Stacked Support Vector Machines Based Expert System for the Effective Prediction of Heart Failure. IEEE Access 2019, 7, 54007–54014. [Google Scholar] [CrossRef]
Ali, L.; Rahman, A.; Khan, A.; Zhou, M.; Javeed, A.; Khan, J.A. An Automated Diagnostic System for Heart Disease Prediction Based on χ² Statistical Model and Optimally Configured Deep Neural Network. IEEE Access 2019, 7, 34938–34945. [Google Scholar] [CrossRef]
Baghel, N.; Dutta, M.K.; Burget, R. Automatic diagnosis of multiple cardiac diseases from PCG signals using convolutional neural network. Comput. Methods Programs Biomed. 2020, 197, 105750. [Google Scholar] [CrossRef]
Ahamed, A.; Hasan, K.; Monowar, K.; Mashnoor, N.; Hossain, A. ECG Heartbeat Classification Using Ensemble of Efficient Machine Learning Approaches on Imbalanced Datasets. In Proceedings of the 2020 2nd International Conference on Advanced Information and Communication Technology (ICAICT), Dhaka, Bangladesh, 28–29 November 2020. [Google Scholar] [CrossRef]
Makimoto, H.; Höckmann, M.; Lin, T.; Glöckner, D.; Gerguri, S.; Clasen, L.; Schmidt, J.; Assadi-Schmidt, A.; Bejinariu, A.; Müller, P.; et al. Performance of a convolutional neural network derived from an ECG database in recognizing myocardial infarction. Sci. Rep. 2020, 10, 8445. [Google Scholar] [CrossRef]
Khan, A.; Hussain, M.; Malik, M. Cardiac Disorder Classification by Electrocardiogram Sensing Using Deep Neural Network. Complexity 2021, 2021, 5512243. [Google Scholar] [CrossRef]
Premanand, S.; Narayanan, S. A Tree Based Machine Learning Approach for PTB Diagnostic Dataset. J. Phys. Conf. Ser. 2021, 2115, 012042. [Google Scholar] [CrossRef]
Kavitha, M.; Gnaneswar, G.; Dinesh, R.; Sai, Y.; Suraj, R. Heart Disease Prediction using Hybrid machine Learning Model. In Proceedings of the 2021 6th International Conference on Inventive Computation Technologies (ICICT), Coimbatore, India, 20–22 January 2021. [Google Scholar] [CrossRef]
Elhoseny, M.; Mohammed, M.; Mostafa, S.; Abdulkareem, K.; Maashi, M.; Garcia-Zapirain, B.; Mutlag, A.; Maashi, M. A New Multi-Agent Feature Wrapper Machine Learning Approach for Heart Disease Diagnosis. Comput. Mater. Contin. 2021, 67, 51–71. [Google Scholar] [CrossRef]
Ahmad, G.; Fatima, H.; Ullah, S.; Saidi, A.; Imdadullah. Efficient Medical Diagnosis of Human Heart Diseases Using Machine Learning Techniques with and without GridSearchCV. IEEE Access 2022, 10, 80151–80173. [Google Scholar] [CrossRef]
Ahmad, S.; Asghar, M.; Alotaibi, F.; Alotaibi, Y. Diagnosis of cardiovascular disease using deep learning technique. Soft Comput. 2022, 27, 8971–8990. [Google Scholar] [CrossRef]
Mhamdi, L.; Dammak, O.; Cottin, F.; Dhaou, I. Artificial Intelligence for Cardiac Diseases Diagnosis and Prediction Using ECG Images on Embedded Systems. Biomedicines 2022, 10, 2013. [Google Scholar] [CrossRef] [PubMed]
Karthik, S.; Santhosh, M.; Kavitha, M.S.; Christopher Paul, A. Automated Deep Learning Based Cardiovascular Disease Diagnosis Using ECG Signals. Comput. Syst. Sci. Eng. 2022, 42, 183–199. [Google Scholar] [CrossRef]
Bousseljot, R.D.; Kreiseler, D.; Schnabel, A. Nutzung der EKG-Signaldatenbank CARDIODAT der PTB über das Internet. Biomed. Tech. Biomed. Eng. 1995, 40, 317–318. [Google Scholar] [CrossRef]
Rodrigues, P.; Bispo, B.; Garrett, C.; Alves, D.; Teixeira, J.; Freitas, D. Lacsogram: A New EEG Tool to Diagnose Alzheimer’s Disease. IEEE J. Biomed. Health Inform. 2021, 25, 3384–3395. [Google Scholar] [CrossRef]
Guido, R. Wavelets behind the scenes: Practical aspects, insights, and perspectives. Phys. Rep. 2022, 985, 1–23. [Google Scholar] [CrossRef]
Malvar, H. Signal Processing with Lapped Transforms; Artech House: Norwood, MA, USA, 1992. [Google Scholar]
Vetterli, M.; Kovačević, J. Wavelets and Subband Coding; Prentice Hall: Englewood Cliffs, NJ, USA, 1995. [Google Scholar]
Chen, C.C.; Tsui, F.R. Comparing different wavelet transforms on removing electrocardiogram baseline wanders and special trends. BMC Med. Inform. Decis. Mak. 2020, 20, 343. [Google Scholar] [CrossRef]
Ribeiro, P.; Marques, J.A.L.; Pordeus, D.; Zacarias, L.; Leite, C.F.; Sobreira-Neto, M.A.; Peixoto, A.A.; de Oliveira, A.; do Vale Madeiro, J.P.; Rodrigues, P.M. Machine learning-based cardiac activity non-linear analysis for discriminating COVID-19 patients with different degrees of severity. Biomed. Signal Process. Control. 2024, 87, 105558. [Google Scholar] [CrossRef]
Rioul, O.; Vetterli, M. Wavelets and signal processing. IEEE Signal Process. Mag. 1991, 8, 14–38. [Google Scholar] [CrossRef]
Peck, R.; Olsen, C.; Devore, J. Introduction to Statistics and Data Analysis; Cengage Learning: Boston, MA, USA, 2008; p. 880. [Google Scholar]
Caesarendra, W.; Kosasih, B.; Tieu, K.; Moodie, C. An application of nonlinear feature extraction—A case study for low speed slewing bearing condition monitoring and prognosis. In Proceedings of the 2013 IEEE/ASME International Conference on Advanced Intelligent Mechatronics, Wollongong, NSW, Australia, 9–12 July 2013; pp. 1713–1718. [Google Scholar] [CrossRef]
Hardstone, R.; Poil, S.S.; Schiavone, G.; Jansen, R.; Nikulin, V.; Mansvelder, H.; Linkenkaer-Hansen, K. Detrended Fluctuation Analysis: A Scale-Free View on Neuronal Oscillations. Front. Physiol. 2012, 3, 450. [Google Scholar] [CrossRef] [PubMed]
Sundararajan, D. Discrete Wavelet Transform a Signal Processing Approach, 1st ed.; John Wiley & Sons: Hoboken, NJ, USA, 2015. [Google Scholar]
Silva, M.; Ribeiro, P.; Bispo, B.C.; Rodrigues, P.M. Detecção da Doença de Alzheimer através de Parâmetros Não-Lineares de Sinais de Fala. In Proceedings of the Anais do XLI Simpósio Brasileiro de Telecomunicações e Processamento de Sinais. Sociedade Brasileira de Telecomunicações, São José dos Campos, SP, Brazil, 8–11 October 2023. [Google Scholar] [CrossRef]
Garcia, A.; Garcia, C.; Villasenor-Pineda, L.; Montoya, O. Biosignal Processing and Classification Using Computational Learning and Intelligence Principles, Algorithms, and Applications; Academic Press: London, UK, 2022; pp. 59–91. [Google Scholar]
Silva, G.; Batista, P.; Rodrigues, P.M. COVID-19 activity screening by a smart-data-driven multi-band voice analysis. J. Voice 2022, in press. [Google Scholar] [CrossRef] [PubMed]
Nakas, C.; Yiannoutsos, C. Ordered multiple-class ROC analysis with continuous measurements. Stat. Med. 2004, 23, 3437–3449. [Google Scholar] [CrossRef]
Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
Sammut, C.; Webb, G.I. (Eds.) Accuracy. In Encyclopedia of Machine Learning and Data Mining; Springer: New York, NY, USA, 2017; p. 8. [Google Scholar] [CrossRef]
Doğan, O. Data Linkage Methods for Big Data Management in Industry 4.0. In Optimizing Big Data Management and Industrial Systems with Intelligent Techniques; IGI Global: Hershey, PA, USA, 2019; pp. 108–127. [Google Scholar] [CrossRef]
Ting, K.M. Precision and Recall. In Encyclopedia of Machine Learning and Data Mining; Springer: New York, NY, USA, 2017; pp. 990–991. [Google Scholar] [CrossRef]
Goutte, C.; Gaussier, E. A Probabilistic Interpretation of Precision, Recall and F-Score, with Implication for Evaluation. In Advances in Information Retrieval; Springer: Berlin/Heidelberg, Germany, 2005; pp. 345–359. [Google Scholar] [CrossRef]
Vieira, S.M.; Kaymak, U.; Sousa, J.M.C. Cohen’s kappa coefficient as a performance measure for feature selection. In Proceedings of the International Conference on Fuzzy Systems, Barcelona, Spain, 18–23 July 2010. [Google Scholar] [CrossRef]
Chicco, D.; Jurman, G. The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC Genom. 2020, 21, 6. [Google Scholar] [CrossRef] [PubMed]
Larner, A. Assessing cognitive screeners with the critical success index. Prog. Neurol. Psychiatry 2021, 25, 33–37. [Google Scholar] [CrossRef]
Nahm, F. Receiver operating characteristic curve: Overview and practical use for clinicians. Korean J. Anesthesiol. 2022, 75, 25–36. [Google Scholar] [CrossRef]
Spirito, P.; Bellone, P.; Harris, K.M.; Bernabò, P.; Bruzzi, P.; Maron, B.J. Magnitude of Left Ventricular Hypertrophy and Risk of Sudden Death in Hypertrophic Cardiomyopathy. N. Engl. J. Med. 2000, 342, 1778–1785. [Google Scholar] [CrossRef]
Sossalla, S.; Vollmann, D. Arrhythmia-Induced Cardiomyopathy. Dtsch. Ärzteblatt Int. 2018, 115, 335. [Google Scholar] [CrossRef] [PubMed]
Sun, B.; Wang, L.; Guo, W.; Chen, S.; Ma, Y.; Wang, D. New treatment methods for myocardial infarction. Front. Cardiovasc. Med. 2023, 10, 1251669. [Google Scholar] [CrossRef] [PubMed]

Figure 2. Violin plots of binary group distributions with significant differences—individual feature power analysis for discrimination. (a)

V H D

vs.

M

; (b)

V H D

vs.

M I

; (c)

V H D

vs.

H C

; (d)

V H D

vs.

B B B

; (e)

M

vs.

B B B

; (f)

M I

vs.

H C

; (g)

M I

vs.

C a r d M y o

; (h)

M I

vs.

B B B

; (i)

M H

vs.

H C

; (j)

H C

vs.

C a r d M y o

; (k)

H C

vs.

B B B

; (l)

D i s

vs.

B B B

.

Figure 2. Violin plots of binary group distributions with significant differences—individual feature power analysis for discrimination. (a)

V H D

vs.

M

; (b)

V H D

vs.

M I

; (c)

V H D

vs.

H C

; (d)

V H D

vs.

B B B

; (e)

M

vs.

B B B

; (f)

M I

vs.

H C

; (g)

M I

vs.

C a r d M y o

; (h)

M I

vs.

B B B

; (i)

M H

vs.

H C

; (j)

H C

vs.

C a r d M y o

; (k)

H C

vs.

B B B

; (l)

D i s

vs.

B B B

.

Figure 3. Heatmap classification report regarding combined feature discriminant power analysis—the best

A c c u r a c y

,

R e c a l l

,

P r e c i s i o n

, F1-

S c o r e

,

A U C

,

K a p p a

,

M C C

,

C S I

, and

G m e a n

results for each comparison group plus the information of lead and ML classifier applied for signal analysis.

Figure 3. Heatmap classification report regarding combined feature discriminant power analysis—the best

A c c u r a c y

,

R e c a l l

,

P r e c i s i o n

, F1-

S c o r e

,

A U C

,

K a p p a

,

M C C

,

C S I

, and

G m e a n

results for each comparison group plus the information of lead and ML classifier applied for signal analysis.

Figure 4. Direct comparison using

A c c u r a c y

between individual and combined feature power analyses for binary groups’ discrimination performed by ML models.

Figure 4. Direct comparison using

A c c u r a c y

between individual and combined feature power analyses for binary groups’ discrimination performed by ML models.

Table 1. State-of-the-art literature report on CVDs detection with information about the database, the comparison groups, the features extracted, used classifiers, limitations, and

A c c u r a c y

.

Table 1. State-of-the-art literature report on CVDs detection with information about the database, the comparison groups, the features extracted, used classifiers, limitations, and

A c c u r a c y

.

Ref	Year	Database	Comparison Group (Number of Participants)	Feature Extracted	Classifier	Limitations	Validation	$Accuracy$
[25]	2004	University Hospital in Lund database	Normal (1119) vs. Myocardial infarction (1119)	Hermite decomposition	ANN	Exclusive assessment of myocardial infarction. Lack of diversity of CVDs.	Cross-validation	94%
[26]	2016	PTB diagnostic ECG database	Normal (49) vs. Cardiomyopathy (14)	ECG PR, QT, RR and QRS intervals	Feed-forward back-propagation Neural Network	Small and unbalanced dataset. Exclusive assessment of cardiomyopathy. Lack of diversity of CVDs.	Cross-validation	95.2%
[27]	2018	PTB diagnostic ECG database	Normal (25) vs. Myocardial infarction (36)	Feature extracted from DNN	DNN (InceptionV3)	Small database for hold-on. Lack of diversity of CVDs.	Hold-on	99.64%
[28]	2018	PTB diagnostic ECG datasets	Normal (52) vs. Myocardial infarction (148)	Features extracted from CNN	CNN	Small and unbalanced database for hold-on. It is impossible to know what features were extracted due to the nature of deep learning algorithms. Lack of diversity of CVDs.	Hold-on	95.9%
[29]	2019	PTB diagnostic ECG database	Healthy (52) vs. Myocardial infarction (148)	Feature extracted from CNN	CNN	Small and unbalanced dataset.There was no discrimination of diseases outside of the MI class. Lack of diversity of CVDs.	Cross-validation	99.78%
[30]	2019	Cleveland heart disease database	Healthy (150) vs. Heart Disease (147)	Patient clinical information	SVM	Small dataset. There was no discrimination of diseases outside of the Heart Disease class. Lack of diversity of CVDs.	Hold-on	92.22%
[31]	2019	Cleveland heart disease database	Healthy (150) vs. Heart Disease (147)	Patient clinical information	DNN	Small dataset. There was no discrimination of diseases outside of the Heart Disease class. Lack of diversity of CVDs.	Hold-on	93.33%
[32]	2020	Heart sound dataset	Normal (400) vs. Mitral valve prolapse (400) vs. Mitral stenosis (400) vs. Mitral regurgitation (400) vs. Aortic stenosis (400)	Feature extracted from CNN	CNN	Low variety of classes. There is a high risk of over-fitting because of the augmentation technique used.	Cross-validation	98.6%
[33]	2020	PTB diagnostic ECG datasets	Normal (313) vs. Abnormal (318)	Features extracted from ANN	Ensemble	Use class weights when training with artificial neural networks to solve the class unbalance problem.	Hold-on	94.14%
[34]	2020	PTB diagnostic ECG database	No Myocardial infarction (141) vs. Myocardial infarction (148)	Feature extracted from CNN	CNN	Small dataset. Just MI different types of discrimination. Small database.	Cross-validation	81%
[35]	2021	Ch. Pervaiz Elahi Institute of Cardiology Multan Dataset	Normal (3408) vs. Abnormal (2796) vs. Myocardial infarction (2880) vs. Previous history of Myocardial infarction (2064)	Feature extracted from SSD MobileNetV2 (CNN)	SSD MobileNetV2 (CNN)	Just MI different types of discrimination. It is impossible to know what features were extracted due to the nature of deep learning algorithms.	Hold-on	98.33%
[36]	2021	PTB diagnostic ECG database	Healthy (52) vs. Abnormal (216)	Domain features and disease-specific features	XGBoost	Small and unbalanced dataset. There is no disease discrimination, just normal vs. abnormal. Lack of diversity of CVDs.	Cross-validation	98.23%
[37]	2021	Cleveland dataset	Normal (170) vs. Abnormal (140)	Patient clinical information	Decision tree and Random forest combined	Small database for hold-on. There is no disease discrimination, just normal vs. abnormal. Lack of diversity of CVDs.	Hold-on	88.00%
[38]	2021	Cleveland HD dataset	Healthy (135) vs. Heart disease (135)	Feature extracted from MAFW	CNN model with the MAFW	Small dataset. A limited number of classifiers were used. High computational cost and time complexity. Runtime is not considered as an evaluation criterion. Lack of diversity of CVDs.	Cross-validation	90.1%
[39]	2022	Cleveland, Hungary, Switzerland, and Long Beach V datasets	Healthy (500) vs. Heart disease (550)	Patient clinical information	Extreme gradient boosting	No disease discrimination, just normal vs. abnormal. Lack of diversity of CVDs.	Cross-validation	100%
[40]	2022	UC Irvine Machine Learning Repository CVD datasets	Healthy (500) vs. Heart disease (550)	Patient clinical information	CNN and BiLSTM hybrid	No disease discrimination, just normal vs. abnormal. Lack of diversity of CVDs.	Hold-on	94.51%
[41]	2022	ECG dataset of Cardiac and COVID-19 Patients and ECG dataset of Cardiac Patients	Normal (284) vs. Abnormal (233) vs. MI (239) vs. Previous history of MI (102)	Feature extracted from MobileNet V2 (CNN)	MobileNet V2 (CNN)	Small database for hold-on, lack of a truly independent test group. Did not consider optimisation techniques.	Hold-on	95.18%
[42]	2022	PTB-XL dataset	Normal (1608) vs. Abnormal (1357)	Feature extracted from DNN	XGBoost	Lack of diversity of CVDs. Unbalanced dataset.	Hold-on	78.65%
Present Work	2023	PTB diagnostic ECG database	$V H D$ vs. M, $V H D$ vs. $M I$ ; $V H D$ vs. $M H$ ; $V H D$ vs. $H C$ ; $V H D$ vs. $D i s$ ; $V H D$ vs. $C a r d M y o$ ; $V H D$ vs. $B B B$ ; M vs. $M I$ ; M vs. $M H$ ; M vs. $H C$ ; M vs. $D i s$ ; M vs. $C a r d M y o$ ; M vs. $B B B$ ; $M I$ vs. $M H$ ; $M I$ vs. $H C$ ; $M I$ vs. $D i s$ ; $M I$ vs. $C a r d M y o$ ; $M I$ vs. $B B B$ ; $M H$ vs. $H C$ ; $M H$ vs. $D i s$ ; $M H$ vs. $C a r d M y o$ ; $M H$ vs. $B B B$ ; $H C$ vs. $D i s$ ; $H C$ vs. $C a r d M y o$ and $H C$ vs. $B B B$	Approximate Entropy, Logarithmic Entropy, Shannon Entropy, Correlation Dimension, Detrended Fluctuation Analysis, Energy, Higuchi Fractal Dimension, Hurst Exponent, Katz Fractal Dimension and Lyapunov Exponent	19 ML Classifiers	Small data sample for some classes and unbalanced dataset.	Cross-validation	73–100%

Table 2. Number of ECGs per diagnosis class.

Diagnostic Class	Number of ECGs
Bundle branch block ( $B B B$ )	17
Cardiomyopathy ( $C a r d M y o$ )	20
Healthy controls ( $H C$ )	80
Myocarditis (M)	4
Myocardial hypertrophy ( $M H$ )	4
Myocardial infarction ( $M I$ )	367
Valvular heart disease ( $V H D$ )	6
Dysrhythmia ( $D i s$ )	16

Table 3. Number of ECGs per diagnosis class after signal quality assessment and artifacts removal.

Diagnostic Class	Number of ECGs
Bundle branch block	9
Cardiomyopathy	15
Healthy controls	75
Myocarditis	3
Myocardial hypertrophy	4
Myocardial infarction	362
Valvular heart disease	4
Dysrhythmia	11

Table 4. The extracted features with the corresponding equations and definitions.

Feature	Equation	Definition
Approximate Entropy ( $A p E n$ )	$A p E n (m, r) = lim_{N \to \infty} Θ^{m} (r) - Θ^{m + 1} (r),$ $Θ$ is the Heaviside step function and m is the dimension [52].	$A p E n$ evaluates the likelihood that similar patterns within the data will remain similar when additional data points are included. The lower the $A p E n$ value is, the more regular or predictable the data are, whereas a higher $A p E n$ value suggests greater complexity or irregularity.
Correlation Dimension ( $C o r r D i m$ )	$C o r r D i m = lim_{M \to \infty} \frac{2 \sum_{i = 1}^{M - k} \sum_{j = i + k}^{M} Θ (l ∣ X_{i} - X_{j} ∣)}{M^{2}},$ where $Θ (x)$ is the Heaviside step function, $X_{i}$ and $X_{j}$ are the position vectors on attractor, l is the distance under consideration, k is the summation offset, and M is the reconstructed vector numbers from the $x (n)$ [52].	$C o r r D i m$ is used to measure self-similarity, and higher values of $C o r r D i m$ means a high degree of complexity and less similarity.
Detrended Fluctuation Analysis ( $D F A$ )	$D F A (n) = \sqrt{\frac{\sum_{k = 1}^{N} {[y (k) - y_{n} (k)]}^{2}}{N}},$ where N is the length, $y_{n} (k)$ is the local trend, and $y (k)$ is defined as $y (k) = \sum_{i = 1}^{k} [x (i) - \bar{x}],$ with $x (i)$ as the inter-beat interval and $\bar{x}$ as its average [53].	$D F A$ is a technique for measuring the power scaling observed through R/S analysis.
Energy ( $E n$ )	$E n = \sum_{n = 0}^{N - 1} ∣ x (n) ∣^{2}$	$E n$ is the capacity of a system to perform work [54].
Higuchi Fractal Dimension (H)	$H = \frac{ln (L (k))}{ln (\frac{1}{k})},$ where k is a number of composed sub-series and $L (k)$ is the averaged curve size.	H estimates the fractal dimension of a time series signal [55].
Hurst Exponent ( $E H$ )	$K_{q} (τ) \sim {(\frac{τ}{ν})}^{q E H (q)},$ with $K_{q} (τ) = \frac{(∣ X (t + τ) - X (t)) ∣^{q})}{(∣ X (t) ∣^{q})},$ where q is the order moments of the distribution increments, $ν$ is the time resolution, $τ$ is the incorporation time delay of the attractor, and t is the period of a given time series signal $X (t)$ [56].	$E H$ quantifies how chaotic or unpredictable a time series is.
Katz Fractal Dimension (K)	$K = \frac{log (n)}{log (n) + log (\frac{m a x_{n} (\sqrt{{(n - 1)}^{2} + {(x (n) - x (1))}^{2}})}{\sum_{n = 2}^{N} \sqrt{1 + {(x (n - 1) - x (n))}^{2}}})},$	K estimates the fractal dimensions through a waveform analysis of a time series [56].
Logarithmic Entropy ( $L o g E n$ )	$L o g E n = \sum_{n = 1}^{N} {log}_{2} [∣ x (n) ∣^{2}]$	$L o g E n$ quantifies the average amount of information (in bits) needed to represent each event in the probability distribution. Higher logarithm entropy values indicate greater unpredictability or randomness in the distribution, while lower values suggest more certainty or order [54].
Lyapunov Exponent ( $E l a y$ )	$E L a y (x_{0}) = lim_{n \to \infty} \frac{\sum_{k = 1}^{n} ln ∣ f^{'} (x_{k} - 1) ∣}{n},$ where $f^{'}$ is the f derivative [57].	$E L y a$ evaluates the system’s predictability and sensitivity to change.
Shannon Entropy ( $S h a E n$ )	$S h a E n = - \sum_{n = 1}^{N} ∣ x (n) ∣^{2} {log}_{2} [∣ x (n) ∣^{2}]$	$S h a E n$ is measured in bits when the base-2 logarithm (log2) is used. This means that the result provides a quantification of the average number of bits required to represent each outcome in a given probability distribution. Higher entropy values indicate greater uncertainty, unpredictability, or randomness in the distribution, while lower values suggest more order or certainty [54].

Table 5. 19 Sci-learn ML classifiers configurations.

Classifier	Hyperparameters
AdaBoostClassifier (AdaBoost)	Default parameters
BaggingClassifier (BaggC)	Default parameters
DecisionTreeClassifier (DeTreeC)	max_depth: 5
ExtraTreesClassifier (ExTreeC)	n_estimators: 300
GaussianNB (GauNB)	Default parameters
GaussianProcessClassifier (GauPro)	1.0 × RBF(1.0)
GradientBoostingClassifier (GradBoost)	Default parameters
KNearestNeighborsClassifier (KNN)	Default parameters
LinearDiscriminantAnalysis (LinDis)	Default parameters
LinearSVC (LinSVC)	Default parameters
LogisticRegression (LogReg)	solver: “lbfgs”
LogisticRegressionCV (LogRegCV)	cv: 3
MLPClassifier (MLP)	$α$ : 1, max_iter: 1000
OneVsRestClassifier (OvsR)	random_state: 0
RandomForestClassifier (RF)	max_depth: 5, n_estimators: 300, max_features: 1
SGDClassifier (SGD)	max_iter: 100, tol: 0.001
SGDClassifierMod (SGDCMod)	Default parameters
Support-vector Machines (SVC)	$γ$ : “auto”

Table 6. Statistical and XROC results for individual feature power analysis per binary groups, where N.S. means no significance.

Comparison Group	Feature	Compressor	mth Sub-Band	Lead	p-Value	$Recall$	$Accuracy$
$V H D$ vs. $M$	$C o r r D i m$	$A v g$	2	$V 4$	0.0339	100%	100%
$V H D$ vs. $M I$	$C o r r D i m$	$A v g$	2	I	0.0486	0%	98.91%
$V H D$ vs. $M H$	$C o r r D i m$	$A v g$	2	$V 1$	N.S.	75.00%	87.50%
$V H D$ vs. $H C$	$C o r r D i m$	$A v g$	2	$V y$	0.0301	0%	94.94%
$V H D$ vs. $D i s$	$C o r r D i m$	$A v g$	2	$V 5$	N.S.	100%	80.00%
$V H D$ vs. $C a r d M y o$	$C o r r D i m$	$A v g$	2	$V y$	N.S.	50.00%	78.94%
$V H D$ vs. $B B B$	$C o r r D i m$	$A v g$	2	$V 1$	0.0308	75.00%	84.62%
$M$ vs. $M I$	$C o r r D i m$	$A v g$	2	$V 4$	N.S.	0%	99.18%
$M$ vs. $M H$	$C o r r D i m$	$A v g$	2	$I I I$	N.S.	100%	85.71%
$M$ vs. $H C$	$C o r r D i m$	$A v g$	2	$V 6$	N.S.	0%	96.15%
$M$ vs. $D i s$	$C o r r D i m$	$A v g$	2	$V 2$	N.S.	33.33%	85.71%
$M$ vs. $C a r d M y o$	$C o r r D i m$	$A v g$	2	$V 2$	N.S.	66.67%	94.44%
$M$ vs. $B B B$	$C o r r D i m$	$A v g$	2	$V 4$	0.0126	100%	100%
$M I$ vs. $M H$	$C o r r D i m$	$A v g$	2	$I I$	N.S.	100%	98.91%
$M I$ vs. $H C$	$C o r r D i m$	$A v g$	2	$V 6$	0.0017	100%	82.84%
$M I$ vs. $D i s$	$C o r r D i m$	$A v g$	2	$V y$	N.S.	100%	97.05%
$M I$ vs. $C a r d M y o$	$C o r r D i m$	$A v g$	2	$V 6$	0.0326	100%	96.02%
$M I$ vs. $B B B$	$C o r r D i m$	$A v g$	2	$a V r$	0.0027	100%	97.57%
$M H$ vs. $H C$	$C o r r D i m$	$A v g$	2	$I I$	0.0078	0%	94.94%
$M H$ vs. $D i s$	$C o r r D i m$	$A v g$	2	$V z$	N.S.	50.00%	80.00%
$M H$ vs. $C a r d M y o$	$C o r r D i m$	$A v g$	2	$V z$	N.S.	50.00%	84.21%
$M H$ vs. $B B B$	$C o r r D i m$	$A v g$	2	$V 2$	N.S.	75.00%	92.31%
$H C$ vs. $D i s$	$C o r r D i m$	$A v g$	2	$V 1$	N.S.	100%	87.21%
$H C$ vs. $C a r d M y o$	$C o r r D i m$	$A v g$	2	$I I I$	0.0071	94.67%	83.33%
$H C$ vs. $B B B$	$C o r r D i m$	$A v g$	2	$V 2$	0.0047	100%	89.29%
$D i s$ vs. $C a r d M y o$	$C o r r D i m$	$A v g$	2	$V 3$	N.S.	54.54%	73.08%
$D i s$ vs. $B B B$	$C o r r D i m$	$A v g$	2	$a V r$	0.0167	81.81%	80.00%
$C a r d M y o$ vs. $B B B$	$C o r r D i m$	$A v g$	2	$a V r$	N.S.	73.33%	75.00%

Table 7. The total number of occasions that a feature was shown to be statistically significant (

p < 0.05

) across all sub-band analyses and leads.

Table 7. The total number of occasions that a feature was shown to be statistically significant (

p < 0.05

) across all sub-band analyses and leads.

Comparison Group	$ApEn$	$CorrDim$	$DFA$	$En$	H	$EH$	K	$LogEn$	$Elay$	$ShaEn$	Total
$V H D$ vs. $M$	48	42	48	48	48	48	48	48	48	48	474
$V H D$ vs. $M I$	24	21	24	24	24	24	24	24	24	24	237
$V H D$ vs. $M H$	0	0	0	0	0	0	0	0	0	0	0
$V H D$ vs. $H C$	48	42	48	48	48	48	48	48	48	48	474
$V H D$ vs. $D i s$	0	0	0	0	0	0	0	0	0	0	0
$V H D$ vs. $C a r d M y o$	0	0	0	0	0	0	0	0	0	0	0
$V H D$ vs. $B B B$	24	21	24	24	24	24	24	24	24	24	237
$M$ vs. $M I$	0	0	0	0	0	0	0	0	0	0	0
$M$ vs. $M H$	0	0	0	0	0	0	0	0	0	0	0
$M$ vs. $H C$	0	0	0	0	0	0	0	0	0	0	0
$M$ vs. $D i s$	0	0	0	0	0	0	0	0	0	0	0
$M$ vs. $C a r d M y o$	0	0	0	0	0	0	0	0	0	0	0
$M$ vs. $B B B$	24	21	24	24	24	24	24	24	24	24	237
$M I$ vs. $M H$	0	0	0	0	0	0	0	0	0	0	0
$M I$ vs. $H C$	120	105	120	120	120	120	120	120	120	120	1185
$M I$ vs. $D i s$	0	0	0	0	0	0	0	0	0	0	0
$M I$ vs. $C a r d M y o$	48	42	48	48	48	48	48	48	48	48	474
$M I$ vs. $B B B$	48	41	48	48	48	48	48	48	48	48	473
$M H$ vs. $H C$	24	21	24	24	24	24	24	24	24	24	237
$M H$ vs. $D i s$	0	0	0	0	0	0	0	0	0	0	0
$M H$ vs. $C a r d M y o$	0	0	0	0	0	0	0	0	0	0	0
$M H$ vs. $B B B$	0	0	0	0	0	0	0	0	0	0	0
$H C$ vs. $D i s$	0	0	0	0	0	0	0	0	0	0	0
$H C$ vs. $C a r d M y o$	48	42	48	48	48	48	48	48	48	48	474
$H C$ vs. $B B B$	120	104	120	120	120	120	120	120	120	120	1184
$D i s$ vs. $C a r d M y o$	0	0	0	0	0	0	0	0	0	0	0
$D i s$ vs. $B B B$	24	21	24	24	24	24	24	24	24	24	237
$C a r d M y o$ vs. $B B B$	0	0	0	0	0	0	0	0	0	0	0
Total	600	523	600	600	600	600	600	600	600	600

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ribeiro, P.; Sá, J.; Paiva, D.; Rodrigues, P.M. Cardiovascular Diseases Diagnosis Using an ECG Multi-Band Non-Linear Machine Learning Framework Analysis. Bioengineering 2024, 11, 58. https://doi.org/10.3390/bioengineering11010058

AMA Style

Ribeiro P, Sá J, Paiva D, Rodrigues PM. Cardiovascular Diseases Diagnosis Using an ECG Multi-Band Non-Linear Machine Learning Framework Analysis. Bioengineering. 2024; 11(1):58. https://doi.org/10.3390/bioengineering11010058

Chicago/Turabian Style

Ribeiro, Pedro, Joana Sá, Daniela Paiva, and Pedro Miguel Rodrigues. 2024. "Cardiovascular Diseases Diagnosis Using an ECG Multi-Band Non-Linear Machine Learning Framework Analysis" Bioengineering 11, no. 1: 58. https://doi.org/10.3390/bioengineering11010058

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Cardiovascular Diseases Diagnosis Using an ECG Multi-Band Non-Linear Machine Learning Framework Analysis

Abstract

1. Introduction

2. Methodology

2.1. Experimental Setup

2.2. Database Characterisation

2.3. Artifacts Removal

2.4. Signal Normalisation

Multi-Band Decomposition via Wavelet Transform and Features Extraction

2.5. Data Driven Framework Analysis

2.5.1. Individual Feature Power Analysis over Binary Groups

2.5.2. Combined Features Power Analysis for Groups Discrimination Using Sci-Learn ML Models

2.5.3. Classification Metrics

3. Results

4. Discussion

4.1. Data Driven Analysis—Individual Feature Power Analysis

4.2. Data Driven Analysis—Combined Feature Power Analysis

4.3. Study Results vs. State-of-the-Art Results

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI