Applying Recurrent Neural Networks for Anomaly Detection in Electrocardiogram Sensor Data

Minic, Ana; Jovanovic, Luka; Bacanin, Nebojsa; Stoean, Catalin; Zivkovic, Miodrag; Spalevic, Petar; Petrovic, Aleksandar; Dobrojevic, Milos; Stoean, Ruxandra

doi:10.3390/s23249878

Open AccessArticle

Applying Recurrent Neural Networks for Anomaly Detection in Electrocardiogram Sensor Data

by

Ana Minic

¹

,

Luka Jovanovic

²

,

Nebojsa Bacanin

²

,

Catalin Stoean

^3,*

,

Miodrag Zivkovic

²

,

Petar Spalevic

⁴

,

Aleksandar Petrovic

²

,

Milos Dobrojevic

²

and

Ruxandra Stoean

³

¹

Teacher Education Faculty, University of Pristina in Kosovska Mitrovica, 38220 Kosovska Mitrovica, Serbia

²

Faculty of Informatics and Computing, Singidunum University, 160622 Belgrade, Serbia

³

Department of Computer Science, Faculty of Sciences, University of Craiova, 200585 Craiova, Romania

⁴

Faculty of Technical Science, University of Pristina in Kosovska Mitrovica, Filipa Visnjica bb, 38220 Kosovska Mitrovica, Serbia

^*

Author to whom correspondence should be addressed.

Sensors 2023, 23(24), 9878; https://doi.org/10.3390/s23249878

Submission received: 29 October 2023 / Revised: 28 November 2023 / Accepted: 14 December 2023 / Published: 17 December 2023

(This article belongs to the Section Biomedical Sensors)

Download

Browse Figures

Versions Notes

Abstract

:

Monitoring heart electrical activity is an effective way of detecting existing and developing conditions. This is usually performed as a non-invasive test using a network of up to 12 sensors (electrodes) on the chest and limbs to create an electrocardiogram (ECG). By visually observing these readings, experienced professionals can make accurate diagnoses and, if needed, request further testing. However, the training and experience needed to make accurate diagnoses are significant. This work explores the potential of recurrent neural networks for anomaly detection in ECG readings. Furthermore, to attain the best possible performance for these networks, training parameters, and network architectures are optimized using a modified version of the well-established particle swarm optimization algorithm. The performance of the optimized models is compared to models created by other contemporary optimizers, and the results show significant potential for real-world applications. Further analyses are carried out on the best-performing models to determine feature importance.

Keywords:

electrocardiogram; diagnosis; particle swarm optimization; optimization; recurrent neural networks

1. Introduction

Cardiovascular health is a global concern as the leading cause of death globally [1]. Many factors contribute to this, among which is the more static lifestyle that has become a standard for most people, and this is especially troublesome as the lack of physical activity leads to various health issues. Other factors include age, stress, and diet irregularities. People who smoke and drink alcohol regularly are also found to be more prone to heart-related diseases [2]. All these factors can be attributed to personal choices and, as Keeney et al. [3] suggest, around 25% of cardiovascular diseases can be avoided just by altering personal choices without even considering genetics and other factors that influence an individual’s health. Long periods of time are usually required for health conditions regarding cardiovascular health to develop. This is due to their nature, as cardiovascular problems develop over time and increase in severity. Furthermore, such illnesses are harder to spot, which jeopardizes the health of the patient.

Traditional ways of monitoring cardiovascular health are open to improvements, with the emphasis being on the time it takes to diagnose a disease. The heart emits electrical signals that can be monitored by instruments, and the most applied principle is the electrocardiogram (ECG) [4]. The standard system consists of 10 sensors [5] distributed over the human body for a precise reading, but other variations with more sensors exist as well. The primary point for the placement of the sensors is the chest due to the position of the heart. The rest are distributed over the limbs. For example, the standard system with 10 electrodes produces 12 leads, which are represented by waveforms. The activity of the heart from specific angles is represented by leads [6]. The ECG provides a graph of the heartbeat and its rhythm, allowing medical personnel to detect possible irregularities that can be indicators of diseases. The results of the ECG are visual and readable by professionals only, but this is not the only hindrance, as the data can have imperfections due to the nonlinearity and complexity of signals, as well as the low amplitude of recordings, and noise [7].

Considering the previously mentioned shortcomings of ECG systems, possible solutions are being explored to improve the determination of cardiovascular health. As artificial intelligence (AI) techniques are beginning to improve daily aspects of humans’ lives, the medical field is not an exception in this trend [8]. Improvements in ECG systems aim for faster recognition of patterns, leading to quicker diagnoses. This is of great importance as it allows patients to begin treatment sooner and reduces the risk of improper medication, which can occur with manual result interpretation. The ECG data problem is formulated as a time-series task, which makes it suitable for AI applications. As Wolpert et al. [9] state in the “no free lunch theorem” (NFL), not one model is perfect and provides equally optimal results for all problems. For this particular type of problem neural networks provide the best results, as they are suited for time-series problems due to their architecture being modeled after the human nervous system. Regardless of their previously achieved exceptional performance in this field, such solutions are not without shortcomings. This is usually solved by applying an optimization technique that can tune the performance of the main solution by providing the optimal subset of its configuration parameters. Every problem requires customized frameworks as the NFL states, and the combinations of such solutions are vast.

For this study, a recurrent neural network (RNN) is selected as the predictor of heart conditions due to its high performance with time-series prediction tasks [10]. RNNs can form feedforward along with feedback connections, with the latter being activated with a delay to ensure the forming of long-term dependencies. This aspect of the RNN architecture makes them well-suited for detecting heart-related diseases, given that these conditions develop slowly over time. The idea is to provide a model that can allow for early detection of these conditions, which has not been achieved in this manner to date. In this context, early detection and rapid diagnostics are crucial as they allow timely intervention and management, potentially preventing complications and improving patient outcomes. The particle swarm optimization (PSO) algorithm, belonging to the swarm intelligence algorithm family, has been selected as an optimizer for the RNN hyperparameters. By applying this principle, the experimentation can result in a model that is as close as possible to optimal, as defined by the NFL. The problem of health predictions based on waveforms belongs to the group of problems of non-deterministic polynomial time complexity (NP-hard). For this type of problem, swarm metaheuristics have proven excellent optimizers. The proposed method was tested against other RNN models optimized by other high-performing metaheuristics for the purpose of results comparison.

An extensive literature survey has shown that there is a research gap in this domain, particularly, RNN has not been applied with PSO in this domain. Therefore, the primary goal of this manuscript is to apply PSO to tune the RNN’s hyperparameters for this specific problem, aiming to develop a lightweight RNN architecture that is capable of achieving good results in ECG analysis.

The summarized main contributions of this work are provided:

The proposal of a lightweight solution for an essential issue of cardiovascular health diagnosis through a robust AI-based framework.
RNN predictor application for the time-series problem of hearth electrical signal waveforms.
Swarm intelligence PSO algorithm optimizer for the specific point in question of RNN hyperparameter tuning.
An extensive analysis of high-end metaheuristic optimizers for RNN optimization.

The organization of the sections is briefly provided: Section 2 provides the fundamentals of the research, Section 3 explains the inner workings of the original optimization algorithm and the performed improvements, and Section 4 provides a basis for experimentation. The experimentation outcomes are presented in Section 5. Finally, Section 6 provides a summary of the problem formulation, the accomplishments of the research, and grounds for future work.

2. Background

The utilization of AI in the field of medicine has gained significant attention from researchers, primarily driven by various compelling factors. Among these factors, the continually growing demand for healthcare services and the increasing need for rigorous scrutiny during the diagnostic process serve as powerful motivators for researchers to explore the integration of automation into the medical domain [11,12]. Moreover, the evolving landscape of networking and the internet of things (IoT) [13] has generated a heightened demand for enhanced security measures. Applications of IoT networks in combination with AI have shown admirable outcomes when applied to issues associated with healthcare [14,15].

One intriguing area where AI finds practical application is in the realm of time-series analysis. These algorithms enable the observation and prediction of trends within continuous datasets, facilitating the determination of data patterns, directions, and correlations. Algorithms that can effectively account for temporal aspects within data have exhibited promising results when applied to complex real-world challenges [16,17]. Furthermore, advanced data decomposition techniques have been combined with time-series data, further enhancing their performance by breaking down signals into a series of component signals. This approach often leads to improved forecasting outcomes, as complex signals are inherently challenging to predict, while a series of simpler signals can be more readily managed and analyzed [18,19].

2.1. AI Approaches in Electrocardiogram Analysis

AI methods have been increasingly employed in the analysis and interpretation of ECGs to aid in the diagnosis of cardiovascular diseases [20,21]. CNNs are effective for image-based tasks, and ECG signals can be treated as 1D images. CNNs can automatically learn hierarchical features from ECG data and can be useful in routine clinical practice, as shown by [22,23,24].

RNNs, and their variant long short-term memory networks (LSTMs), are useful for capturing temporal dependencies in ECG signals, making them suitable for tasks such as arrhythmia detection. These models are typically lightweight, simple, and show promising efficiency and accuracy, as discussed by [25,26]. Hybrid methods have also been considered, such as the CNN-LSTM approach introduced in [27].

Machine learning algorithms have been considered for this problem as well [28,29]. Random forests and decision trees may be used for classification tasks, such as identifying different types of arrhythmia [30,31,32]. On the other hand, support vector machines (SVMs) may be effective for binary classification tasks and have been applied to identify specific cardiac conditions in ECGs [33].

2.2. Recurrent Neural Networks

With the goal of creating a neural network that is more suitable for problems that require sequential data analysis, the RNN was created. The difference from the basic neural network is the existence of recurrent connections between neurons, allowing for future inputs memory storage. Sequential layers with neurons are connected similarly to the basic neural network, as well as the weights and biases for connection input evaluation, decision-making, and output generation. RNNs require optimization in terms of architecture in addition to the control parameters for optimal performance. The benefits of using RNNs less complex in their structure are observable in data interpretation and training. More complex architecture is required for problems that have to keep track of complex nonlinear relationships.

The hidden state in the previous time step

h_{t - 1}

is combined with the current at the time step t. The process is described by Equation (1).

a_{t} = b + W h_{t - 1} + U x_{t},

(1)

where

a_{t}

is the input activation, b the bias term, and W and U represent the weight matrices of recurrent and input connections, respectively.

Equation (2) describes the process of the alteration of the hidden state after every input with the

ϕ

activation function over the

a_{t}

.

h_{t} = ϕ (a_{t})

(2)

Based on the prediction goal, different functions can be used as

ϕ

. The output of the network is derived from the hidden state. The previously described process is mathematically formulated by Equation (3).

y_{t} = W h_{t}

(3)

While RNNs have a unique ability of working with and reacting to changes in sequential data, certain drawbacks can be observed in the basic model. This class of networks is particularly sensitive to vanishing and exploding gradients [34], making good models difficult to construct. Furthermore, RNNs can only retain an influence of one previous input, which can limit their applicability for longer term forecasts. Certain methods have been developed to deal with these issues. such as the gated recurrent unit (GRU) and long short-term memory networks (LSTMs). However, while these versions of RNNs offer some advantages, they come at the cost of an increased complexity relative to the base algorithm.

2.3. Metaheuristics

The field of metaheuristic algorithms became popular due to the algorithms’ proficiency in solving NP-hard problems. The main challenge is to find solutions to these problems within a reasonable timeframe, while also maintaining reasonable hardware requirements. The algorithms can be divided further into subgroups, but there is no formal definition. The grouping that is recognized by most researchers includes the differentiation by the phenomena used for inspiration of the algorithm. In this manner, the different groups include swarm, genetic, physics, human, and the most novel group of these, the mathematically inspired algorithms.

Swarm-inspired solutions take inspiration from species that live in large groups and the aspects of their lives that benefit from group behavior [35]. This is often the case when a single unit is incapable of completing a task on its own, and that is where other units of the same species come into play. The swarm group of algorithms has provided excellent results with solutions to NP-hard problems, but to reach their maximum potential, hybridization with similar solutions is advised. The issue of these stochastic population-based algorithms is that they usually favor one of the two phases between exploration and exploitation, which can be overcome by incorporating a mechanism from a different solution. Notable algorithms from the swarm family include PSO [36], genetic algorithm (GA) [37], sine cosine algorithm (SCA) [38], firefly algorithm (FA) [39], grey wolf optimizer (GWO) [40], reptile search algorithm (RSA) [41], as well as the COLSHADE [42] algorithm.

Swarm metaheuristics find application in a wide range of real-world problems. Some of the implementations include glioma MRI classification [43], detection of credit card fraud [44,45], global optimization problems and engineering optimization [46,47,48], cloud computing [49], prediction of the number of COVID-19 cases [50], feature selection [51], and wireless sensor networks [52,53].

Ahmadpour et al. [54] developed a genetic-algorithm-based solution to track subjects’ blood pressure, significantly improving their overall quality of life and allowing for the early detection of preventable diseases. Khan et al. [55] explore an IoT environment that enables all-day monitoring of patients’ conditions and greatly improves their cardiovascular health.

Examples of AI-assisted medical diagnosis include diabetic retinopathy detection [56], skin lesion classification [57], lung cancer classification [58], and magnetic resonance imaging (MRI), among diverse other applications in medicine.

Although it is one of the first metaheuristics algorithms, proposed over twenty years ago, PSO is still considered a very powerful optimizer. Recently, the PSO algorithm has been successfully implemented, either in a basic or modified version, to tackle numerous problems in the medical and other domains. Notable examples include tuning LSTM for ECG-based biometric analysis [59], CNN-based classification of cardiac arrhytmias and healthcare monitoring [60,61], and RNN-based cloud balancing [62], to mention a few.

3. Methods

The following section describes the base algorithm that serves as a basis for modification. The algorithm is selected empirically, based on previous research where significant potential has been observed. Following the description of the basic algorithm, its initialization, and search mechanisms, we highlight observed limitations and present a potential solution. Finally, the pseudocode of the final modified approach is provided.

3.1. The Original PSO

The original PSO was introduced in 1995 by Kennedy and Eberhart [36]. The flocking of birds and fish was the main inspiration for this metaheuristic. Particles are represented as search agents and are considered a part of the population. Discrete as well as continuous problems can be solved by the PSO.

The model of the algorithm works such that every particle is assigned an initial velocity, which can be regarded as the position in the population. During one iteration the particles change their location in search of a better one. The weight component describes how fast the particles move. The weights are the old velocity, the best obtained so far, and the best yet obtained by the neighboring particle.

\{\begin{matrix} \vec{v_{i}} \leftarrow \vec{v_{i}} + \vec{U} (0, ϕ_{1}) ⨂ (\vec{p_{i}} - \vec{x_{i}}) + \vec{U} (0, ϕ_{2}) ⨂ (\vec{p_{g}} - \vec{x_{i}}) \\ \vec{x_{i}} \leftarrow \vec{x_{i}} + \vec{v_{i}}, \end{matrix}

(4)

In Equation (4), the component-wise multiplier is shown as ⨂, the range of all the components from

v_{i}

is

[- V_{m a x}, + V_{m a x}]

, and the vector

\vec{U} (0, ϕ_{1})

represents every particle randomly generated and uniformly distributed in the range

[0, ϕ_{i}]

. The value of

p_{i}

denotes the best solution for particle i while

p g

denotes the global best particle. Any of the particles is a possible solution in a D-dimensional space and its position is defined by Equation (5), the best position obtained before the update is shown in Equation (6), and the velocities are given in Equation (7):

X_{i} = (x_{i 1}, x_{i 2}, \dots, x_{i D})

(5)

P_{i} = (p_{i 1}, p_{i 2}, \dots, p_{i D})

(6)

V_{i} = (v_{i 1}, v_{i 2}, \dots, v_{i D})

(7)

The best solutions overall and in the group are noted as

p_{i}

and

p_{g}

, respectively. The search agent takes both pieces of information into account before deciding on the next move in terms of the distance currently between its position and

p_{i}

and

p_{g}

.

With the application of the inertia weight approach, this behavior can be modeled as in Equation (8):

v_{i d} = W * v_{i d} + c_{1} * r_{1} * (P_{i d} - X_{i d}) + c_{2} * r_{2} * (P_{g d} - X_{i d})

(8)

The relative influence inertia factors are shown in Equation (8) as w,

c_{1}

, and

c_{2}

, used for cognitive and social components, respectively.

r_{1}

and

r_{2}

are random numbers, while the particle velocity and the current position are given, respectively, as

v_{i d}

and

x_{i d}

.

p_{i d}

and

p_{g d}

are the

p_{i}

and

p_{g}

, respectively.

Equation (9) describes the inertia factor.

w_{m a x}

is the initial weight, while

w_{m i n}

is the final weight; T is the maximum number of iterations, and the current iterations are given as t.

w = w_{m a x} - \frac{w_{m a x} - w_{m i n}}{T} \cdot t

(9)

3.2. Genetically Inspired PSO (GIPSO)

Although the original PSO shows good performance, it exhibits certain shortcomings when evaluated using standard CEC [63] evaluation functions. To address these issues and enhance the PSO, this study introduces hybridization techniques. Drawing inspiration from the genetic algorithm (GA) [37], we create a new algorithm known as the genetically inspired PSO (GIPSO).

In the GIPSO algorithm, a novel introduction mechanism is activated after each iteration. It selects a random agent and combines it with the best solution obtained so far. The algorithm uniformly combines their parameters, and this combination is governed by a control parameter, denoted as

p c

. Empirically, the optimal value for

p c

has been determined to be

0.1

. Furthermore, an additional modification involves parameter mutation. When triggered, this process selects a random value within a specified parameter constraint. Half of this selected value is either added to or subtracted from the parameter, depending on the mutation direction parameter

m d

. Once again, the value for

m d

is determined empirically, set to

0.1

.

After generating a new solution, the worst-performing solution in the swarm is replaced by the new agent. The evaluation of the new solution is deferred until the next iteration, maintaining the computational complexity of the original algorithm.

From a mathematical perspective, the introduced algorithm follows the randomness influence initialization pattern of the the PSO algorithm as described in the original study [36]. Updates are also also performed as described in the original algorithm. Each agent A contains a vector of values that represent the genetic structure as

A_{i} = (a_{1}, a_{2}, \dots a_{D})

(10)

where

A_{i}

represents a given agent and a a given parameter and D the number of parameters dependent on the dimensionality of the search space. Once crossover is initialized, two agents are selected and recombined:

c_{j} = α \cdot a_{j} + (1 - α) \cdot b_{j}

(11)

where

c_{j}

denotes the resulting child parameter,

a_{j}

and

b_{j}

are the j-th parameters of agents A and a randomly selected second agent B, and

α

is a random factor. The agent parameters are then further mutated as

A_{i k} = A_{i k} + m d \cdot r n d or A_{i k} = A_{i k} - m d \cdot r n d

(12)

This equation represents the mutation of the k-th parameter in an agent, where

m d

is the mutation direction parameter, and

r n d

is a random value within a specified parameter constraint. The mutation can either add or subtract half of

r n d

from the original parameter value, depending on the mutation direction. Once an agent has been combined and mutated, the worst solution in the population based on an objective function is replaced. The objective function

f (o b j)

can be adjusted depending on the optimization problem being tackled.

To provide a comprehensive understanding of the algorithm, we present its pseudocode in Algorithm 1.

Algorithm 1 Pseudocode of the introduced GIPSO.

Initialize a population, denoted as P

while t is less than T do

Evaluate the solutions in P using the objective function

for Each solution X in P do

Update agent locations by applying the PSO search

Generate a new solution, referred to as

N S

, using a genetically inspired mechanism

Mutate the parameters of

N S

Replace the worst solution in P with

N S

end for

end while

return The best solution attained within P

The computational complexity of the introduced algorithm remains the same relative to the original as evaluations are only carried out after a new solution has been generated and the worst option replaced. Nevertheless, it is important to note that the implementation complexity of the introduced modified version might be slightly higher compared to the original. However, this potential drawback is considered acceptable when considering the increased boosts to performance.

4. Experimental Setup

The dataset exploited in this research is the heart rate time series from the Massachusetts Institute of Technology (MIT), publicly available online (https://www.kaggle.com/datasets/ahmadsaeed1007/heart-rate-time-series-mitbih-database, accessed on 13 December 2023). The data are prepared for ML processing and consist of 1800 measurements evenly spaced at intervals of 0.5 s, measured for up to a total of 15 min of monitoring. Readings are captured from 12 sensors (electrodes) on the chest. A visualization of the dataset can be observed in Figure 1. The shown features are time steps that have shown the highest importance following the best-model analysis described later in this work. Normal activity is indicated by a white background while anomalous activity has a red background. Anomalous activity can be considered as any irregular or abnormal heartbeat, otherwise known as an arrhythmia [64].

In the experimentation, 15 lags of data, approximately equivalent to

\frac{1}{8}

of a second of EEG readings, are used. These inputs are provided to 15 input neurons of an RNN. The number of hidden layers is optimized within

[1, 3]

, favoring lighter networks in order to reduce computational demands, and neuron counts in each of the hidden layers are selected in a range of

[\frac{l a g s}{2}, l a g s]

. In this research, neuron optimization is, therefore, carried out in a range of

[8, 15]

. The training parameters are also optimized. The number of training epochs is tuned from the range

[30, 60]

, learning rate from the range

[0.01, 0.0001]

, and dropout from

[0.2, 0.05]

.

The large search space presented by the hyperparameters warrants the use of algorithms capable of effectively addressing complex optimizations. Several contemporary optimization metaheuristics are employed alongside the introduced GIPSO. The metaheuristics are implemented under identical conditions with a population size of 6, and allocated 8 iterations to improve the population quality. To account for variations due to random factors associated with metaheuristic algorithms, the experiments are repeated over 30 independent runs. This also provides a base for further statistical validation of the outcomes. The algorithms included in the comparative performance analysis include the original algorithms used as inspiration, the PSO [36] and GA [37] algorithms, as well as other well-established optimizers, the FA [39], SCA [38] GWO [40], RSA [41], and COLSHADE [42] algorithms.

During the experiments, the initial 70% of the data are used to train the models with parameters selected by the metaheuristic algorithms. A subsequent 10% of the remaining data are used as validation data. The constructed models are repeatedly evaluated using the validation dataset, and their control parameters updated accordingly using metaheuristics. To ensure a valid comparison, the remaining 20% of the data are reserved only for testing. The best constructed models are verified to ensure that no over-fitting has occurred using the elbow method, and early stopping is used to help prevent over-training, with a patience of one-third of the total number of selected epochs.

To provide a comprehensive assessment of the constructed models in comparison to those constructed by other contemporary optimizers, a battery of standard classification metrics including accuracy, precision, recall, and F1 score [65] are utilized. The error rate is used as the objective function for optimization, determined as

E r r o r = 1 - a c c u r a c y

. Further metrics include Cohen’s kappa [66], described in Equation (13), which gives a more complete assessment in cases when unbalanced datasets are utilized:

κ = \frac{z_{o} - z_{e}}{1 - z_{e}}

(13)

in which

z_{o}

and

z_{e}

represent the observed and expected classification values. Cohen’s kappa is used as the indicator function during the optimizations.

Finally, Figure 2 illustrates the experimental framework flowchart.

5. Experimental Outcomes

The experimental outcomes, in terms of the best and worst, as well as the mean and median outcomes throughout 30 independent executions, are provided in Table 1. Further outcomes, in terms of the standard deviation and variance that demonstrate the stability of each optimum, are also provided. Indicator function outcomes are showcased in the same format in Table 2.

Overall the metrics indicate that the introduced GIPSO algorithm attained the best outcome in the best-case scenario. However, the relatively novel RSA shows an impressive performance, attaining better outcomes for the worst-case execution, and thereby showing better outcomes in terms of the mean and median. This improvement carries over to algorithm stability, with the RSA demonstrating higher levels of stability in terms of objective as well as indicator functions. Nevertheless, the modifications applied to the PSO algorithm demonstrate improvements, with the GIPSO algorithm outperforming the original algorithm as well as the GA across all test cases and showing a significant improvement in terms of stability.

Comparisons in terms of algorithm stability across the objective and indicator functions are further reinforced by the outcomes shown in Figure 3.

Clear stability improvements over the best PSO and GA can be observed for the GIPSO algorithm. However, the admirable performance of the RSA also needs to be noted, as the metaheuristic demonstrated impressive stability in comparison to competing metaheuristics.

Improvements in the convergence can be observed in the convergence graphs for the best-performing models shown in Figure 4.

It is important to note that the exploitation power of the algorithm plays a significant factor in this optimization. And it is evident that the introduced algorithm managed to avoid local optima and locate the most promising region within the local search space that presents the best outcomes. Further, detailed metrics for the best-performing models formulated by each metaheuristic are provided in Table 3.

As can be observed in Table 3, the best-performing model constructed by the introduced metaheuristic demonstrates a clear superiority in comparison to other metaheuristics, attaining the best evaluation score across all but one metric. The best-performing model is further assessed with the ROC and PR curves demonstrated in Figure 5 and the confusion matrix in Figure 6.

It can be deduced that the introduced algorithms constructed a model with parameters optimized for the task of ECG anomaly detection, altering the model to the provided data in such a way that the attained classification accuracy for normal activity reached

91.4 %

with misclassifications occurring in only

8.6 %

of cases. Anomalous activity is detected with an even higher accuracy of

99.1 %

, with misclassifications occurring in only

0.9 %

of cases.

To encourage experimental repeatability by independent researchers, the parameters selected by each optimization algorithm for their respective best-performing model are provided in Table 4.

5.1. Statistical Validation of Outcomes

Because experimental results alone are frequently insufficient to state that one algorithm surpasses its competitors, scientists in modern computer research must establish if the offered enhancements are statistically significant. This study put eight techniques for configuring RNN networks for time-series classification of aberrant cardiac activity to the test, including the proposed GIPSO metaheuristics.

According to the recommendations in [67], statistical tests in such scenarios should involve creating a representative collection of outcomes for each method, which involves creating a sample of outcomes by determining average objective values over several independent executions for each problem. However, this technique may not be appropriate when dealing with outliers that originate from a non-normal distribution, perhaps leading to misleading findings. According to [67], an unresolved debate remains about whether using the mean objective function value in statistical tests is acceptable for comparing stochastic approaches. Despite these possible disadvantages, the classification error rate objective function was averaged across 30 separate runs to compare 10 approaches for detecting ECG anomalies.

After performing the Shapiro–Wilk test [68] for single-problem analysis using the described procedure, a decision was made: a data sample was constructed for each algorithm and each problem by gathering the results of each run, and the corresponding p-values were computed for all method–problem combinations. Table 5 shows the resultant p-values.

These outcomes are further enforced in Figure 7, showing the distributions of objective function outcomes of each optimizer over 30 independent runs.

The null hypothesis may be rejected since the p-values in Table 5 are all less than the significance threshold, denoted as

α

, which is set to

0.05

. As a result, the data samples for solutions do not all come from the Gaussian distribution, implying that using the average objective value in future statistical tests is not appropriate. As a consequence, the best results were used for further statistical analysis in this study. Because the normalcy assumption was not met, parametric tests were inapplicable. As a consequence, in the next stage, the non-parametric Wilcoxon signed-rank [69] test was utilized. This test can be conducted on the same data series that contains the best values obtained in each metaheuristic run.

In this test, the developed algorithm serves as the control algorithm, and the Wilcoxon signed-rank test was performed on the supplied data series. In all three observed occurrences, the estimated p-values were less than

0.05

. Using the

a l p h a = 0.1

significance threshold, these findings reveal that the new algorithm statistically outperformed all competing techniques by a significant margin. The total results of the Wilcoxon signed-rank test are shown in Table 6.

5.2. Best Model Interpretation

It is increasingly important to build trust in ML models, especially when tackling important topics such as healthcare and diagnosis. Further model analysis can highlight issues with the model and help improve methods of data collection. There are several methods for tackling model interpretation. Determining model sensitivity is one approach. However, when dealing with complex multi-layer networks it is often helpful to consider methods that apply model approximations.

The Shapley additive explanations (SHAP) [70] technique relies on game theory in order to build a better understanding of features and their impact on model decisions. Furthermore, this unique approach allows us to consider individual interactions between feature contributions. This work applies the Python implementation of SHAP to analyze the best-constructed anomaly detection model. The outcomes are demonstrated in Figure 8.

The impact of the values in each step of the ECG sequence can be observed in the interpretation outcomes. Each individual sample in the ECG is numerically labeled. It can be observed that early samples have the highest influence of anomalous ECG activity, latter sequences also indicate abnormal readings. However, a fairly consistent importance is observed across all features.

6. Conclusions

The conducted research applied metaheuristic algorithms in order to optimize an RNN architecture and training parameters in order to construct models that demonstrate the best results when applied to ECG anomaly detection and classification. A total of seven contemporary metaheuristics were assessed for their ability to optimize hyperparameters and the constructed models were applied and evaluated on a publicly available real-world dataset. An additional modified metaheuristic was introduced that combined ideas borrowed from the GA and incorporated them into the PSO algorithm to improve performance. Several metrics were included in order to provide in-depth comparisons between algorithms. The introduced algorithm created the single best-performing model, outperforming the base PSO algorithm as well as the GA, attaining an accuracy of

91.8695 %

. Improvements to the exploration and exploitation power of the algorithm were observed in the modified metaheuristic. Additionally, it is important to note the observed great performance of the RSA as well, which demonstrated good performance despite not attaining an optimal model. The constructed models were rigorously statistically validated to confirm the improvements, and the best-performing model was subjected to rigorous sensitivity analysis to attain further insight into the decision-making process.

Some limitations exist within this research. Due to limited computational resources, smaller populations and limited executions were carried out. Only a small subset of available optimization algorithms were evaluated on this problem. Additionally, the capabilities of RNN variants such as LSTM and GRU were explored due to computational constraints. These algorithms present a potential future point of focus in subsequent works. Nevertheless, the proposed method may be used in clinical monitoring for near-real-time analysis. The hardware dictates the execution speed much more than the algorithm itself. Additionally, the execution period is negligibly small compared to optimization. Results may be obtained almost in real-time; only the initial lag must pass as a delay.

In future works wehope to address some of the limitations of this research, expand on the available set of tools for experts and care providers, and offer a better diagnostic technique. Additionally, the potential of recognizing and providing specific disease diagnoses will be explored to further enhance the clinical utility of the model. Furthermore, the potential of the introduced modified algorithm will be explored on other pressing optimization challenges. Finally, modified versions of the RSA will be explored for tackling optimization for ECG data in combination with other algorithms, as significant potential has been observed in this research.

Author Contributions

Conceptualization, A.M., L.J. and N.B.; methodology, A.M., N.B. and M.Z.; software, A.M. and L.J.; validation, M.Z., R.S., P.S. and M.D.; formal analysis, A.P. and M.D.; investigation, A.M., L.J. and N.B.; resources, A.P.; data curation, A.M. and L.J.; writing—original draft preparation, A.M.; writing—review and editing, P.S., C.S., A.P. and R.S.; visualization, A.M. and L.J.; supervision, N.B.; project administration, N.B.; funding acquisition, N.B. and C.S. All authors have read and agreed to the published version of the manuscript.

Funding

This article is partially based upon work from COST Action ROAR-NET-Randomised Optimisation Algorithms Research Network, CA22137, supported by COST (European Cooperation in Science and Technology).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The dataset utilized in this research is the heart rate time series from the Massachusetts Institute of Technology (MIT), which is publicly available online (https://www.kaggle.com/datasets/ahmadsaeed1007/heart-rate-time-series-mitbih-database, accessed on 13 December 2023). All data supporting the reported results can be accessed via the provided link.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

RNN	Recurrent neural network
ECG	Electrocardiogram
GA	Genetic algorithm
PSO	Particle swarm optimization
RSA	Random search algorithm
LSTM	Long short-term memory
GRU	Gated recurrent unit
SCA	Simulated annealing
FA	Firefly algorithm
GWO	Grey wolf optimization
RSA	Random search algorithm

References

Mc Namara, K.; Alzubaidi, H.; Jackson, J.K. Cardiovascular disease as a leading cause of death: How are pharmacists getting involved? Integr. Pharm. Res. Pract. 2019, 8, 1–11. [Google Scholar] [CrossRef] [PubMed]
Ezzati, M.; Obermeyer, Z.; Tzoulaki, I.; Mayosi, B.M.; Elliott, P.; Leon, D.A. Contributions of risk factors and medical care to cardiovascular mortality trends. Nat. Rev. Cardiol. 2015, 12, 508–530. [Google Scholar] [CrossRef] [PubMed]
Keeney, R.L. Personal decisions are the leading cause of death. Oper. Res. 2008, 56, 1335–1347. [Google Scholar] [CrossRef]
Berkaya, S.K.; Uysal, A.K.; Gunal, E.S.; Ergin, S.; Gunal, S.; Gulmezoglu, M.B. A survey on ECG analysis. Biomed. Signal Process. Control 2018, 43, 216–235. [Google Scholar] [CrossRef]
Zhang, Q.; Frick, K. All-ECG: A least-number of leads ECG monitor for standard 12-lead ECG Tracking during Motion. In Proceedings of the 2019 IEEE Healthcare Innovations and Point of Care Technologies, (HI-POCT), Bethesda, MD, USA, 20–22 November 2019; pp. 103–106. [Google Scholar]
Antczak, K. Deep recurrent neural networks for ECG signal denoising. arXiv 2018, arXiv:1807.11551. [Google Scholar]
Hassaballah, M.; Wazery, Y.M.; Ibrahim, I.E.; Farag, A. Ecg heartbeat classification using machine learning and metaheuristic optimization for smart healthcare systems. Bioengineering 2023, 10, 429. [Google Scholar] [CrossRef]
Bohr, A.; Memarzadeh, K. The rise of artificial intelligence in healthcare applications. In Artificial Intelligence in Healthcare; Elsevier: Amsterdam, The Netherlands, 2020; pp. 25–60. [Google Scholar]
Wolpert, D.H.; Macready, W.G. No free lunch theorems for optimization. IEEE Trans. Evol. Comput. 1997, 1, 67–82. [Google Scholar] [CrossRef]
Smith, C.; Jin, Y. Evolutionary multi-objective generation of recurrent neural network ensembles for time series prediction. Neurocomputing 2014, 143, 302–311. [Google Scholar] [CrossRef]
Velichko, A.; Huyut, M.T.; Belyaev, M.; Izotov, Y.; Korzun, D. Machine learning sensors for diagnosis of COVID-19 disease using routine blood values for internet of things application. Sensors 2022, 22, 7886. [Google Scholar] [CrossRef]
Akgönüllü, S.; Özgür, E.; Denizli, A. Quartz crystal microbalance-based aptasensors for medical diagnosis. Micromachines 2022, 13, 1441. [Google Scholar] [CrossRef]
Sadhu, P.K.; Yanambaka, V.P.; Abdelgawad, A.; Yelamarthi, K. Prospect of internet of medical things: A review on security requirements and solutions. Sensors 2022, 22, 5517. [Google Scholar] [CrossRef] [PubMed]
Al-Kahtani, M.S.; Khan, F.; Taekeun, W. Application of internet of things and sensors in healthcare. Sensors 2022, 22, 5738. [Google Scholar] [CrossRef] [PubMed]
Phan, D.T.; Nguyen, C.H.; Nguyen, T.D.P.; Tran, L.H.; Park, S.; Choi, J.; Lee, B.i.; Oh, J. A flexible, wearable, and wireless biosensor patch with internet of medical things applications. Biosensors 2022, 12, 139. [Google Scholar] [CrossRef] [PubMed]
Islam, M.M.; Islam, M.Z.; Asraf, A.; Al-Rakhami, M.S.; Ding, W.; Sodhro, A.H. Diagnosis of COVID-19 from X-rays using combined CNN-RNN architecture with transfer learning. BenchCouncil Trans. Benchmarks Stand. Eval. 2022, 2, 100088. [Google Scholar] [CrossRef]
Kamble, D.D.; Kale, P.H.; Nitture, S.P.; Waghmare, K.V.; Aher, C.N. Heart disease detection through deep learning model RNN. In Proceedings of the Smart Intelligent Computing and Applications, Volume 2: Proceedings of Fifth International Conference on Smart Computing and Informatics (SCI 2021); Springer: Berlin/Heidelberg, Germany, 2022; pp. 469–480. [Google Scholar]
Djemili, R.; Djemili, I. Nonlinear and chaos features over EMD/VMD decomposition methods for ictal EEG signals detection. Comput. Methods Biomech. Biomed. Eng. 2023, 1–20. [Google Scholar] [CrossRef] [PubMed]
Pandey, P.; Seeja, K. Subject independent emotion recognition from EEG using VMD and deep learning. J. King Saud Univ.-Comput. Inf. Sci. 2022, 34, 1730–1738. [Google Scholar] [CrossRef]
Saini, S.K.; Gupta, R. Artificial intelligence methods for analysis of electrocardiogram signals for cardiac abnormalities: State-of-the-art and future challenges. Artif. Intell. Rev. 2022, 55, 1519–1565. [Google Scholar] [CrossRef]
Mincholé, A.; Camps, J.; Lyon, A.; Rodríguez, B. Machine learning in the electrocardiogram. J. Electrocardiol. 2019, 57, S61–S64. [Google Scholar] [CrossRef]
Fiorina, L.; Maupain, C.; Gardella, C.; Manenti, V.; Salerno, F.; Socie, P.; Li, J.; Henry, C.; Plesse, A.; Narayanan, K.; et al. Evaluation of an ambulatory ECG analysis platform using deep neural networks in routine clinical practice. J. Am. Heart Assoc. 2022, 11, e026196. [Google Scholar] [CrossRef]
Jin, Y.; Li, Z.; Qin, C.; Liu, J.; Liu, Y.; Zhao, L.; Liu, C. A novel attentional deep neural network-based assessment method for ECG quality. Biomed. Signal Process. Control 2023, 79, 104064. [Google Scholar] [CrossRef]
Li, W.; Tang, Y.M.; Yu, K.M.; To, S. SLC-GAN: An automated myocardial infarction detection model based on generative adversarial networks and convolutional neural networks with single-lead electrocardiogram synthesis. Inf. Sci. 2022, 589, 738–750. [Google Scholar] [CrossRef]
Boda, S.; Mahadevappa, M.; Dutta, P.K. An automated patient-specific ECG beat classification using LSTM-based recurrent neural networks. Biomed. Signal Process. Control 2023, 84, 104756. [Google Scholar] [CrossRef]
Lee, J.A.; Kwak, K.C. Personal identification using an ensemble approach of 1D-LSTM and 2D-CNN with electrocardiogram signals. Appl. Sci. 2022, 12, 2692. [Google Scholar] [CrossRef]
Rai, H.M.; Chatterjee, K. Hybrid CNN-LSTM deep learning model and ensemble technique for automatic detection of myocardial infarction using big ECG data. Appl. Intell. 2022, 52, 5366–5384. [Google Scholar] [CrossRef]
Wasimuddin, M.; Elleithy, K.; Abuzneid, A.S.; Faezipour, M.; Abuzaghleh, O. Stages-based ECG signal analysis from traditional signal processing to machine learning approaches: A survey. IEEE Access 2020, 8, 177782–177803. [Google Scholar] [CrossRef]
Celin, S.; Vasanth, K. ECG signal classification using various machine learning techniques. J. Med. Syst. 2018, 42, 241. [Google Scholar] [CrossRef] [PubMed]
Maturo, F.; Verde, R. Pooling random forest and functional data analysis for biomedical signals supervised classification: Theory and application to electrocardiogram data. Stat. Med. 2022, 41, 2247–2275. [Google Scholar] [CrossRef]
Myrovali, E.; Hristu-Varsakelis, D.; Tachmatzidis, D.; Antoniadis, A.; Vassilikos, V. Identifying patients with paroxysmal atrial fibrillation from sinus rhythm ECG using random forests. Expert Syst. Appl. 2023, 213, 118948. [Google Scholar] [CrossRef]
Ma, C.; Wang, Z.; Yang, M.; Li, J.; Liu, C. Decision Tree-based Model for Signal Quality Scanning in Wearable ECG. In Proceedings of the 2022 Computing in Cardiology (CinC), Tampere, Finland, 4–7 September 2022; Volume 498, pp. 1–4. [Google Scholar]
Botros, J.; Mourad-Chehade, F.; Laplanche, D. CNN and SVM-Based Models for the Detection of Heart Failure Using Electrocardiogram Signals. Sensors 2022, 22, 9190. [Google Scholar] [CrossRef]
Mattheakis, M.; Protopapas, P. Recurrent neural networks: Exploding vanishing gradients & reservoir computing. In Advanced Topics in Data Science; Harvard Press: Cambridge, MA, USA, 2019. [Google Scholar]
Beni, G. Swarm intelligence. In Complex Social and Behavioral Systems: Game Theory and Agent-Based Models; Springer: New York, NY, USA, 2020; pp. 791–818. [Google Scholar]
Eberhart, R.; Kennedy, J. Particle swarm optimization. In Proceedings of the IEEE International Conference on Neural Networks, Perth, WA, Australia, 27 November–1 December 1995; Volume 4, pp. 1942–1948. [Google Scholar]
Mirjalili, S.; Mirjalili, S. Genetic algorithm. In Evolutionary Algorithms and Neural Networks: Theory and Applications; Springer: Cham, Switzerland, 2019; pp. 43–55. [Google Scholar]
Mirjalili, S. SCA: A sine cosine algorithm for solving optimization problems. Knowl.-Based Syst. 2016, 96, 120–133. [Google Scholar] [CrossRef]
Yang, X.S.; Slowik, A. Firefly algorithm. In Swarm Intelligence Algorithms; CRC Press: Boca Raton, FL, USA, 2020; pp. 163–174. [Google Scholar]
Faris, H.; Aljarah, I.; Al-Betar, M.A.; Mirjalili, S. Grey wolf optimizer: A review of recent variants and applications. Neural Comput. Appl. 2018, 30, 413–435. [Google Scholar] [CrossRef]
Abualigah, L.; Abd Elaziz, M.; Sumari, P.; Geem, Z.W.; Gandomi, A.H. Reptile Search Algorithm (RSA): A nature-inspired meta-heuristic optimizer. Expert Syst. Appl. 2022, 191, 116158. [Google Scholar] [CrossRef]
Gurrola-Ramos, J.; Hernàndez-Aguirre, A.; Dalmau-Cedeño, O. COLSHADE for real-world single-objective constrained optimization problems. In Proceedings of the 2020 IEEE Congress on Evolutionary Computation (CEC), Glasgow, UK, 19–24 July 2020; pp. 1–8. [Google Scholar]
Bezdan, T.; Zivkovic, M.; Tuba, E.; Strumberger, I.; Bacanin, N.; Tuba, M. Glioma brain tumor grade classification from mri using convolutional neural networks designed by modified fa. In Proceedings of the International Conference on Intelligent and Fuzzy Systems, Istanbul, Turkey, 21–23 July 2020; Springer: Cham, Switzerland, 2020; pp. 955–963. [Google Scholar]
Jovanovic, D.; Antonijevic, M.; Stankovic, M.; Zivkovic, M.; Tanaskovic, M.; Bacanin, N. Tuning machine learning models using a group search firefly algorithm for credit card fraud detection. Mathematics 2022, 10, 2272. [Google Scholar] [CrossRef]
Petrovic, A.; Bacanin, N.; Zivkovic, M.; Marjanovic, M.; Antonijevic, M.; Strumberger, I. The adaboost approach tuned by firefly metaheuristics for fraud detection. In Proceedings of the 2022 IEEE World Conference on Applied Intelligence and Computing (AIC), Sonbhadra, India, 17–19 June 2022; pp. 834–839. [Google Scholar]
Strumberger, I.; Tuba, E.; Zivkovic, M.; Bacanin, N.; Beko, M.; Tuba, M. Dynamic search tree growth algorithm for global optimization. In Technological Innovation for Industry and Service Systems: Proceedings of the 10th IFIP WG 5.5/SOCOLNET Advanced Doctoral Conference on Computing, Electrical and Industrial Systems, DoCEIS 2019, Costa de Caparica, Portugal, May 8–10, 2019, Proceedings 10; Springer: Cham, Switzerland, 2019; pp. 143–153. [Google Scholar]
Zamani, H.; Nadimi-Shahraki, M.H.; Gandomi, A.H. Starling murmuration optimizer: A novel bio-inspired algorithm for global and engineering optimization. Comput. Methods Appl. Mech. Eng. 2022, 392, 114616. [Google Scholar] [CrossRef]
Nadimi-Shahraki, M.H.; Zamani, H. DMDE: Diversity-maintained multi-trial vector differential evolution algorithm for non-decomposition large-scale global optimization. Expert Syst. Appl. 2022, 198, 116895. [Google Scholar] [CrossRef]
Bacanin, N.; Bezdan, T.; Tuba, E.; Strumberger, I.; Tuba, M.; Zivkovic, M. Task scheduling in cloud computing environment by grey wolf optimizer. In Proceedings of the 2019 27th Telecommunications Forum (TELFOR), Belgrade, Serbia, 26–27 November 2019; pp. 1–4. [Google Scholar]
Zivkovic, M.; Venkatachalam, K.; Bacanin, N.; Djordjevic, A.; Antonijevic, M.; Strumberger, I.; Rashid, T.A. Hybrid genetic algorithm and machine learning method for COVID-19 cases prediction. In Proceedings of the International Conference on Sustainable Expert Systems: ICSES 2020; Springer: Singapore, 2021; pp. 169–184. [Google Scholar]
Bezdan, T.; Cvetnic, D.; Gajic, L.; Zivkovic, M.; Strumberger, I.; Bacanin, N. Feature selection by firefly algorithm with improved initialization strategy. In Proceedings of the 7th Conference on the Engineering of Computer Based Systems, Novi Sad, Serbia, 26–27 May 2021; pp. 1–8. [Google Scholar]
Zivkovic, M.; Bacanin, N.; Tuba, E.; Strumberger, I.; Bezdan, T.; Tuba, M. Wireless sensor networks life time optimization based on the improved firefly algorithm. In Proceedings of the 2020 International Wireless Communications and Mobile Computing (IWCMC), Limassol, Cyprus, 15–19 June 2020; pp. 1176–1181. [Google Scholar]
Zivkovic, M.; Bacanin, N.; Zivkovic, T.; Strumberger, I.; Tuba, E.; Tuba, M. Enhanced grey wolf algorithm for energy efficient wireless sensor networks. In Proceedings of the 2020 Zooming Innovation in Consumer Technologies Conference (ZINC), Novi Sad, Serbia, 26–27 May 2020; pp. 87–92. [Google Scholar]
Ahmadpour, M.R.; Ghadiri, H.; Hajian, S.R. Model predictive control optimisation using the metaheuristic optimisation for blood pressure control. IET Syst. Biol. 2021, 15, 41–52. [Google Scholar] [CrossRef] [PubMed]
Khan, M.A.; Algarni, F. A healthcare monitoring system for the diagnosis of heart disease in the IoMT cloud environment using MSSO-ANFIS. IEEE Access 2020, 8, 122259–122269. [Google Scholar] [CrossRef]
Gupta, A.; Chhikara, R. Diabetic retinopathy: Present and past. Procedia Comput. Sci. 2018, 132, 1432–1440. [Google Scholar] [CrossRef]
Mahbod, A.; Schaefer, G.; Wang, C.; Ecker, R.; Ellinge, I. Skin lesion classification using hybrid deep neural networks. In Proceedings of the ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 12–17 May 2019; pp. 1229–1233. [Google Scholar]
Ren, Z.; Zhang, Y.; Wang, S. A hybrid framework for lung cancer classification. Electronics 2022, 11, 1614. [Google Scholar] [CrossRef]
Zhang, Y.; Zhao, Z.; Deng, Y.; Zhang, X.; Zhang, Y. ECGID: A human identification method based on adaptive particle swarm optimization and the bidirectional LSTM model. Front. Inf. Technol. Electron. Eng. 2021, 22, 1641–1654. [Google Scholar] [CrossRef]
Baños, F.S.; Romero, N.H.; Mora, J.C.S.T.; Marín, J.M.; Vite, I.B.; Fuentes, G.E.A. A Novel Hybrid Model Based on Convolutional Neural Network with Particle Swarm Optimization Algorithm for Classification of Cardiac Arrhytmias. IEEE Access 2023, 11, 55515–55532. [Google Scholar] [CrossRef]
Karthiga, M.; Santhi, V.; Sountharrajan, S. Hybrid optimized convolutional neural network for efficient classification of ECG signals in healthcare monitoring. Biomed. Signal Process. Control 2022, 76, 103731. [Google Scholar] [CrossRef]
Predić, B.; Jovanovic, L.; Simic, V.; Bacanin, N.; Zivkovic, M.; Spalevic, P.; Budimirovic, N.; Dobrojevic, M. Cloud-load forecasting via decomposition-aided attention recurrent neural network tuned by modified particle swarm optimization. Complex Intell. Syst. 2023, 1–21. [Google Scholar] [CrossRef]
Jiang, S.; Yang, S.; Yao, X.; Tan, K.C.; Kaiser, M.; Krasnogor, N. Benchmark Functions for the cec’2018 Competition on Dynamic Multiobjective Optimization; Technical Report; Newcastle University: Newcastle upon Tyne, UK, 2018. [Google Scholar]
Kennedy, H.L. Silent atrial fibrillation: Definition, clarification, and unanswered issues. Ann. Noninvasive Electrocardiol. 2015, 20, 518–525. [Google Scholar] [CrossRef]
Hossin, M.; Sulaiman, M.N. A review on evaluation metrics for data classification evaluations. Int. J. Data Min. Knowl. Manag. Process 2015, 5, 1. [Google Scholar]
Warrens, M.J. Five ways to look at Cohen’s kappa. J. Psychol. Psychother. 2015, 5. [Google Scholar] [CrossRef]
Eftimov, T.; Korošec, P.; Seljak, B.K. Disadvantages of statistical comparison of stochastic optimization algorithms. In Proceedings of the Bioinspired Optimizaiton Methods and Their Applications, BIOMA, Bled, Slovenia, 18–20 May 2016; pp. 105–118. [Google Scholar]
Shapiro, S.S.; Francia, R. An approximate analysis of variance test for normality. J. Am. Stat. Assoc. 1972, 67, 215–216. [Google Scholar] [CrossRef]
Wilcoxon, F. Individual comparisons by ranking methods. In Breakthroughs in Statistics: Methodology and Distribution; Springer: Berlin/Heidelberg, Germany, 1992; pp. 196–202. [Google Scholar]
Lundberg, S.M.; Lee, S.I. A Unified Approach to Interpreting Model Predictions. In Advances in Neural Information Processing Systems 30; Guyon, I., Luxburg, U.V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., Garnett, R., Eds.; Curran Associates, Inc.: New York, NY, USA, 2017; pp. 4765–4774. [Google Scholar]

Figure 1. Dataset visualization.

Figure 2. Experimental framework flowchart.

Figure 3. Objective and indicator function distributions.

Figure 4. Objective and indicator function convergence graphs.

Figure 5. Best-performing model’s ROC and PR curves.

Figure 6. Best-performing model’s confusion matrix.

Figure 7. Objective function outcome KDE plots for each metaheuristic.

Figure 8. SHAP analysis summary and feature impact outcomes.

Table 1. Objective function outcomes over 30 independent runs. Best results are written in bold.

Method	Best	Worst	Mean	Median	Std	Var
RNN-GIPSO	0.020718	0.022790	0.021984	0.022445	0.000906	8.21 × 10 $^{- 7}$
RNN-PSO	0.021754	0.029006	0.026473	0.028660	0.003340	1.12 × 10 $^{- 5}$
RNN-GA	0.023481	0.032459	0.026819	0.024517	0.004010	1.61 × 10 $^{- 5}$
RNN-SCA	0.024862	0.027624	0.025783	0.024862	0.001302	1.70 × 10 $^{- 6}$
RNN-FA	0.022790	0.024862	0.023596	0.023135	0.000906	8.21 × 10 $^{- 7}$
RNN-GWO	0.021409	0.023135	0.021984	0.021409	0.000814	6.62 × 10 $^{- 7}$
RNN-RSA	0.021409	0.022099	0.021639	0.021409	0.000326	1.06 × 10 $^{- 7}$
RNN-COLSHADE	0.022445	0.029351	0.026128	0.026588	0.002838	8.05 × 10 $^{- 6}$

Table 2. Indicator function outcomes over 30 independent runs. Best results are written in bold.

Method	Best	Worst	Mean	Median	Std	Var
RNN-GIPSO	0.918695	0.912862	0.914520	0.912862	0.002973	8.84 × 10 $^{- 6}$
RNN-PSO	0.915662	0.884873	0.895986	0.887422	0.013952	1.95 × 10 $^{- 4}$
RNN-GA	0.907855	0.873338	0.895263	0.904598	0.015561	2.42 × 10 $^{- 4}$
RNN-SCA	0.902434	0.891184	0.898684	0.902434	0.005303	2.81 × 10 $^{- 5}$
RNN-FA	0.911728	0.902800	0.907884	0.909124	0.003749	1.41 × 10 $^{- 5}$
RNN-GWO	0.916143	0.909804	0.913765	0.915348	0.002819	7.95 × 10 $^{- 6}$
RNN-RSA	0.915985	0.913923	0.915662	0.915985	0.001308	1.71 × 10 $^{- 6}$
RNN-COLSHADE	0.912168	0.885572	0.897502	0.894767	0.011029	1.22 × 10 $^{- 4}$

Table 3. Detailed metrics of the best-performing models. Best results are written in bold.

Method	Metric	Normal	Anomalous	Accuracy	Macro Avg	Weighted Avg
RNN-GIPSO	Precision	0.948357	0.984615	0.979282	0.966486	0.979081
	Recall	0.914027	0.991035	0.979282	0.952531	0.979282
	F1 score	0.930876	0.987815	0.979282	0.959345	0.979124
RNN-PSO	Precision	0.931663	0.986569	0.978246	0.959116	0.978189
	Recall	0.925339	0.987775	0.978246	0.956557	0.978246
	F1 score	0.928490	0.987172	0.978246	0.957831	0.978215
RNN-GA	Precision	0.938967	0.982996	0.976519	0.960982	0.976276
	Recall	0.904977	0.989405	0.976519	0.947191	0.976519
	F1 score	0.921659	0.986190	0.976519	0.953925	0.976341
RNN-SCA	Precision	0.800000	0.800000	0.902434	0.890642	0.902434
	Recall	0.995475	0.802434	0.902434	0.890368	0.902696
	F1 score	0.887096	0.801214	0.902434	0.890458	0.902860
RNN-FA	Precision	0.927273	0.986156	0.977210	0.956715	0.977169
	Recall	0.923077	0.986960	0.977210	0.955018	0.977210
	F1 score	0.925170	0.986558	0.977210	0.955864	0.977189
RNN-GWO	Precision	0.906143	0.926253	0.916143	0.904172	0.916143
	Recall	0.970588	0.926445	0.916143	0.916517	0.916207
	F1 score	0.904225	0.926348	0.916143	0.911682	0.916567
RNN-RSA	Precision	0.946009	0.984211	0.978591	0.965110	0.978380
	Recall	0.911765	0.990628	0.978591	0.951196	0.978591
	F1 score	0.928571	0.987409	0.978591	0.957990	0.978429
RNN-COLSHADE	Precision	0.939394	0.984191	0.977555	0.961793	0.977354
	Recall	0.911765	0.989405	0.977555	0.950585	0.977555
	F1 score	0.925373	0.986791	0.977555	0.956082	0.977417
	Support	442	2454

Table 4. Indicator function outcomes over 30 independent runs.

Method	Learning Rate	Dropout	Epochs	Layers	Neurons L1	Neurons L2
RNN-GIPSO	0.007225	0.150573	58	1	12	/
RNN-PSO	0.008380	0.102960	60	2	15	12
RNN-GA	0.010000	0.050000	54	1	15	/
RNN-SCA	0.007023	0.185083	53	1	8	/
RNN-FA	0.010000	0.200000	49	2	15	15
RNN-GWO	0.003352	0.198888	53	1	13	/
RNN-RSA	0.007580	0.078741	53	2	14	15
RNN-COLSHADE	0.008961	0.112279	48	1	5	/

Table 5. Shapiro–Wilk test scores for the single-problem analysis.

Problem	GIPSO	PSO	GA	SCA	FA	GWO	RSA	COLSHADE
ECG	0.024	0.026	0.017	0.021	0.029	0.035	0.031	0.039

Table 6. Wilcoxon signed-rank test values exhibiting p-values for experiments (GIPSO vs. others).

Problem	PSO	GA	SCA	FA	GWO	RSA	COLSHADE
ECG-RNN	0.003	0.018	0.037	0.04	0.027	0.031	0.028

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Minic, A.; Jovanovic, L.; Bacanin, N.; Stoean, C.; Zivkovic, M.; Spalevic, P.; Petrovic, A.; Dobrojevic, M.; Stoean, R. Applying Recurrent Neural Networks for Anomaly Detection in Electrocardiogram Sensor Data. Sensors 2023, 23, 9878. https://doi.org/10.3390/s23249878

AMA Style

Minic A, Jovanovic L, Bacanin N, Stoean C, Zivkovic M, Spalevic P, Petrovic A, Dobrojevic M, Stoean R. Applying Recurrent Neural Networks for Anomaly Detection in Electrocardiogram Sensor Data. Sensors. 2023; 23(24):9878. https://doi.org/10.3390/s23249878

Chicago/Turabian Style

Minic, Ana, Luka Jovanovic, Nebojsa Bacanin, Catalin Stoean, Miodrag Zivkovic, Petar Spalevic, Aleksandar Petrovic, Milos Dobrojevic, and Ruxandra Stoean. 2023. "Applying Recurrent Neural Networks for Anomaly Detection in Electrocardiogram Sensor Data" Sensors 23, no. 24: 9878. https://doi.org/10.3390/s23249878

APA Style

Minic, A., Jovanovic, L., Bacanin, N., Stoean, C., Zivkovic, M., Spalevic, P., Petrovic, A., Dobrojevic, M., & Stoean, R. (2023). Applying Recurrent Neural Networks for Anomaly Detection in Electrocardiogram Sensor Data. Sensors, 23(24), 9878. https://doi.org/10.3390/s23249878

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Applying Recurrent Neural Networks for Anomaly Detection in Electrocardiogram Sensor Data

Abstract

1. Introduction

2. Background

2.1. AI Approaches in Electrocardiogram Analysis

2.2. Recurrent Neural Networks

2.3. Metaheuristics

3. Methods

3.1. The Original PSO

3.2. Genetically Inspired PSO (GIPSO)

4. Experimental Setup

5. Experimental Outcomes

5.1. Statistical Validation of Outcomes

5.2. Best Model Interpretation

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI