1. Introduction
Excessive exercise is a primary cause of sports injuries and sudden death. Therefore, fatigue detection is of vital importance [
1,
2,
3]. As a vital sign monitoring technology, an ECG is extensively utilized in modern healthcare devices, owing to its noninvasiveness and convenience. The HRV features of ECG signals are measured by the corresponding R-peak location of ECG signals, providing key insights into heart health, stress response, and exercise performance. Sivanantham used linear and nonlinear HRV features for cardiac arrhythmia detection and achieved an accuracy of 90.26% [
4]. Jos Vicente focused on the detection of driver’s drowsiness by means of HRV analysis and achieved a sensitivity of 85% for drowsy minutes classification [
5]. Meng proposed a sleep stages classification method based on HRV features, achieving an average accuracy of 88.67% for subject-specific classifiers [
6]. Mustafa Radha used Long Short-Term Memory (LSTM) neural networks to classify sleep stages based on HRV features, achieving an accuracy of 77.00 ± 8.90% across the entire database [
7]. Marta Vigier used HRV features for cancer classification, achieving a classification accuracy of 86% for the ensemble model [
8]. To enhance the accuracy of classification, methods based on fusion features have been adopted. Chen integrated HRV features and ECG image features for exercise fatigue classification and the accuracy value improved to 94.32% [
9]. Cheol Ho Song developed a multi-dimensional feature fusion-based stress classification method by combining LSTM and Xception, which overcame the limitations of traditional HRV-based stress classification and achieved an accuracy of 99.51% [
10]. Ahmed S. Eltrass extracted ECG features by integrating HRV features, other linear and nonlinear features extracted from ECG, and features of the deep neural network method, and the corresponding proposed diagnostic system achieved an accuracy of 98.75% [
11]. Prakash put forward a modified 1-D U-Net architecture to identify electrocardiogram fiducial points and segment the signals. Subsequently, features were extracted and classified using the Random Forest algorithm. The proposed method was validated on two publicly available databases, namely LUDB and MIT-BIH. Experimental results demonstrate that the performance of the proposed method was superior to that of state-of-the-art techniques [
12]. Although all the aforementioned methods have introduced effective features and achieved good results, none of them have focused on enhancing the reliability of HRV features, which may limit further improvements in performance.
As mentioned above, HRV features are important physiological indicators for analyzing ECG signals. Note that, the correct R-peak detection is the base of the accurate HRV features. Recently, R-peak detection techniques have been predominantly focused on the medical field, where the ECG signals are relatively stable and the noise interference is small and controlled. So far, the R-peak detection research can be mainly divided into two categories: the classical threshold-based methods and deep learning methods. The classical R-peak detection algorithms primarily enhance the energy of the QRS complex, followed by the amplitude-threshold-based R-peak detection [
13]. For example, JP Martínez et al. developed a wavelet-based detector that remains outstanding in standard databases [
14]. These algorithms could rapidly detect R-peaks, but few consider robustness to noise ECG signals. Deep learning methods have shown remarkable performance in R-peak detection. Wang proposed two parallel 1D residual neural networks to capture the time-domain characteristics of QRS waveform and attained a sensitivity value of 99.92% on the MIT-BIH Arrhythmia dataset (MITDB) [
15]. Laitala applied LSTM networks to construct the R-peak detector [
16]. Vijayrangan combined U-Net, Inception, and Residual blocks to detect R-peak on a combined dataset and achieved nearly an accuracy of 98.37% [
17]. Mehri proposed an R-peak detection architecture for 3D vectorcardiogram (VCG) that requires no preprocessing or post-processing steps. Experiments were conducted on four different public databases. Using the proposed approach, high F1 scores of 99.80% and 99.64% were achieved in the leave-one-out cross-validation and cross-database validation protocols [
18]. Wang proposed an end-to-end electrocardiogram waveform detection method (ECT-net) based on a convolutional neural network (CNN) and transformer. It performed best among comparison methods and achieved F1 scores of 94.27% for the P wave, 97.32% for the QRS wave, and 93.92% for the T wave [
19].
At present, most of the research focuses on feature analysis and the classification of medical ECG signals. Unfortunately, it is unreasonable to directly apply these methods to analyze exercise ECG signals. The reason is that exercise ECG signals fluctuate greatly under different exercise intensities, and strong noises are generated during exercise. Thus, for exercise ECG signals, the accuracy of HRV features derived from R-peak detection and the effectiveness of the resulting ECG classification all face the great challenges. Therefore, in order to improve the reliability of HRV features and explore more abundant ECG features for exercise fatigue classification, we have developed an adaptive R-peak detection algorithm suitable for high-intensity exercise and introduced more suitable descriptor features and more subtle deep learning features.
The main goal of this work is to devise an effective solution for exercise fatigue detection based on ECG. Compared with previous studies, in order to enhance the reliability of HRV, we bolster the reliability of R-peak detection. Specifically, we introduce an innovative ECG data generator. This device is purposefully engineered to generate noisy ECG signals, which act as essential training materials for nurturing a robust R-peak detector. At the same time, to overcome the difficulties caused by the scarcity of publicly available exercise ECG data and the lack of precise labels, we carry out intra-class segmentation. This methodology significantly increases the number of samples in our dataset, laying a more stable groundwork for further analysis. Based on the above-mentioned generated training data, a novel R-peak detection method based on BILSTM and multi-scale convolution (called BILSTM-CNN) is proposed, whose inputs are hybrid time–frequency signals. To the best of our knowledge, this is the first attempt to detect R-peaks using time–frequency features. And, an adaptive filtering function is proposed to further filter out unnatural R-peaks during high-intensity exercise. Up to this point, the algorithm specifically designed for detecting R-peaks in ECG signals during high-intensity exercise has been completed. Based on the proposed R-peak detector BILSTM-CNN, accurate HRV features are obtained and then the accuracy of exercise ECG classification is improved. Furthermore, by integrating HRV features with statistical descriptor features and deep learning features extracted from ECG, the proposed ECG classification framework has an accuracy of up to 99.1%, which provides a strong base for exercise fatigue monitoring based on ECG signals. This method provides a certain basis for sports fatigue detection.
The rest of this article is organized as follows:
Section 2 describes the datasets. In
Section 3, pre-processing, the proposed robust and adaptive R-peak detection model BILSTM-CNN and the resulting multi-feature fusion model for exercise ECG classification are introduced.
Section 4 introduces experiments and results, which demonstrate the effectiveness of the proposed R-peak detection model BILSTM-CNN and the novel ECG classification method based on multi-feature fusion. Finally, conclusions and future work are provided in
Section 5.
2. Datasets
In this work, two prominent available databases, the GUDB database [
20] and the EPFL database (
https://zenodo.org/records/5727800; accessed on 25 November 2021) [
21], are used to train and test our R-peak detection model. Clearly, the more accurate the R-peak detection, the higher the accuracy of ECG classification. Thus, the experiment results of ECG classification could further validate the effectiveness of the proposed R-peak detection method. Here, the EPFL database is employed in exercise ECG classification for fatigue detection due to the standard classification of fatigue levels shown in
Figure 1.
(1) GUDB: The GUDB database includes 24 ECG records of 120 s duration from 24 subjects. The ECG signals were recorded using an Attys Bluetooth data acquisition board. This board has a sampling rate of 250 Hz and a resolution of 24-bit over a range of ±2.42 V. All the records are collected during 3 physical activities: treadmill walking, hand-bike usage, and jogging. Since the GUDB dataset has the R-labeled dataset, it is used to train and test R-peak detection for exercise ECG data.
(2) EPFL: The EPFL database contains 100 ECG records from 20 subjects and 5 different segments, each record has a 20s duration. The ECG signal was sampled at 500 Hz and then downsampled at 250 Hz. The original ECG signals were measured at a maximum of 10 mV. Then, they were scaled down by a factor of 1000; hence, the data are represented in uV. In the EPEL dataset, exercise intensities are divided into 5 segments based on the ventilation threshold and VO2max, including before and after the so-called second ventilatory threshold or VT2, before and in the middle of VO2max, and during the recovery after exhaustion shown in
Figure 1. Thus, since the EPFL database is R-labeled and exercise fatigue-labeled, the EPFL database is applied in the R-peak detection and ECG classification estimation during exercise.
5. Discussions and Future Work
In this study, we explore a multi-feature fusion method to improve the exercise ECG classification for fatigue monitoring. Here, three contributions guarantee the effectiveness of the proposed method. First, an adaptive and robust R-peak detection model has been proposed, which achieves more accurate R-peak detection, and more precise HRV features are captured. The proposed R-peak detection method includes the ECG data generator, BILSTM-CNN model, and adaptive post-processing. The ECG data generator ensures robustness; the constructed BILSTM-CNN could capture multi-scale features of exercise ECG signals, and post-processing guarantees the correction of misjudged R-peaks. Second, descriptor features of ECG signals have been verified the effectiveness for classifying exercise ECG signals. Finally, multi-feature fusion method are presented by combining HRV features derived from the proposed R-peak detection model, descriptor features, and deep learning features. Experiment results show that our R-peak detection approach achieves better performance on real exercise ECG data and ECG signals with added noise (MIT-BIH noise and Gaussian noise). In terms of classification performance, extensive experiments validate the superiority of our multi-feature fusion approach over state-of-the-art classification algorithms. While we have made significant progress, there are still potential research directions worth exploring further. Future studies could focus on improving the accuracy and robustness of R-peak detection, as well as exploring more complex deep learning models and feature fusion methods to enhance exercise intensity detection performance.