Detection of Atrial Fibrillation in Holter ECG Recordings by ECHOView Images: A Deep Transfer Learning Study †
Abstract
:1. Introduction
1.1. Atrial Fibrillation Detection in Holter ECG Recordings: Challenges and Current Practices
1.2. Advanced Colormap Modality for ECG Interpretation
1.3. Deep Transfer Learning for ECG Signals
1.4. DNN Models Interpretability
1.5. Research Objectives
2. Materials and Methods
2.1. Holter ECG Database and ECHOView Images
- Width: N pixels, corresponding to the number of QRS complexes within the analysis interval, which varies with heart rate.
- Height: 300 pixels (equivalent to 1.5 s at a 200 Hz sampling rate), defining the resolution of the ECHOView segment. This segment represents the heartbeat pattern within a ±0.75 s window around the QRS annotation.
2.2. Deep Transfer Learning
- Top Layer Retraining
- 2.
- Fine-Tuning
2.3. Performance Evaluation
- Accuracy metrics:
- 2.
- Computational cost:
- Inference time ratio: The ratio of a model’s inference time relative to the inference time of the fastest model. It provides a normalized comparison of processing time.
- Number of parameters (weights and biases) in the model: This directly impacts memory usage, computational complexity, and overall efficiency.
2.4. ECHOView Image Importance
3. Results
3.1. Comparative Study of Retrained and Fine-Tuned ImageNet DNNs for AF Detection
- Retraining of the top layer with 513 to 2049 trainable parameters (Table 2) used learning rate = 1 × 10−3, optimizer = ‘Adam’, loss function = ‘binary cross-entropy’, batch size = 64.
- Fine-tuning of 0.38 M to 23.5 M trainable parameters (Table 2) used frozen batch normalization layers, learning rate = 1 × 10−5, optimizer = ‘Adam’, loss function = ‘binary cross-entropy’, batch size = 32.
- Test 1: Test accuracy vs. computational cost;
- Test 2: Robustness across different patients;
- Test 3: Robustness across ECG leads;
- Test 4: Effect of shortening the analysis duration;
- Test 5: Noise robustness over full-length Holter ECG recordings.
3.1.1. Test Accuracy vs. Computational Cost
- Retrained DNNs: EfficientNetV2B1 (1.92 inference time ratio; 96.3% accuracy) is the most accurate model and has a moderate computational cost. EfficientNetV2B2 (2.33; 95.8%) ranks second but has a higher inference time. EfficientNetV2B0 (1.5; 95.3%) and EfficientNetB1 (2.17; 95.3%) rank third, with a 1% accuracy drop, presenting low and moderate computational costs, respectively.
- Fine-tuned DNNs: MobileNetV2 (α = 1.4) is both fast and highly accurate (1.33; 97.4%). DenseNet121 (1.92; 97.5%) and DenseNet201 (2.75; 97.6%) achieve the highest accuracy but at moderate to high computational costs. InceptionV3 (1.58; 97.3%), EfficientNetV2B1 (1.92; 97.3%), ResNet50 (2.08; 97.3%), and DenseNet169 (2.33; 97.2%) follow closely, balancing strong performance with varying inference times.
3.1.2. Robustness Across Different Patients
3.1.3. Robustness Across ECG Leads
3.1.4. Effect of Shortening the Analysis Duration
3.1.5. Noise Robustness over Full-Length Holter ECG Recordings
3.1.6. DNN Model Rankings Across All Tests
3.2. DNN Model Interpretability for AF Detection: Case Studies
3.2.1. Sinus Rhythm: Correctly Detected Cases
- Electromyographic (EMG) artifacts: The heatmap-enhanced ECG shows that AF detection is not triggered during the intensive EMG artifacts (from 0 to 8 s in trace (A), and from 17 to 27 s in trace (B)), although the ECHOView image presents artifact-induced alterations.
- Motion artifacts: Although a motion-induced baseline drift in trace (D) persists throughout the entire recording and affects the entire ECHOView image, the DNN highlights only a few ECG segments resembling AF, primarily in the isoelectric line between the T and P waves, significantly altered by the noise.
- Multiple ventricular extrasystoles: The presented records show frequent extrasystoles, causing rhythm irregularity. Combined with the challenge of detecting P-waves due to recording artifacts, this makes the diagnosis of sinus rhythm more uncertain. In the presented cases, the DNN is not activated by the ventricular extrasystoles in trace (B). However, in trace (C), DNN is activated only after at least two bigeminy cycles, specifically in the region of the shortened RnRn+1 interval, including the entire extrasystole waveform. Nevertheless, these small activation segments on the heatmap do not affect the DNN’s final decision for non-AF.
3.2.2. Atrial Fibrillation: Correctly Detected Cases
- QRS complexes, corresponding to the most irregular and shortened RR-intervals in traces (A) and (D);
- Atrial fibrillatory f-waves, highlighted on the ECG signal near the isoelectric line in trace (B) and overlapping the T-waves in trace (C). The activated regions on the heatmap align with specific areas of the ECHOView image before and after the central QRS peak (Rn). This is an important notification in case (B), where low ECHOView contrast—due to long RR-intervals and low-amplitude f and T-waves—may impact the diagnostic interpretation.
3.2.3. Atrial Fibrillation: Disputable Cases
- In trace (A), a transition from AF to sinus rhythm is observed. The initial portion of the ECG (about 15 s) shows AF, with identifiable f-waves—key indicators for AF detection—visible either on the isoelectric line or superimposed on the T-waves, as highlighted by the heatmap. However, these regions are limited and insufficient for a definitive AF diagnosis (pAF = 0.395).
- Trace (B) begins with a sinus rhythm, which is disrupted at 16 s by an artifact and a rhythm change. The heatmap is activated immediately after the abrupt rhythm change, where shortened RR intervals are clearly visible in ECHOView. Although the activation corresponds to a relatively short ECG segment of just over 5 s, the DNN classifies it as strong enough for definitive AF detection (pAF = 0.997). However, visual inspection of the trace does not confirm AF, and another type of supraventricular tachycardia cannot be ruled out.
- Trace (C) presents an example of sinus tachycardia, as determined by cardiologists through visual inspection of the ECG record. The heatmap highlights a sustained rhythm with very short RR intervals, which was initially misclassified as AF by the algorithm (pAF = 0.943). However, this classification was later corrected when the RR intervals were prolonged. This example suggests that the DNN output may be influenced by heart rate, especially in rhythms with very short RR intervals.
- Trace (D) illustrates an example of atrial flutter, diagnosed through visual inspection by cardiologists. The heatmap shows strong activation in the region of F-waves in the middle part of the recording, where the rhythm is characterized by a rapid heart rate and pronounced RR interval variability. This activation led to the misclassification of atrial flutter as AF, with a high AF probability (pAF = 0.999). This suggests that the DNN was trained using mixed AF and atrial flutter annotations, which may explain why the algorithm misclassified atrial flutter as AF.
4. Discussion
4.1. DNN Model Interpretability for AF Detection
4.2. Comparative Analysis of Transfer Learning DNNs for AF Detection
- Retrained EfficientNetV2B1 has a 96.3% test accuracy with 2.1% inter-patient and 0.3% inter-lead drops; detects AF in shorter episodes with an accuracy drop of 0.6% (20 s) and 15% (10 s); presents 97.4% long-term monitoring accuracy.
- Fine-tuned EfficientNetV2B1 has a 97.3% test accuracy with 1.5% inter-patient and 1.2% inter-lead drops, detects AF in shorter episodes with an accuracy drop of 0.4% (20 s) and 8% (10 s), and presents 98.2% long-term monitoring accuracy.
- Three fine-tuned DenseNet models (121, -169, -201) have a 97.2–97.6% test accuracy with 1.4–1.6% inter-patient and 0.5–0.9% inter-lead drops; detect AF in shorter episodes with accuracy drop of 0.2–0.5% (20 s) and 4–6% (10 s); and present 98.3–98.5% long-term monitoring accuracy.
4.3. Comparative Analysis with Published Studies
- ECG acquisition devices: The IRIDIA-AF dataset was recorded using Holter devices (200 Hz sampling rate, unspecified bandwidth), while ambulatory ECG recorders were used for the MIT-BIH-AF (250 Hz, 0.1–40 Hz) and MIT-BIH-Arrhythmia datasets (360 Hz, 0.1–100 Hz). The CinC Challenge 2017 dataset, in contrast, was collected using AliveCor ECG sensors (300 Hz, 0.5–40 Hz). Despite variations in sampling frequency and precision, the IRIDIA-AF, MIT-BIH-AF, and MIT-BIH-Arrhythmia datasets share similar acquisition conditions—recordings were taken from chest-mounted devices by healthcare professionals over extended periods (10–24 h) during daily activities. Conversely, the CinC Challenge 2017 dataset consists of short ECG recordings (9–61 s) captured via hand-held AliveCor sensors, with signals acoustically transmitted to a smartphone. This method is prone to acquisition errors, such as improper sensor placement, bad electrode contact, or inverted electrodes [97], potentially impacting AF detection accuracy as reported in [20].
- AF rhythm types: The IRIDIA-AF dataset explicitly includes only paroxysmal AF cases, excluding persistent and permanent AF. Similarly, the AF episodes in the MITBIH-AF dataset are described as “mostly paroxysmal”. However, the MIT-BIH Arrhythmia and CinC Challenge 2017 datasets do not provide specific details regarding AF types. The potential presence of multiple AF types (paroxysmal, persistent, and permanent) in these datasets could contribute to variability in detection performance and may have influenced the TPR reported in [20].
- Non-AF rhythm types: The IRIDIA-AF and MIT-BIH-AF datasets include ECG recordings from patients with at least one paroxysmal AF episode, without specific selections based on non-AF rhythm variety. In contrast, half of the recordings in the MITBIH-Arrhythmia dataset have been selected to include less common but clinically significant arrhythmias that would not be well-represented in a small random sample [96]. Similarly, the large patient cohort in the CinC Challenge 2017 dataset suggests a wide range of heart disease diagnoses, grouped under the annotation “Other rhythms”. The increased variability in non-AF rhythms within the MIT-BIH Arrhythmia and CinC Challenge 2017 datasets may contribute to the lower TNR observed in [19,20] compared to [18] and this study.
- Noise presence: The IRIDIA-AF dataset explicitly excludes ECG recordings with insufficient quality or excessive noise. We hypothesize that this quality-based selection process was based on observations from only segments of the Holter ECG recordings, as we observed significant noise consistently present throughout the entire 24–72 h duration of the records. In contrast, the CinC Challenge 2017 dataset includes a separate ECG class labeled “too noisy to be classified”. Therefore, the classification task is different, presenting one mixed class of AF and noise in this study versus two separate classes of AF and noise in [20]. Information on the presence of noise in MITBIH-AF and MITBIH-Arrhythmia datasets is not available, and relevant conclusions regarding noise impact cannot be derived.
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Van Gelder, I.C.; Rienstra, M.; Bunting, K.V.; Casado-Arroyo, R.; Caso, V.; Crijns, H.J.G.M.; De Potter, T.J.R.; Dwight, J.; Guasti, L.; Hanke, T.; et al. 2024 ESC Guidelines for the management of atrial fibrillation developed in collaboration with the European Association for Cardio-Thoracic Surgery (EACTS). Eur. Heart J. 2024, 45, 3314–3414. [Google Scholar] [CrossRef] [PubMed]
- Bedford, J.P.; Ferrando-Vivas, P.; Redfern, O.; Rajappan, K.; A Harrison, D.; Watkinson, P.J.; Doidge, J.C. New-onset atrial fibrillation in intensive care: Epidemiology and outcomes. Eur. Heart J. Acute Cardiovasc. Care 2022, 11, 620–628. [Google Scholar] [CrossRef]
- Mannina, C.; Jin, Z.; Matsumoto, K.; Ito, K.; Biviano, A.; Elkind, M.S.; Rundek, T.; Homma, S.; Sacco, R.L.; Di Tullio, M.R. Frequency of cardiac arrhythmias in older adults: Findings from the subclinical atrial fibrillation and risk of is chemic stroke (SAFARIS) study. Int. J. Cardiol. 2021, 337, 64–70. [Google Scholar] [CrossRef] [PubMed]
- Healey, J.S.; Connolly, S.J.; Gold, M.R.; Israel, C.W.; Van Gelder, I.C.; Capucci, A.; Lau, C.; Fain, E.; Yang, S.; Bailleul, C.; et al. Subclinical atrial fibrillation and the risk of stroke. N. Engl. J. Med. 2012, 366, 120–129. [Google Scholar] [CrossRef] [PubMed]
- Kaufman, E.S.; Israel, C.W.; Nair, G.M.; Armaganijan, L.; Divakaramenon, S.; Mairesse, G.H.; Brandes, A.; Crystal, E.; Costantini, O.; Sandhu, R.K.; et al. Positive predictive value of device-detected atrial high-rate episodes at different rates and durations: An analysis from ASSERT. Heart Rhythm. 2012, 9, 1241–1246. [Google Scholar] [CrossRef]
- Miyazawa, K.; Pastori, D.; Martin, D.T.; Choucair, W.K.; Halperin, J.L.; Lip, G.Y.H. Characteristics of patients with atrial high rate episodes detected by implanted defibrillator and resynchronization devices. Europace 2022, 24, 375–383. [Google Scholar] [CrossRef]
- Simantirakis, E.; Papakonstantinou, P.; Chlouverakis, G.; Kanoupakis, E.; Mavrakis, H.; Arkolaki, E.; Vardas, P. Asymptomatic versus symptomatic episodes in patients with paroxysmal atrial fibrillation via long-term monitoring with implantable loop recorders. Int. J. Cardiol. 2017, 231, 125–130. [Google Scholar] [CrossRef] [PubMed]
- Jekova, I.; Krasteva, V.; Leber, R.; Schmid, R.; Twerenbold, R.; Reichlin, T.; Müller, C.; Abächerli, R. A real-time quality monitoring system for optimal recording of 12-lead resting ECG. Biomed. Signal Process. Control 2017, 34, 126–133. [Google Scholar] [CrossRef]
- Vitolo, M.; Imberti, J.F.; Maisano, A.; Albini, A.; Bonini, N.; Valenti, A.C.; Malavasi, V.L.; Proietti, M.; Healey, J.S.; Lip, G.Y.; et al. Device-detected atrial high rate episodes and the risk of stroke/thromboembolism and atrial fibrillation incidence: A systematic review and meta-analysis. Eur. J. Intern. Med. 2021, 92, 100–106. [Google Scholar] [CrossRef]
- Schiller’s Medilog® Holter System. 2010. Available online: https://www.ntnu.edu/documents/221360533/1261844795/Medilog+AR-4.pdf/7fd6e572-ae78-4382-a86f-042e13d4b7c4 (accessed on 2 March 2024).
- Li, D.; Tian, F.; Rengifo, S.; Xu, G.; Wang, M.; Borjigin, J. Electrocardiomatrix: A new method for beat-by-beat visualization and inspection of cardiac signals. J. Integr. Cardiol. 2015, 1, 124–128. [Google Scholar] [CrossRef]
- Xu, G.; Dodaballapur, S.; Mihaylova, T.; Borjigin, J. Electrocardiomatrix facilitates qualitative identification of diminished heart rate variability in critically ill patients shortly before cardiac arrest. J. Electrocardiol. 2018, 51, 955–961. [Google Scholar] [CrossRef] [PubMed]
- Lee, V.; Xu, G.; Liu, V.; Farrehi, P.; Borjigin, J. Accurate detection of atrial fibrillation and atrial flutter using the electrocardiomatrix technique. J. Electrocardiol. 2018, 51, S121–S125. [Google Scholar] [CrossRef] [PubMed]
- Brown, D.L.; Xu, G.; Krzyske, A.M.B.; Buhay, N.C.; Blaha, M.; Wang, M.M.; Farrehi, P.; Borjigin, J. Electrocardiomatrix Facilitates Accurate Detection of Atrial Fibrillation in Stroke Patients. Stroke 2019, 50, 1676–1681. [Google Scholar] [CrossRef]
- Naydenov, S.; Jekova, I.; Krasteva, V. Recognition of Supraventricular Arrhythmias in Holter ECG Recordings by ECHOView Color Map: A Case Series Study. J. Cardiovasc. Dev. Dis. 2023, 10, 360. [Google Scholar] [CrossRef] [PubMed]
- Sharma, K.; Rao, B.M.; Marwaha, P.; Kumar, A. Accurate detection of congestive heart failure using electrocardiomatrix technique. Multimed. Tools Appl. 2022, 81, 30007–30023. [Google Scholar] [CrossRef]
- Rao, B.M.; Kumar, A. Congestive Heart Failure Detection Based on Electrocardiomatrix Method with ECG signal. Comput. Assist. Methods Eng. Sci. 2023, 30, 291–304. [Google Scholar] [CrossRef]
- Salinas-Martínez, R.; de Bie, J.; Marzocchi, N.; Sandberg, F. Automatic Detection of Atrial Fibrillation Using Electrocardiomatrix and Convolutional Neural Network. In Proceedings of the 2020 Computing in Cardiology, Rimini, Italy, 13–16 September 2020; Volume 47. [Google Scholar] [CrossRef]
- Salinas-Martínez, R.; de Bie, J.; Marzocchi, N.; Sandberg, F. Detection of Brief Episodes of Atrial Fibrillation Based on Electrocardiomatrix and Convolutional Neural Network. Front. Physiol. 2021, 12, 673819. [Google Scholar] [CrossRef]
- Lee, H.; Shin, M. Learning Explainable Time-Morphology Patterns for Automatic Arrhythmia Classification from Short Single-Lead ECGs. Sensors 2021, 21, 4331. [Google Scholar] [CrossRef]
- Fuster-Barceló, C.; Peris-Lopez, P.; Camara, C. ELEKTRA: ELEKTRokardiomatrix application to biometric identification with convolutional neural networks. Neurocomputing 2022, 506, 37–49. [Google Scholar] [CrossRef]
- Ismail, S.N.M.S.; Razak, S.F.A.; Aziz, N.A.A. ECG-based transfer learning for cardiovascular disease: A scoping review. Int. J. Cogn. Comput. Eng. 2025, 6, 280–297. [Google Scholar] [CrossRef]
- Russakovsky, O.; Deng, J.; Su, H.; Krause, J.; Satheesh, S.; Ma, S.; Huang, Z.; Karpathy, A.; Khosla, A.; Bernstein, M.; et al. ImageNet large scale visual recognition challenge. Int. J. Comput. Vision. 2015, 115, 211–252. [Google Scholar] [CrossRef]
- Peimankar, A.; Ebrahimi, A.; Wiil, U.K. xECG-Beats: An explainable deep transfer learning approach for ECG-based heartbeat classification. Netw. Model. Anal. Health Inform. Bioinform. 2024, 13, 45. [Google Scholar] [CrossRef]
- Zhang, W. 2D Transfer Learning for ECG Classification using Continuous Wavelet Transform. medRxiv 2024. [Google Scholar] [CrossRef]
- Stoyanov, T.; Krasteva, V.; Naydenov, S.; Schmid, R.; Jekova, I. Deep Transfer Learning for Detection of Atrial Fibrillation Using Holter ECG Color Maps. In Proceedings of the 2024 Computing in Cardiology Conference (CinC), Karlsruhe, Germany, 8–11 September 2024; Volume 51, pp. 1–4. [Google Scholar] [CrossRef]
- Akkuzu, N.; Ucan, M.; Kaya, M. Classification of Multi-Label Electrocardiograms Utilizing the EfficientNet CNN Model. In Proceedings of the 2023 4th International Conference on Data Analytics for Business and Industry (ICDABI), Bahrain, 25–26 October 2023; pp. 268–272. [Google Scholar] [CrossRef]
- Ghaffari, A.; Madani, N. Atrial fibrillation identification based on a deep transfer learning approach. Biomed. Phys. Eng. Express 2019, 5, 035015. [Google Scholar] [CrossRef]
- Khorsheed, M.S.; Karim, A.A. Analyzing the impact of transfer learning on explanation accuracy in deep learning-based ECG recognition systems. Open Eng. 2024, 14, 20240066. [Google Scholar] [CrossRef]
- Kolhar, M.; Al Rajeh, A.M. Deep learning hybrid model ECG classification using AlexNet and parallel dual branch fusion network model. Sci. Rep. 2024, 14, 26919. [Google Scholar] [CrossRef]
- Naaz, M.; Kumar, L.V.R.; Sai, Y.P. A new transfer learning approach to detect cardiac arrhythmia from ECG signals. Signal Image Video Process. 2022, 16, 1945–1953. [Google Scholar] [CrossRef]
- Sabeenian, R.S.; Sree Janani, K.K. Transfer learning-based electrocardiogram classification using wavelet scattered features. Biomed. Biotechnol. Res. J. 2023, 7, 52–59. [Google Scholar] [CrossRef]
- Rahuja, N.; Valluru, S.K. A Comparative Analysis of Deep Neural Network Models using Transfer Learning for Electrocardiogram Signal Classification. In Proceedings of the 2021 International Conference on Recent Trends on Electronics, Information, Communication & Technology (RTEICT), Bangalore, India, 27–28 August 2021; pp. 285–290. [Google Scholar] [CrossRef]
- Cao, M.; Zhao, T.; Li, Y.; Zhang, W.; Benharash, P.; Ramezani, R. ECG Heartbeat classification using deep transfer learning with Convolutional Neural Network and STFT technique. J. Phys. Conf. Ser. 2023, 2547, 012031. [Google Scholar] [CrossRef]
- Weimann, K.; Conrad, T.O.F. Transfer learning for ECG classification. Sci. Rep. 2021, 11, 5251. [Google Scholar] [CrossRef]
- Mavaddati, S. ECG arrhythmias classification based on deep learning methods and transfer learning technique. Biomed. Signal Process. Control 2025, 101, 107236. [Google Scholar] [CrossRef]
- Bassiouni, M.M.; Hegazy, I.; Rizk, N. Deep Learning Approach Based on Transfer Learning with Different Classifiers for ECG Diagnosis. Int. J. Intell. Comput. Inf. Sci. 2022, 22, 44–62. [Google Scholar] [CrossRef]
- Bortolan, G.; Christov, I.; Simova, I. Potential of Rule-Based Methods and Deep Learning Architectures for ECG Diagnostics. Diagnostics 2021, 11, 1678. [Google Scholar] [CrossRef] [PubMed]
- Bortolan, G. 3D ECG display with deep learning approach for identification of cardiac abnormalities from a variable number of leads. Physiol. Meas. 2023, 44, 024002. [Google Scholar] [CrossRef]
- Abdelazez, M.; Rajan, S.; Chan, A.D.C. Transfer Learning for Detection of Atrial Fibrillation in Deterministic Compressive Sensed ECG. In Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Montreal, QC, Canada, 20–24 July 2020. [Google Scholar] [CrossRef]
- Silveira, E.C. Automated atrial fibrillation recognition in 12-lead electrocardiographic records: A signal to image and transfer learning approach. Precis. Future Med. 2021, 5, 184–189. [Google Scholar] [CrossRef]
- Victor, J.O.; Chew, X.Y.; Khaw, K.W.; Lee, M.H. A Cost-Based Dual ConvNet-Attention Transfer Learning Model for ECG Heartbeat Classification. J. Inform. Web Eng. 2023, 2, 90–110. [Google Scholar] [CrossRef]
- Jang, J.-H.; Kim, T.Y.; Yoon, D. Effectiveness of Transfer Learning for Deep Learning-Based Electrocardiogram Analysis. Healthc. Inform. Res. 2021, 27, 19–28. [Google Scholar] [CrossRef] [PubMed]
- Cao, X.-C.; Yao, B.; Chen, B.-Q. Atrial Fibrillation Detection Using an improved Multi-scale Decomposition Enhanced Residual Convolutional Neural Network. IEEE Access 2019, 7, 89152–89161. [Google Scholar] [CrossRef]
- Shi, H.; Wang, H.; Qin, C.; Zhao, L.; Liu, C. An incremental learning system for atrial fibrillation detection based on transfer learning and active learning. Comput. Methods Programs Biomed. 2020, 187, 105219. [Google Scholar] [CrossRef]
- Ng, Y.; Liao, M.-T.; Chen, T.-L.; Lee, C.-K.; Chou, C.-Y.; Wang, W. Few-shot transfer learning for personalized atrial fibrillation detection using patient-based siamese network with single-lead ECG records. Artif. Intell. Med. 2023, 144, 102644. [Google Scholar] [CrossRef]
- Gong, Y.; Wei, L.; Yan, S.; Zuo, F.; Zhang, H.; Li, Y. Transfer learning based deep network for signal restoration and rhythm analysis during cardiopulmonary resuscitation using only the ECG waveform. Inf. Sci. 2023, 626, 754–772. [Google Scholar] [CrossRef]
- Ahsan, M.M.; Siddique, Z. Machine learning-based heart disease diagnosis: A systematic literature review. Artif. Intell. Med. 2022, 128, 102289. [Google Scholar] [CrossRef] [PubMed]
- Jahmunah, V.; Ng, E.Y.K.; Tan, R.S.; Oh, S.L.; Acharya, U.R. Explainable detection of myocardial infarction using deep learning models with Grad-CAM technique on ECG signals. Comput. Biol. Med. 2022, 146, 105550. [Google Scholar] [CrossRef]
- Taniguchi, H.; Takata, T.; Takechi, M.; Furukawa, A.; Iwasawa, J.; Kawamura, A.; Taniguchi, T.; Tamura, Y. Explainable Artificial Intelligence Model for Diagnosis of Atrial Fibrillation Using Holter Electrocardiogram Waveforms. Int. Heart J. 2021, 62, 534–539. [Google Scholar] [CrossRef] [PubMed]
- Ganeshkumar, M.; Vinayakumar, R.; Sowmya, V.; Gopalakrishnan, E.A.; Soman, K.P. Explainable Deep Learning-Based Approach for Multilabel Classification of Electrocardiogram. IEEE Trans. Eng. Manag. 2023, 70, 2787–2799. [Google Scholar] [CrossRef]
- Han, C.; Sun, J.; Bian, Y.; Que, W.; Shi, L. Automated detection and localization of myocardial infarction with interpretability analysis based on deep learning. IEEE Trans. Instrum. Meas. 2023, 72, 2508512. [Google Scholar] [CrossRef]
- Goettling, M.; Hammer, A.; Malberg, H.; Schmidt, M. xECGArch: A trustworthy deep learning architecture for interpretable ECG analysis considering short-term and long-term features. Sci. Rep. 2024, 14, 13122. [Google Scholar] [CrossRef]
- Suzuki, S.; Motogi, J.; Umemoto, T.; Hirota, N.; Nakai, H.; Matsuzawa, W.; Takayanagi, T.; Hyodo, A.; Satoh, K.; Arita, T.; et al. Lead-Specific Performance for Atrial Fibrillation Detection in Convolutional Neural Network Models Using Sinus Rhythm Electrocardiography. Circ. Rep. 2024, 6, 46–54. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Mahim, S.M.; Hossen, M.E.; Al Hasan, S.; Islam, M.K.; Iqbal, Z.; Alibakhshikenari, M. TransMixer-AF: Advanced Real-Time Detection of Atrial Fibrillation Utilizing Single-Lead Electrocardiogram Signals. IEEE Access 2024, 12, 143149–143162. [Google Scholar] [CrossRef]
- Akter, M.; Islam, N.; Ahad, A.; Chowdhury, M.A.; Apurba, F.F.; Khan, R. An Embedded System for Real-Time Atrial Fibrillation Diagnosis Using a Multimodal Approach to ECG Data. Eng 2024, 1, 2728–2751. [Google Scholar] [CrossRef]
- Sadeghi, A.; Hajati, F.; Rezaee, A.; Sadeghi, M.; Argha, A.; Alinejad-Rokny, H. 3DECG-Net: ECG fusion network for multi-label cardiac arrhythmia detection. Comput. Biol. Med. 2024, 182, 109126. [Google Scholar] [CrossRef] [PubMed]
- Ivaturi, P.; Gadaleta, M.; Pandey, A.C.; Pazzani, M.; Steinhubl, S.R.; Quer, G. A Comprehensive Explanation Framework for Biomedical Time Series Classification. IEEE J. Biomed. Health Inform. 2021, 25, 2398–2408. [Google Scholar] [CrossRef]
- Wickramasinghe, N.L.; Athif, M. Multi-label classification of reduced-lead ECGs using an interpretable deep convolutional neural network. Physiol. Meas. 2022, 42, 064002. [Google Scholar] [CrossRef] [PubMed]
- Jekova, I.; Christov, I.; Krasteva, V. Atrioventricular Synchronization for Detection of Atrial Fibrillation and Flutter in One to Twelve ECG Leads Using a Dense Neural Network Classifier. Sensors 2022, 22, 6071. [Google Scholar] [CrossRef] [PubMed]
- Krasteva, V.; Christov, I.; Naydenov, S.; Stoyanov, T.; Jekova, I. Application of dense neural networks for detection of atrial fibrillation and ranking of augmented ECG feature set. Sensors 2021, 21, 6848. [Google Scholar] [CrossRef]
- Mousavi, S.; Afghah, F.; Acharya, U.R. HAN-ECG: An interpretable atrial fibrillation detection model using hierarchical attention networks. Comput. Biol. Med. 2020, 127, 104057. [Google Scholar] [CrossRef] [PubMed]
- Li, Z.; Jin, Y.; Tian, Y.; Liu, J.; Wang, M.; Wei, X.; Zhao, L.; Liu, C. M-XAF: Medical explainable diagnosis system of atrial fibrillation based on medical knowledge and semantic representation fusion. Eng. Appl. Artif. Intell. 2024, 136, 108890. [Google Scholar] [CrossRef]
- Gilon, C.; Grégoire, J.M.; Mathieu, M.; Carlier, S.; Bersini, H. IRIDIA-AF, a large paroxysmal atrial fibrillation long-term electrocardiogram monitoring database. Sci. Data. 2023, 10, 714. [Google Scholar] [CrossRef]
- ANSI/AAMI EC38:2007; Medical Electrical Equipment—Part 2–47: Particular Requirements for the Safety, Including Essential Performance, or Ambulatory Electrocardiographic Systems. Association for the Advancement of Medical Instrumentation: Washington, DC, USA, 2007. Available online: https://webstore.ansi.org/standards/aami/ansiaamiec382007 (accessed on 9 March 2025).
- Schiller’s Medilog Holter System. 2025. Available online: https://www.schiller.ch/schiller_images_pdfs/devices/holter/2.500548_2025-01_en_medilog_holter-system_pp_lq.pdf (accessed on 5 March 2025).
- Keras Applications. 2025. Available online: https://keras.io/api/applications/ (accessed on 5 March 2025).
- Sandler, M.; Howard, A.; Zhu, M.; Zhmoginov, A.; Chen, L. MobileNetV2: Inverted residuals and linear bottlenecks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–23 June 2018; pp. 4510–4520. [Google Scholar] [CrossRef]
- Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. In Proceedings of the 3rd International Conference on Learning Representations (ICLR), San Diego, CA, USA, 7–9 May 2015; pp. 1–14. [Google Scholar] [CrossRef]
- Zoph, B.; Vasudevan, V.; Shlens, J.; Le, Q.V. Learning transferable architectures for scalable image recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 8697–8710. [Google Scholar] [CrossRef]
- Tan, M.; Le, Q.V. EfficientNet: Rethinking model scaling for convolutional neural networks. In Proceedings of the 36th International Conference on Machine Learning, ICML, Long Beach, CA, USA, 9–15 June 2019; pp. 6105–6114. Available online: http://proceedings.mlr.press/v97/tan19a.html (accessed on 9 March 2025).
- Tan, M.; Le, Q.V. EfficientNetV2: Smaller Models and Faster Training. arXiv 2021, arXiv:2104.00298v3. [Google Scholar] [CrossRef]
- Szegedy, C.; Vanhoucke, V.; Ioffe, S.; Shlens, J.; Wojna, Z. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 2818–2826. [Google Scholar] [CrossRef]
- Chollet, F. Xception: Deep Learning with Depthwise Separable Convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 1800–1807. [Google Scholar] [CrossRef]
- Huang, G.; Liu, Z.; Van Der Maaten, L.; Weinberger, K.Q. Densely Connected Convolutional Networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 2261–2269. [Google Scholar] [CrossRef]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar] [CrossRef]
- Transfer Learning & Fine-Tuning. 2023. Available online: https://www.tensorflow.org/guide/keras/transfer_learning (accessed on 5 March 2025).
- Selvaraju, R.R.; Cogswell, M.; Das, A.; Vedantam, R.; Parikh, D.; Batra, D. Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization. Int. J. Comput. Vis. 2020, 128, 336–359. [Google Scholar] [CrossRef]
- Dawn, K. GradCAM—Enhancing Neural Network Interpretability in the Realm of Explainable AI. 2023. Available online: https://learnopencv.com/intro-to-gradcam/ (accessed on 27 February 2025).
- Ucan, M.; Kaya, B.; Kaya, M. Generating Medical Reports with a Novel Deep Learning Architecture. Int. J. Imaging Syst. Technol. 2025, 35, e70062. [Google Scholar] [CrossRef]
- Ucan, M.; Kaya, B.; Kaya, M.; Alhajj, R. Medical Report Generation from Medical Images Using Vision Transformer and Bart Deep Learning Architectures. Lect. Notes Comput. Sci. 2025, 15214, 257–267. [Google Scholar] [CrossRef]
- Svennberg, E.; Friberg, L.; Frykman, V.; Al-Khalili, F.; Engdahl, J.; Rosenqvist, M. Clinical outcomes in systematic screening for atrial fibrillation (STROKESTOP): A multicentre, parallel group, unmasked, randomised controlled trial. Lancet 2021, 398, 1498–1506. [Google Scholar] [CrossRef] [PubMed]
- De With, R.R.; Erküner, Ö.; Rienstra, M.; Nguyen, B.-O.; Körver, F.W.J.; Linz, D.; Ten, H.C.; Spronk, H.; A Kroon, A.; Maass, A.H.; et al. Temporal patterns and short-term progression of paroxysmal atrial fibrillation: Data from RACE V. Europace 2020, 22, 1162–1172. [Google Scholar] [CrossRef]
- Lubitz, S.A.; Yin, X.; Rienstra, M.; Schnabel, R.B.; Walkey, A.J.; Magnani, J.W.; Rahman, F.; McManus, D.D.; Tadros, T.M.; Levy, D.; et al. Long-term outcomes of secondary atrial fibrillation in the community: The Framingham Heart Study. Circulation 2015, 131, 1648–1655. [Google Scholar] [CrossRef] [PubMed]
- Sgreccia, D.; Manicardi, M.; Malavasi, V.L.; Vitolo, M.; Valenti, A.C.; Proietti, M.; Lip, G.Y.H.; Boriani, G. Comparing Outcomes in Asymptomatic and Symptomatic Atrial Fibrillation: A Systematic Review and Meta-Analysis of 81,462 Patients. J. Clin. Med. 2021, 10, 3979. [Google Scholar] [CrossRef] [PubMed]
- Kao, Y.-H.; Lin, C.-L. Enhancing Diabetic Retinopathy Detection Using Pixel Color Amplification and EfficientNetV2: A Novel Approach for Early Disease Identification. Electronics 2024, 13, 2070. [Google Scholar] [CrossRef]
- Xi, C.; Liu, R. Detection of Small Floating Target on Sea Surface Based on Gramian Angular Field and Improved EfficientNet. Remote Sens. 2022, 14, 4364. [Google Scholar] [CrossRef]
- Kanchana, K.; Kavitha, S.; Anoop, K.J.; Chinthamani, B. Enhancing Skin Cancer Classification Using Efficient Net B0-B7 through Convolutional Neural Networks and Transfer Learning with Patient-Specific Data. Asian Pac. J. Cancer Prev. 2024, 25, 1795–1802. [Google Scholar] [CrossRef]
- Sandhya Devi, R.S.; Vijay Kumar, V.R.; Sivakumar, P. EfficientNetV2 Model for Plant Disease Classification and Pest Recognition. Comput. Syst. Sci. Eng. CSSE 2023, 45, 2249–2263. [Google Scholar] [CrossRef]
- Ayub, N.; Tayyaba; Hussain, S.; Ullah, S.S.; Iqbal, J. An Efficient Optimized DenseNet Model for Aspect-Based Multi-Label Classification. Algorithms 2023, 16, 548. [Google Scholar] [CrossRef]
- Chauhan, T.; Palivela, H.; Tiwari, S. Optimization and fine-tuning of DenseNet model for classification of COVID-19 cases in medical imaging. Int. J. Inf. Manag. Data Insights 2021, 2021, 100020. [Google Scholar] [CrossRef]
- Tan, L.; Fu, Q.; Li, J. An Improved Neural Network Model Based on DenseNet for Fabric Texture Recognition. Sensors 2024, 24, 7758. [Google Scholar] [CrossRef] [PubMed]
- Ucan, S.; Ucan, M.; Kaya, M. Deep Learning Based Approach with EfficientNet and SE Block Attention Mechanism for Multiclass Alzheimer’s Disease Detection. In Proceedings of the 2023 4th International Conference on Data Analytics for Business and Industry (ICDABI), Bahrain, 25–26 October 2023; pp. 285–289. [Google Scholar] [CrossRef]
- Moody, G.B.; Mark, R.G. A new method for detecting atrial fibrillation using R-R intervals. Comput. Cardiol. 1983, 10, 227–230. [Google Scholar]
- Goldberger, A.L.; Amaral, L.A.; Glass, L.; Hausdorff, J.M.; Ivanov, P.C.; Mark, R.G.; Mietus, J.E.; Moody, G.B.; Peng, C.K.; Stanley, H.E. PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation 2000, 101, e215–e220. [Google Scholar] [CrossRef] [PubMed]
- Moody, G.B.; Mark, R.G. The impact of the MIT-BIH Arrhythmia Database. IEEE Eng. Med. Biol. 2001, 20, 45–50. [Google Scholar] [CrossRef] [PubMed]
- Clifford, G.D.; Liu, C.; Moody, B.; Lehman, L.H.; Silva, I.; Li, Q.; Johnson, A.E.; Mark, R.G. AF Classification from a Short Single Lead ECG Recording: The PhysioNet/Computing in Cardiology Challenge 2017. In Proceedings of the 2017 Computing in Cardiology (CinC), Rennes, France, 24–27 September 2017; Volume 44, pp. 1–4. [Google Scholar] [CrossRef]
- Artificial Intelligence and Machine Learning in Software as a Medical Device. 2025. Available online: https://www.fda.gov/medical-devices/software-medical-device-samd/artificial-intelligence-and-machine-learning-software-medical-device#regulation (accessed on 24 March 2025).
Training | Validation | Test | |
---|---|---|---|
Records | 000–071 | 000–071 | 072–166 |
Non-AF samples (Lead I/Lead II) | 3384/3384 | 1656/1656 | 2850/2850 |
AF samples (Lead I/Lead II) | 3384/3384 | 1656/1656 | 2850/2850 |
Total samples | 13,536 | 6624 | 11,400 |
Pretrained DNNs | Retrained DNNs | Fine-Tuned DNNs | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Name | Input D | Total Params (Depth) | Inference Time Ratio | Trainable Params | TPR (%) | TNR (%) | Acc (%) | AUROC | Trainable Params | TPR (%) | TNR (%) | Acc (%) | AUROC |
MobileNetV2 (α = 0.35) [68] | 224 | 0.41 M (53) | 1.00 | 1281 | 92.0 | 98.0 | 95.0 | 0.988 | 0.38 M | 93.7 | 98.9 | 96.3 | 0.990 |
MobileNetV2 (α = 1) [68] | 224 | 2.26 M (53) | 1.17 | 1281 | 91.2 | 97.6 | 94.4 | 0.986 | 2.19 M | 94.1 | 99.1 | 96.6 | 0.990 |
MobileNetV2 (α = 1.4) [68] | 224 | 4.37 M (53) | 1.33 | 1793 | 89.3 | 96.9 | 93.1 | 0.981 | 4.27 M | 95.4 # | 99.5 # | 97.4 # | 0.991 |
VGG16 [69] | 224 | 14.72 M (16) | 2.42 | 513 | 91.1 | 96.4 | 93.7 | 0.980 | 14.72 M | 95.1 # | 99.2 # | 97.1 # | 0.991 |
NASNetMobile [70] | 224 | 4.27 M (88) | 2.33 | 1057 | 88.2 | 97.5 | 92.9 | 0.980 | 4.20 M | 93.4 | 99.1 | 96.3 | 0.992 |
EfficientNetB0 [71] | 224 | 4.05 M (16) | 1.58 | 1281 | 91.6 | 98.2 | 94.9 | 0.989 | 3.97 M | 93.6 | 99.4 | 96.5 | 0.991 |
EfficientNetV2B0 [72] | 224 | 5.92 M (11) | 1.50 | 1281 | 92.2 | 98.5 | 95.3 | 0.991 | 5.78 M | 94.0 | 99.1 | 96.5 | 0.994 |
EfficientNetB1 [71] | 240 | 6.58 M (18) | 2.17 | 1281 | 92.1 | 98.4 | 95.3 | 0.991 | 6.45 M | 94.5 | 99.4 | 96.9 | 0.993 |
EfficientNetV2B1 [72] | 240 | 6.93 M (16) | 1.92 | 1281 | 93.9 * | 98.7 * | 96.3 * | 0.993 | 6.79 M | 95.3 # | 99.4 # | 97.3 # | 0.994 |
EfficientNetB2 [71] | 260 | 7.77 M (18) | 2.67 | 1409 | 91.7 | 98.3 | 95.0 | 0.988 | 7.63 M | 94.6 | 99.4 | 97.0 | 0.993 |
EfficientNetV2B2 [72] | 260 | 8.77 M (19) | 2.33 | 1409 | 93.3 | 98.3 | 95.8 | 0.993 | 8.61 M | 93.7 | 99.3 | 96.5 | 0.993 |
InceptionV3 [73] | 224 | 21.80 M (48) | 1.58 | 2049 | 89.8 | 96.8 | 93.3 | 0.981 | 21.75 M | 95.2 # | 99.4 # | 97.3 # | 0.992 |
Xception [74] | 224 | 20.86 M (71) | 1.92 | 2049 | 91.2 | 97.3 | 94.2 | 0.986 | 20.75 M | 94.4 | 99.0 | 96.7 | 0.988 |
DenseNet121 [75] | 224 | 7.04 M (121) | 1.92 | 1025 | 89.1 | 98.3 | 93.7 | 0.988 | 6.87 M | 95.6 # | 99.3 # | 97.5 # | 0.991 |
Densenet169 [75] | 224 | 12.64 M (169) | 2.33 | 1665 | 89.7 | 98.3 | 94.0 | 0.987 | 12.33 M | 95.1 # | 99.4 # | 97.2 # | 0.994 |
DenseNet201 [75] | 224 | 18.32 M (201) | 2.75 | 1921 | 91.6 | 98.7 | 95.1 | 0.991 | 17.87 M | 95.8 # | 99.5 # | 97.6 # | 0.989 |
ResNet50 [76] | 224 | 23.59 M (50) | 2.08 | 2049 | 92.1 | 98.4 | 95.2 | 0.990 | 23.48 M | 95.4 # | 99.3 # | 97.3 # | 0.992 |
ResNet50V2 [76] | 224 | 23.57 M (50) | 1.58 | 2049 | 89.8 | 97.6 | 93.7 | 0.986 | 23.48 M | 94.3 | 99.3 | 96.8 | 0.992 |
Study | DNNs (Model Parameters) | Input (Analysis Interval) | Database | TPR, % | TNR, % | Acc, % |
---|---|---|---|---|---|---|
This study | Retrained EfficientNetV2B1 (6.93 M) | ECHOView images (30 s) | IRIDIA-AF | 93.9 | 98.7 | 96.3 |
Fine-tuned EfficientNetV2B1 (6.93 M) | 95.3 | 99.4 | 97.3 | |||
Fine-tuned DenseNet-121 (7.04 M) | 95.6 | 99.3 | 97.5 | |||
Fine-tuned DenseNet-169 (12.64 M) | 95.1 | 99.4 | 97.2 | |||
Fine-tuned DenseNet-201 (18.32 M) | 95.8 | 99.5 | 97.6 | |||
Salinas-Martínez et al., 2020 [18] | CNN (11,257) | ECM images (10 beats + 3 s) | MITBIH-AF | 78.3 * | 91.2 * | 85.1 * |
Salinas-Martínez et al., 2021 [19] | CNN (10,217) | ECM images (10 beats + 3 s) | MITBIH-AF | 79.7 & | 92.4 & | 86.4 & |
MITBIH- Arrhythmia | 91.7 & | 77.1 & | 78.6 & | |||
Lee et al., 2021 [20] | CNN (2.01 M) | ECM images (1 min) | CinC Challenge 2017 | 80.5 # | 85.6 # | 85.1 # |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Krasteva, V.; Stoyanov, T.; Naydenov, S.; Schmid, R.; Jekova, I. Detection of Atrial Fibrillation in Holter ECG Recordings by ECHOView Images: A Deep Transfer Learning Study. Diagnostics 2025, 15, 865. https://doi.org/10.3390/diagnostics15070865
Krasteva V, Stoyanov T, Naydenov S, Schmid R, Jekova I. Detection of Atrial Fibrillation in Holter ECG Recordings by ECHOView Images: A Deep Transfer Learning Study. Diagnostics. 2025; 15(7):865. https://doi.org/10.3390/diagnostics15070865
Chicago/Turabian StyleKrasteva, Vessela, Todor Stoyanov, Stefan Naydenov, Ramun Schmid, and Irena Jekova. 2025. "Detection of Atrial Fibrillation in Holter ECG Recordings by ECHOView Images: A Deep Transfer Learning Study" Diagnostics 15, no. 7: 865. https://doi.org/10.3390/diagnostics15070865
APA StyleKrasteva, V., Stoyanov, T., Naydenov, S., Schmid, R., & Jekova, I. (2025). Detection of Atrial Fibrillation in Holter ECG Recordings by ECHOView Images: A Deep Transfer Learning Study. Diagnostics, 15(7), 865. https://doi.org/10.3390/diagnostics15070865