Next Article in Journal
Image-Guided Minimally Invasive Treatment Options for Degenerative Lumbar Spine Disease: A Practical Overview of Current Possibilities
Previous Article in Journal
Differential Diagnosis of OKC and SBC on Panoramic Radiographs: Leveraging Deep Learning Algorithms
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Novel Approach to Dual Feature Selection of Atrial Fibrillation Based on HC-MFS

1
Business School, University of Shanghai for Science and Technology, Shanghai 200093, China
2
Chongming Hospital, Shanghai University of Medicine & Health Sciences, Shanghai 202150, China
3
Collaborative Innovation Center for Biomedicine, Shanghai University of Medicine & Health Sciences, Shanghai 201318, China
4
Antai College of Economics and Management, Shanghai Jiao Tong University, Shanghai 200030, China
*
Author to whom correspondence should be addressed.
Diagnostics 2024, 14(11), 1145; https://doi.org/10.3390/diagnostics14111145
Submission received: 29 April 2024 / Revised: 23 May 2024 / Accepted: 24 May 2024 / Published: 30 May 2024
(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)

Abstract

:
This investigation sought to discern the risk factors for atrial fibrillation within Shanghai’s Chongming District, analyzing data from 678 patients treated at a tertiary hospital in Chongming District, Shanghai, from 2020 to 2023, collecting information on season, C-reactive protein, hypertension, platelets, and other relevant indicators. The researchers introduced a novel dual feature-selection methodology, combining hierarchical clustering with Fisher scores (HC-MFS), to benchmark against four established methods. Through the training of five classification models on a designated dataset, the most effective model was chosen for method performance evaluation, with validation confirmed by test set scores. Impressively, the HC-MFS approach achieved the highest accuracy and the lowest root mean square error in the classification model, at 0.9118 and 0.2970, respectively. This provides a higher performance compared to existing methods, thanks to the combination and interaction of the two methods, which improves the quality of the feature subset. The research identified seasonal changes that were strongly associated with atrial fibrillation (pr = 0.31, FS = 0.11, and DCFS = 0.33, ranked first in terms of correlation); LDL cholesterol, total cholesterol, C-reactive protein, and platelet count, which are associated with inflammatory response and coronary heart disease, also indirectly contribute to atrial fibrillation and are risk factors for AF. Conclusively, this study advocates that machine-learning models can significantly aid clinicians in diagnosing individuals predisposed to atrial fibrillation, which shows a strong correlation with both pathological and climatic elements, especially seasonal variations, in the Chongming District.

1. Introduction

Atrial fibrillation (AF) is the third most common cardiovascular disease after hyper-tension and coronary heart disease. Although AF is not immediately life-threatening, it can lead to embolic complications and stroke, making it a significant predisposing factor for cerebrovascular events. According to the Global Burden of Disease Study 2019, there are more than 38 million cases of AF worldwide, and the disability-adjusted life years due to AF is 121.62 per 100,000 people in our country [1]. This puts a huge strain on the global health system. In 2021, statistical studies revealed significant differences in the prevalence of AF up to 12-fold across geographic regions [2], with North America, Europe, Southeast Asia, and China being particularly affected. In China, as a region with rapid population growth, the burden of AF will increase [3]. The variation in AF prevalence is due to a variety of factors, such as age, diet, and other diseases, making AF an important challenge in the field of smart healthcare. However, machine learning has shown great promise in improving the prediction of diseases, including atrial fibrillation [4]. It has been proven to be effective in processing large-scale healthcare data and discovering underlying disease patterns, risk factors, and trends, thus contributing to improved rates of early diagnosis of disease, personalized treatment, and patient management [5].
AF is a cardiac arrhythmia that has a significant impact on global health. According to the Global Burden of Disease study [1], AF is a leading contributor to 369 diseases and injuries worldwide, based on data from 2019. This systematic analysis has robustly substantiated the significance of AF in the context of the Global Burden of Disease study, emphasizing its impact on global health. Following this, a multinational cohort study involving 153,152 middle-aged individuals [2] revealed significant variations in the global prevalence, treatment approaches, and health implications of AF. The study disclosed that individuals with AF demonstrated an annual stroke incidence of 1.1% and a mortality rate of 2.6%, which were five-fold and two-fold as high as those in the healthy population, respectively. These elevated rates of morbidity and mortality associated with AF, coupled with the current lack of high-quality therapies, have positioned AF as a focal point for clinical research.
Scholars are currently investigating the risk factors associated with AF. Hyperuricemia has been identified as an independent risk factor for AF, raising clinical concerns for patients with hyperuricemia [6]. The relationship between alcohol and AF has also garnered significant attention. Voskoboinik et al. [7] explored this correlation, shedding light on the potential impact of alcohol consumption on AF. Furthermore, in a study examining the connection between epicardial fat distribution and left heart size [8], Nattel S [9] emphasized the influence of cardiac anatomy on AF, contributing to a deeper understanding of its pathogenesis. Earlier research has revealed associations between AF and climatic factors, such as air pollution [10,11,12] and temperature [13]. In a large Danish population-based study, Murphy N F et al. [14] observed a significant seasonal correlation between temperature and the incidence of AF, particularly a higher occurrence during colder seasons [15], in hospitalized patients with acute ischemic stroke. Their data are compelling and strongly support the association between temperature and AF, opening a new avenue for investigating the role of climatic factors in AF development. However, it is important to note that existing studies predominantly focus on the correlation between specific factors and AF, such as total cholesterol and lipoproteins, which are strongly associated with cardiovascular disease, including atrial fibrillation (AF) [16,17]. With the emergence of the big data era, the extraction of key insights from vast amounts of high-dimensional data is crucial for disease prediction. In the context of AF, a notable research gap exists. Therefore, this paper aims to concentrate on the influence of biochemical indicators, such as platelets and uric acid, on AF, while considering the climatic characteristics of the Chongming District. The goal is to extract key features from high-dimensional data [18], providing a fresh perspective for a more in-depth understanding of AF pathogenesis. Additionally, this paper aims to conduct a comprehensive examination of the relationships between these factors.
In recent years, substantial strides have been achieved in the realm of machine learning applied to the medical domain, particularly in its pivotal role in disease prediction [19]. A notable area of focus in the current research on AF revolves around two key aspects: feature selection and AF prediction. Within the realm of feature selection, researchers predominantly concentrate on extracting features from electrocardiogram signals and intracardiac electrical tracings [20,21]. However, there exists a notable gap in the extraction of key features from high-dimensional biochemical metrics. While the filter remains a widely used feature-selection method in this field, it tends to overlook the intricate interrelationships between features. Optimized feature-selection methods, such as hierarchical clustering feature selection [22], mutual information and hierarchical clustering [23], mutual information feature selection, and max-relevance and min-redundancy, among others, offer an improvement by computing the correlation between each feature and the target variable, as well as the correlation among features. These methods comprehensively consider the balance between these correlations. Nevertheless, these approaches typically follow a static greedy search mechanism, where the selected features remain unchanged or are not later adjusted. This limitation hampers their ability to find a subset of features with optimal performance. To address this shortfall, Hancer E et al. introduced MIRFFS [24], which utilizes forward and backward search techniques to identify the optimal feature subset. Building upon this concept, this paper innovatively proposes a dual feature-selection method based on hierarchical clustering and mean Fisher scores. In this approach, features are classified into relevant and irrelevant classes through hierarchical clustering. The Fisher scores are then employed to compute intra- and inter-class distances, determining the importance of features [25]. The combination of these two methods filters a high-performance subset of features. In the experiments aimed at finding the optimal feature subset, forward search and backward search were introduced to mitigate the performance loss caused by the static greedy mechanisms in previous studies. Additionally, data preprocessing includes the standardization of datasets, as well as handling missing values and outliers. Comparative experiments were conducted on four datasets with or without normalization and outlier processing, using five classification models (K-nearest Neighbor (KNN), Random Forest (RF), Support Vector Machine (SVM), Naive Bayes (NB), and the Logistic Regression Model (LR)) to observe the impact of normalization and outlier processing on prediction performance in order to identify the best-performing datasets and feature subsets. The experimental results demonstrate that the HC-MFS enhances the classification performance of the model, revealing a strong correlation between AF and platelets and season.
The structure of this paper is as follows: Section 2 presents the sources of the experimental data and outlines the experimental methods; Section 3 describes the execution of the experiments and the acquisition of the results; Section 4 discusses the findings and offers pertinent recommendations; and Section 5 provides a summary and conclusion.

2. Data Sources and Methods

2.1. Data Sources

This paper presents a retrospective study utilizing the AF dataset acquired from January 2020 to September 2023 at the Cardiovascular Department of a tertiary hospital in Chongming District, Shanghai. Furthermore, climatic data for the Chongming District was gathered from both the Shanghai Meteorological Bureau” http://sh.cma.gov.cn (accessed on 14 September 2023)” and the Shanghai Bureau of Ecology and Environment “https://sthj.sh.gov.cn (accessed on 14 September 2023)”. A comprehensive set of 20 features was meticulously selected by recommendations from coronary heart disease experts at a tertiary hospital in Chongming District and insights derived from prior research. Notably, the dataset exhibited missing values, prompting the removal of entries where more than 5 feature values were absent for the same ID. Consequently, 678 records were extracted, encompassing 339 patients diagnosed with AF and an equal number of patients without AF. To enhance the dataset’s quality, various preprocessing steps were implemented, including handling missing values, identifying and addressing outliers, and standardization. Subsequent phases of this study involved comparative experiments to evaluate the classification performance of the dataset with and without standardization and outlier treatments.

2.2. Statistical Analysis

The data were analyzed using Python 3.9, the measurement information was expressed as ± s, the counting information was expressed as the number of cases (%), comparisons between the groups were made using the independent samples t-test or the χ2 test, and the difference was considered statistically significant at p < 0.05. CO, minimum temperature, CRP, platelets, platelet pressure product, LDL, HDL, and TC were lower in the atrial fibrillation group than in the non-atrial fibrillation group (control); age, PM10, NO2, erythrocyte distribution width, erythrocyte pressure product, platelet distribution width, large platelet ratio, mean platelet volume, and uric acid were higher than those in the control group; and the differences were all statistically significant (p < 0.05), as shown in Table 1.

2.3. Dual Feature-Selection Method Based on Hierarchical Clustering and Mean Fisher Score HC-MFS

The inclusion of weak or irrelevant features within disease data often compromises the performance of disease prediction models. Traditional feature-selection methods rigidly lock in selected features, hindering the identification of an optimal subset for improved model performance. To address these challenges, we present the HC-MFS method. The HC-MFS method employs hierarchical clustering to categorize features into classes, distinguishing irrelevant ones and reducing the similarity within each class. Simultaneously, Fisher scoring evaluates feature importance by computing intra- and inter-class distances, thereby selecting features most relevant to the target. The combination of these two methods, coupled with forward and backward search strategies, results in the identification of a feature subset with superior performance. The HC-MFS process is shown in Figure 1. The method unfolds in the following steps:
Step 1. Ward Clustering: Apply Ward clustering to the original feature sequence F = {f1, f2, …, fn}.
Step 2. Fisher Score Calculation: Compute Fisher scores (FS) for features, rank them, and determine each category’s mean Fisher score (MFS).
Step 3. Initial Feature Subset Selection: Include features with FS higher than MFS in the initial feature subset FFSS, emphasizing superior class differentiation ability.
Step 4. Iterative Refinement: Based on FFSS, iteratively enhance model accuracy by adding features through forward and backward searches until all features are considered.
Step 5. Accuracy Comparison: Compare the accuracy of the forward search (F) and backward search (R), selecting the subset with higher accuracy. In the case of a tie, prioritize the subset with fewer features.

3. Empirical Analysis

3.1. Parameter Settings

In this paper, we conducted experiments using Python 3.9 to compare HC-MFS with FS, Relief_F (R_F), mutual information (MI), and the Distance Correlation Factor (DCFS) on the same dataset using the KNN, RF, SVM, NB, and LR models. We aimed to find the optimal combinations of the model’s parameters using lattice scavenging and 5-fold cross-validation. The experimental results show that datasets with missing-value processing and standardization (IMV + SS) reach optimal performance more frequently, indicating that most datasets benefit from standardization treatment in improving model performance. At the same time, some models require outlier treatment (OR) to further optimize performance. However, the use of mean substitution for outliers in this paper was found not to achieve higher performance in RF models. Complex methods such as median substitution, truncation, and distribution-based methods can be considered to better preserve the original data characteristics. Abbreviations in the text are explained in Table 2. The specific parameter settings of the model are shown in Table 3, and Further details experiments are shown in Table A1 in Appendix A.

3.2. Clustering Results

The results of Ward’s method of clustering are presented in Figure 2 The names of the features are displayed at the lower end of the clustering dendrogram. The results indicate that it is more appropriate to classify the features into three categories. The first major class is dominated by the seasonal and cholesterol classes, the second class is dominated by the age and red blood cell classes, and platelets are a separate class. The increased gap between classes allows subsequent feature selection to reduce the correlation between features and improve the quality of the feature subset. The separate category of platelets, on the other hand, provides a new direction for our study, which is analyzed in detail in the subsequent Section 4.1.3.

3.3. Model Evaluation and Comparison

This paper investigates the correlation between all features and atrial fibrillation using different methods and evaluates their performance using a classification model. The training set achieved optimal performance in the backward search of the RF model by the HC-MFS method, with the corresponding feature subset containing 12 features. To maintain the same feature dimensions, we ranked the features according to the FS, R_F, MI, and DCFS methods, and selected the top 12 features as the feature subset for comparison experiments with HC-MFS. The mean correlation and initial feature dimensions of HC-MFS are given in Table 4, and the final feature subsets of the five methods are given in Table 5. The experimental results show that all five methods achieve optimal performance in the RF model, with accuracies of 0.7868, 0.7574, 0.7206, 0.7574, and 0.7647, respectively. HC-MFS performs better, as demonstrated in Table 6. The accuracy of HC-MFS was improved by up to 9.19%, and the root mean square error was reduced by up to 9.69%, compared with the existing methods, details are shown in Table 7. These results indicate that the HC-MFS method plays a significant role in enhancing the performance of the model for predicting atrial fibrillation diseases. Moreover, a recent study by Andrea Bernardini et al. [26] on the prediction of atrial fibrillation based on machine learning showed that an AUC of 0.7790 ± 0.0160 was obtained, and our experiment is superior to this result, which is attributed to the introduction of hierarchical clustering to reduce feature redundancy.

4. Discussion and Suggestions

4.1. Feature Analysis

4.1.1. Season Correlation Analysis

A significant association between season and AF was observed, with the DCFS value reaching 0.3298, the FS being 0.1043, and the Relief_F score 180 being 0.0603, all of which were at the top of the characteristic correlations. It is worth emphasizing that the Chongming District is located in the subtropical region, and its winter and summer last for more than 230 days, with an average annual number of rainy days as high as 93 to 183 days, and the annual rainfall is on an increasing trend, which is nearly twice as much as the national average annual rainfall. This unique climatic feature leads to elevated air humidity, which may aggravate the burden on the respiratory and circulatory systems, providing a potential physiological explanation for the onset of AF. In addition, the continuous decrease in the average annual minimum temperature in the Chongming District over the past three years, from 15 degrees to 13 degrees Celsius, has been more pronounced, especially in the spring and winter of 2022. This decreasing temperature trend may be associated with the increasing incidence of AF, providing a substantial observational basis for a further in-depth investigation of the relationship between climatic factors and AF. Therefore, we suggest that the unique climatic conditions and the decreasing temperature trend in the Chongming District may be important influences on the incidence of AF.

4.1.2. Serum Total Cholesterol (TC) and Low-Density Lipoprotein Correlation Analysis

Metabolic syndrome is widely recognized as an independent risk factor for AF, which mainly encompasses hypertension, diabetes mellitus, and dyslipidemia [27,28]. A 5-year follow-up study [29] demonstrated that subjects with incident AF were typically older, had higher rates of obesity, and more severe alcohol abuse, and their baseline BMI, TC, and glucose levels were higher compared with those without AF. In particular, the high variability in TC levels resulted in a 7.5% increased risk of AF. The reasons for this were analyzed: First, high levels of TC and LDL are strongly associated with the development of atherosclerosis. Hardened arteries lead to the obstruction of blood flow and increase the burden on the heart. Second, high levels of TC and LDL may trigger inflammatory responses and oxidative stress, and these inflammatory processes may adversely affect cardiac electrophysiology and atrial tissue. Again, high levels of TC and LDL may lead to myocardial fibrosis, abnormal proliferation, and the hardening of cardiac muscle tissue, which affects electrical signaling in the atria. In addition, high cholesterol may lead to alterations in platelet aggregation and the clotting process, increasing the risk of blood clots, which can travel through the bloodstream to the atria. Finally, persistently high levels of cholesterol and LDL can lead to the enlargement of the atria and heart failure. These pathological conditions increase the pressure and volume load on the left atrium, leading to atrial fibrosis and electrical conduction abnormalities, thereby increasing the risk of atrial fibrillation.

4.1.3. Platelet Count Correlation Analysis

The experimental results presented a striking finding of a positive correlation between platelets and AF, a conclusion that provides an answer to a question that previous studies have failed to clarify. Specifically, the mutual information values were 0.296 for P-LCR and AF and 0.2627 for PLT, which were at the forefront in terms of characteristic correlation. We used PLT as a sole indicator to predict AF. The experimental results showed that the accuracy of predicting AF using only platelet count was over 60%, with a recall rate of 64.29%. This indicates that platelet count has a certain role in predicting AF. Although the predictive capability of PLT as a single indicator is limited, its performance still demonstrates some clinical value. By combining PLT with other biomarkers and clinical characteristics, the accuracy and reliability of AF prediction could be further improved. The reasons for this were analyzed: First, patients with AF are usually accompanied by AF and irregular heartbeats, which can increase the risk of platelet involvement in thrombosis. AF may lead to the retention of blood in the atria and the formation of blood clots, which directly increases the risk of embolism. Second, platelets play a key role in the pathophysiological process of inflammation and vascular endothelial injury, which is one of the key factors in the development of AF. In this process, platelets contribute to the inflammatory response and repair of endothelial damage through adhesion and the release of inflammatory mediators, thereby exacerbating the progression of cardiovascular disease. In addition, abnormal platelet activity and function may lead to the formation of microthrombi in blood vessels. The presence of these microthrombi may cause microcirculatory disorders in the atria, which in turn promote the development of AF. The above analysis provides a more comprehensive understanding of the association between platelets and AF and provides useful clues for an in-depth exploration of the pathogenesis of AF.

4.1.4. C-Reactive Protein (CRP) Correlation Analysis

Our study found that CRP was positively correlated with AF, with a high mutual information value of 0.5809, which ranked first in the ranking of characteristics, a result consistent with previous studies. The pathogenesis of AF is complex and involves the interaction of inflammatory responses and atrial fibrosis. These processes lead to the electrical and structural remodeling of the atria [30]. There is growing evidence that chronic inflammation can alter the electrophysiology of the atria, driving myocardial fibrosis and structural changes that affect the electrophysiological properties of the heart and may increase susceptibility to AF. In addition, elevated CRP levels are associated with thrombosis and vascular endothelial dysfunction. CRP can directly impair endothelial function, leading to a pro-thrombotic state and increasing the risk of thromboembolic events in patients with AF. Endothelial dysfunction also contributes to atrial remodeling and AF progression. In AF, irregular atrial contractions lead to blood retention in the atria, increasing the risk of thrombosis. High CRP levels may indirectly increase the likelihood of thrombosis by causing vascular endothelial damage and inflammatory responses.

4.2. Suggestions

4.2.1. Suggestions to Patients

To reduce the risk of AF, patients are advised to follow these guidelines:
(1) Stay warm in cold weather by wearing adequate clothing to reduce the risk of hypothermia and minimize outdoor activity.
(2) Adopt a heart-healthy diet that includes reducing saturated fat and cholesterol intake and increasing the intake of fruits and vegetables, grains, fish, and healthy fats (e.g., nuts), as well as quitting smoking and limiting alcohol intake.
(3) Exercise for at least 150 min of moderate-intensity aerobic exercise per week, such as brisk walking and cycling.
(4) Undergo regular health checkups and cardiac screening for early detection and the management of risk factors if you are in a group at high risk of AF.

4.2.2. Suggestions for Hospitals

To improve the prevention and treatment of AF, the following guidelines are recommended:
(1) Provide health promotion education on the potential risks to heart health during the cold season, as well as preventive measures.
(2) Increase medical resources by ensuring that hospitals have adequate equipment and staff to meet patient needs during the winter months and that they have a dedicated cardiac emergency room.
(3) Prevent the spread of infection by providing influenza vaccines, improving hand hygiene, and taking isolation measures during the winter months when influenza and other respiratory infections are more likely to spread.
(4) Regularly screen patients for CRP, TC, PLT, and other biochemical indicators, especially during physical exams and outpatient clinics, to focus on high-risk groups.
(5) Establish a multidisciplinary team of cardiovascular specialists, internists, dietitians, and psychologists to actively research and innovate to continuously improve the prevention and treatment of AF. Enhance cooperation and coordination with other healthcare providers, insurance companies, government agencies, and non-profit organizations to provide more comprehensive AF management and care.

5. Conclusions

In this study, we developed a novel dual feature-selection method for atrial fibrillation (AF) by combining hierarchical clustering and Fisher score methods, termed HC-MFS. We used patient data from a tertiary hospital in Chongming District, Shanghai, and climate data from the Chongming District provided by the Shanghai Meteorological Bureau and the Shanghai Bureau of Ecology and Environment. We validated HC-MFS with empirical examples and compared its performance with established methods such as feature selection, Relief_F, mutual information, and DCFS. Our results show that HC-MFS outperforms these methods, achieving the lowest root mean square error. Key features identified include season, total cholesterol, low-density lipoprotein, platelet count, and C-reactive protein. This study offers valuable insights for disease prediction and serves as a reference for physicians during diagnosis.

6. Limitations

Several important limitations of this study must be emphasized. The amount of data obtained for this study was limited, and there were some limitations in model training. Moreover, hospital inpatient data were used for all data; outpatient diagnoses were not included in our analysis and may have been underestimated. Echocardiographic, biological, and imaging parameters were also not available. In other words, to improve the model quality, there is a need for more data. We also encourage other researchers to explore these directions for AF development.

Author Contributions

Conceptualization and formal analysis, H.L. and L.L.; writing—discussion of the original draft, L.L. and H.X.; data curation, L.L. and Z.L.; methodology, H.X. and L.L.; redrafting and editing, H.L., H.X. and Z.L.; discussion of reviewer(s) comments and giving important suggestions, C.F., Z.L., L.F. and H.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by the Shanghai University of Medicine & Health Sciences Local High-level University Construction Project (Grant No. 22MC2022001), Three-Year Action Plan for Strengthening the Construction of Public Health System in Shanghai (2023-2025) of Key discipline construction project (Grant No. GWVI-11.1-49), and Three-Year Action Program of the Shanghai Municipality for Strengthening the Construction of the Public Health System (Grant No. GWVI-6).

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Ethics Committee of Chongming Hospital, Shanghai University of Medicine & Health Sciences (Protocol code CMEC-2023-KT-05; Approval Date: 24 February 2023).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Due to the sensitive nature of the data used in this study, which include personal and confidential patient information, the datasets are not publicly available. Data sharing is restricted to safeguard the privacy and confidentiality of the participants as per applicable laws and regulations. Detailed data access policies and requests for collaboration may be directed to the corresponding author, subject to approval by the relevant ethics and data protection authorities.

Acknowledgments

The authors would like to acknowledge the support from all the funding agencies that made this research possible. Special thanks to the administrative and technical staff at a tertiary hospital in Chongming District, Shanghai, for their invaluable support throughout the project.

Conflicts of Interest

We, the authors of this manuscript titled “A novel Approach to Dual Feature Selection of AF Based on HC-MFS”, declare that we have no conflicts of interest that could potentially bias the results or interpretation of the research presented in this paper. A conflict of interest exists when an author’s financial, personal, or professional affiliations or relationships could influence their judgment, objectivity, or the integrity of the research. We affirm that there are no financial relationships or connections with any organization or individual that could be perceived as a conflict of interest regarding this work. Furthermore, we confirm that we have no personal relationships or conflicts of interest with any individuals, such as colleagues or competitors, that would impact our ability to conduct this research objectively and report the findings impartially.

Appendix A

Table A1. Performance Metrics by Model and Dataset.
Table A1. Performance Metrics by Model and Dataset.
MethodsModelDatasetsAccuracyPrecisionRecallF1AUC
HC_MFSKNNIMV0.59410.58400.64810.61380.6095
IMV+SS0.69380.66470.78150.71710.7316
IMV+OR0.59410.58430.64810.61340.6060
IMV+OR+SS0.69740.68220.74070.70760.7323
RFIMV0.76190.73440.81800.77310.8298
IMV+SS0.76190.73440.81800.77310.8299
IMV+OR0.72700.71630.74740.73100.8055
IMV+OR+SS0.72700.71630.74740.73100.8052
SVMIMV0.71030.70150.71820.70720.7699
IMV+SS0.69730.68580.73330.70510.7687
IMV+OR0.68990.67990.72980.70350.7248
IMV+OR+SS0.69380.68230.73820.70790.7444
NBIMV0.70110.69380.72290.70650.7534
IMV+SS0.72520.70160.79050.74300.7516
IMV+OR0.68630.68370.69340.68750.7367
IMV+OR+SS0.69000.67950.71110.69460.7427
LRIMV0.68080.67300.70000.68520.7369
IMV+SS0.67160.66870.67780.67110.7288
IMV+OR0.66790.66950.66670.66530.7191
IMV+OR+SS0.66790.65990.69260.67470.7320
FSKNNIMV0.57200.57650.52590.54960.5729
IMV+SS0.67710.66160.72220.68970.7214
IMV+OR0.57210.56730.59630.57950.5934
IMV+OR+SS0.66970.65280.72590.68700.7034
RFIMV0.74540.74050.75120.74550.8198
IMV+SS0.74540.74050.75120.74550.8198
IMV+OR0.74370.73100.76970.74910.8103
IMV+OR+SS0.74370.73100.76970.74910.8103
SVMIMV0.66240.65270.69140.67020.7264
IMV+SS0.68060.67100.73250.70010.7392
IMV+OR0.66590.66020.66280.66080.7207
IMV+OR+SS0.66050.65050.67040.65850.7394
NBIMV0.68820.66990.74810.70570.7433
IMV+SS0.67540.65870.74110.69670.7424
IMV+OR0.66600.66930.68500.67600.7348
IMV+OR+SS0.64030.64440.67420.65600.7185
LRIMV0.67530.67060.68520.67700.7348
IMV+SS0.67530.66900.68890.67780.7358
IMV+OR0.66050.66000.65930.65910.7204
IMV+OR+SS0.65130.65160.64810.64890.7230
R_FKNNIMV0.58690.57920.62960.60300.6066
IMV+SS0.67150.64510.77410.70220.7032
IMV+OR0.58480.57280.64810.60770.6047
IMV+OR+SS0.65310.62350.77410.68880.6888
RFIMV0.70300.68320.75100.71490.7849
IMV+SS0.70300.67850.76580.71920.7920
IMV+OR0.69750.67240.76580.71560.7847
IMV+OR+SS0.69750.67240.76580.71560.7839
SVMIMV0.67330.66910.68520.67670.7209
IMV+SS0.66440.64930.66400.65420.7171
IMV+OR0.66040.67330.67410.67260.7081
IMV+OR+SS0.67710.67390.69620.68410.7209
NBIMV0.69730.67750.76210.71620.7354
IMV+SS0.69740.66670.76450.71120.7413
IMV+OR0.68820.67820.70540.69080.7426
IMV+OR+SS0.68980.68560.71480.69830.7413
LRIMV0.68080.68170.68520.68180.7319
IMV+SS0.67900.68120.68150.67950.7330
IMV+OR0.66240.66200.66670.66210.7248
IMV+OR+SS0.66410.66160.67780.66880.7154
MIKNNIMV0.56650.55720.64440.59720.5948
IMV+SS0.69180.67170.74810.70690.7291
IMV+OR0.58490.57970.60740.59290.5926
IMV+OR+SS0.67150.65630.72220.68660.7174
RFIMV0.74360.72950.76970.74840.8226
IMV+SS0.74360.72950.76970.74840.8228
IMV+OR0.72880.73310.71410.72260.8138
IMV+OR+SS0.73060.69840.80680.74840.8203
SVMIMV0.70480.68440.72600.70430.7608
IMV+SS0.64760.63650.67630.65550.7131
IMV+OR0.68270.67770.72120.69870.7444
IMV+OR+SS0.64750.63970.67780.65660.7086
NBIMV0.69350.67970.75080.71220.7360
IMV+SS0.67340.64060.73550.68470.7320
IMV+OR0.67520.69760.66180.67500.7423
IMV+OR+SS0.66240.66060.65940.65840.7219
LRIMV0.65870.65650.67410.66390.7048
IMV+SS0.66610.66880.66300.66440.7296
IMV+OR0.64390.64310.64810.64430.6991
IMV+OR+SS0.64570.64230.65560.64810.6981
DCFSKNNIMV0.57210.56100.64440.59910.6100
IMV+SS0.68630.66300.75930.70680.7374
IMV+OR0.57200.56550.60740.58510.5749
IMV+OR+SS0.67710.66210.72220.69040.7247
RFIMV0.72330.71660.73260.72380.8093
IMV+SS0.71960.71290.72890.71990.8074
IMV+OR0.72880.71830.74740.73190.8085
IMV+OR+SS0.72880.71830.74740.73190.8087
SVMIMV0.66430.65790.69630.67600.7286
IMV+SS0.67160.65730.70170.67690.7455
IMV+OR0.66790.65020.67070.65940.7246
IMV+OR+SS0.67530.66990.68270.67570.7172
NBIMV0.69360.67380.76650.71650.7473
IMV+SS0.70480.68300.76360.72050.7484
IMV+OR0.68820.69350.69090.69170.7490
IMV+OR+SS0.66230.66440.66650.66520.7403
LRIMV0.67900.67720.68520.67960.7373
IMV+SS0.68080.67680.68890.68170.7376
IMV+OR0.65500.65770.64810.65110.7088
IMV+OR+SS0.65680.65410.66300.65780.7226

References

  1. Vos, T.; Lim, S.S.; Abbafati, C.; Abbas, K.M.; Abbasi, M.; Abbasifard, M.; Abbasi-Kangevari, M.; Abbastabar, H.; Abd-Allah, F.; Abdelalim, A.; et al. Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: A systematic analysis for the Global Burden of Disease Study 2019. Lancet 2020, 396, 1204–1222. [Google Scholar] [CrossRef]
  2. Joseph, P.G.; Healey, J.S.; Raina, P.; Connolly, S.J.; Ibrahim, Q.; Gupta, R.; Avezum, A.; Dans, A.L.; Lopez-Jaramillo, P.; Yeates, K.; et al. Global variations in the prevalence, treatment, and impact of atrial fibrillation in a multinational cohort of 153 152 middle-aged individuals. Cardiovasc. Res. 2021, 117, 1523–1531. [Google Scholar] [CrossRef] [PubMed]
  3. Chen, Y.; Gue, Y.; Calvert, P.; Gupta, D.; McDowell, G.; Azariah, J.L.; Namboodiri, N.; Bucci, T.; Jabir, A.; Tse, H.F.; et al. Predicting stroke in Asian patients with atrial fibrillation using machine learning: A report from the KERALA-AF registry, with external validation in the APHRS-AF registry. Curr. Probl. Cardiol. 2024, 49, 102456. [Google Scholar] [CrossRef] [PubMed]
  4. Tseng, A.S.; Noseworthy, P.A. Prediction of Atrial Fibrillation Using Machine Learning: A Review. Front. Physiol. 2021, 12, 752317. [Google Scholar] [CrossRef] [PubMed]
  5. Al Snousy, M.B.; El-Deeb, H.M.; Badran, K.; Al Khlil, I.A. Suite of decision tree-based classification algorithms on cancer gene expression data. Egypt. Inform. J. 2011, 12, 73–82. [Google Scholar] [CrossRef]
  6. Kuwabara, M.; Niwa, K.; Nishihara, S.; Nishi, Y.; Takahashi, O.; Kario, K.; Yamamoto, K.; Yamashita, T.; Hisatome, I. Hyperuricemia is an independent competing risk factor for atrial fibrillation. Int. J. Cardiol. 2017, 231, 137–142. [Google Scholar] [CrossRef] [PubMed]
  7. Voskoboinik, A.; Prabhu, S.; Ling, L.H.; Kalman, J.M.; Kistler, P.M. Alcohol and atrial fibrillation: A sobering review. J. Am. Coll. Cardiol. 2016, 68, 2567–2576. [Google Scholar] [CrossRef] [PubMed]
  8. Iacobellis, G.; Ribaudo, M.C.; Zappaterreno, A.; Iannucci, C.V.; Leonetti, F. Relation between epicardial adipose tissue and left ventricular mass. Am. J. Cardiol. 2004, 94, 1084–1087. [Google Scholar] [CrossRef] [PubMed]
  9. Nattel, S. New ideas about atrial fibrillation 50 years on. Nature 2002, 415, 219–226. [Google Scholar] [CrossRef]
  10. Erkapic, D.; Aleksic, M.; Roussopoulos, K.; Weipert, K.F.; Sözener, K.; Kostev, K.; Allendörfer, J.; Rosenbauer, J.; Guenduez, D.; Tanislav, C. Microembolizations in the Arterial Cerebral Circulation in Patients with Atrial Fibrillation Ablation Using the Cryoballoon Technique—Protocol and Methodology of a Prospective Observational Study. Diagnostics 2023, 13, 1660. [Google Scholar] [CrossRef]
  11. Bennett, M.; Nault, I.; Koehle, M.; Wilton, S. Air Pollution and Arrhythmias. Can. J. Cardiol. 2023, 39, 1253–1262. [Google Scholar] [CrossRef] [PubMed]
  12. Kim, I.S.; Yang, P.S.; Lee, J.; Yu, H.T.; Kim, T.H.; Uhm, J.S.; Pak, H.N.; Lee, M.H.; Joung, B. Long-term exposure of fine particulate matter air pollution and incident atrial fibrillation in the general population: A nationwide cohort study. Int. J. Cardiol. 2019, 283, 178–183. [Google Scholar] [CrossRef] [PubMed]
  13. Dahlquist, M.; Frykman, V.; Kemp-Gudmunsdottir, K.; Svennberg, E.; Wellenius, G.A.; Ljungman, P.L.S. Short-term associations between ambient air pollution and acute atrial fibrillation episodes. Environ. Int. 2020, 141, 105765. [Google Scholar] [CrossRef]
  14. Murphy, N.F.; Stewart, S.; MacIntyre, K.; Capewell, S.; McMurray, J.J. Seasonal variation in morbidity and mortality related to atrial fibrillation. Int. J. Cardiol. 2004, 97, 283–288. [Google Scholar] [CrossRef] [PubMed]
  15. Mottillo, S.; Filion, K.B.; Genest, J.; Joseph, L.; Pilote, L.; Poirier, P.; Rinfret, S.; Schiffrin, E.L.; Eisenberg, M.J. The Metabolic Syndrome and Cardiovascular Risk: A Systematic Review and Meta-Analysis. J. Am. Coll. Cardiol. 2010, 56, 1113–1132. [Google Scholar] [CrossRef] [PubMed]
  16. Watanabe, H.; Tanabe, N.; Watanabe, T.; Darbar, D.; Roden, D.M.; Sasaki, S.; Aizawa, Y. Metabolic syndrome and risk of development of atrial fibrillation: The Niigata Preventive Medicine Study. Circulation 2008, 117, 1255–1260. [Google Scholar] [CrossRef] [PubMed]
  17. Beam, A.L.; Motsinger-Reif, A.; Doyle, J. Bayesian neural networks for detecting epistasis in genetic association studies. BMC Bioinform. 2014, 15, 368. [Google Scholar] [CrossRef] [PubMed]
  18. Altamirano-Flores, J.S.; Alvarado-Hernández, L.; Cuevas-Tello, J.C.; Tino, P.; Guerra-Palomares, S.E.; Garcia-Sepulveda, C.A. Identification of Clinically Relevant HIV Vif Protein Motif Mutations through Machine Learning and Undersampling. Cells 2023, 12, 772. [Google Scholar] [CrossRef]
  19. Schilling, C.; Keller, M.; Scherr, D.; Oesterlein, T.; Haïssaguerre, M.; Schmitt, C.; Dössel, O.; Luik, A. Fuzzy decision tree to classify complex fractionated atrial electrograms. Biomed. Eng./Biomed. Tech. 2015, 60, 245–255. [Google Scholar] [CrossRef]
  20. Duque, S.; Orozco-Duque, A.; Kremen, V.; Novak, D.; Tobón, C.; Bustamante, J. Feature subset selection and classification of intracardiac electrograms during atrial fibrillation. Biomed. Signal Process. Control 2017, 38, 182–190. [Google Scholar] [CrossRef]
  21. Rivera-Caravaca, J.M.; Roldan, V.; Vicente, V.; Lip, G.Y.H.; Marin, F. Particulate Matter and Temperature: Increased Risk of Adverse Clinical Outcomes in Patients with Atrial Fibrillation. Mayo Clin. Proc. 2020, 95, 2360–2369. [Google Scholar] [CrossRef] [PubMed]
  22. Tayyib, N.; Coyer, F.; Lewis, P.A. A Two-Arm Cluster Randomized Control Trial to Determine the Effectiveness of a Pressure Ulcer Prevention Bundle for Critically Ill Patients. J. Nurs. Sch. 2015, 47, 237–247. [Google Scholar] [CrossRef]
  23. Dutta, D.; Dutta, P.; Sil, J. Simultaneous feature selection and clustering with mixed features by multi objective genetic algorithm. Int. J. Hybrid Intell. Syst. 2014, 11, 41–54. [Google Scholar] [CrossRef]
  24. Hancer, E.; Xue, B.; Zhang, M. Differential evolution for filter feature selection based on information theory and feature ranking. Knowl.-Based Syst. 2018, 140, 103–119. [Google Scholar] [CrossRef]
  25. Gu, Q.; Li, Z.; Han, J. Generalized fisher score for feature selection. arXiv 2012, arXiv:1202.3725. [Google Scholar]
  26. Bernardini, A.; Bindini, L.; Antonucci, E.; Berteotti, M.; Giusti, B.; Testa, S.; Palareti, G.; Poli, D.; Frasconi, P.; Marcucci, R. Machine learning approach for prediction of outcomes in anticoagulated patients with atrial fibrillation. Int. J. Cardiol. 2024, 407, 132088. [Google Scholar] [CrossRef] [PubMed]
  27. Hindricks, G.; Potpara, T.; Dagres, N.; Arbelo, E.; Bax, J.J.; Blomström-Lundqvist, C.; Boriani, G.; Castella, M.; Dan, G.A.; Dilaveris, P.E.; et al. 2020 ESC Guidelines for the diagnosis and management of atrial fibrillation developed in collaboration with the European Association for Cardio-Thoracic Surgery (EACTS) The Task Force for the diagnosis and management of atrial fibrillation of the European Society of Cardiology (ESC) Developed with the special contribution of the European Heart Rhythm Association (EHRA) of the ESC. Eur. Heart J. 2021, 42, 373–498. [Google Scholar] [CrossRef] [PubMed]
  28. Alonso, A.; Yin, X.; Roetker, N.S.; Magnani, J.W.; Kronmal, R.A.; Ellinor, P.T.; Chen, L.Y.; Lubitz, S.A.; McClelland, R.L.; McManus, D.D.; et al. Blood Lipids and the Incidence of Atrial Fibrillation: The Multi-Ethnic Study of Atherosclerosis and the Framingham Heart Study. J. Am. Heart Assoc. 2014, 3, e001211. [Google Scholar] [CrossRef] [PubMed]
  29. Lee, S.R.; Choi, E.K.; Han, K.D.; Lee, S.H.; Oh, S. Effect of the variability of blood pressure, glucose level, total cholesterol level, and body mass index on the risk of atrial fibrillation in a healthy population. Heart Rhythm 2020, 17, 12–19. [Google Scholar] [CrossRef]
  30. Pauklin, P.; Zilmer, M.; Eha, J.; Tootsi, K.; Kals, M.; Kampus, P. Markers of Inflammation, Oxidative Stress, and Fibrosis in Patients with Atrial Fibrillation. Oxidative Med. Cell. Longev. 2022, 2022, 4556671. [Google Scholar] [CrossRef]
Figure 1. Flowchart of HC-MFS method.
Figure 1. Flowchart of HC-MFS method.
Diagnostics 14 01145 g001
Figure 2. Hierarchical clustering results. Because platelets (PLT) are a unique category and quite special, they are highlighted in blue. The orange color has no special significance.
Figure 2. Hierarchical clustering results. Because platelets (PLT) are a unique category and quite special, they are highlighted in blue. The orange color has no special significance.
Diagnostics 14 01145 g002
Table 1. Basic overview of AF and statistical analysis.
Table 1. Basic overview of AF and statistical analysis.
FeaturesNon-AF Group (n = 339)AF Group (n = 339)t/χ2p-Value
Age/years76.53 ± 11.1679.48 ± 9.45−3.717<0.001
CO/ppm14.94 ± 4.4413.82 ± 4.763.1870.002
Minimum Temperature/°C13.87 ± 9.4511.78 ± 9.492.8740.004
Seasons 36.091<0.001
Spring/n68 (20.1%)112 (33.0%)
Summer/n65 (19.2%)99 (29.2%)
Autumn/n95 (28.0%)61 (18.0%)
Winter/n111 (32.7%)67 (19.8%)
CRP/(mg·L−1)27.74 ± 55.009.52 ± 12.032.9500.003
Platelets/(L−1)194.91 ± 76.04169.49 ± 56.804.168<0.001
Platelet Distribution Width/(%)12.43 ± 2.3713.31 ± 2.88−4.328<0.001
Large Platelet Ratio/(%)29.90 ± 8.3331.86 ± 8.11−3.709<0.001
Platelet Crit/(%)0.21 ± 0.080.19 ± 0.693.1210.0019
Mean Platelet Volume/(fL)10.63 ± 1.0910.91 ± 1.11−4.009<0.001
LDL/(mmol·L−1)2.38 ± 0.992.04 ± 0.794.600<0.001
Uric Acid/(μmol·L−1)0.34 ± 0.130.38 ± 0.13−5.352<0.001
TC/(mmol·L−1)4.38 ± 1.263.90 ± 1.034.534<0.001
PM10/(μg/m3)31.97 ± 16.7534.99 ± 17.35−2.3060.021
NO2/(ppbv)22.14 ± 12.5424.24 ± 14.69−2.0020.046
Diabetes/n 2.4460.015
Diabetes/n129 (38.1%)99 (29.2%)
Not Diabetes/nic210 (61.9%)240 (70.8%)
Hypertension/n −2.6640.024
Hypertension/n236 (69.6%)262 (77.3%)
Not Hypertension/n103 (30.4%)77 (22.7%)
Erythrocyte Distribution Width/(fL)45.13 ± 6.1246.14 ± 6.46−2.0720.039
Erythrocyte Pressure/(L/L)37.38 ± 6.9238.37 ± 6.30−1.9590.049
High-Density Lipoprotein (HDL)1.61 ± 0.651.51 ± 0.462.1680.031
Table 2. Abbreviation explanation.
Table 2. Abbreviation explanation.
AbbreviationExplanation
HC-MFSA novel dual feature-selection methodology, combining hierarchical clustering with Fisher scores
FSFisher score feature-selection method
R_FRelief_F feature-selection method
MIMutual information feature-selection method
DCFSDistance Correlation Factor feature-selection method
KNNA classification model: K-nearest Neighbor
RFA classification model: Random Forest
SVMA classification model: Support Vector Machine
NBA classification model: Naive Bayes
LRA classification model: the Logistic Regression Model
IMVOnly missing values are processed on the dataset.
IMV+SSThe dataset is processed for missing values and standardized.
IMV+ORThe dataset is processed for missing values and outliers.
IMV+OR+SSThe dataset is processed for missing values, outlier values, and standardization.
Table 3. The parameter settings for the five categorical prediction models.
Table 3. The parameter settings for the five categorical prediction models.
MethodsModelParameterParameter ValueDatasets
HC-MFSKNNn_neighbors19IMV+OR+SS
p1
weightsdistance
RFmax_depth16IMV+SS
random_state2
n_estimators300
SVMC1IMV
kernellinear
NB/GaussianNBIMV+SS
LRsolverliblinearIMV
penaltyL1
C10
FSKNNn_neighbors30IMV+SS
p/
weightsuniform
RFmax_depth5IMV+SS
random_state2
n_estimators300
SVMC1IMV+SS
kernellinear
NB/GaussianNBIMV
LRsolverliblinearIMV+SS
penaltyL2
C1
R_FKNNn_neighbors18IMV+SS
p1
weightsdistance
RFmax_depth3IMV+SS
random_state1
n_estimators100
SVMC1IMV+OR+SS
kernellinear
NB/GaussianNBIMV+OR+SS
LRsolverliblinearIMV
penaltyL1
C10
MIKNNn_neighbors20IMV+SS
p1
weightsdistance
RFmax_depth19IMV+SS
random_state1
n_estimators200
SVMC1IMV
kernellinear
NB/GaussianNBIMV
LRsolverliblinearIMV+SS
penaltyL2
C0.1
DCFSKNNn_neighbors28IMV+SS
p/
weightsuniform
RFmax_depth4IMV+OR+SS
random_state2
n_estimators100
SVMC1IMV+SS
kernellinear
NB/GaussianNBIMV+SS
LRsolverliblinearIMV+SS
penaltyL2
C1
Table 4. Initial feature subset selection for HC-MFS.
Table 4. Initial feature subset selection for HC-MFS.
MethodClassificationMeanInitial Feature Subsets
HC-MFS10.02608
20.0111
30.0257
Table 5. Feature subsets.
Table 5. Feature subsets.
MethodFeature Subsets
HC-MFS{Season, uric acid, low-density lipoprotein, total cholesterol, platelet distribution width, platelets, mean platelet volume, age, large platelet ratio, C-reactive protein, high-density lipoprotein, and erythrocyte distribution width}
FS{Season, uric acid, LDL, total cholesterol, platelet distribution width, platelets, mean platelet volume, age, large platelet ratio, CO, platelet pressure, and C-reactive protein}
R_F{Season, hypertension, large platelet ratio, diabetes mellitus, high-density lipoprotein, erythrocyte pressure product, C-reactive protein, total cholesterol, low-density lipoprotein, platelet pressure product, platelets, and age}
MI{C-reactive protein, total cholesterol, LDL, large platelet ratio, platelets, HDL, erythrocyte pressure volume, erythrocyte distribution width, season, platelet distribution width, uric acid, and PM10}
DCFS{Season, uric acid, total cholesterol, LDL, platelet distribution width, age, platelets, mean platelet volume, large platelet ratio, CO, HDL, and C-reactive protein}
Table 6. Model results.
Table 6. Model results.
MethodAccuracyPrecisionRecallF1AUCDatasets
HC-MFS0.78680.78870.80000.79430.7964IMV+SS
FS0.75740.74030.81830.77550.7556IMV+SS
R_F0.72060.71620.75710.73610.7195IMV+SS
MI0.75740.74670.80000.77240.7561IMV+SS
DCFS0.76470.76390.78570.77460.7641IMV+OR+SS
Table 7. Model errors.
Table 7. Model errors.
MethodRMSERstd
HC-MFS0.47740.54370.4769
FS0.49260.51560.4899
R_F0.52860.44050.5278
MI0.49260.51460.4912
DCFS0.48510.52880.4848
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, H.; Lu, L.; Xiong, H.; Fan, C.; Fan, L.; Lin, Z.; Zhang, H. A Novel Approach to Dual Feature Selection of Atrial Fibrillation Based on HC-MFS. Diagnostics 2024, 14, 1145. https://doi.org/10.3390/diagnostics14111145

AMA Style

Liu H, Lu L, Xiong H, Fan C, Fan L, Lin Z, Zhang H. A Novel Approach to Dual Feature Selection of Atrial Fibrillation Based on HC-MFS. Diagnostics. 2024; 14(11):1145. https://doi.org/10.3390/diagnostics14111145

Chicago/Turabian Style

Liu, Hong, Lifeng Lu, Honglin Xiong, Chongjun Fan, Lumin Fan, Ziqian Lin, and Hongliu Zhang. 2024. "A Novel Approach to Dual Feature Selection of Atrial Fibrillation Based on HC-MFS" Diagnostics 14, no. 11: 1145. https://doi.org/10.3390/diagnostics14111145

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop