Next Article in Journal
A Byzantine Sensing Network Based on Majority-Consensus Data Aggregation Mechanism
Previous Article in Journal
Experimental Implementation and Performance Evaluation of an IoT Access Gateway for the Modbus Extension
Previous Article in Special Issue
Estimation of the Respiratory Rate from Localised ECG at Different Auscultation Sites
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Expert Hypertension Detection System Featuring Pulse Plethysmograph Signals and Hybrid Feature Selection and Reduction Scheme

1
Department of Electronics Engineering, University of Engineering and Technology Taxila, Taxila 47050, Pakistan
2
Department of Electrical and Computer Engineering, COMSATS University Islamabad, Wah Campus, Wah Cantonment, Islamabad 45550, Pakistan
3
Department of Computer and Software Engineering, College of Electrical and Mechanical Engineering, National University of Sciences and Technology, Islamabad 44000, Pakistan
4
Department of Computer Science and Engineering, Soonchunhyang University, Asan 31538, Korea
5
Department of Computer Science, HITEC University, Taxila 47080, Pakistan
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Sensors 2021, 21(1), 247; https://doi.org/10.3390/s21010247
Submission received: 1 December 2020 / Revised: 22 December 2020 / Accepted: 24 December 2020 / Published: 2 January 2021
(This article belongs to the Special Issue Signal Processing Using Non-invasive Physiological Sensors)

Abstract

:
Hypertension is an antecedent to cardiac disorders. According to the World Health Organization (WHO), the number of people affected with hypertension will reach around 1.56 billion by 2025. Early detection of hypertension is imperative to prevent the complications caused by cardiac abnormalities. Hypertension usually possesses no apparent detectable symptoms; hence, the control rate is significantly low. Computer-aided diagnosis based on machine learning and signal analysis has recently been applied to identify biomarkers for the accurate prediction of hypertension. This research proposes a new expert hypertension detection system (EHDS) from pulse plethysmograph (PuPG) signals for the categorization of normal and hypertension. The PuPG signal data set, including rich information of cardiac activity, was acquired from healthy and hypertensive subjects. The raw PuPG signals were preprocessed through empirical mode decomposition (EMD) by decomposing a signal into its constituent components. A combination of multi-domain features was extracted from the preprocessed PuPG signal. The features exhibiting high discriminative characteristics were selected and reduced through a proposed hybrid feature selection and reduction (HFSR) scheme. Selected features were subjected to various classification methods in a comparative fashion in which the best performance of 99.4% accuracy, 99.6% sensitivity, and 99.2% specificity was achieved through weighted k-nearest neighbor (KNN-W). The performance of the proposed EHDS was thoroughly assessed by tenfold cross-validation. The proposed EHDS achieved better detection performance in comparison to other electrocardiogram (ECG) and photoplethysmograph (PPG)-based methods.

Graphical Abstract

1. Introduction

Hypertension, also known as high blood pressure, is one of the most common risk factor for cardiovascular disease (CVD) [1]. It is a very common condition in which a large amount of force from the blood pushes on the walls of the arteries leading towards heart diseases [2]. The main risk factors for hypertension include age, genetics, gender, lack of physical activity, bad diet practices, high cholesterol, excessive salt consumption, less intake of vegetables and fruit, smoking, obesity, family history, and other diseases such as kidney disease or diabetes [3]. According to the World Health Organization (WHO) statistics, 1.13 million of the world population suffers from hypertension, and more men are affected than women. One out of every four men suffers from high blood pressure issues [3]. It is a silent killer that affects the most significant tissues of the human body [4]. Indeed, many people are not aware they have hypertension [5]. In the US, an estimated 13 million people are unaware of their condition [6], while in China, 59% of people with hypertension are unaware of their condition [5]. In Pakistan, 18% of the adults are affected by hypertension, and 33% of the adults above the age of 45 were affected according to the National Health Survey Pakistan [7]. Prevalence rates of hypertension based on genetic and ethnic variations ranges from about 29% for Asians, 45% for black men, and around 46.3% for women [8].
Table 1 describes a blood pressure ranges of normal and hypertension in terms of systolic and diastolic pressures. Some of the common symptoms of hypertension include headaches, dizziness, migraine, lightheadedness, changes in vision, or fainting episodes [9]. Hypertension serves as the first step towards CVDs, but the most chronic effect of unchecked hypertension is stroke, which can lead to permanent paralysis of certain body parts. Prolonged and undetected hypertension can be fatal; therefore, its detection in the preliminary stages is crucial.
Moreover, the world is currently suffering from the outbreak of a pandemic COVID-19 caused by the coronavirus SARS-CoV-2. It was reported that there are some specific comorbidities associated with a high risk of infection and increased severity of lung injury. Most of the common comorbidities in COVID-19 patients are hypertension (30%), cardiovascular disease (8%), and diabetes (19%) [10]. Therefore, it is not entirely surprising that the COVID-19 patients experiencing worst complications are hypertensive since hypertension is most frequent in older people and these elderlies are particularly at risk of being infected by a coronavirus [11]. Given the above information and statistics, it is clear that we need a technique for recognizing hypertension as early as possible to avoid significant damage to one’s body.
Various techniques including physiological signals such as electrocardiogram (ECG) and photoplethymograph (PPG) are currently being used to detect hypertension. Detection of hypertension from PPG signals (MIMIC database) using continuous wavelet transform (CWT) and the GoogLeNet deep learning model [12] achieved an F1 score of 92.55%. This work relies on a deep learning model so it requires high processing power, large scale data sets, and more training time. The authors of [13] proposed a method based on pulse arrival time (PAT) features extracted from PPG and ECG signals. The k-nearest neighbor (KNN) classification method was employed to predict hypertension with an F1 score of 94.84%. The research achieved acceptable results but missing consideration of preprocessing the PPG signals as PPG suffers from motion artifacts and variation in light intensity. Identification of hypertension [14] from heart rate variability (HRV) signals yielded an accuracy of 85.47% using standard deviation of all NN intervals and multiple instance learning (MIL). However, HRV feature extraction for long-term data requires significant processing resources. A model [15] to detect hypertension obtained 93.33% accuracy using Savitzky–Golay filtering (SGF), entropy features extracted from ECG, and a support vector machine (SVM) classifier. The method achieved a considerable performance on a comparatively small data set consisting of 48 participants.
In [16], the authors proposed a hypertension detection framework based on five principal components extracted from HRV signals to achieve the highest accuracy of 85.5% with quadratic discriminant analysis (QDA). Rajput et al. [17] proposed a scheme to identify the low and high risk of hypertension. The scheme yielded classification accuracy 100% using optimal orthogonal wavelet filter back (OWFB), log, and fractal dimension features extracted from ECG. Despite promising results, this work suffers from a data imbalance problem. The authors in [18] proposed a method to detect ECG hypertensive signals using empirical mode decomposition (EMD) for preprocessing of the signals, yielding an accuracy of 97.7% through the KNN classifier. The extracted features were selected physically, making this process laborious. The method was only trained on a small data set. A system to detect hypertension using morphological descriptors derived from PPG with 92.31% accuracy is discussed in [19]. Identification of hypertension patients from ballistocardiograms (BCG) is presented in [20]. The system achieved a mean accuracy of 84.4% using class association rules (CAR) classifier and morphological features. The BCG signals were collected from patients lying on a smart mattress which has a limited availability.
Medical devices in hospitals can easily get affected by electromagnetic interference (EMI) in a complex electromagnetic environment [21,22]. ECG signals are usually affected by the EMI and preconditioning circuits. Changes in temperature and ambient lighting conditions impact the PPG signal acquisition. PPG signal acquisition is exposed to motion artifacts as well [23]. The frequency of the PPG signal is about 1–3 Hz [24], so it also requires a high order filter for signal denoising. The motivation behind this research was to investigate the feasibility of a new signal modality, i.e., pulse plethysmograph (PuPG). In contrast to PPG that uses light to detect the volume of blood flow in the finger, the PuPG senses the pressure changes in blood flow.

Main Contributions

In this research, novel PuPG signals were used to design the hypertension detection system. The PuPG signal includes considerable cardiac health characteristic information [25,26,27]. The PuPG signals are recently being used for emotions classification [28] and biometric systems [29] as well. The main contributions of this work are listed as follows:
  • This is the first study that used PuPG-based signals for the detection of hypertension.
  • To accurately detect the hypertension pattern, we extract a large number of multi-domain features from preprocessed PuPG signals through discrete wavelet transform (DWT) and EMD.
  • To reduce the feature dimensions and redundancy while improving the discriminative power of features, we proposed a hybrid feature selection and reduction (HFSR) scheme.
  • The proposed expert hypertension detection system (EHDS) comprises preprocessing through EMD, followed by the feature extraction, kernel principal component analysis (KPCA), and weighted k-nearest neighbor (KNN-W) classifier, achieved an accuracy of 99.4%, sensitivity of 99.6%, and specificity of 99.4%.
The rest of this paper is structured as follows: Section 2 gives details about the materials used in this study. Section 3 describes the details about the methods. Next, we present the results in Section 4 and Section 5 discuss the proposed method and its comparative analysis. Section 6 concludes this research paper.

2. Materials

2.1. Data Acquisition

In this study, a portable pulse plethysmograph (PuPG) sensor PTN-104 (NISensors, iWorx Systems Inc., Dover, UK) in combination with NI myDAQ (National Instruments Corporation, Austin, TX, USA) was used for PuPG data acquisition. PTN-104 sensor is attached to the index finger of the subject to convert pulse pressure into an electrical voltage signal. The real-time integral of its output signal produces the same volume pulse signal as the expensive infrared PPG sensor. The PTN-104 is a rugged non-magnetic accelerometer, which is made up of piezoelectric material. IX-myDAQ (National Instruments Corporation, Austin, TX, USA) is a breakout board used for connecting the PTN-104 sensor and myDAQ (National Instruments Corporation, Austin, TX, USA) for data acquisition via mini DIN7 port. NI myDAQ is a low-cost data acquisition tool that converts analog signals to digital format and allows the users to analyze real-time data in NI LabVIEW software (National Instruments Corporation, Austin, TX, USA) on PC. The sampling frequency was set to be 1 kHz for PuPG data acquisition.
It is very essential to highlight the difference between PPG and PuPG signals and sensors. Both of them operate on completely different principles with different input parameters. Table 2 presents a comparison between various properties of both sensors such as input parameters, working principles, and the impact of noise on a signal acquisition. Figure 1 illustrates the output signals acquired from both sensors. It was observed that the PuPG signal carries more information as compared to the PPG signal since multiple frequencies contribute towards the dicrotic notch for PuPG.

2.2. Data Set Description

Raw PuPG signals were acquired from the subjects. The data acquisition was carried out for developing a two-class data bank; one was hypertension and the other normal. A total of 700 signals were collected from hypertension subjects and 709 signals from normal subjects, with a timestamp of 10 s per signal and a sampling frequency of 1000 Hz. Subjects were advised to keep calm and remain static during data acquisition activity. Informed consent was obtained from all participants included in the research. Recording activity was performed between breakfast and lunch time. None of the involved subjects were smokers or diabetic. Table 3 shows the details of the subjects and the acquired data for this study. Figure 2 shows a comparison of raw PuPG signals collected from a normal subject and a subject suffering from hypertension. Sometimes acquired signals (both normal and hypertension) were affected by the circuit noise. The noise/power line distortion incurred due the embedded electronics of data acquisition setup can be seen as a sinusoidal oscillatory component (50 Hz) in the normal PuPG Hat of Figure 2.

3. Methods

3.1. Design of the Study

The proposed methodology in this research adopts the machine learning paradigm shown in Figure 3. It consists of four main stages, namely (i) preprocessing; (ii) feature extraction; (iii) hybrid feature selection and reduction, and (iv) classification. These stages are separated through a dotted line in Figure 3. Each step is elaborated in detail in forthcoming sections. This research adopts a comparative approach between two pattern analysis frameworks, i.e., method I and method II. Method I is comprised of discrete wavelet transform (DWT)-based preprocessing while method II adopts empirical mode decomposition (EMD) for signal denoising. The rest of the framework for both methods is the same. The feature values extracted, reduced feature vectors, and the performance of the classifiers vary for both methods due to the difference in preprocessing methods. All experiments were performed on MATLAB 2018a (The MathWorks, Inc., Natick, MA, USA) running on a personal computer with Core i7 (Intel Corporation, Santa Clara, CA, USA) processors and 32 GB RAM.
1.
Preprocessing: It removes the irrelevant information and artifacts from the acquired PuPG signal data of normal and hypertension classes. Method I employs discrete wavelet transform (DWT) for signal denoising through frequency and mean relative energy-based criteria. Method II adopts empirical mode decomposition (EMD) for noise elimination through analysis of mean frequencies and energies of individual signal components extracted from normal and hypertension classes.
2.
Feature extraction: It extracts a combination of 102 features from preprocessed PuPG through DWT and EMD separately. These include time, frequency, spectral, texture, and cepstral features. The difference between signal classes is best captured through the extraction of a wide range of informative features.
3.
Feature selection and reduction: This step eliminates features with redundant information through a hybrid feature selection and reduction (HFSR) method that is a combination of multiple feature ranking and transformation schemes. A high-dimensional feature vector is reduced through a new strategy of the averaging outcome of seven feature ranking methods, thus providing more reliable results. Next, we employed kernel principal component analysis (KPCA) to further decrease the feature dimension and represent significant information in fewer parameters. Extracted features in both method I and II are fed to the HFSR scheme to reduce the dimension of the resultant feature vector.
4.
Classification:The final feature vectors extracted in both methods I and II of hypertension and normal classes are fed to a range of different classifiers, i.e, support vector machines (SVM), k-nearest neighbors (KNN), ensemble methods, decision trees (DT), and logistic regression (LR). Classification performance of both methods is evaluated through a baseline tenfold cross-validation strategy and compared with 5-, 15-, 20-, and 25-fold cross-validation.

3.2. Preprocessing

The acquired PuPG data were contaminated with noise and artifacts and include redundant information (Figure 2). These noise components needs to be eliminated for a robust performance of the proposed system. Therefore, we employed DWT and EMD-based preprocessing for signal denoising. Later on, we compared the preprocessing performance of both methods.

3.2.1. Discrete Wavelet Transform

The discrete wavelet transform (DWT) is a widely applied approach in biomedical signal processing applications [30,31,32]. DWT decomposes a signal into different resolutions by using a combination of high-pass and low-pass filters. Figure 4 illustrates the complete process of wavelet-based denoising [33] adopted in this research. Numerous filter coefficients have been developed for diverse types of signal analysis applications—for instance, Daubechies, Symlets, and Coiflets coefficients, etc.
In this study, we employed the Symlet wavelet due to its similarity with the shape of the PuPG signal under consideration [34,35]. Symlet wavelet yields the best results as compared to others due to its resemblances with the morphological characteristics of the PuPG signal.
Table 4 exhibits information about decomposition levels, frequency ranges, and mean relative energies of normal and hypertension data classes of PuPG signals. It can be observed that D 1 , D 2 , D 3 , and D 4 signal components have high frequency range and include low mean relative energies; therefore, these components were eliminated while reconstructing a denoised signal. This is also endorsed by the fact that the PuPG signal has a very low frequency (normally less than 60 Hz). Figure 5 provides a graphical illustration of wavelet decomposition for normal and hypertension PuPG signals. Figure 6 presents the denoised signal generated as a result of applying DWT. High frequency noise visible in raw PuPG signal (Figure 2) is eliminated in the denoised version.

3.2.2. Empirical Mode Decomposition

EMD is an adaptive method that derives fundamental functions directly from the data [36]. EMD does not require any previously known value of the signal for its computation. The principal task for computing EMD of a given signal is to empirically determine the intrinsic oscillatory components through their particular time scales in a signal and subsequently disintegrate the signal into intrinsic mode functions (IMFs) [37]. Therefore, EMD provides remarkably better results for nonlinear and non-stationary biomedical signals.
Selection criteria of IMF have to satisfy two conditions;
  • In the entire signal, the total number of local extrema and zero crossings must be equal to each other or differ by a maximum one.
  • The average of the envelopes computed through local minima and local maxima must be zero.
The systematic approach to disintegrate the signal into its IMFs is known as the “sifting” process, explained in Figure 7.
The basic objective of applying EMD for preprocessing the PuPG signal was to decompose the distorted signal into its constituent IMFs as depicted in Figure 8. Considering the fact that some IMFs carry discriminative and characteristic information about various data classes while others include redundant and noisy content, the determination of the proper number of IMFs is a crucial step towards creating an effective signal denoising strategy.
It is perceived from Figure 8 and Table 5 that the first IMF includes mainly high-frequency content. Table 5 provides mean frequency and energy information of each IMF for normal and hypertension data classes of the PuPG signal. The first IMF also holds very little mean relative energy components for both classes, i.e., 0.00% and 1.02% for normal and hypertension classes, respectively. Therefore, it was discarded while reconstructing the denoised signal. All other IMFs and residual signals were added to form a denoised version of the PuPG signal. Figure 9 illustrates the PuPG signal denoised through the EMD process for normal and hypertension data. It is clear that high frequency noise that was visible in raw PuPG signal (Figure 2) is eliminated now.

3.3. Feature Extraction

The objective of the feature extraction stage is to extract significant features from the biomedical signals of various classes that contributes towards an effective classification performance. In this study, a total of 102 features were separately computed from the PuPG signal denoised through DWT and EMD. Table 6 lists all the extracted features along with their statistical measures of mean and standard deviation (STD) for method I (DWT) and method II (EMD). We extracted time domain [38,39,40,41,42,43,44,45], spectral [46,47], fractal and chaos [48,49], chroma [50,51], cepstral [52], and texture features [53] and analyzed them statistically.
These features were subjected to the feature selection step (HFSR) to recognize the features with maximum discriminative content among normal and hypertension classes.

3.4. Hybrid Feature Selection and Reduction

Feature selection is one of the key steps in the modern pattern recognition and machine learning paradigms. The extracted features may include redundant information and irrelevant and noisy parameters. A two-stage hybrid feature selection and reduction (HFSR) strategy was designed to select and transform the best distinctive features as shown in Figure 10. The first stage ranks the input features through seven different methods and the second stage transforms the selected ranked features to further reduce dimensionality.

3.4.1. Feature Selection Scheme

Feature selection routines serve to enhance the performance of classifiers by reducing the feature dimensions as well as decreasing the computational time [54,55]. Feature selection methods are categorized as filter methods and wrapper methods. Filter type feature selection methods employ feature ranking techniques based on the applied statistical measure for selecting a suitable feature. In wrapper type feature selection techniques, a feature subset is selected recursively based on the overall model performance. The selection criterion computes the variation in model performance that decides the addition or removal of a feature from the subset.
To address the limitations of individual feature selection approaches, we employed a hybrid scheme of feature selection by combining seven feature ranking methods through a voting strategy. Figure 10 illustrates the hybrid scheme of feature selection and reduction. In this scheme, seven state-of-the-art feature ranking techniques, namely student t-test (TT), Kullback–Leibler distance (KLD) [56], Bhattacharya distance (BD) [57], Mann–Whitney’s test (MWT), ReliefF (RRF) [58], minimum redundancy maximum relevance (MRMR) [59,60], and receiver operating characteristic curve (ROC) were employed to rank the feature individually. Ranking assigned to each feature by all feature ranking methods is combined to calculate the mean rank (MR) value. A threshold is applied to MR value for feature selection.
Table 7 provides the sorted lists of the best forty features with the highest MR values for method I (features extracted from signal preprocessed through DWT). Rank assigned to individual features by each ranking method is also computed. The top 24 features are highlighted in Table 6 were forwarded to the next stage. It was perceived that if a feature ranking method assigns a high rank to a particular feature that failed to get high scores from other methods, it gets rejected due to the hybrid scheme of feature selection. For instance, consider the Root Sum of Squares feature that received the rank value of 99 from the ROC method, but gets scores of 53, 49, 52, 18, 58, and 72 from TT, KLD, BD, MWT, MRMR, and RRF, respectively. It achieved an MR value of 57.29 that is below the selection criterion, so it was rejected from the final feature vector of 1 × 24 dimensions. Table 8 enlists the top forty features with the highest MR values for method II, i.e., features extracted from the signal preprocessed through EMD. The rank value assigned by an individual feature ranking method to a specific feature can be examined. One to one comparison of the top ten MR values of method I in Table 7 and method II in Table 8 reveals that the magnitude of MR values of method II (81–70) is higher than that for method I (73–67).

3.4.2. Feature Reduction Using Kernel PCA

PCA applies orthogonal transformation to transform a group of likely correlated features into a set of linearly independent features known as principal components. These principal components represent the normalized linear combinations of the original features. It includes information about the most powerful variations present in the data set. The first principal component holds maximum variance information of the data set.
Kernel PCA (KPCA) [61,62] enhances the original PCA to non-linear data distribution problems through a kernel function. A kernel function projects low-dimensional feature data to a higher-dimensional feature space, where it becomes linearly separable [63].
The previous stage of hybrid feature selection reduced the feature dimensions to 1 × 24 which are fed to KPCA to further decrease dimensions for both methods I and II. Components of KPCA were selected recursively based on the classification performance through tenfold cross-validation. Separate sets of 5, 7, 10, 12, 15, and 17 components were picked for methods I and II to investigate the classification performance for differentiating normal and hypertension signal classes of PuPG signals.

3.5. Classification

To perform the classification of normal and hypertension classes of PuPG signal data set, this study employed a range of classification methods through tenfold cross-validation schemes. The classification methods opted in this study were SVM-Linear (SVM-L), SVM-Quadratic (SVM-Q), SVM-Cubic (SVM-C), SVM-Fine Gaussian (SVM-FG), SVM-Medium Gaussian (SVM-MG), KNN-Fine (KNN-F), KNN-Medium (KNN-M), KNN-Cosine distance (KNN-Cos), KNN-Cubic (KNN-C), KNN-Weighted (KNN-W), Decision Trees (DT), Linear Discriminant (LD), Logistic Regression (LR), Gaussian Naive Baise (NBG), Kernel Naive Baise (NBK), Ensemble Boosted Trees (Eboost), Ensemble Bagged Trees (EBT), Ensemble Subspace Discriminant (ESD), and Ensemble Subspace KNN (ESKNN). The tenfold cross-validation was also compared with 5-, 15-, and 20-fold cross-validation and 80–20% and 75–25% train-test experiments. All experiments were implemented on MATLAB 2018a on a personal computer with Core i7 with 32 GB RAM.

4. Results

In this study, the PuPG signal data set comprising two classes (Normal and Hypertension) was first preprocessed through DWT and EMD to develop methods I and II respectively. We obtained 102 features for each method, i.e., DWT and EMD. These features were subjected to the HFSR framework to reduce the computational complexity and feature vector dimensions. Standard statistical parameters of Accuracy (Acc), Sensitivity (Sen), Specificity (Sp), and Error rate (Err) were used to measure the classification performance.

4.1. Method I

In this research, a comparative analysis was performed via preprocessing the PuPG signal through DWT and EMD. This section presents the results yielded by preprocessing through DWT and succeeding processes of feature extraction, selection, and classification. Various feature sets, namely S 1 , S 2 , S 3 , S 4 , S 5 , and S 6 were formed by randomly choosing 5, 7, 10, 12, 15, and 17 transformed features. These feature components were fed to several classification methods to examine the diagnostic performance through tenfold cross-validation. Table 9 presents consolidated result analysis of various classification methods for features sets S 1 (5 components), S 2 (7 components), and S 3 (10 components). Table 10 illustrates comprehensive analysis of classification performance over different classifiers for feature sets S 4 (12 components), S 5 (15 components), and S 6 (17 components). As expressed in Table 10, Ensemble Subspace KNN classifier scores highest average accuracy of 98.4%, for 12 feature components, i.e., S 4 feature set.
Figure 11 shows the performance in terms of accuracy for different feature sets in various classifiers for distinguishing normal and hypertension classes using PuPG signals. Figure 12 demonstrates the specificity performance of several classifiers for various features sets from DWT based preprocessing method. Figure 13 presents a graphical comparison of the sensitivity performance of several classifiers for different feature combinations.
NBG classifier achieves highest specificity performance of 100% for feature sets S 3 , S 4 , S 5 , and S 6 (Figure 12), but it reaches maximum sensitivities of 26%, 26%, 32%, and 34% for the same feature sets (Figure 13); therefore, it results in significant reduction of overall classifier accuracy of NBG. The sensitivity performance is 100% for several classifiers (LD, LR, NBG, SVM-FG, SVM-MG, EBT) for feature set S 1 (Figure 13), but the specificity performance is comparatively low.
Figure 14 shows the classification performance results in the form of a confusion matrix for best configurations such as ESKNN classifier with S 4 (12 feature components). The sensitivity of classification is 99%, which means that out of 700 PuPG signals of hypertension, 693 were correctly predicted as hypertension data class while testing, whereas only seven were misclassified as healthy class. The classifier achieved a 98% specificity performance. Out of 709 healthy PuPG signal samples, 695 were accurately predicted as healthy class, whereas the remaining 14 signals were misclassified.
Table 11 includes the extensive experimentation results to avoid the classifier overfitting. The selected configuration was tested through 5-, 10-, 15-, and 20-fold cross-validation and 20% and 25% train-test holdout validations.

4.2. Method II

This section is primarily focused on the second method that is under discussion for this research. It encompasses the results of the classification of the features extracted after the preprocessing of the PuPG signal via EMD. A certain number of feature sets were chosen that were the result of the HFSR. The feature sets comprising of 5, 7, 10, 12, 15, and 17 transformed features were chosen and named S 1 , S 2 , S 3 , S 4 , S 5 , and S 6 , respectively. These feature components were fed to a various number of classifiers for classification and their performance was tested through tenfold cross-validation.
Table 12 depicts the results obtained after the classification of the feature set S 1 (5 components), S 2 (7 components), and S 3 (10 components) on using a selection of various classifiers. Table 13 shows the outcomes of various classification techniques applied on feature sets S 4 (12 components), S 5 (15 components), and S 6 (17 components). Analysis of both Table 12 and Table 13 show that a maximum average accuracy using the least number of features is 99.4%. This accuracy is the result of the weighted KNN classification method applied on the feature set S 1 .
Figure 15 shows a comparison of the performance of various classifiers based on the accuracy achieved as a result of distinguishing hypertension and normal PuPG signal. Figure 16 depicts the comparison result of various classifiers based on their specificities after using EMD as the preprocessing technique. Figure 17 represents the comparison of the sensitivities of various classification methods. NBG classifier achieves the highest specificity performance of 100% for feature sets S 3 (Figure 16), but it reaches maximum sensitivities of 26% for the same feature set (Figure 17). The sensitivity performance is 100% for several classifiers (LD, NBG, SVM-MG, ESD) for feature set S 1 (Figure 17), but the specificity performance is comparatively low.
Figure 18 illustrates the best classification performance in the form of a confusion matrix for selected features set ( S 1 ) with KNN-W classifier. The sensitivity of classification is more than 99%, which means only one out of 700 PuPG signals was wrong predicted as hypertension data class, whereas the remaining 699 PuPG signals were correctly identified as hypertension. Out of 709 healthy PuPG signals, 702 were correctly predicted as healthy, achieving specificity of 99%. The overall average classification accuracy in the best configuration with the KNN-W classifier was 99.4%.
Table 14 includes the results of comprehensive experimentation which is performed to avoid the classifier overfitting. The selected framework was examined through 5-, 10-, 15-, and 20-fold cross-validation and 20% and 25% train-test holdout validations. For all experimental settings, the proposed scheme achieved more than 98% accuracy.

4.3. Method I versus Method II: A Comparative Analysis

This section aims to compare both methods I and II analytically. Based on this comparison, we figure out the best working solution for the detection of hypertension through PuPG signals. Method I comprises of preprocessing of PuPG signals through DWT, followed by feature extraction. Extracted features were subjected to the HFSR scheme and finally classified through Ensemble Subspace KNN. Method II consists of EMD-based signal preprocessing followed by feature extraction. Features were fed to KNN-W classifier for distinguishing normal and hypertension data classes after being reduced through the HFSR approach.
Table 15 shows the performance comparison of methods I and II in terms of average accuracy, sensitivity, specificity, error, and number of features. Method I achieves classification performance of 98.4% accuracy, 97% sensitivity, and 99% specificity using 12 transformed features. Method II obtains 99.4%, 99.2%, and 99.6% results of classification accuracy, sensitivity, and specificity respectively through only five reduced features.
Comparative analysis of both methods establishes that method II outperforms method I in terms of achieving better classification accuracy on a reduced number of features. This might be due to the fact that the accuracy achieved in the case of DWT highly depends on the proper wavelet basis selection [64]. The selection of an appropriate basis is challenging especially for non-stationary data [65]. On the other hand, EMD is a fully data-driven, adaptive, and basis-less transformation [66]. Moreover, the IMF selection process of EMD based on relative energy and mean frequency has assisted the selection of useful discriminative signal characteristics.
Figure 19 presents the finalized EHDS (expert hypertension detection system) based on PuPG signal analysis. EHDS first takes raw PuPG signal as input and performs preprocessing through EMD by rejecting the irrelevant IMFs. Next, only 24 significant features highlighted by the hybrid selection scheme are extracted and reduced through KPCA. The final transformed 1 × 5 feature vector is fed to KNN-W to distinguish the normal and hypertension data classes. Figure 20 illustrates the classification performance of the proposed EHDS as a function of the number of transformed features. It can be observed that the proposed EHDS achieves the optimum performance on only five transformed features. The classification performance shows no notable improvement with the increase in the number of features.

5. Discussion

Human blood vessels and the microcirculation system experience transformations with the rise in blood pressure (BP); these changes are exceptionally obvious for patients with severe hypertension. PuPG signals carry a wealth of information about the cardiac health [25,26,27]. The PuPG signal reflects physical changes in blood volume pressure in blood vessels during the cardiac cycle. The features extracted in this study indicate the changes in Normal and Hypertension PuPG signals acquired from various subjects. The high classification performance of EHDS reflects the association of extracted transformed features with the physiological characteristics of the cardiac condition of the subject. Thus, the proposed expert system may provide a good approximation of the presence or absence of non-communicable diseases such as hypertension.
Table 16 presents a performance comparison of the recent studies. A diagnostic index for the classification of low and high-risk hypertension classes attaining accuracy of 100% was proposed by [17]. In contrast, our work is targeted towards the classification of Normal and Hypertension classes through PuPG signals. In another study, [18] developed a computational intelligence tool based on ECG signals for the classification of normal and hypertension. EMD was employed in the signal preprocessing stage, followed by nonlinear feature extraction from the decomposed IMFs. Extracted features were ranked through Student’s t-test. The highest classification accuracy of 97.70% was obtained through the KNN classifier with tenfold cross-validation. A photoplethysmograph (PPG) based detection of hypertension was proposed by [19]. A total of 125 features of various types were extracted and reduced through MRMR. The authors reported the best classification performance with KNN-W, specifically to be 100%, 85.71%, and 92.31% for positive predictive value, sensitivity, and F1-score, respectively.
The current research is focused on the classification between normal and hypertension data through PuPG signals. To the best of author’s knowledge, this is the first study that uses the PuPG signals for discriminating among normal and hypertension with high precision. The current method achieves better performance than the existing ECG- [15,17,18], PPG- [12,19], HRV- [14,16], and BCG-based [20] approaches. Our method also outperforms the fusion-based method for detection of hypertension that utlized a combination of PPG and ECG [13].
The proposed expert system could play a vital role in the early detection of hypertension in low- and middle-income countries. It is important to mention that an estimated 1.04 billion population suffered from hypertension in 2010 [67]. A non-invasive technique based on PuPG signals analysis proposed in this research could be used for the detection of non-communicable diseases.

6. Conclusions

Early detection of hypertension or high blood pressure is extremely significant since it does not cause any obvious symptoms in many people; hence, it can harm the heart, the kidneys, and even the brain. In this study, we proposed an automated detection system for hypertension from PuPG signals for timely and precise screening of disease. First, PuPG signals were preprocessed through EMD, followed by feature extraction of various types. Highly discriminative features were selected through the proposed HFSR scheme that consisted of feature reduction and selection methods. The resultant reduced features of dimension 1 × 5 were subjected to various classification methods. The KNN-W classifier achieved the best performance in terms of accuracy, sensitivity, and specificity of 99.4%, 99.2%, and 99.6%, respectively. To compute the model performance and avoid overfitting, 5-, 10-, 15-, and 20-fold cross-validations were employed. The proposed method was also compared with the DWT based preprocessing scheme followed by the same feature extraction, selection (HFSR), and classification pipeline. The main advantages of this research are as follows:
  • The proposed EHDS system is based on the non-invasive methodology of PuPG signals.
  • The EHDS is reliable and less computational intensive with high accuracy.
  • The EHDS avoids overfitting as it is validated through 5-, 10-, 15-, and 20-fold cross-validation.
  • The proposed approach does not only rely on morphological characteristics of the acquired signal.
  • The method can be completely automated, and it works with all qualities of PuPG signals.
Despite the enormous advantages of the proposed method, it has a few limitations.
  • The data set used in this research is yet small, with each sample with a length of 10 s.
  • The procedure of initial feature extraction and selection of proper IMFs in EMD made the overall process strenuous and time-consuming.
The proposed study conducted a comprehensive comparison of preprocessing schemes (DWT and EMD), feature analysis, selection, and classification as illustrated in Figure 3. The computational complexity of the proposed is significantly low due to the fact that it operates on trained classifier models, therefore eliminating the training computational cost (Figure 19). The proposed system has the potential to be deployed in clinical environments and intensive care units where it can contribute to lessen the workload of medical professionals through its accurate detection and timely diagnosis. In future works, our research group aims to increase the data set size and apply deep learning models to automate the feature extraction process. The proposed framework is intended to be implemented on portable embedded platforms.

Author Contributions

Conceptualization, M.U.K. and S.A.; methodology, M.U.K. and S.A.; software, K.I. and T.A.; validation, F.A. and K.I.; formal analysis, T.A. and K.I.; investigation, K.I. and F.A.; writing—original draft preparation, S.A. and M.U.K.; writing—review and editing, M.A.K. and Y.N.; funding acquisition, Y.N.; project administration, T.A.; data curation, F.A. and M.A.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by Korea Institute for Advancement of Technology (KIAT) grant funded by the Korea Government (MOTIE) (P0012724, The Competency Development Program for Industry Specialist) and the Soonchunhyang University Research Fund.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Data will be available on request.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Tayefi, M.; Esmaeili, H.; Karimian, M.S.; Zadeh, A.A.; Ebrahimi, M.; Safarian, M.; Nematy, M.; Parizadeh, S.M.R.; Ferns, G.A.; Ghayour-Mobarhan, M. The application of a decision tree to establish the parameters associated with hypertension. Comput. Methods Programs Biomed. 2017, 139, 83–91. [Google Scholar] [CrossRef] [PubMed]
  2. WHO. Cardiovascular Diseases. 2017. Available online: https://www.who.int/news-room/fact-sheets/detail/cardiovascular-diseases-(cvds) (accessed on 23 October 2020).
  3. WHO. Hypertension, 13 September 2019. Available online: https://www.who.int/news-room/fact-sheets/detail/hypertension (accessed on 23 October 2020).
  4. Poddar, M.; Kumar, V.; Sharma, Y.P. Heart rate variability based classification of normal and hypertension cases by linear-nonlinear method. Def. Sci. J. 2014, 64, 542. [Google Scholar] [CrossRef]
  5. Li, W.; Gu, H.; Teo, K.K.; Bo, J.; Wang, Y.; Yang, J.; Wang, X.; Zhang, H.; Sun, Y.; Jia, X.; et al. Hypertension prevalence, awareness, treatment, and control in 115 rural and urban communities involving 47,000 people from China. J. Hypertens. 2016, 34, 39–46. [Google Scholar] [CrossRef] [PubMed]
  6. Wall, H.K.; Hannan, J.A.; Wright, J.S. Patients with undiagnosed hypertension: Hiding in plain sight. JAMA 2014, 312, 1973–1974. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Saleem, F.; Hassali, A.A.; Shafie, A.A. Hypertension in Pakistan: Time to take some serious action. Br. J. Gen. Pract. 2010, 60, 449–450. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Benjamin, E.J.; Blaha, M.J.; Chiuve, S.E.; Cushman, M.; Das, S.R.; Deo, R.; De Ferranti, S.D.; Floyd, J.; Fornage, M.; Gillespie, C.; et al. Heart disease and stroke statistics—2017 update. Circulation 2017, 135, e146–e603. [Google Scholar] [CrossRef]
  9. Goodhart, A.K. Hypertension from the patient’s perspective. Br. J. Gen. Pract. 2016, 66, 570. [Google Scholar] [CrossRef]
  10. Huang, C.; Wang, Y.; Li, X.; Ren, L.; Zhao, J.; Hu, Y.; Zhang, L.; Fan, G.; Xu, J.; Gu, X.; et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet 2020, 395, 497–506. [Google Scholar] [CrossRef] [Green Version]
  11. Schiffrin, E.L.; Flack, J.M.; Ito, S.; Muntner, P.; Webb, R.C. Hypertension and COVID-19. Am. J. Hypertens. 2020, 33, 373–374. [Google Scholar] [CrossRef]
  12. Liang, Y.; Chen, Z.; Ward, R.; Elgendi, M.J.B. Photoplethysmography and deep learning: Enhancing hypertension risk stratification. Biosensors 2018, 8, 101. [Google Scholar] [CrossRef] [Green Version]
  13. Liang, Y.; Chen, Z.; Ward, R.; Elgendi, M. Hypertension assessment via ECG and PPG signals: An evaluation using MIMIC database. Diagnostics 2018, 8, 65. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Lan, K.C.; Raknim, P.; Kao, W.F.; Huang, J.H. Toward hypertension prediction based on PPG-derived HRV signals: A feasibility study. J. Med. Syst. 2018, 42, 103. [Google Scholar] [CrossRef] [PubMed]
  15. Ni, H.; Cho, S.; Mankoff, J.; Yang, J.J.; Computing, H. Automated recognition of hypertension through overnight continuous HRV monitoring. J. Ambient. Intell. Humaniz. Comput. 2018, 9, 2011–2023. [Google Scholar] [CrossRef]
  16. Kublanov, V.S.; Dolganov, A.Y.; Belo, D.; Gamboa, H.J. Comparison of machine learning methods for the arterial hypertension diagnostics. Appl. Bionics Biomech. 2017, 2017, 5985479. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Rajput, J.S.; Sharma, M.; Acharya, U.R. Hypertension Diagnosis Index for Discrimination of High-Risk Hypertension ECG Signals Using Optimal Orthogonal Wavelet Filter Bank. Int. J. Environ. Res. Public Health 2019, 16, 4068. [Google Scholar] [CrossRef] [Green Version]
  18. Soh, D.C.K.; Ng, E.; Jahmunah, V.; Oh, S.L.; San, T.R.; Acharya, U.R. A computational intelligence tool for the detection of hypertension using empirical mode decomposition. Medicine 2020, 118, 103630. [Google Scholar] [CrossRef]
  19. Liang, Y.; Chen, Z.; Ward, R.; Elgendi, M. Hypertension assessment using photoplethysmography: A risk stratification approach. J. Clin. Med. 2019, 8, 12. [Google Scholar] [CrossRef] [Green Version]
  20. Liu, F.; Zhou, X.; Wang, Z.; Cao, J.; Wang, H.; Zhang, Y. Unobtrusive mattress-based identification of hypertension by integrating classification and association rule mining. Sensors 2019, 19, 1489. [Google Scholar] [CrossRef] [Green Version]
  21. Baranchuk, A.; Kang, J.; Shaw, C.; Campbell, D.; Ribas, S.; Hopman, W.M.; Alanazi, H.; Redfearn, D.P.; Simpson, C.S. Electromagnetic interference of communication devices on ECG machines. Clin. Cardiol. Int. Index. Peer Rev. J. Adv. Treat. Cardiovasc. Dis. 2009, 32, 588–592. [Google Scholar] [CrossRef]
  22. Klein, A.; Djaiani, G. Mobile phones in the hospital–past, present and future. Anaesthesia 2003, 58, 353–357. [Google Scholar] [CrossRef]
  23. Kranjec, J.; Beguš, S.; Geršak, G.; Drnovšek, J. Non-contact heart rate and heart rate variability measurements: A review. Biomed. Signal Process. Control. 2014, 13, 102–112. [Google Scholar] [CrossRef]
  24. Ave, A.; Fauzan, H.; Adhitya, S.R.; Zakaria, H. Early detection of cardiovascular disease with photoplethysmogram (PPG) sensor. In Proceedings of the 2015 International Conference on Electrical Engineering and Informatics (ICEEI), Denpasar, Indonesia, 10–11 August 2015; pp. 676–681. [Google Scholar]
  25. Khan, M.U.; Aziz, S.; Malik, A.; Imtiaz, M.A. Detection of Myocardial Infarction using Pulse Plethysmograph Signals. In Proceedings of the 2019 International Conference on Frontiers of Information Technology (FIT), Islamabad, Pakistan, 16–18 December 2019; pp. 95–955. [Google Scholar]
  26. Khan, M.U.; Aziz, S.; Amjad, F.; Mohsin, M. Detection of Dilated Cardiomyopathy using Pulse Plethysmographic Signal Analysis. In Proceedings of the 2019 22nd International Multitopic Conference (INMIC), Islamabad, Pakistan, 29–30 November 2019; pp. 1–5. [Google Scholar]
  27. Khan, M.U.; Aziz, S.; Iqtidar, K.; Zainab, A.; Saud, A. Prediction of Acute Coronary Syndrome Using Pulse Plethysmograph. In Proceedings of the 2019 4th International Conference on Emerging Trends in Engineering, Sciences and Technology (ICEEST), Karachi, Pakistan, 10–11 December 2019; pp. 1–6. [Google Scholar]
  28. Naqvi, S.Z.H.; Aziz, S.; Khan, M.U.; Asghar, N.; Rasool, G. Emotion Recognition System using Pulse Plethysmograph. In Proceedings of the 2020 International Conference on Emerging Trends in Smart Technologies (ICETST), Karachi, Pakistan, 26–27 March 2020; pp. 1–6. [Google Scholar]
  29. Khan, M.U.; Aziz, S.; Naqvi, S.Z.H.; Zaib, A.; Maqsood, A. Pattern Analysis Towards Human Verification using Photoplethysmograph Signals. In Proceedings of the 2020 International Conference on Emerging Trends in Smart Technologies (ICETST), Karachi, Pakistan, 26–27 March 2020; pp. 1–6. [Google Scholar]
  30. Zhang, G.; Si, Y.; Yang, W.; Wang, D. A Robust Multilevel DWT Densely Network for Cardiovascular Disease Classification. Sensors 2020, 20, 4777. [Google Scholar] [CrossRef] [PubMed]
  31. Aileni, R.M.; Pasca, S.; Florescu, A. EEG-Brain Activity Monitoring and Predictive Analysis of Signals Using Artificial Neural Networks. Sensors 2020, 20, 3346. [Google Scholar] [CrossRef] [PubMed]
  32. Alturki, F.A.; AlSharabi, K.; Abdurraqeeb, A.M.; Aljalal, M. EEG Signal Analysis for Diagnosing Neurological Disorders Using Discrete Wavelet Transform and Intelligent Techniques. Sensors 2020, 20, 2505. [Google Scholar] [CrossRef] [PubMed]
  33. Gradolewski, D.; Redlarski, G. Wavelet-based denoising method for real phonocardiography signal recorded by mobile devices in noisy environment. Comput. Biol. Med. 2014, 52, 119–129. [Google Scholar] [CrossRef] [PubMed]
  34. Ebrahim, M.P.; Heydari, F.; Redoute, J.M.; Yuce, M.R. Accurate heart rate detection from on-body continuous wave radar sensors using wavelet transform. In Proceedings of the 2018 IEEE SENSORS, New Delhi, India, 28–31 October 2018; pp. 1–4. [Google Scholar]
  35. Ebrahim, M.P.; Heydari, F.; Walker, K.; Joe, K.; Redoute, J.M.; Yuce, M.R. Systolic Blood Pressure Estimation Using Wearable Radar and Photoplethysmogram Signals. In Proceedings of the 2019 IEEE International Conference on Systems, Man and Cybernetics (SMC), Bari, Italy, 6–9 October 2019; pp. 3878–3882. [Google Scholar]
  36. Asghar, M.A.; Khan, M.J.; Rizwan, M.; Mehmood, R.M.; Kim, S.H. An Innovative Multi-Model Neural Network Approach for Feature Selection in Emotion Recognition Using Deep Feature Clustering. Sensors 2020, 20, 3765. [Google Scholar] [CrossRef] [PubMed]
  37. Aziz, S.; Khan, M.U.; Alhaisoni, M.; Akram, T.; Altaf, M. Phonocardiogram Signal Processing for Automatic Diagnosis of Congenital Heart Disorders through Fusion of Temporal and Cepstral Features. Sensors 2020, 20, 3790. [Google Scholar] [CrossRef]
  38. Tsimpiris, A.; Kugiumtzis, D. Feature selection for classification of oscillating time series. Expert Syst. 2012, 29, 456–477. [Google Scholar] [CrossRef]
  39. Goyal, D.; Choudhary, A.; Pabla, B.; Dhami, S. Support vector machines based non-contact fault diagnosis system for bearings. J. Intell. Manuf. 2019, 31, 1275–1289. [Google Scholar] [CrossRef]
  40. Fasil, O.; Rajesh, R. Time-domain exponential energy for epileptic EEG signal classification. Neurosci. Lett. 2019, 694, 1–8. [Google Scholar]
  41. Banker, R.; Natarajan, R.; Zhang, D. Two-stage estimation of the impact of contextual variables in stochastic frontier production function models using data envelopment analysis: Second stage OLS versus bootstrap approaches. Eur. J. Oper. Res. 2019, 278, 368–384. [Google Scholar] [CrossRef]
  42. Kumar, D.; Carvalho, P.; Antunes, M.; Paiva, R.; Henriques, J. Heart murmur classification with feature selection. In Proceedings of the 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology, Buenos Aires, Argentina, 31 August–4 September 2010; pp. 4566–4569. [Google Scholar]
  43. Dhindsa, I.S.; Agarwal, R.; Ryait, H.S. Performance evaluation of various classifiers for predicting knee angle from electromyography signals. Expert Syst. 2019, 36, e12381. [Google Scholar] [CrossRef]
  44. Mukhopadhyay, A.K.; Samui, S. An experimental study on upper limb position invariant EMG signal classification based on deep neural network. Biomed. Signal Process. Control. 2020, 55, 101669. [Google Scholar] [CrossRef]
  45. Phinyomark, A.; Phukpattaranont, P.; Limsakul, C. Feature reduction and selection for EMG signal classification. Expert Syst. Appl. 2012, 39, 7420–7431. [Google Scholar] [CrossRef]
  46. Peeters, G. A large set of audio features for sound description (similarity and classification) in the CUIDADO project. Cuid. IST Proj. Rep. 2004, 54, 1–25. [Google Scholar]
  47. Yadav, A.; Singh, A.; Dutta, M.K.; Travieso, C.M. Machine learning-based classification of cardiac diseases from PCG recorded heart sounds. Neural Comput. Appl. 2019, 1–14. [Google Scholar] [CrossRef]
  48. Amezquita-Sanchez, J.P.; Mammone, N.; Morabito, F.C.; Marino, S.; Adeli, H. A novel methodology for automated differential diagnosis of mild cognitive impairment and the Alzheimer’s disease using EEG signals. J. Neurosci. Methods 2019, 322, 88–95. [Google Scholar] [CrossRef]
  49. Peng, J.; Hao, D.; Yang, L.; Du, M.; Song, X.; Jiang, H.; Zhang, Y.; Zheng, D. Evaluation of electrohysterogram measured from different gestational weeks for recognizing preterm delivery: A preliminary study using random Forest. Biocybern. Biomed. Eng. 2020, 40, 352–362. [Google Scholar] [CrossRef]
  50. Birajdar, G.K.; Patil, M.D. Speech/music classification using visual and spectral chromagram features. J. Ambient. Intell. Humaniz. Comput. 2020, 11, 329–347. [Google Scholar] [CrossRef]
  51. Leite, J.P.R.; Moreno, R.L. Heartbeat classification with low computational cost using Hjorth parameters. IET Signal Process. 2017, 12, 431–438. [Google Scholar] [CrossRef]
  52. Dua, M.; Aggarwal, R.K.; Biswas, M. GFCC based discriminatively trained noise robust continuous ASR system for Hindi language. J. Ambient. Intell. Humaniz. Comput. 2019, 10, 2301–2314. [Google Scholar] [CrossRef]
  53. Adnan, S.M.; Irtaza, A.; Aziz, S.; Ullah, M.O.; Javed, A.; Mahmood, M.T. Fall detection through acoustic local ternary patterns. Appl. Acoust. 2018, 140, 296–300. [Google Scholar] [CrossRef]
  54. Guyon, I. Practical feature selection: From correlation to causality. In Proceedings of the NATO Advanced Study Institute on Mining Massive Data Sets for Security (MMDSS 2007), Gazzada, Italy, 10–21 September 2007; pp. 27–43. [Google Scholar]
  55. Li, X.; Ling, S.H.; Su, S. A Hybrid Feature Selection and Extraction Methods for Sleep Apnea Detection Using Bio-Signals. Sensors 2020, 20, 4323. [Google Scholar] [CrossRef]
  56. Zhen, Z.; Zeng, X.; Wang, H.; Han, L. A global evaluation criterion for feature selection in text categorization using Kullback-Leibler divergence. In Proceedings of the 2011 International Conference of Soft Computing and Pattern Recognition (SoCPaR), Dalian, China, 14–16 October 2011; pp. 440–445. [Google Scholar]
  57. Guorong, X.; Peiqi, C.; Minhui, W. Bhattacharyya distance feature selection. In Proceedings of the 13th International Conference on Pattern Recognition, Vienna, Austria, 25–29 August 1996; Volume 2, pp. 195–199. [Google Scholar]
  58. Chowdhury, M.H.; Shuzan, M.N.I.; Chowdhury, M.E.; Mahbub, Z.B.; Uddin, M.M.; Khandakar, A.; Reaz, M.B.I. Estimating Blood Pressure from the Photoplethysmogram Signal and Demographic Features Using Machine Learning Techniques. Sensors 2020, 20, 3127. [Google Scholar] [CrossRef] [PubMed]
  59. Ding, C.; Peng, H. Minimum redundancy feature selection from microarray gene expression data. J. Bioinform. Comput. Biol. 2005, 3, 185–205. [Google Scholar] [CrossRef] [PubMed]
  60. Xu, Y.; Zhao, X.; Chen, Y.; Zhao, W. Research on a mixed gas recognition and concentration detection algorithm based on a metal oxide semiconductor olfactory system sensor array. Sensors 2018, 18, 3264. [Google Scholar] [CrossRef] [PubMed]
  61. Schölkopf, B.; Smola, A.; Müller, K.R. Nonlinear component analysis as a kernel eigenvalue problem. Neural Comput. 1998, 10, 1299–1319. [Google Scholar] [CrossRef] [Green Version]
  62. Khan, M.U.; Saad, M.; Aziz, S.; Ch, J.M.; Naqvi, S.Z.H.; Qasim, M.A. Electrocardiogram based Gender Classification. In Proceedings of the 2020 International Conference on Electrical, Communication, and Computer Engineering (ICECCE), Istanbul, Turkey, 12–13 June 2020; pp. 1–6. [Google Scholar]
  63. Hoffmann, H. Kernel PCA for novelty detection. Pattern Recognit. 2007, 40, 863–874. [Google Scholar] [CrossRef]
  64. Mohamed, M.; Deriche, M. An approach for ECG feature extraction using daubechies 4 (DB4) wavelet. Int. J. Comput. Appl. 2014, 96, 36–41. [Google Scholar]
  65. Acharya, U.R.; Fujita, H.; Adam, M.; Lih, O.S.; Sudarshan, V.K.; Hong, T.J.; Koh, J.E.; Hagiwara, Y.; Chua, C.K.; Poo, C.K.; et al. Automated characterization and classification of coronary artery disease and myocardial infarction by decomposition of ECG signals: A comparative study. Inf. Sci. 2017, 377, 17–29. [Google Scholar] [CrossRef]
  66. Labate, D.; La Foresta, F.; Occhiuto, G.; Morabito, F.C.; Lay-Ekuakille, A.; Vergallo, P. Empirical mode decomposition vs. wavelet decomposition for the extraction of respiratory signal from single-channel ECG: A comparison. IEEE Sens. J. 2013, 13, 2666–2674. [Google Scholar] [CrossRef]
  67. Mills, K.T.; Bundy, J.D.; Kelly, T.N.; Reed, J.E.; Kearney, P.M.; Reynolds, K.; Chen, J.; He, J. Global disparities of hypertension prevalence and control: A systematic analysis of population-based studies from 90 countries. Circulation 2016, 134, 441–450. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Visual comparison of Photoplethysmograph (PPG) and Pulse Plethysmograph (PuPG) signals.
Figure 1. Visual comparison of Photoplethysmograph (PPG) and Pulse Plethysmograph (PuPG) signals.
Sensors 21 00247 g001
Figure 2. Raw PuPG signals of Normal and Hypertension classes.
Figure 2. Raw PuPG signals of Normal and Hypertension classes.
Sensors 21 00247 g002
Figure 3. Overall flow chart of the proposed design methodology for detection of hypertension through pulse plethysmograph signals.
Figure 3. Overall flow chart of the proposed design methodology for detection of hypertension through pulse plethysmograph signals.
Sensors 21 00247 g003
Figure 4. Wavelet-based denoising.
Figure 4. Wavelet-based denoising.
Sensors 21 00247 g004
Figure 5. Wavelet decomposition of raw PuPG signals.
Figure 5. Wavelet decomposition of raw PuPG signals.
Sensors 21 00247 g005
Figure 6. Denoised version of PuPG signal for Normal and Hypertension through DWT.
Figure 6. Denoised version of PuPG signal for Normal and Hypertension through DWT.
Sensors 21 00247 g006
Figure 7. EMD algorithm (flow chart).
Figure 7. EMD algorithm (flow chart).
Sensors 21 00247 g007
Figure 8. EMD decomposition of raw PuPG signals.
Figure 8. EMD decomposition of raw PuPG signals.
Sensors 21 00247 g008
Figure 9. Preprocessed signal using EMD.
Figure 9. Preprocessed signal using EMD.
Sensors 21 00247 g009
Figure 10. Block diagram (feature selection and reduction method).
Figure 10. Block diagram (feature selection and reduction method).
Sensors 21 00247 g010
Figure 11. Performance of accuracy for different feature sets in various classifiers for PuPG signal through method I.
Figure 11. Performance of accuracy for different feature sets in various classifiers for PuPG signal through method I.
Sensors 21 00247 g011
Figure 12. Performance of sensitivity for different feature sets in various classifiers for PuPG signal through method I.
Figure 12. Performance of sensitivity for different feature sets in various classifiers for PuPG signal through method I.
Sensors 21 00247 g012
Figure 13. Performance of specificity for different feature sets in various classifiers for PuPG signal through method I.
Figure 13. Performance of specificity for different feature sets in various classifiers for PuPG signal through method I.
Sensors 21 00247 g013
Figure 14. Confusion matrix for method I.
Figure 14. Confusion matrix for method I.
Sensors 21 00247 g014
Figure 15. Performance of accuracy for different feature sets in various classifiers for PuPG signal through method II.
Figure 15. Performance of accuracy for different feature sets in various classifiers for PuPG signal through method II.
Sensors 21 00247 g015
Figure 16. Performance of sensitivity for different feature sets in various classifiers for PuPG signal through method II.
Figure 16. Performance of sensitivity for different feature sets in various classifiers for PuPG signal through method II.
Sensors 21 00247 g016
Figure 17. Performance of specificity for different feature sets in various classifiers for PuPG signal through method II.
Figure 17. Performance of specificity for different feature sets in various classifiers for PuPG signal through method II.
Sensors 21 00247 g017
Figure 18. Confusion matrix for method II.
Figure 18. Confusion matrix for method II.
Sensors 21 00247 g018
Figure 19. Proposed EHDS block diagram.
Figure 19. Proposed EHDS block diagram.
Sensors 21 00247 g019
Figure 20. Performance of method II in terms of accuracy, sensitivity, and specificity for 1 to 24 transformed features.
Figure 20. Performance of method II in terms of accuracy, sensitivity, and specificity for 1 to 24 transformed features.
Sensors 21 00247 g020
Table 1. Categorization of blood pressure.
Table 1. Categorization of blood pressure.
ClassSystolic (mmHg)Diastolic (mmHg)
OptimalLess than 120Less than 80
Normal120 to 12980 to 84
High Normal130 to 13985 to 89
HypertensionMore than or equal to 140More than or equal to 90
Table 2. Difference between PPG and PuPG data acquisition.
Table 2. Difference between PPG and PuPG data acquisition.
TypePhotoplethysmograph (PPG) SensorPulse Plethysmograph (PuPG) Sensor
Input signalOptical signalPressure changes
PhenomenonBlood volumetric changes are detected by measuring the amount of light transmitted or reflected by the sensor.Blood volumetric changes are detected by the piezoelectric material of the sensor as pressure changes when the blood volume changes.
Noise ImpactLight signal can be easily impacted by any external light changes.
Dirty hand can distort the light intensities.
Piezoelectric material based sensors are normaly temperature sensitive.
Dirty hands or foreign material on hand or fingers does not have significant impact.
Table 3. Summary of the self-collected PuPG data set.
Table 3. Summary of the self-collected PuPG data set.
Data ClassSubjectsAge GroupSamples
HypertensionMale: 29Male: 40–76700
Female: 27Female: 39–59
NormalMale: 35Male: 21–63709
Female: 30Female: 20–59
Overall12120–761409
Table 4. Comparison of mean relative energies and frequency ranges of various decomposition levels for Normal and Hypertension classes.
Table 4. Comparison of mean relative energies and frequency ranges of various decomposition levels for Normal and Hypertension classes.
Decomposition Levels Frequency Range (Hz) Mean Relative Energy (%)
NormalHypertension
D 1 250–5000.07%0.59%
D 2 122–2560.09%0.32%
D 3 61.1–1280.19%0.35%
D 4 30.6–63.90.46%0.49%
D 5 15.3–31.91.93%3.10%
D 6 7.65–1614.77%13.31%
D 7 3.84–7.9731.03%21.49%
D 8 1.94–3.9929.11%19.05%
D 9 1.03–1.9921.11%26.77%
D 10 0.594–0.9580.16%7.65%
A 10 0–0.4311.08%6.88%
A 4 0–31.299.19%98.26%
Table 5. Comparison of mean relative energies and frequency ranges of various intrinsic mode functions (EMD) for Normal and Hypertension classes. Bold font indicates the selected components.
Table 5. Comparison of mean relative energies and frequency ranges of various intrinsic mode functions (EMD) for Normal and Hypertension classes. Bold font indicates the selected components.
ComponentsNormalHypertension
Mean Frequency Range (Hz)Mean Relative Energy (%)Mean Frequency Range (Hz)Mean Relative Energy (%)
I M F 1 103–4830.0086.5–4841.02
I M F 2 11.3–60.20.1440.7–2190.35
I M F 3 3.09–1430.343.3–612.04
I M F 4 2.99–12.24.973.34–23.36.65
I M F 5 1.28–1022.762.98–11.414.95
Residual0.129–5.5541.780.0197–4.3374.99
Table 6. Statistical data of all extracted features for both methods.
Table 6. Statistical data of all extracted features for both methods.
Method IMethod II
Feature NormalHypertensionNormalHypertension
MeanSTDMeanSTDMeanSTDMeanSTD
Mean0.0080.0270.0170.0310.0010.0490.0130.053
Standard Deviation0.2530.0320.2500.0460.2540.0320.2480.045
Skewness−1.9590.522−2.1440.651−1.9970.576−2.2200.641
Kurtosis6.9931.4998.0833.8647.1391.6888.2973.944
Peak to Peak Value1.3800.1991.3980.1401.3770.1781.3750.144
Root Mean Square0.2550.0330.2520.0470.2580.0380.2540.046
Crest Factor1.6590.9391.6350.3811.5870.9911.5230.536
Shape Factor1.4840.1341.4580.1591.5220.1631.4650.167
Impulse Factor2.4351.2972.3820.5932.3711.3762.1710.632
Margin Factor15.2214.6415.127.2715.1615.5613.396.45
Energy389.6209.7437.4207.6393.9207.4448.5222.5
Peak to RMS Value3.8940.6234.0940.9913.9210.6634.1081.102
Root Sum of Squares18.9335.60020.2925.08919.0695.52520.4805.412
Shannon Energy549.7312.3618.2279.7526.6295.8686.5414.4
Log Energy−27,88815,515−34,56918,644−28,50916,289−32,61316,387
Mean Absolute Deviation0.1690.0260.1690.0430.1690.0280.1680.043
Median Absolute Deviation0.0740.0260.0710.0300.0710.0330.0590.024
Average Frequency0.0020.0010.0010.0010.0020.0030.0020.002
Jitter137.1180.985.5150.4159.0248.852.095.8
Spectral Mean3.1445.9280.7711.9608.62315.7002.7646.372
Spectral Standard Deviation3.4014.8331.5493.4637.57111.4983.0316.016
Spectral Skewness2.3611.4963.7631.2950.9961.3601.7971.908
Specral Kurtosis11.3318.23520.1909.5895.5884.94710.5648.751
Spectral Centroid9.4420.2029.9151.4819.7711.57610.7682.216
Spectral Flux0.0080.0020.0080.0020.0080.0020.0080.002
Spectral Roll-off91.6991.01096.5878.38397.03610.371136.25744.372
Spectral Flatness0.0250.0140.0510.0350.0630.0650.1710.138
Spectral Crest0.6420.0120.6210.0400.6310.0300.5930.059
Spectral Decrease−4.3330.228−4.0130.658−4.1690.538−3.6560.830
Spectral Slope−0.0230.003−0.0230.003−0.0240.003−0.0230.005
Spectral Spread18.5050.19919.0291.17819.3281.61723.5215.818
Mean Frequency4.3470.9494.9993.4514.9893.0636.4464.440
Median Frequency3.5740.8032.9720.8803.8113.5732.8460.753
Spurious-free Dynamic Range3.0736.0192.1962.1073.1636.4512.0561.812
Signal to Noise Distortion−0.8856.041−2.0363.101−0.7556.819−2.0973.030
Total Harmonic Distortions−2.3765.452−0.9384.964−3.1446.562−0.3964.110
1st Coeffient of MFCC−44.990.37−44.7160.512−44.840.60−44.450.80
2nd Coeffient of MFCC6.2680.5236.6610.7146.4800.8467.0281.122
3rd Coeffient of MFCC5.9760.4996.3500.6836.1690.8026.6901.066
4th Coeffient of MFCC5.5080.4625.8510.6345.6710.7336.1480.976
1st Coeffient of GFCC−7.1830.430−6.7620.610−7.0270.793−6.2201.266
2nd Coeffient of GFCC1.8440.0631.8690.0711.5220.4191.1190.574
3rd Coeffient of GFCC0.5530.1380.3670.2690.6430.1050.4920.206
4th Coeffient of GFCC0.3010.0240.2660.0330.3920.1090.4080.109
1st Coefficient of Chroma Vector0.3830.2350.7500.5010.6530.5322.1261.930
2nd Coefficient of Chroma Vector0.4160.2580.7730.5180.6630.5462.1301.948
3rd Coefficient of Chroma Vector0.4330.2690.8420.5750.7420.6722.1292.144
4th Coefficient of Chroma Vector0.6230.3781.2970.9420.7000.5682.0112.046
5th Coefficient of Chroma Vector0.5640.3371.2120.8720.6910.5342.0441.935
6th Coefficient of Chroma Vector0.5270.3201.2300.9870.7480.5632.2272.267
7th Coefficient of Chroma Vector0.4830.2961.0690.7600.7050.5242.1071.893
8th Coefficient of Chroma Vector0.4510.2790.9820.6960.6860.5282.0711.863
9th Coefficient of Chroma Vector0.4290.2680.9080.6380.6790.5292.0991.929
10th Coefficient of Chroma Vector0.4000.2510.8780.6090.6510.5372.0871.848
11th Coefficient of Chroma Vector0.3730.2320.7760.5370.6680.5222.1061.954
12th Coefficient of Chroma Vector0.3480.2250.7050.4740.6220.5222.0781.890
Enhanced Mean Absolute Value0.2970.0390.3020.0550.2940.0520.3010.049
Enhanced Wavelength236.4133.5413.7319.8284.3198.6665.7515.0
Wavelength36.8321.9485.5996.9454.4246.94193.47186.80
Slope Sign Change45.186.9508.4549.01039.51657.63463.12792.1
Average Amplitude Change0.0060.0030.0100.0090.0090.0090.0210.018
Difference Absolute Std. Dev.0.0090.0030.0130.0100.0140.0100.0270.021
Log Detector0.1080.0300.1180.0370.1070.0430.1170.036
Modified Mean Absolute Value0.1300.0250.1330.0340.1300.0330.1320.030
Modified Mean Absolute Value 20.0830.0220.0890.0260.0840.0270.0870.020
Pulse Percentage Rate0.9390.0290.9530.0270.9370.0450.9570.035
Simple Square Integral389.6209.7437.4207.6393.9207.4448.5222.5
Willison Amplitude1153.6765.62670.82603.91836.01707.64551.33502.4
Maximum Fractal Length−0.4630.384−0.1360.740−0.1740.5940.4281.121
Root Squared Zero Order Moment2.5920.0322.6000.0262.5930.0312.6010.028
Root Squared 2nd Order Moment2.0680.0662.0360.0621.9840.1151.9130.130
Root Squared 4th Order Moment2.0450.0772.0010.0781.8910.1591.7940.205
Sparseness0.5350.0640.5820.0860.6550.1370.7470.188
Irregularity Factor−0.4640.037−0.4450.047−0.4460.058−0.4060.061
Waveform Length Ratio−0.0650.703−0.3540.230−0.6480.604−0.7210.727
Complexity0.5020.2220.7060.2530.8970.5241.3140.515
Mobility0.0380.0110.0570.0380.0550.0350.1150.079
Higuchi’s Fractal Dimension1.0540.0521.1490.1191.1830.2401.4900.401
Katz Fractal Dimension1.0000.0001.0000.0001.0000.0001.0000.000
Lyapunov Exponent437.449.5394.849.3362.978.8239.192.2
Approximate Entropy0.1040.0490.1550.1340.1270.0800.3160.312
Correlation Dimension1.6870.1681.7330.1911.6760.2021.7310.300
1st Coefficient of LTP259.6128.8291.5122.1259.6128.8291.5122.1
2nd Coefficient of LTP40.79030.94356.80935.99840.79030.94356.80935.998
3rd Coefficient of LTP24.57423.71741.72332.08324.57423.71741.72332.083
4th Coefficient of LTP16.38115.76130.73822.66716.38115.76130.73822.667
5th Coefficient of LTP18.47215.75533.41124.88118.47215.75533.41124.881
6th Coefficient of LTP25.20521.60745.51833.71525.20521.60745.51833.715
7th Coefficient of LTP17.69916.35229.03522.60917.69916.35229.03522.609
8th Coefficient of LTP26.26123.07539.64528.38626.26123.07539.64528.386
9th Coefficient of LTP47.48332.84458.16335.13247.48332.84458.16335.132
10th Coefficient of LTP207.27894.769201.80972.537207.27894.769201.80972.537
11th Coefficient of LTP269.5137.7302.1121.7269.5137.70302.1121.7
12th Coefficient of LTP41.73331.35459.77338.09241.73331.35459.77338.092
13th Coefficient of LTP24.00623.09641.07131.29424.00623.09641.07131.294
14th Coefficient of LTP16.78415.90030.19923.47116.78415.90030.19923.471
15th Coefficient of LTP18.50615.14033.39025.58318.50615.14033.39025.583
16th Coefficient of LTP26.14822.75544.17731.68226.14822.75544.17731.682
17th Coefficient of LTP16.89816.55930.14921.91016.89816.55930.14921.910
18th Coefficient of LTP25.79022.89239.31226.78425.79022.89239.31226.784
19th Coefficient of LTP46.53433.04556.29833.64746.53433.04556.29833.647
20th Coefficient of LTP197.77385.090191.85874.487197.77385.090191.85874.487
Table 7. List of features extracted for method I and sorted with respect to mean rank (MR) value. Bold font indicates the top 24 ranked features.
Table 7. List of features extracted for method I and sorted with respect to mean rank (MR) value. Bold font indicates the top 24 ranked features.
FeatureTTKLDBDROCMWTMRMRRRFMR
3rd Coefficient of LTP12939383321029873.29
6th Coefficient of Chroma Vector3898767978944973.14
Lyapunov Exponent9392927081107072.57
Sparseness7090903062798472.14
Jitter7858587477857472.00
9th Coefficient of LTP5783837582922470.86
Spectral Decrease6228618575938369.57
4th Coefficient of MFCC9641419540729969.14
Irregularity35941005788901368.14
1st Coeffient of MFCC99232310037969767.86
Waveform Length Ratio13100949191006667.57
3rd Coeffient of MFCC7997978439551766.86
1st Coefficient of Chroma Vector7186869826485366.86
Spectral Roll-off8143796656598166.43
Spectral Crest6180602692717566.43
6th Coefficient of Chroma Vector10076993161455266.29
7th Coefficient of Chroma Vector379998196428665.57
Median Frequency8835787110278264.86
2nd Coefficient of Chroma Vector897968941815164.86
Spectral Centroid7679433986695864.29
Difference Absolute Std. Dev. Value8438378179636463.71
Shape Factor5953535472807363.43
Spectral Mean4377779646406162.86
Simple Square Integral472722289899262.86
3rd Coefficient of GFCC9529889434534562.57
4th Coefficient of Chroma Vector4096308785524762.43
Root Mean Square4556567743876961.86
Signal to Noise Distortion9769694774314261.29
9th Coefficient of Chroma Vector7212958827765761.00
Mean Absolute Deviation5848484976826260.43
Root Squared 2nd Order Moment918271859209160.29
Root Squared 4th Order Moment1017182824661460.00
10th Coefficient of Chroma Vector3695852873415659.14
12th Coefficient of Chroma Vector9089895910373858.86
1st Coefficient of GFCC698787616638358.71
Mean8026268080684457.71
Enhanced Mean Absolute Value2173736258546357.71
Root Sum of Squares5349529918587257.29
2nd Coefficient of LTP8270703536129557.14
Katz Fractal Dimension67336790759056.43
Table 8. List of features extracted for method II and sorted with respect to MR value. Bold font indicates the top 24 ranked features.
Table 8. List of features extracted for method II and sorted with respect to MR value. Bold font indicates the top 24 ranked features.
FeatureTTKLDBDROCMWTMRMRRRFMR
7th Coefficient of Chroma Vector10098982696975581.43
4th Coefficient of Chroma Vector8996767385529681.00
Mobility9364646768919577.43
Spectral Centroid6878787745988876.00
Enhanced Mean Absolute Value7195952898568775.71
9th Coefficient of LTP641011016493791974.43
1st Coefficient of GFCC9535979838995473.71
7th Coefficient of LTP57939357101248973.43
Slope Sign Change7473897989891372.29
Maximum Fractal Length372749190907070.00
3rd Coefficient of MFCC38972295102805369.57
6th Coefficient of Chroma Vector6980969027685669.43
8th Coefficient of Chroma Vector6199996966494069.00
Enhanced Wavelength8185856894165068.43
Pulse Percentage Rate4941004273767766.57
Root Squared Zero Order Moment6384843069724664.00
Crest Factor5168688072327263.29
Modified Mean Absolute Value 2429090788646263.29
Spectral Crest8758454047857562.43
2nd Coeffient of MFCC3769698477485162.14
1st Coeffient of MFCC7799100993810262.00
Average Frequency8029544754868261.71
4th Coefficient of LTP3465656542748461.29
Willison Amplitude9174721516778361.14
Spectral Spread994758856096860.86
3rd Coefficient of LTP659191565210160.00
Root Squared 4th Order Moment7066711497623659.43
Lyapunov Exponent3263638628110058.43
2nd Coeffient of GFCC4087878940501458.14
3rd Coeffient of GFCC7322106639969957.86
Correlation Dimension1470701015719157.71
Root Squared 2nd Order Moment10156636311006157.14
5th Coefficient of Chroma Vector207680974368657.00
2nd Coefficient of Chroma Vector754141887873256.86
11th Coefficient of Chroma Vector6639397495473756.71
Log Energy4575525319886056.00
10th Coefficient of Chroma Vector90383819100693856.00
5th Coefficient of LTP882828351013055.86
1st Coefficient of LTP1192922483573155.71
Table 9. Consolidated result analysis of feature sets ( S 1 , S 2 , S 3 ) for method I with various classifiers.
Table 9. Consolidated result analysis of feature sets ( S 1 , S 2 , S 3 ) for method I with various classifiers.
Classifier  S 1 (5 Components) S 2 (7 Components) S 3 (10 Components)
AccSpSenErrAccSpSenErrAccSpSenErr
DT0.8740.890.890.1260.9240.920.930.0760.9340.940.930.066
LD0.6880.310.3120.6880.310.3120.6690.30.970.331
LR0.6880.310.3120.6880.310.3120.6880.310.312
NBG0.6880.310.3120.6880.310.3120.5910.260.41
NBK0.8040.910.720.1960.830.920.760.170.8930.910.880.107
SVM-L0.4790.560.410.5210.5870.110.970.4130.4980.370.60.502
SVM-Q0.5270.650.430.4730.5270.080.890.4730.5460.410.650.454
SVM-C0.470.40.530.530.5550.030.980.4450.52400.940.476
SVM-FG0.6880.310.3120.6880.3010.3120.6880.310.312
SVM-MG0.6880.310.3120.6880.3010.3120.6880.310.312
KNN-F0.9370.90.970.0630.9720.960.980.0280.9840.970.990.016
KNN-M0.7920.680.880.2080.8640.810.910.1360.9050.860.940.095
KNN-Cos0.6850.30.990.3150.6810.30.990.3190.6850.30.990.315
KNN-C0.6720.680.660.3280.8710.830.90.1290.8960.840.940.104
KNN-W0.9210.880.950.0790.9650.960.970.0350.9780.970.980.022
Eboost0.9180.890.940.0820.8640.740.960.1360.555010.445
EBT0.6880.310.3120.9720.950.990.0280.9430.930.950.057
ESD0.940.920.950.060.6880.310.3120.6810.30.990.319
ESKNN0.9150.910.910.0850.9840.980.990.0160.9810.970.990.019
Table 10. Consolidated result analysis of feature sets ( S 4 , S 5 , S 6 ) for method I with various classifiers. Bold font indicates best results.
Table 10. Consolidated result analysis of feature sets ( S 4 , S 5 , S 6 ) for method I with various classifiers. Bold font indicates best results.
Classifier  S 4 (12 Components) S 5 (15 Components) S 6 (17 Components)
AccSpSenErrAccSpSenErrAccSpSenErr
DT0.9590.960.960.0410.9590.940.980.0410.9720.96980.028
LD0.5810.30.990.4190.6620.30.950.3380.6910.320.990.309
LR0.6880.310.3120.6880.310.3120.6750.350.940.325
NBG0.5910.260.410.62510.320.3750.63110.340.369
NBK0.8680.850.890.1320.8770.920.840.1230.880.910.950.12
SVM-L0.5240.260.740.4760.5430.120.880.4570.5360.070.910.464
SVM-Q0.5520.220.820.4480.5360.330.70.4640.5460.020.970.454
SVM-C0.5240.80.880.4760.51700.930.4830.5140.050.890.486
SVM-FG0.6880.310.3120.6880.310.3120.70.3310.3
SVM-MG0.6650.310.3350.6880.310.3120.6940.3110.306
KNN-F0.9810.970.990.0190.9750.970.980.0250.9150.880.940.085
KNN-M0.9180.850.970.0820.9120.850.960.0880.6590.70.630.341
KNN-Cos0.6880.310.3120.6880.310.3120.6850.30.990.315
KNN-C0.9050.850.950.0950.9090.850.950.0910.9090.890.930.091
KNN-W0.9810.980.980.0190.9750.970.980.0250.9780.970.980.022
Eboost0.555010.4450.555010.4450.555010.445
EBT0.9650.940.980.0350.9720.970.970.0280.940.920.950.06
ESD0.6880.310.3120.6660.30.960.3340.6810.30.990.319
ESKNN0.9840.970.990.0160.9750.970.980.0250.9810.990.990.019
Table 11. Validation of the selected scheme of method I.
Table 11. Validation of the selected scheme of method I.
EvaluationClassesAccuracyTrue Positive RateFalse Negative Rate
5-Fold Cross-Validation Healthy0.9830.980.02
Hypertension0.990.01
10-Fold Cross-Validation Healthy0.9840.980.02
Hypertension0.990.01
15-Fold Cross-Validation Healthy0.9840.980.02
Hypertension0.990.01
20-Fold Cross-Validation Healthy0.9840.980.02
Hypertension0.990.01
20% Hold Out Validation Healthy0.97810
Hypertension0.940.06
25% Hold Out Validation Healthy0.9890.980.02
Hypertension10
Table 12. Feature analysis table ( S 1 , S 2 , S 3 ) for method II. Bold font indicates the best results.
Table 12. Feature analysis table ( S 1 , S 2 , S 3 ) for method II. Bold font indicates the best results.
Classifier  S 1 (5 Components) S 2 (7 Components) S 3 (10 Components)
AccSpSenErrAccSpSenErrAccSpSenErr
DT0.9740.980.970.0260.9830.980.990.0170.9890.980.990.011
LD0.6190.2410.3810.6190.2410.3810.5680.240.90.432
LR0.6790.970.390.3210.6790.970.390.3210.6790.970.390.321
NBG0.6190.2410.3810.6190.2410.3810.26810.260.732
NBK0.9460.970.920.0540.9460.980.910.0540.940.970.910.06
SVM-L0.480.490.470.520.4970.590.410.5030.5230.520.530.477
SVM-Q0.510.530.50.490.5110.240.780.4890.4970.460.530.503
SVM-C0.490.30.690.510.4910.220.770.5090.4910.230.760.509
SVM-FG0.66810.340.3320.66210.320.3380.6650.990.340.335
SVM-MG0.6190.2410.3810.6140.510.720.3860.5970.730.460.403
KNN-F0.990.9900.010.8930.980.9840.1070.9910.990.990.009
KNN-M0.9570.930.980.0430.9690.950.990.0310.9720.950.990.028
KNN-Cos0.6310.270.990.3690.6390.30.980.3610.6360.30.980.364
KNN-C0.9570.930.980.0430.9660.940.990.0340.9690.940.990.031
KNN-W0.9940.9920.9960.0060.9860.940.990.0140.9920.990.990.008
Eboost0.4890.390.590.5110.4890.390.590.5110.4890.390.590.511
EBT0.980.970.990.020.9860.980.990.0140.9860.980.990.014
ESD0.6190.2410.3810.6190.2410.3810.5710.240.90.429
ESKNN0.9910.990.990.0090.9830.990.980.0170.9910.990.990.009
Table 13. Feature analysis table ( S 4 , S 5 , S 6 ) for method II.
Table 13. Feature analysis table ( S 4 , S 5 , S 6 ) for method II.
Classifier  S 4 (12 Components) S 5 (15 Components) S 6 (17 Components)
AccSpSenErrAccSpSenErrAccSpSenErr
DT0.9920.990.990.0080.9720.950.990.0280.9830.980.980.017
LD0.5480.320.780.4520.5650.340.80.4350.66510.330.335
LR0.6790.970.390.3210.6790.970.390.3210.6760.970.380.324
NBG0.6360.970.390.3640.66210.320.3380.66510.330.335
NBK0.920.960.880.080.9260.970.890.0740.9090.950.870.091
SVM-L0.5310.270.790.4690.4860.230.740.5140.50.240.760.5
SVM-Q0.5030.190.820.4970.4690.150.790.5310.5140.150.880.486
SVM-C0.47200.940.5280.4720.10.850.5280.48600.970.514
SVM-F0.66210.320.3380.66210.320.3380.69610.390.304
SVM-MG0.66510.330.3350.66210.320.3380.69610.390.304
KNN-F0.9910.990.990.0090.9830.980.980.0170.9890.980.990.011
KNN-M0.9490.940.960.0510.960.940.980.040.940.970.910.06
KNN-Cos0.6390.2810.3610.6280.280.970.3720.6450.30.990.355
KNN-C0.9460.940.950.0540.9630.940.980.0370.940.980.90.06
KNN-W0.9910.990.990.0090.9860.990.980.0140.9930.990.990.007
Eboost0.4890.390.590.5110.5340.480.590.4660.4890.390.590.511
EBT0.9890.980.990.0110.9660.970.960.0340.9860.990.980.014
ESD0.5770.260.890.4230.5630.390.730.4370.66510.330.335
ESKNN0.9910.990.990.0090.9830.980.980.0170.9910.990.990.009
Table 14. Validation of the selected scheme of method II.
Table 14. Validation of the selected scheme of method II.
EvaluationClassesAccuracyTrue Positive RateFalse Negative Rate
5 Fold Cross-Validation Healthy0.9860.990.01
Hypertension0.980.02
10 Fold Cross-Validation Healthy0.9940.990.01
Hypertension>0.99<0.01
15 Fold Cross-Validation Healthy0.9940.990.01
Hypertension>0.99<0.01
20 Fold Cross-Validation Healthy0.9970.990.01
Hypertension10
20% Hold Out Validation Healthy0.98610
Hypertension0.970.03
25% Hold Out Validation Healthy0.9890.980.02
Hypertension10
Table 15. Performance comparison of methods I and II.
Table 15. Performance comparison of methods I and II.
PerformanceMethod IMethod II
Accuracy98.40%99.40%
Sensitivity97.00%99.20%
Specificity99.00%99.60%
Error0.02%0.60%
# of features125
Table 16. Comparison with previous works.
Table 16. Comparison with previous works.
Ref.ModalityPreprocessingFeaturesFeature ReductionClassificationData SetResults
[12]PPGCWTGoogLeNet-GoogLeNetMIMICF1 score: 92.55%
[13]PPG and ECG-PAT and morphological features-KNNMIMICF1 score: 94.84%
[14]HRV-Standard deviation of NN intervals-MILSelf-collected data set Hypertension 24 and Normal: 19Accuracy: 85.47%
[15]ECGSGFEntropy features-SVMSelf-collected data set Hypertension: 61 and Normal: 67Accuracy: 93.33%
[16]HRV-Statistical, spectral, geometrical, wavelet, fractal, and non-linear featuresPCAQDASelf-collected data set Hypertension: 41 Normal: 30Accuracy: 85.5%
[17]ECGOWFBFractal dimension and energy featuresStudent’s t-testDiagnosis indexPhysioNet database High-risk Hypertension: 17 subjects
Low-risk Hypertension: 122 subjects
Total: 139 subjects
100% between low-risk and high-risk classes
[18]ECGEMDEntropy featuresStudent’s t-testKNN classifierMIT BIH Sinus rhythm database, SHAREE
database: Normal: 18 signals
Hypertension: 139 signals
Accuracy: 97.70%
Sensitivity: 98.90%
Specificity: 89.10%
[19]PPGChebyshev IITime and morphological featuresMRMRKNN-WHypertension: 35 Normal: 48 Total: 83Positive Predictive Value: 100% Sensitivity: 85.71% F1-score: 92.31%
[20]BCG Morphological features-CARSelf-collected data set Hypertension: 61 and Normal: 67Accuracy: 84.4%
This studyPuPGEMDTime, frequency, cepstral, fractal, and chaotic featuresHFSRKNN-WSelf-collected data set Hypertension: 56 Normal: 65Accuracy: 99.7%
Sensitivity: 99.2%
Specificity: 99.4%
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Khan, M.U.; Aziz, S.; Akram, T.; Amjad, F.; Iqtidar, K.; Nam, Y.; Khan, M.A. Expert Hypertension Detection System Featuring Pulse Plethysmograph Signals and Hybrid Feature Selection and Reduction Scheme. Sensors 2021, 21, 247. https://doi.org/10.3390/s21010247

AMA Style

Khan MU, Aziz S, Akram T, Amjad F, Iqtidar K, Nam Y, Khan MA. Expert Hypertension Detection System Featuring Pulse Plethysmograph Signals and Hybrid Feature Selection and Reduction Scheme. Sensors. 2021; 21(1):247. https://doi.org/10.3390/s21010247

Chicago/Turabian Style

Khan, Muhammad Umar, Sumair Aziz, Tallha Akram, Fatima Amjad, Khushbakht Iqtidar, Yunyoung Nam, and Muhammad Attique Khan. 2021. "Expert Hypertension Detection System Featuring Pulse Plethysmograph Signals and Hybrid Feature Selection and Reduction Scheme" Sensors 21, no. 1: 247. https://doi.org/10.3390/s21010247

APA Style

Khan, M. U., Aziz, S., Akram, T., Amjad, F., Iqtidar, K., Nam, Y., & Khan, M. A. (2021). Expert Hypertension Detection System Featuring Pulse Plethysmograph Signals and Hybrid Feature Selection and Reduction Scheme. Sensors, 21(1), 247. https://doi.org/10.3390/s21010247

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop