Dual Classification Approach for the Rapid Discrimination of Metabolic Syndrome by FTIR

Tkachenko, Kateryna; Esteban-Díez, Isabel; González-Sáiz, José M.; Pérez-Matute, Patricia; Pizarro, Consuelo

doi:10.3390/bios13010015

Open AccessArticle

Dual Classification Approach for the Rapid Discrimination of Metabolic Syndrome by FTIR

by

Kateryna Tkachenko

¹,

Isabel Esteban-Díez

¹,

José M. González-Sáiz

¹,

Patricia Pérez-Matute

²

and

Consuelo Pizarro

^1,*

¹

Department of Chemistry, University of La Rioja, 26006 Logroño, Spain

²

Infectious Diseases, Microbiota and Metabolism Unit, Infectious Diseases Department, Center for Biomedical Research of La Rioja (CIBIR), 26006 Logroño, Spain

^*

Author to whom correspondence should be addressed.

Biosensors 2023, 13(1), 15; https://doi.org/10.3390/bios13010015

Submission received: 4 November 2022 / Revised: 12 December 2022 / Accepted: 21 December 2022 / Published: 23 December 2022

(This article belongs to the Special Issue Biosensing and Diagnosis)

Download

Browse Figures

Versions Notes

Abstract

:

Metabolic syndrome is a complex of interrelated risk factors for cardiovascular disease and diabetes. Thus, new point-of-care diagnostic tools are essential for unambiguously distinguishing MetS patients, providing results in rapid time. Herein, we evaluated the potential of Fourier transform infrared spectroscopy combined with chemometric tools to detect spectra markers indicative of metabolic syndrome. Around 105 plasma samples were collected and divided into two groups according to the presence of at least three of the five clinical parameters used for MetS diagnosis. A dual classification approach was studied based on selecting the most important spectral variable and classification methods, linear discriminant analysis (LDA) and SIMCA class modelling, respectively. The same classification methods were applied to measured clinical parameters at our disposal. Thus, the classification’s performance on reduced spectra fingerprints and measured clinical parameters were compared. Both approaches achieved excellent discrimination results among groups, providing almost 100% accuracy. Nevertheless, SIMCA class modelling showed higher classification performance between MetS and no MetS for IR-reduced variables compared to clinical variables. We finally discuss the potential of this method to be used as a supportive diagnostic or screening tool in clinical routines.

Keywords:

metabolic syndrome; infrared spectroscopy; point of care; metabolic signatures; chemometrics; classification strategy; health and wellbeing monitoring

1. Introduction

The high prevalence of non-communicable diseases (NCD) in adults is reflected in increased costs for public health systems worldwide [1]. Among these NCD, metabolic syndrome (MetS) plays a significant role. MetS is often associated with an increased risk of diabetes and cardiovascular disease, resulting in increased incidence of morbidity and mortality and reduced quality of life [2,3,4,5,6]. Thus, the commensurate prevalence of metabolic syndrome burdens national health expenditure, representing a significant socio-economic problem, particularly in low- and middle-income countries [7,8,9,10]. However, MetS is a multifactorial disorder accompanied by conflicting opinions on its definition [11,12,13]. In particular, many different definitions have been proposed to describe MetS in adults. The main discrepancies were associated with inclusion and exclusion criteria adopted according to the World Health Organization (WHO), National Cholesterol Education Program (NCEP), Adult Treatment Panel III (ATPIII), and International Diabetes Federation (IDF). Finally, in 2009, the definition for metabolic syndrome was harmonised [14]: MetS is a disease formed by metabolic and vascular abnormalities, namely insulin resistance (IR), visceral adiposity, atherogenic dyslipidaemia, and oxidative and endothelial dysfunction. These risk factors easily predispose hyperglycaemia and hypertension, atherosclerotic vascular diseases and viral infection [15,16,17,18].

Given the complex and intertwined nature of MetS, it would be utopian to think that a single biomarker could define it unambiguously. Thus, parameters concerned around central obesity (waist circumference (WC)), hypertension (blood pressure), atherogenic dyslipidaemia (small low-density lipoprotein (LDL) and levels of high-density lipoprotein (HDL) cholesterol), and insulin resistance (fasting glucose levels) are usually measured to evaluate MetS diagnosis [19]. Due to the heterogeneity of these factors, people affected by metabolic syndrome are three times more likely to suffer acute myocardial infarction, cerebrovascular events, diabetes, or stroke. In addition, they have higher mortality rates [20]. Besides the economic impact, misdiagnosis or tardive diagnosis could lead not only to inefficient treatment outcomes but even to significant dysfunctions such as cancer [21,22]. Thus, early and proper diagnosis plays a crucial role in delaying the pathology’s onset or progression as much as possible and improving a patient’s condition.

Today, MetS diagnosis is based on several steps such as measuring metabolic markers of insulin resistance and other indices of metabolic syndrome (triglycerides, HDL cholesterol levels, and blood glucose) that are obtainable from routine clinical biochemistry laboratories, whereas blood pressure is measured in primary care [23]. The collection and analysis of samples also entails a waiting time for laboratory results and additional time for a new medical consultation. Although the proposed definition of MetS shares some common features, the clinical diagnosis lacks standardisation. On that basis, it was proposed that individuals showing a combination of any three out of these five simple clinical criteria were likely to be characterised by insulin resistance. Prospective analyses have also shown that any combination of these factors was predictive of an increased risk of both type 2 diabetes and cardiovascular disease. First, it is still challenging to identify a unified criteria for MetS applicable across all ethnicities. In addition, the contribution of each parameter seems to have different importance based on the evaluation adopted in each clinical environment (e.g., diagnosis focussed on glucose tolerance instead of obesity cut-offs). Moreover, there is variation in the cut-off values of diagnostic inclusion criteria (≥140/90 mmHg according to WHO vs. ≥130/85 mmHg according to ATP III for blood pressure). The application of MetS diagnosis in clinical practice could also be compromised, since most patient registries have missing data, limiting a study’s accuracy or leading to false-positive results. In addition, measurements such as WC, one of the predominant parameters for defining MetS, are not always feasible in patients because the diagnosis can often be limited by the patient’s inability to perform a complete physical examination.

Given these perspectives, the need for standardised clinical diagnostic tools and protocols becomes imperative in the prevention and diagnosis of MetS. For this reason, analysing global metabolic profiles instead of disparate clinical measurements could be essential in shedding light on MetS disarrangements. A multifactorial and complex pathology such as MetS seems to require an approach from a holistic functional perspective, so an analysis of metabolic profiles reflecting the global clinical status of a patient could represent a suitable alternative.

By now, metabolomics plays a key role as a powerful analytical tool that has been widely applied to investigate plenty of disorders and disarrangements [24,25,26]. Metabolomics analysis has the potential to discover biomarkers and allow for the detection of a wide range of metabolites. In recent years, there has been a great interest in extracting biomarkers from biofluids and, considering that blood is a biofluid containing numerous valuable metabolic information, it seems that it in particular, it appropriately reflects metabolic changes and disarrangements during disease initiation or progression [27,28]. In this context, techniques based on vibrational spectroscopy are particularly suitable as sample preparation is simple, non-invasive, rapid, and low-cost [29]. Therefore, the Fourier transformed infrared spectroscopy (FTIR) technique has been established as a reliable analytical tool in metabolomic-based studies [30,31,32,33,34]. Moreover, another significant advantage resides in the fact that FTIR is ideally suitable for acquose matrices such as blood [35,36]; the instrument requires the collection of only one blood sample, with little or almost null pre-treatment. In this study, we proposed an FTIR-based method that investigates many components at a time, which are registered as spectral signatures. The development of a chemometric strategy capable of extrapolating the most significant infrared (IR) signatures plays a crucial role in this study, since each spectrum is unique for every patient and reflects their metabolic status. Non-targeted metabolomic studies, such as the one presented here, aim to extract the metabolic signatures instead of individual biomarkers with limited potential, and this permits the classification of patients according to their molecular patterns, reflecting clinical/pathological conditions such as MetS or no MetS.

This method could greatly support clinicians, capturing the complexity of the MetS metabolic profile when the clinical indicators are missing or lacking sufficient discriminative power, revealing the globality of physiological disturbances. We do not want to underestimate the importance of clinical diagnosis at any time. Still, our main aim is to propose an alternative analytical strategy that could be of great diagnostic relevance and support, limiting the time and cost of clinical measurements.

2. Materials and Methods

2.1. Study Population

A total of 105 plasma samples from anonymous donors were recruited from Infectious Disease Area, Center for Biomedical Research of La Rioja (Logroño, Spain). This study was approved by the Committee for Ethics in Drug Research in La Rioja (CEImLAR) (23 April 2013, reference number 121) and a written informed consent was achieved from all participants. The patients were evaluated by the NCEP-ATP-III scale and, if eligible, were assigned to a metabolic syndrome category. MetS was defined as the concomitant presence of at least three of the following risk factors: elevated TGL (≥150 mg/dL), low concentrations of the fraction HDL cholesterol (<50 levels mg/dL in women or <40 mg/dL levels in men), increased WC (≥88 cm in women or ≥102 cm in men), elevated blood pressure (>130/85 mmHg), and elevated fasting glucose (>110 mg/dL or diabetes) [37]. Thus, the patients were divided into two groups by the criteria of MetS: 19 patients tested as MetS positive and 86 as MetS negative. The patients enrolled in this study were also characterised by the presence of viral load through serological evidence of HIV or co-infection of HIV/HCV. A correct distribution between patients with and without infection in both categories has been ensured to not introduce bias in future models developed for diagnosing MetS.

2.2. Sample Collection

Once drawn, the venous blood samples were centrifuged at 2200× g for 15 min at 4 °C and the obtained plasma were transferred into a clean Eppendorf tube. Aliquots of 200 μL of each sample were stored at −80 °C until the day of the analysis. Before FTIR measurements, plasma samples were defrosted during the night according to the optimised ultrasound-based protocol for lipidomic analyses developed in our research group [38].

2.3. Method

FTIR spectroscopy measurements were performed by a Spectrum-One ABB Miracle Type MB3000 FT-IR Spectrophotometer using a PerkinElmer liquid cell (Omni Cell, Specac Ltd., Orpington, UK) with CaF₂ windows separated with a 50 μm Mylar spacer. The spectra from 25 μL of each plasma sample were recorded in the mid-IR region (4000–300 cm⁻¹) in triplicate. A mean spectrum was subsequently obtained from the replicates recorded for each plasma sample. The sample temperature was maintained at 23.0 ± 1.0 °C, and a constant N₂ purge was applied for atmospheric water vapour and CO₂ suppression. A resolution of 2 cm⁻¹ was obtained using 32 scans. In order to monitor the stability and reproducibility of the analytical system, quality control (QC) samples were processed similarly to the actual samples and inserted regularly. In addition, the instrument performance was verified at the beginning of each day of data collection using PE-specific reference standards.

2.4. Data Analysis

After data acquisition, the processing and computational analysis of raw metabolic data was performed using Unscrambler (version X 11.0, Camo ASA, Oslo, Norway), V-Parvus (version PARVUS2011, Michele Forina, Genoa, Italy), and Matlab (MATLAB 9.4 R2018a). Two different regions of the mid-IR spectrum were analysed: the first region examined was the biochemical “fingerprint region” at 1500–1050 cm⁻¹, and the second was a higher region at 2950–2700 cm⁻¹. Remaining wavenumber ranges, as they were affected by signal saturation effects caused mainly by strong water absorptions or noise, were removed, and not considered for further analysis. Given the high dimensionality of biological spectral data, many disturbing factors influence the spectral data acquisition, such as random noise, baseline distortions, or light scattering. Thus, the pre-processing step is imperative in analysis to reduce these factors. To compensate for instrumental artefacts and sample to sample variations, different pre-processing methods were evaluated individually or in combination to minimise the adulterant-unrelated variability, namely derivatives (e.g., Savitzky–Golay (S–G) first and second derivatives), standard normal variate (SNV), and extended multiplicative scatter correction (EMSC). Thus, better resolution of overlapping peaks and decreased scatter effects were ensured after applying the combination (S–G) smoothing and SNV.

The entire data set was split into two independent subsets to develop and validate the classifications proposed: a training set with 95 samples (used to optimise and develop the classification rules and models) and a test set with ten samples (never used in the construction of the classification but to evaluate their actual predictive ability). The test set used was the same for all methods applied and classifications developed. As a result, the smoothed and normalised output tables were always centred before additional multivariate analysis and classification algorithms.

3. Results and Discussion

After careful pre-processing, FTIR measurements were submitted for further multivariate analysis. Thus, five measured clinical variables and a total of 838 spectra variables over the wavelength ranges of 1583–1050 cm⁻¹ and 2973–2700 cm⁻¹ collected from 105 patients were included. The two main categories of this study were patients with and without metabolic syndrome, i.e., MetS and no MetS, respectively.

3.1. Descriptive Statistics

Herein, an analysis was performed based on the distribution of five clinical parameters. It should be noted that one of the most critical clinical measurements, waist circumference, was not included in this study because most patients had missing data in the clinical register. Therefore, only parameters that were available for all patients have been used for the further comparative classification step. Thus, the descriptive statistics were calculated to analyse the distribution of clinical data in a box and whisker plot (Figure 1). The plot shows that TGL values seem to have more influence and variability between the two categories of patients; indeed, MetS patients have significantly higher values ranging from a minimum of 33 to 338 (mg/dL). The general distribution trend indicates that MetS patients also have slightly higher diastolic and systolic blood pressure values and glucose levels, whereas HDL values are lower, ranging from 25 to 95 (mg/dL). Table 1 shows the ranges of the collected values with the respective medians between the two categories.

3.2. Exploratory Analysis with PCA

An unsupervised pattern recognition method based on principal component analysis (PCA) was performed for the initial data overview and to investigate any possible clustering of samples based on five collected clinical parameters and 838 spectral variables, respectively.

The PCA score plot of clinical parameters, with 50.46% of explained variance by PC1, displays evident clustering according to known categories, delimitated by the parallel to the bisector of the second quadrant (Figure 2). Whereas PCA performed on pre-treated IR spectra accounted for 83.12% of explained variability on the PC1, evidenced by very subtle clustering between known categories (Figure 3).

In both cases, the first PCs explained most of the data’s variability. The distribution of samples in principal component space suggests that it only seems possible to address subsequent, direct discrimination in the case of analysis of clinical parameters. Thus, parameters such as TGL and GLU majorly contributed to the segregation of no MetS from MetS and the values of HDL contributed to the separation of MetS from no MetS, as was shown in preliminary analysis by descriptive statistics. No evident clustering among the two main categories was observed performing PCA on spectral variables; only a few outliers were determined and excluded from further analysis. The high degree of overlapping features among the two classes was expected, as most blood components are common in all individuals. This also indicates the need to perform a selection of relevant spectral variables, closely related to clinicopathological parameters of prognostic importance in MetS. Therefore, other chemometric strategies were used to investigate and highlight metabolomic differences in metabolic syndrome using IR spectra.

3.3. Supervised Techniques

The selection of variables in tandem with classification methods to extract reduced IR fingerprints that reflect the metabolic profiles of patients for a potential MetS diagnosis was studied. Therefore, a dual approach was applied based on a classification method on the one hand and a class modelling method on the other.

For its part, discriminant techniques focus on the differences between samples belonging to different categories, dividing the multidimensional space into as many subregions as the number of the considered classes. As a result of this work principle, every tested sample would always be assigned to one of the predefined categories, even in the case where an analysed sample truly belongs to a class not considered in the study. Regarding the above, it makes good sense to evaluate the application of a discriminant classification strategy in a two-class (binary) classification problem such as the one addressed in this paper. In particular, linear discriminant analysis (LDA), the most widely used classification algorithm, was used.

On the other hand, in contrast to class discrimination, class modelling approaches exploit similarities among inter-category samples to construct an individual model for every class independently from the others. Consequently, the developed class models may not entirely cover the original multivariate space. This fact opens the door to different assignment scenarios depending on whether a sample falls clearly into a single class region (so that it is assigned to that) or if it falls in overlapping regions (leading to a confusing classification in multiple classes), and, finally, when a sample falls outside every class model constructed (predicted as member of none of the considered categories). Therefore, due to their specific properties, modelling techniques, such as soft independent modelling by class analogy (SIMCA), are suitable for classification problems in which the emphasis is placed on a particular class of interest, as may be the case here with the MetS category.

3.3.1. SELECT

Considering that IR data presents high dimensionality, eliminating the futile features due to noise and identifying the relevant and important variables to be applied in the following classification steps was imperative. Thus, the stepwise orthogonalization of predictors (SELECT) algorithm [39,40] was prioritised among other variable selection techniques since it enabled us to optimise discrimination by simultaneously performing feature selection and classification. Moreover, thanks to its stepwise decorrelation procedure, SELECT also avoids the presence of redundant information in the subset of selected significant predictors. In addition, it has previously demonstrated its accurate prediction ability in selecting the most important variable for the discrimination of pathological status [41,42]. Thus, SELECT was applied to extract the most significant wavenumbers from the IR dataset, providing input features for a further dual-classification approach. Based on the commonly established rule, the number of training objects selected was always at least three times greater than the number of finally selected wavenumbers. An in-depth study of the literature is encouraged to understand the algorithm’s rules [43].

3.3.2. LDA on Clinical Parameters

LDA is a well-known and extensively applied powerful supervised chemometric classification technique [44]. Based on LDA classification rules, the objects are always classified in one of the predefined classes.

LDA of five clinical parameters, built by leave one out (LOO) cross-validation, was performed to evaluate the feasibility of this classification methodology to differentiate between MetS and no MetS patients. Excellent discrimination among categories was achieved, providing a 100% level of correctly classified samples for no MetS subjects and patients with metabolic syndrome, respectively. Satisfactory external prediction performances ranging from 98.73% to 100% were achieved for both categories (within one no MetS subject classified as MetS), respectively (Table 2). Furthermore, a clear interclass separation achieved between these main categories can also be visually appreciated in the corresponding discriminative histogram (Figure 4). This classification performance was almost predictable since the PCA results already showed a clear clustering between the two groups.

The object belonging to the category MetS which was classified as no MetS was characterised by the following clinical parameters: 213 mg/mL of TGL, 76 mg/mL of HDL, 139 mmHg of SP, 83 mmHg of DP, and 102 mg/mL of GLU. As we can see, two out of five parameters have increased values, and the DP parameter is very close to the cut-off value, which is 85 mmHg based on the NCEP-ATP-III scale. Thus, this patient might instead be classified as MetS positive, presenting almost three out of five clinical parameters with augmented values. In addition, as we said above, the TGL parameter has a major contribution, among other parameters, to MetS classification. Thus, the plausible explanation could be that this subject, who has greater values of TGL, is more likely to be classified as MetS by LDA rather than no MetS. However, as we highlighted before, the eligibility criteria can be very insidious and create confusion and misassignment, worsening and delaying the patients’ well-being.

3.3.3. SELECT-LDA on IR Wavenumbers

Likewise, LDA on the IR dataset, containing 838 wavenumbers, was also performed. Before LDA analysis, as explained above, SELECT was applied to extract those predictor variables correlated with the discrimination between categories here considered. Therefore, based on the SELECT rules, 20 selected spectra variables were decorrelated from other signals and used for LDA. The 20 selected features showed an outstanding classification performance and the results were higher in performance than LDA results on clinical parameters, achieving 100% in classification and external prediction, respectively. The results of the SELECT LDA performance are displayed in Table 3. The suitability of the classification strategy applied to reduced IR plasma signatures can be visually appreciated in Figure 5. A discriminative histogram shows a clear group separation on the first canonical variable.

3.3.4. SIMCA

In an attempt to go one step further in this classification strategy, it was decided to build optimised class models based on clinical parameters and the subset of reduced IR signatures selected by SELECT. SIMCA often outperforms other classification methods, where a new sample will always be classified in one of the predefined categories. Classification methods such as LDA are based on the development of classification rules and delimiters between classes, whereas in class models, significance limits are built for the specified classes. These limits define the membership parameters for each class; thus, an unknown sample can be classified as not belonging to any defined categories because it is not included in any of its class spaces. SIMCA class modelling uses the number of true/false positives and negatives and statistics, showing the ability of a classification model to recognise class members (sensitivity or true positive rate) and showing how good the model is for identifying strangers (specificity or true negative rate). Moreover, SIMCA class modelling is often used to describe the class structure of the data set, requiring little or no prior assumptions to build the model.

On applying SIMCA, independent PCA modelling is performed for each class; each sample is fitted in a PCA model to check the separation between classes [45]. This model uses the optimal number of principal components that best describes and groups an individual class. This model can then be used to classify new samples whose class is unknown. The principal components are obtained usually using the NIPALS (non-iterative partial least squares) algorithm after separate autoscaling of the data. Finally, the models built for the different classes are compared by studying their differences and analogies [46]. Each class is modelled independently; thus, it is sensitive to the quality of the data used to generate the principal component models for each class in the training set (at a 5% significance level).

SIMCA on Clinical Parameters

Herein, SIMCA modelling was performed on five clinical parameters (Table 4). A class modelling of five clinical parameters of MetS was built using 4PCs for the inner space of classes, achieving satisfactory results in both internal prediction (LOO) and external prediction 98.95%. SIMCA builds a mathematical model of the category with its principal components and a sample is accepted by the specific category if its distance to the model is not significantly different from the class residual standard deviation. The results of SIMCA modelling can be visually appreciated by a Cooman’s Plot, representing the samples’ distances against each of the two models. The Cooman’s plots were built considering a 95% confidence level to define the class space and the unweighted augmented distance. This diagram is an effective visual representation that directly indicates the quality of the model constructed with the magnitude of the distance between categories. Thus, the distances to the principal component models and SIMCA approximation in a two-class problem for the class of MetS and no MetS are plotted in Figure 6. No clear outliers were observed, but several samples that fall into the joint space of both categories belong mainly to the MetS category. This relatively large number of samples plotted in the class-space common (overlapping) to the two models representing MetS and no MetS patients, as well as the considerable amount of no MetS samples located near their class boundary, suggest potential specificity problems associated with this classification approach based on clinical parameters. Therefore, the distribution of some samples from the MetS category in the area of relative indecision (small left quadrant) could be due to the unequivocal diagnostic parameters defining metabolic syndrome. In fact, these patients have three out of five altered parameters not necessarily similar. In addition, some parameters may be much less marked than others, confounding the decision about their location inside the model.

The data modelling power (MP) and discriminatory power (DP) of the SIMCA class modelling of clinical parameters are presented in Table 4. The MP describes how well a variable helps each principal component to model variation in the data, and discriminatory power (DP) describes how well a variable helps each principal component model to classify samples in a training set. The first detail that can be noticed is that, comparably, the MP in no MetS is consistently higher for all parameter pairs. This was expected as the distribution of the values of clinical parameters for each class of patients was significantly different. Nevertheless, the values of TGL have the highest modelling power in both MetS and no MetS categories, with values of 0.94 and 0.96, respectively. This ability of TGL to discriminate between the two groups is justified by previous studies, as metabolic syndrome patients should have significantly higher TGL values. This difference in modelling power is especially remarkable by the measured glucose (0.97 vs. 0.84) and HDL (0.94 vs. 0.79). In addition, clinical parameters such as glucose and HDL also showed significant discriminant power, with values of 2.63 and 2.58, respectively. These two parameters are also perfectly in line with the data collected from our patients. The MetS group is characterised by high glucose and low HDL values. These same parameters are often responsible for the presence or future development of comorbidities in patients such as diabetes, cardiac disease, and obesity. Other clinical parameters seem to contribute less to the principal component models; indeed, no significant difference was observed in the values distribution of SP or DP between the two categories.

SELECT-SIMCA on IR Wavenumbers

The best recognition ability (percentage of the samples in training set correctly classified during the modelling step) afforded by SIMCA was achieved by only ten of 20 previously selected wavenumbers by SELECT, providing 98.94% in classification and 95.79% in external prediction, respectively. Interestingly, eight out of ten selected wavenumbers belong to the ‘’fingerprint region’’, which reflects the production of characteristic perturbations in the metabolome and other such variations. The absorption pattern in this area is highly complex; that same inherent complexity makes it unique for each sample and reflects its pathophysiological status. Thus, eight of the selected IR spectral wavenumbers may reflect the current status of the organism and could be directly correlated with the presence or absence of the disease. The results of SIMCA performance applied to clinical variables and to reduced number of IR spectral variables are summarised in Table 5.

A Cooman’s plot is presented to show discrimination between the two MetS categories of IR variables (Figure 7), where the distance to the PC models for MetS and no MetS are displayed. Compared to the Cooman’s plot of clinical parameters, it is observed that there is better separation and discrimination between categories. The Cooman’s plot showed a high degree of interclass specificity and a patently clear separation between class models, with a significant improvement from the models constructed from available clinical parameters to those constructed from IR variables. The no MetS patients appear evidently segregated and concentrated forming a dense cluster at large distances from the model of MetS class. Likewise, the vast majority of MetS samples fall clearly and univocally into their class region, far from the class limit for the no MetS model. Furthermore, the single MetS sample located in the inconclusive classification region is virtually placed above the membership threshold.

From ten selected wavenumbers, the highest discriminant power (5.87) was obtained by the 1133.09 cm⁻¹ spectra variable from the ‘’fingerprint region’’ (Table 6), followed by 4.31 for 1557.40 cm⁻¹ and 4.29 for 2948.94 cm⁻¹ from the higher spectral region. The average discriminant power for IR variables is higher compared to DP values obtained with SIMCA modelling of clinical parameters, indicating the increased suitability of the method compared to those using values obtained from clinical measurements. Likewise, the contribution of IR variables to the model variation was of major strength compared to clinical parameters. Thus, all the selected variables contributed equally to marking the difference between MetS and no MetS with an MP equal to 1.00. Furthermore, the distance between classes was 5.19, significantly higher than in the case of SIMCA class modelling applied to clinical parameters (4.26). These results highlight that the proposed method outperformed in accuracy and specificity of the evaluation parameters used in clinical practice. Since the clinical diagnosis of metabolic syndrome lacks standardisation, the results of the obtained model capacity could greatly support clinical decisions, for example, in terms of exclusion and inclusion evaluation criteria for MetS discrimination.

Our principal aim was to obtain optimal segregation between patients without additional clinical, physical, or ethnic data, and this goal was achieved.

3.3.5. Biochemical Reasoning of Ten Extracted Signals

Herein, we presented a simple, non-invasive, low-cost FTIR-based method for rapid discrimination between MetS and no MetS patients. The use of FTIR spectroscopy is gaining momentum for diagnosis of multiple disorders, from infectious diseases such as hepatitis C and B viruses or malaria to cancers [47,48,49,50,51,52,53]. Due to its ease of use and portability, the potential for using FTIR techniques in clinical environments is within reach. Our strategy extracted the metabolic signatures, instead of individual biomarkers with limited potential, that permit the classification of patients according to molecular patterns. Thus, the FTIR technique provided an overview of spectral changes associated with lipid, protein, or carbohydrate metabolisms.

Ten out of twenty previously selected wavenumbers showed higher discriminant power than clinical parameters. Thus, among these, influential bands at 1578.61, 1562.22, and 1557.40 cm⁻¹ could be assigned to [δ (N-H) + ν (C-H)] of the amide II region of proteins. These discriminative signals may suggest some link with HDL lipoproteins, which showed significant influence among five clinical factors for the classification of MetS and no MetS subjects. Likewise, the higher absorbance in peaks at 2860.22 cm⁻¹ and 2948.94 cm⁻¹ could be attributed to CH3 and CH2 sym. stretching of lipids or carbohydrates, which is perfectly congruent with the formulated theories about MetS impairments and their possible implication in the disease. Moreover, as discussed above, TGL and GLU levels seemed to have more influence and variability between the two categories of patients; thus, these attempted assignments properly reflect the actual situation of the patient’s metabolism. In addition, the variable at 1133.09 cm⁻¹ could be associated with stretching C-O/C-O(H) of carbohydrates or proteins, since it was already shown that the parameters such as glucose or HDL have remarkable modelling and discriminant powers compared to other measured factors.

In this study, the selected spectral biomarkers perfectly reflect the clinical reality of the patient’s metabolic profile. Thus, the explanation of the most significant spectral bands confirms the potential of FTIR spectroscopy to deal with such a complex disorder as MetS.

4. Conclusions

We firmly believe that this alternative analytical strategy could be of great diagnostic relevance and support for clinicians, limiting the time and cost of MetS diagnosis. Moreover, the evaluation of the metabolic profile captures the globality of physiological disturbances, whereas clinical indicators often lack sufficient discriminative power. The results indicate the possibility of rapid application of this strategy to screen for patients with metabolic syndrome. The LDA classifications and SIMCA developed models demonstrated that the spectral variables could provide the same discriminative results as measured clinical parameters. Therefore, why take five measurements when one measurement could provide the same classification ability, greatly stratifying categories of patients? The proposed FTIR method is quick, simple, and non-invasive, and it could be perfectly implemented for large scale-analysis in clinical routines. The principal limitation of this study resides in the relatively tiny sample size at our disposal. In addition, this is a cross-sectional study; therefore, no data on confounding factors (such as gender, age, or diet) were routinely included. The results of a more extensive data set would be required to strengthen the validity of the adopted classification strategy and lead to a firmer conclusion.

Author Contributions

Investigation and writing—original draft preparation, K.T.; methodology, J.M.G.-S.; data curation, I.E.-D.; resources, P.P.-M.; project administration, C.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the European Union’s H2020 research grant (N· 801586) and Ministry of Science and Innovation (CTQ2011-26603).

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Ethics Committee of San Pedro Hospital of La Rioja Province (CEImLAR, 23 April 2013, reference number 121).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Saklayen, M.G. The Global Epidemic of the Metabolic Syndrome. Curr. Hypertens. Rep. 2018, 20, 12. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Esposito, K.; Chiodini, P.; Capuano, A.; Bellastella, G.; Maiorino, M.I.; Giugliano, D. Metabolic Syndrome and Endometrial Cancer: A Meta-Analysis. Endocrine 2014, 45, 28–36. [Google Scholar] [CrossRef] [PubMed]
Mili, N.; Paschou, S.A.; Goulis, D.G.; Dimopoulos, M.-A.; Lambrinoudaki, I.; Psaltopoulou, T. Obesity, Metabolic Syndrome, and Cancer: Pathophysiological and Therapeutic Associations. Endocrine 2021, 74, 478–497. [Google Scholar] [CrossRef] [PubMed]
Esposito, K.; Chiodini, P.; Colao, A.; Lenzi, A.; Giugliano, D. Metabolic Syndrome and Risk of Cancer: A Systematic Review and Meta-Analysis. Diabetes Care 2012, 35, 2402–2411. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Alexandra, K.; Konstantinos, I.; Konstantinos, S.; Alexandros, S.; Michalis, D.; Vasilios, A.; Katsimardou, A.; Imprialos, K.; Stavropoulos, K.; Sachinidis, A.; et al. Hypertension in Metabolic Syndrome: Novel Insights. Curr. Hypertens. Rev. 2019, 16, 12–18. [Google Scholar] [CrossRef]
Isomaa, B.; Almgren, P.; Tuomi, T.; Forsén, B.; Lahti, K.; Nissén, M.; Taskinen, M.R.; Groop, L. Cardiovascular Morbidity and Mortality Associated with the Metabolic Syndrome. Diabetes Care 2001, 24, 683–689. [Google Scholar] [CrossRef] [Green Version]
Federspil, G.; Nisoli, E.; Vettor, R. A Critical Reflection on the Definition of Metabolic Syndrome. Pharmacol. Res. 2006, 53, 449–456. [Google Scholar] [CrossRef]
Abebe, S.M.; Demisse, A.G.; Alemu, S.; Abebe, B.; Mesfin, N. Magnitude of Metabolic Syndrome in Gondar Town, Northwest Ethiopia: A Community-Based Cross-Sectional Study. PLoS ONE 2021, 16, e0257306. [Google Scholar] [CrossRef]
Motuma, A.; Gobena, T.; Roba, K.T.; Berhane, Y.; Worku, A. Metabolic Syndrome Among Working Adults in Eastern Ethiopia. Diabetes Metab. Syndr. Obes. Targets Ther. 2020, 13, 4941–4951. [Google Scholar] [CrossRef]
Misra, A.; Khurana, L. The Metabolic Syndrome in South Asians: Epidemiology, Determinants, and Prevention. Metab. Syndr. Relat. Disord. 2009, 7, 497–514. [Google Scholar] [CrossRef]
Huang, P.L. A Comprehensive Definition for Metabolic Syndrome. Dis. Models Mech. 2009, 2, 231–237. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Punthakee, Z.; Goldenberg, R.; Katz, P. Definition, Classification and Diagnosis of Diabetes, Prediabetes and Metabolic Syndrome. Can. J. Diabetes 2018, 42 (Suppl. 1), S10–S15. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Alberti, K.G.M.M.; Zimmet, P.; Shaw, J. Metabolic Syndrome—A New World-Wide Definition. A Consensus Statement from the International Diabetes Federation. Diabet. Med. 2006, 23, 469–480. [Google Scholar] [CrossRef] [PubMed]
KG Alberti, R.E.S.G.P.Z.J.C.K.D.J.F.W.J.C.L.S.S. Harmonizing the Metabolic Syndrome: A Joint Interim Statement of the International Diabetes Federation Task Force on Epidemiology and Prevention. Circulation 2009, 120, 1640–1645. [Google Scholar] [CrossRef] [Green Version]
Reddy, P.; Leong, J.; Jialal, I. Amino Acid Levels in Nascent Metabolic Syndrome: A Contributor to the pro-Inflammatory Burden. J. Diabetes Complicat. 2018, 32, 465–469. [Google Scholar] [CrossRef] [PubMed]
Smith, M.; Honce, R.; Schultz-Cherry, S. Metabolic Syndrome and Viral Pathogenesis: Lessons from Influenza and Coronaviruses. J. Virol. 2020, 94, e00665-20. [Google Scholar] [CrossRef]
O’Neill, S.; O’Driscoll, L. Metabolic Syndrome: A Closer Look at the Growing Epidemic and Its Associated Pathologies. Obes. Rev. 2015, 16, 1–12. [Google Scholar] [CrossRef] [Green Version]
Lee, Y.H.; Pratley, R.E. The Evolving Role of Inflammation in Obesity and the Metabolic Syndrome. Curr. Diabetes Rep. 2005, 5, 70–75. [Google Scholar] [CrossRef]
Bovolini, A.; Garcia, J.; Andrade, M.A.; Duarte, J.A. Metabolic Syndrome Pathophysiology and Predisposing Factors. Int. J. Sports Med. 2021, 42, 199–214. [Google Scholar] [CrossRef]
Fanta, K.; Daba, F.B.; Asefa, E.T.; Chelkeba, L.; Melaku, T. Prevalence and Impact of Metabolic Syndrome on Short-Term Prognosis in Patients with Acute Coronary Syndrome: Prospective Cohort Study. Diabetes Metab. Syndr. Obes. 2021, 14, 3253–3262. [Google Scholar] [CrossRef]
Wiklund, P.K.; Pekkala, S.; Autio, R.; Munukka, E.; Xu, L.; Saltevo, J.; Cheng, S.; Kujala, U.M.; Alen, M.; Cheng, S. Serum Metabolic Profiles in Overweight and Obese Women with and without Metabolic Syndrome. Diabetol. Metab. Syndr. 2014, 6, 40. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Esposito, K.; Chiodini, P.; Capuano, A.; Bellastella, G.; Maiorino, M.I.; Rafaniello, C.; Panagiotakos, D.B.; Giugliano, D. Colorectal Cancer Association with Metabolic Syndrome and Its Components: A Systematic Review with Meta-Analysis. Endocrine 2013, 44, 634–647. [Google Scholar] [CrossRef]
Lemieux, I.; Després, J.P. Metabolic Syndrome: Past, Present and Future. Nutrients 2020, 12, 3501. [Google Scholar] [CrossRef]
Shao, Y.; Le, W. Recent Advances and Perspectives of Metabolomics-Based Investigations in Parkinson’s Disease. Mol. Neurodegener. 2019, 14, 3. [Google Scholar] [CrossRef] [PubMed] [Green Version]
González-Domínguez, R.; García-Barrera, T.; Gómez-Ariza, J.L. Combination of Metabolomic and Phospholipid-Profiling Approaches for the Study of Alzheimer’s Disease. J. Proteom. 2014, 104, 37–47. [Google Scholar] [CrossRef] [PubMed]
Alonso, A.; Marsal, S.; Julià, A. Analytical Methods in Untargeted Metabolomics: State of the Art in 2015. Front. Bioeng. Biotechnol. 2015, 3, 23. [Google Scholar] [CrossRef] [Green Version]
Spalding, K.; Bonnier, F.; Bruno, C.; Blasco, H.; Board, R.; Benz-de Bretagne, I.; Byrne, H.J.; Butler, H.J.; Chourpa, I.; Radhakrishnan, P.; et al. Enabling Quantification of Protein Concentration in Human Serum Biopsies Using Attenuated Total Reflectance–Fourier Transform Infrared (ATR-FTIR) Spectroscopy. Vib. Spectrosc. 2018, 99, 50–58. [Google Scholar] [CrossRef] [Green Version]
Gika, H.G.; Wilson, I.D. Global Metabolic Profiling for the Study of Alcohol-Related Disorders. Bioanalysis 2014, 6, 59–77. [Google Scholar] [CrossRef]
Serkova, N.J.; Standiford, T.J.; Stringer, K.A. The Emerging Field of Quantitative Blood Metabolomics for Biomarker Discovery in Critical Illnesses. Am. J. Respir. Crit. Care Med. 2011, 184, 647–655. [Google Scholar] [CrossRef] [Green Version]
Finlayson, D.; Rinaldi, C.; Baker, M.J. Is Infrared Spectroscopy Ready for the Clinic? Anal. Chem. 2019, 19, 12117–12128. [Google Scholar] [CrossRef]
Lovergne, L.; Lovergne, J.; Bouzy, P.; Untereiner, V.; Offroy, M.; Garnotel, R.; Thiéfin, G.; Baker, M.J.; Sockalingum, G.D. Investigating Pre-Analytical Requirements for Serum and Plasma Based Infrared Spectro-Diagnostic. J. Biophotonics 2019, 12, e201900177. [Google Scholar] [CrossRef] [PubMed]
Maitra, I.; Morais, C.L.M.; Lima, K.M.G.; Ashton, K.M.; Date, R.S.; Martin, F.L. Attenuated Total Reflection Fourier-Transform Infrared Spectral Discrimination in Human Bodily Fluids of Oesophageal Transformation to Adenocarcinoma. Analyst 2019, 144, 7447–7456. [Google Scholar] [CrossRef] [PubMed]
Roy, S.; Perez-Guaita, D.; Bowden, S.; Heraud, P.; Wood, B.R. Spectroscopy Goes Viral: Diagnosis of Hepatitis B and C Virus Infection from Human Sera Using ATR-FTIR Spectroscopy. Clin. Spectrosc. 2019, 1, 100001. [Google Scholar] [CrossRef]
Kaznowska, E.; Depciuch, J.; Łach, K.; Kołodziej, M.; Koziorowska, A.; Vongsvivut, J.; Zawlik, I.; Cholewa, M.; Cebulski, J. The Classification of Lung Cancers and Their Degree of Malignancy by FTIR, PCA-LDA Analysis, and a Physics-Based Computational Model. Talanta 2018, 186, 337–345. [Google Scholar] [CrossRef]
Perez-Guaita, D.; Garrigues, S.; de la Miguel, G. Infrared-Based Quantification of Clinical Parameters. TrAC Trends Anal. Chem. 2014, 62, 93–105. [Google Scholar] [CrossRef]
Wang, X.; Wu, Q.; Li, C.; Zhou, Y.; Xu, F.; Zong, L.; Ge, S. A Study of Parkinson’s Disease Patients’ Serum Using FTIR Spectroscopy. Infrared Phys. Technol. 2020, 106, 103279. [Google Scholar] [CrossRef]
Baioumi, A.Y.A.A. Comparing Measures of Obesity: Waist Circumference, Waist-Hip, and Waist-Height Ratios. In Nutrition in the Prevention and Treatment of Abdominal Obesity; Elsevier: Amsterdam, The Netherlands, 2019; pp. 29–40. [Google Scholar]
Pizarro, C.; Arenzana-Rámila, I.; Pérez-del-Notario, N.; Pérez-Matute, P.; González-Sáiz, J.M. Thawing as a Critical Pre-Analytical Step in the Lipidomic Profiling of Plasma Samples: New Standardized Protocol. Anal. Chim. Acta 2016, 912, 1–9. [Google Scholar] [CrossRef]
Forina, M.; Lanteri, S.; Oliveros, M.C.C.; Millan, C.P. Selection of Useful Predictors in Multivariate Calibration. Anal. Bioanal. Chem. 2004, 380, 397–418. [Google Scholar] [CrossRef]
Pizarro, C.; Esteban-Díez, I.; Arenzana-Rámila, I.; González-Sáiz, J.M. Discrimination of Patients with Different Serological Evolution of HIV and Co-Infection with HCV Using Metabolic Fingerprinting Based on Fourier Transform Infrared. J. Biophotonics 2018, 11, e201700035. [Google Scholar] [CrossRef]
Pizarro, C.; Esteban-Díez, I.; Espinosa, M.; Rodríguez-Royo, F.; González-Sáiz, J.M. An NMR-Based Lipidomic Approach to Identify Parkinson’s Disease-Stage Specific Lipoprotein-Lipid Signatures in Plasma. Analyst 2019, 144, 1334–1344. [Google Scholar] [CrossRef]
Tkachenko, K.; Espinosa, M.; Esteban-Díez, I.; González-Sáiz, J.M.; Pizarro, C. Extraction of Reduced Infrared Biomarker Signatures for the Stratification of Patients Affected by Parkinson’s Disease: An Untargeted Metabolomic Approach. Chemosensors 2022, 10, 229. [Google Scholar] [CrossRef]
Cocchi, M.; Biancolillo, A.; Marini, F. Chemometric Methods for Classification and Feature Selection. In Comprehensive Analytical Chemistry; Elsevier B.V.: Amsterdam, The Netherlands, 2018; Volume 82, pp. 265–299. ISBN 9780444640444. [Google Scholar]
Forina, M.; Lanteri, S.; Armanino, C.; Oliveros, M.C.C.; Casolino, C. V-PARVUS. An Extendable Package of Programs for Explorative Data Analysis, Classification and Regression Analysis. Dip.Chimica e Tecnologie Farmaceutiche ed Alimentari, University of Genova, Genova (Italy) 2011. Available online: https://iris.unige.it/handle/11567/202703 (accessed on 3 November 2022).
Forina, M.; Oliveri, P.; Casale, M. Complete Validation for Classification and Class Modeling Procedures with Selection of Variables and/or with Additional Computed Variables. Chemom. Intell. Lab. Syst. 2010, 102, 110–122. [Google Scholar] [CrossRef]
Brown, S.; Tauler, R.; Walczak, B. Comprehensive Chemometrics; Elsevier: Amsterdam, The Netherlands, 2010; ISBN 9780444527011. [Google Scholar]
van der Greef, J.; Smilde, A.K. Symbiosis of Chemometrics and Metabolomics: Past, Present, and Future. J. Chemom. 2005, 19, 376–386. [Google Scholar] [CrossRef]
Martin, M.; Perez-Guaita, D.; Andrew, D.W.; Richards, J.S.; Wood, B.R.; Heraud, P. The Effect of Common Anticoagulants in Detection and Quantification of Malaria Parasitemia in Human Red Blood Cells by ATR-FTIR Spectroscopy. Analyst 2017, 142, 1192–1199. [Google Scholar] [CrossRef] [PubMed]
Tomasid, R.C.; Sayat, A.J.; Atienza, A.N.; Danganan, J.L.; Ramos, M.R.; Fellizar, A.; Notarteid, K.I.; Angeles, L.M.; Bangaoilid, R.; Santillan, A.; et al. Detection of Breast Cancer by ATR-FTIR Spectroscopy Using Artificial Neural Networks. PLoS ONE 2022, 17, e0262489. [Google Scholar] [CrossRef]
Sitnikova, V.E.; Kotkova, M.A.; Nosenko, T.N.; Kotkova, T.N.; Martynova, D.M.; Uspenskaya, M.v. Breast Cancer Detection by ATR-FTIR Spectroscopy of Blood Serum and Multivariate Data-Analysis. Talanta 2020, 214, 120857. [Google Scholar] [CrossRef]
Theophilou, G.; Lima, K.M.G.; Martin-Hirsch, P.L.; Stringfellow, H.F.; Martin, F.L. ATR-FTIR Spectroscopy Coupled with Chemometric Analysis Discriminates Normal, Borderline and Malignant Ovarian Tissue: Classifying Subtypes of Human Cancer. Analyst 2016, 141, 585–594. [Google Scholar] [CrossRef] [Green Version]
Banerjee, A.; Gokhale, A.; Bankar, R.; Palanivel, V.; Salkar, A.; Robinson, H.; Shastri, J.S.; Agrawal, S.; Hartel, G.; Hill, M.M.; et al. Rapid Classification of COVID-19 Severity by ATR-FTIR Spectroscopy of Plasma Samples. Anal. Chem 2021, 93, 10391–10396. [Google Scholar] [CrossRef]
el Khoury, Y.; Collongues, N.; de Sèze, J.; Gulsari, V.; Patte-Mensah, C.; Marcou, G.; Varnek, A.; Mensah-Nyagan, A.G.; Hellwig, P. Serum-Based Differentiation between Multiple Sclerosis and Amyotrophic Lateral Sclerosis by Random Forest Classification of FTIR Spectra. Analyst 2019, 144, 4647–4652. [Google Scholar] [CrossRef]

Figure 1. Box and whisker plot showing the distribution of clinical values levels in patients with MetS and no MetS. The line located in the middle of the box represents the median and is used to better visualise the differences between clinical parameters: triglycerides (TGL) levels are displayed in orange ( Biosensors 13 00015 i001

); high density lipoprotein (HDL) in violet ( Biosensors 13 00015 i002

); systolic pressure (SP) in yellow ( Biosensors 13 00015 i003

); diastolic pressure (DP) in green ( Biosensors 13 00015 i004

); and glucose (GLU) in blue ( Biosensors 13 00015 i005

).

Figure 1. Box and whisker plot showing the distribution of clinical values levels in patients with MetS and no MetS. The line located in the middle of the box represents the median and is used to better visualise the differences between clinical parameters: triglycerides (TGL) levels are displayed in orange ( Biosensors 13 00015 i001

); high density lipoprotein (HDL) in violet ( Biosensors 13 00015 i002

); systolic pressure (SP) in yellow ( Biosensors 13 00015 i003

); diastolic pressure (DP) in green ( Biosensors 13 00015 i004

); and glucose (GLU) in blue ( Biosensors 13 00015 i005

).

Figure 2. Scores for the plasma samples on the first two principal components explaining the variability in the dataset of five measured clinal parameters. The samples are labelled according to their specific pathology: no MetS ( Biosensors 13 00015 i006

), MetS (

), and external test samples ( Biosensors 13 00015 i008

).

Figure 2. Scores for the plasma samples on the first two principal components explaining the variability in the dataset of five measured clinal parameters. The samples are labelled according to their specific pathology: no MetS ( Biosensors 13 00015 i006

), MetS (

), and external test samples ( Biosensors 13 00015 i008

).

Figure 3. Scores for the plasma samples on the first two principal components explaining the variability in the IR spectral dataset. The samples are labelled according to their specific pathology: no MetS ( Biosensors 13 00015 i006

), MetS (

), and external test samples ( Biosensors 13 00015 i008

).

Figure 3. Scores for the plasma samples on the first two principal components explaining the variability in the IR spectral dataset. The samples are labelled according to their specific pathology: no MetS ( Biosensors 13 00015 i006

), MetS (

), and external test samples ( Biosensors 13 00015 i008

).

Figure 4. Histogram of the first canonical variable for the discrimination of MetS ( Biosensors 13 00015 i009

) and no MetS ( Biosensors 13 00015 i010

) patients within included ( Biosensors 13 00015 i011

) test set, after performing LDA in the stratification approach based on clinical parameters (y-axis indicates the maximum discrimination power between categories).

Figure 4. Histogram of the first canonical variable for the discrimination of MetS ( Biosensors 13 00015 i009

) and no MetS ( Biosensors 13 00015 i010

) patients within included ( Biosensors 13 00015 i011

) test set, after performing LDA in the stratification approach based on clinical parameters (y-axis indicates the maximum discrimination power between categories).

Figure 5. Histogram of the first canonical variable for the discrimination of MetS ( Biosensors 13 00015 i009

) and no MetS ( Biosensors 13 00015 i010

) patients within the included ( Biosensors 13 00015 i011

) test set, after performing SELECT-LDA in the stratification approach based on 20 IR variables (y-axis indicates the maximum discrimination power between categories).

Figure 5. Histogram of the first canonical variable for the discrimination of MetS ( Biosensors 13 00015 i009

) and no MetS ( Biosensors 13 00015 i010

) patients within the included ( Biosensors 13 00015 i011

) test set, after performing SELECT-LDA in the stratification approach based on 20 IR variables (y-axis indicates the maximum discrimination power between categories).

Figure 6. Cooman’s plot displaying the results obtained by applying SIMCA class-modelling to clinical parameters: MetS ( Biosensors 13 00015 i012

) and no MetS ( Biosensors 13 00015 i013

) patients within the included ( Biosensors 13 00015 i014

) test set. The red solid line indicates a confidence level for class space at 95%. The red dashed line indicates equal class distance.

Figure 6. Cooman’s plot displaying the results obtained by applying SIMCA class-modelling to clinical parameters: MetS ( Biosensors 13 00015 i012

) and no MetS ( Biosensors 13 00015 i013

) patients within the included ( Biosensors 13 00015 i014

) test set. The red solid line indicates a confidence level for class space at 95%. The red dashed line indicates equal class distance.

Figure 7. Cooman’s plot displaying the results obtained by applying the SELECT-SIMCA class-modelling to ten selected IR signals: MetS ( Biosensors 13 00015 i012

) and no MetS ( Biosensors 13 00015 i013

) patients within included ( Biosensors 13 00015 i014

) test set. The red solid line indicates a confidence level for class space at 95%. The red dashed line indicates equal class distance.

Figure 7. Cooman’s plot displaying the results obtained by applying the SELECT-SIMCA class-modelling to ten selected IR signals: MetS ( Biosensors 13 00015 i012

) and no MetS ( Biosensors 13 00015 i013

) patients within included ( Biosensors 13 00015 i014

) test set. The red solid line indicates a confidence level for class space at 95%. The red dashed line indicates equal class distance.

Table 1. The distribution of the clinically measured parameters in MetS and no MetS patients expressed in mg/dL and in mmHg.

Category		MetS		No MetS
Clinical Parameters	Max	Min	Mean	Max	Min	Mean
Systolic blood pressure	174	120	136	178	94	126
Diastolic blood pressure	109	75	87	115	61	79
Triglycerides	338	88	242	215	33	109
HDL	58	25	37	95	29	55
Glucose	164	82	114	123	63	91

Table 2. Results of LDA classification performance on clinical parameters.

Clinical Parameters	Classification (%)	External Prediction (%)	Total Rate (%)
MetS	100	100	100
No MetS	100	98.73 (1)¹	99.36
Total rate	100	98.94	99.47

¹ The one corresponds to one misclassified subject in cross-validation.

Table 3. Results of SELECT LDA classification performance on 20 IR selected spectral variables.

Clinical Parameters	Classification (%)	External Prediction (%)	Total Rate (%)
MetS	100	100	100
No MetS	100	100	100
Total rate	100	100	100

Table 4. The values of discriminant and modelling powers of clinical parameters after SIMCA class-modelling.

Clinical Parameters	Discriminant Power	Modelling Power
Clinical Parameters	Discriminant Power	Category MetS	Category No MetS
Systolic blood pressure	1.99	0.70	0.73
Diastolic blood pressure	2.01	0.70	0.73
Triglycerides	2.18	0.94	0.96
HDL	2.34	0.79	0.94
Glucose	2.36	0.84	0.97

Table 5. The results of SIMCA class-modelling performance on clinical parameters and ten selected IR spectral variables.

Variables	Classification (%)	LOO (%)	CV Efficiency (%)	Efficiency Forced Model (%)	Total Rate (%)
5 clinical measurements	98.59	97.18	87.05	95.68	100
10 IR selected wavenumbers	97.18	94.37	87.92	97.86	100

Table 6. Discriminative and modelling powers of ten selected spectra variables after SELECT-SIMCA class modelling.

Wavenumber (cm⁻¹)	Discriminant Power	Modelling Power
Wavenumber (cm⁻¹)	Discriminant Power	Category MetS	Category No MetS
2860.22	3.77	1.00	1.00
1423.36	4.23
1562.22	3.66
1578.61	3.75
1108.98	3.70
1316.32	3.64
2948.94	4.29
1557.40	4.31
1133.09	5.86
1247.85	3.58

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Tkachenko, K.; Esteban-Díez, I.; González-Sáiz, J.M.; Pérez-Matute, P.; Pizarro, C. Dual Classification Approach for the Rapid Discrimination of Metabolic Syndrome by FTIR. Biosensors 2023, 13, 15. https://doi.org/10.3390/bios13010015

AMA Style

Tkachenko K, Esteban-Díez I, González-Sáiz JM, Pérez-Matute P, Pizarro C. Dual Classification Approach for the Rapid Discrimination of Metabolic Syndrome by FTIR. Biosensors. 2023; 13(1):15. https://doi.org/10.3390/bios13010015

Chicago/Turabian Style

Tkachenko, Kateryna, Isabel Esteban-Díez, José M. González-Sáiz, Patricia Pérez-Matute, and Consuelo Pizarro. 2023. "Dual Classification Approach for the Rapid Discrimination of Metabolic Syndrome by FTIR" Biosensors 13, no. 1: 15. https://doi.org/10.3390/bios13010015

APA Style

Tkachenko, K., Esteban-Díez, I., González-Sáiz, J. M., Pérez-Matute, P., & Pizarro, C. (2023). Dual Classification Approach for the Rapid Discrimination of Metabolic Syndrome by FTIR. Biosensors, 13(1), 15. https://doi.org/10.3390/bios13010015

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Dual Classification Approach for the Rapid Discrimination of Metabolic Syndrome by FTIR

Abstract

1. Introduction

2. Materials and Methods

2.1. Study Population

2.2. Sample Collection

2.3. Method

2.4. Data Analysis

3. Results and Discussion

3.1. Descriptive Statistics

3.2. Exploratory Analysis with PCA

3.3. Supervised Techniques

3.3.1. SELECT

3.3.2. LDA on Clinical Parameters

3.3.3. SELECT-LDA on IR Wavenumbers

3.3.4. SIMCA

SIMCA on Clinical Parameters

SELECT-SIMCA on IR Wavenumbers

3.3.5. Biochemical Reasoning of Ten Extracted Signals

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI