Next Article in Journal
The Furin Protease Dependence and Antiviral GBP2 Sensitivity of Murine Leukemia Virus Infection Are Determined by the Amino Acid Sequence at the Envelope Glycoprotein Cleavage Site
Previous Article in Journal
The Influence of N-Acetylcysteine-Enriched Hydrogels on Wound Healing in a Murine Model of Type II Diabetes Mellitus
Previous Article in Special Issue
Lectin-Based Fluorescent Comparison of Glycan Profile—FDA Validation to Expedite Approval of Biosimilars
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Predicting Biochemical and Physiological Parameters: Deep Learning from IgG Glycome Composition

1
Department of Biology, Faculty of Science, University of Zagreb, 10000 Zagreb, Croatia
2
Genos Glycoscience Research Laboratory, 10000 Zagreb, Croatia
3
Faculty of Pharmacy and Biochemistry, University of Zagreb, 10000 Zagreb, Croatia
4
Department of Public Health, University of Split School of Medicine, 21000 Split, Croatia
5
Croatian Science Foundation, 10000 Zagreb, Croatia
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2024, 25(18), 9988; https://doi.org/10.3390/ijms25189988
Submission received: 17 July 2024 / Revised: 29 August 2024 / Accepted: 11 September 2024 / Published: 16 September 2024

Abstract

:
In immunoglobulin G (IgG), N-glycosylation plays a pivotal role in structure and function. It is often altered in different diseases, suggesting that it could be a promising health biomarker. Studies indicate that IgG glycosylation not only associates with various diseases but also has predictive capabilities. Additionally, changes in IgG glycosylation correlate with physiological and biochemical traits known to reflect overall health state. This study aimed to investigate the power of IgG glycans to predict physiological and biochemical parameters. We developed two models using IgG N-glycan data as an input: a regression model using elastic net and a machine learning model using deep learning. Data were obtained from the Korčula and Vis cohorts. The Korčula cohort data were used to train both models, while the Vis cohort was used exclusively for validation. Our results demonstrated that IgG glycome composition effectively predicts several biochemical and physiological parameters, especially those related to lipid and glucose metabolism and cardiovascular events. Both models performed similarly on the Korčula cohort; however, the deep learning model showed a higher potential for generalization when validated on the Vis cohort. This study reinforces the idea that IgG glycosylation reflects individuals’ health state and brings us one step closer to implementing glycan-based diagnostics in personalized medicine. Additionally, it shows that the predictive power of IgG glycans can be used for imputing missing covariate data in deep learning frameworks.

1. Introduction

Immunoglobulin G (IgG) is one of the best-studied glycoproteins in humans. Both functional domains of IgG, the fragment antigen-binding (Fab) region and the fragment crystallizable (Fc) region, undergo glycosylation which significantly impacts IgG function. Glycosylation of a conserved N-glycosylation site Asn-297, present at both heavy chains of the Fc domain, influences the antibody’s conformation, stability, and interaction with Fcγ receptors, thereby acting as a switch between pro- and anti-inflammatory responses [1,2,3,4,5]. The Fab region lacks a conserved glycosylation site; however, 15–30% of IgG molecules contain one or more N-glycosylation sites at this region introduced by somatic hypermutation [6,7,8,9]. Fab glycosylation impacts antigen-binding affinity and serum half-life [1,7,8]. Although IgG glycome demonstrates low variability within an individual under homeostatic conditions, disruptions to this equilibrium, such as those occurring during disease or aging, prompt striking changes in the IgG glycome profile [1,6]. However, the IgG glycome exhibits substantial inter-individual variability that is driven by the intricate interplay between an individual’s genetic makeup, epigenetic regulation, and environmental influences [10,11,12].
Studies indicate that changes in the IgG glycosylation profile, particularly a decrease in IgG galactosylation, reflect the underlying inflammatory processes that take place in numerous diseases [6]. IgG glycan changes in diseases like galactosemia and congenital disorders of glycosylation (CDG) are directly linked to known defects in carbohydrate metabolism, glycan synthesis, or glycan modification pathways [13,14]. However, for most diseases, especially multifactorial ones, the relationship between changes in IgG glycosylation and disease is complex and influenced by numerous factors. In complex diseases, genetics, environmental influences, and metabolic influences interact at multiple levels, and due to their plasticity, changes in IgG glycans reflect these influences [6]. It remains unclear whether changes in the IgG glycome are involved in disease onset or occur as a response to pathological processes within the organism. Nevertheless, the fact that the IgG glycome is affected by various physiological and pathological states makes it a promising biomarker for a person’s overall health status [1]. Observation that IgG glycans can estimate biological age led to the development of the glycan clock, a biological age biomarker based solely on IgG glycosylation [15]. Integration of clinical traits further enhanced age prediction models [15,16]. Numerous studies have now confirmed the association between the IgG glycome and various clinical traits associated with disease states [17,18,19,20,21,22,23,24,25,26,27]. These studies have shown correlations between specific IgG glycan traits and lipid metabolism, with specific glycans linked to levels of high-density lipoprotein (HDL) cholesterol, low-density lipoprotein (LDL) cholesterol, total cholesterol, and triglycerides [21,24]. Beyond lipids, accumulating evidence suggests a role for IgG glycosylation in blood pressure regulation. Studies have identified associations between altered IgG glycosylation and prehypertension, established hypertension, and both systolic and diastolic blood pressure (SBP and DBP). Additionally, specific glycan features have been shown to predict incident hypertension [18,20,25,27]. The repertoire of biochemical and physiological traits associated with IgG glycome is continually expanding through ongoing research. In addition to previously established associations, recent studies have revealed connections with serum levels of glucose, insulin [21], hemoglobin A1c (HbA1c), fibrinogen, D-dimer [15], uric acid [17], and creatinine [28]. However, a question remains: do changes in IgG glycosylation represent a cause or consequence of these metabolic imbalances? To address this, Meng et al. employed a bidirectional Mendelian Randomization (MR) analysis. By integrating GWAS data for metabolic traits and IgG glycans, along with IgG N-glycan-QTLs and metabolic-QTLs, their study confirmed the existence of bidirectional causality between IgG glycosylation and various metabolic traits [26].
The IgG glycome exhibits substantial biomarker potential as evidenced by numerous studies leveraging its attributes to construct predictive models for various diseases [1]. Given that studies demonstrated both association and bidirectional causality between IgG glycans and metabolic traits, we hypothesize that IgG glycan data holds potential for predicting different biochemical but also physiological parameters. This study used data from the Korčula cohort to build predictive models for biochemical and physiological parameters using IgG glycans as the input feature and data from an additional cohort (the Vis cohort) for validation of the results. We employed two distinct methods, elastic net and deep learning, to evaluate the predictive power of IgG glycans. Establishing this link would support the notion that the IgG glycome profile reflects an individual’s metabolic health status. Furthermore, the ability to predict biochemical and physiological parameters from IgG glycans holds additional value, as it could facilitate imputation of missing biochemical and physiological data in datasets intended for further analysis. In this study, we assessed the predictive capacity of IgG glycans for imputing missing covariate values, including selected physiological and biochemical parameters with a focus on cardiometabolic health.

2. Results

2.1. Population Characteristics

Baseline information of all study subjects in both the Korčula and Vis cohorts, including information regarding biochemical and physiological traits used in this study, is summarized in Table 1.

2.2. Glycomics Data Correlation

The Glycomics data show strong correlations within several clusters of glycan peaks (GP) and a weak correlation with biological sex (Figure 1). Positively correlated glycan peaks are mostly located on the same part of the IgG chromatogram, such as peaks GP3 and GP4 or GP14 and GP15. Earlier eluting peaks harbor less complex glycan structures that mostly lack galactose while later eluting peaks exhibit more complex glycan structures containing one or two galactose residues capped with sialic acid. Later glycan peaks show negative correlations with peaks at the beginning of the chromatogram as seen between peaks GP6 and GP23. This is likely because the increase in abundance of complex glycan structures coincides with a decrease in the number of less complex structures, and vice versa. Correlation analysis of glycomics data demonstrates the need for using regression and machine learning models robust to this type of data and led to the selection of elastic net [29] as a regression model and deep learning [30,31], ultimately linked to the TensorFlow library, as a machine learning model. In contrast, other methods, such as random forests and simple regression, would not be appropriate for generalization and building reliable predictive models on such correlated data.

2.3. Prediction of Biochemical and Physiological Parameters from the IgG Glycome Composition

IgG glycosylation data, along with sex, have substantial predictive power for certain biochemical and physiological parameters, which can be seen in Figure 2. The performance is variable across the parameters, yet some physiologically relevant values can be inferred from the glycosylation data. These include parameters involved in cardiovascular events: pulse wave velocity, central SBP, central DBP, heart rate, serum creatinine, serum uric acid; parameters involved in lipid metabolism: serum triglycerides, serum cholesterol, serum HDL, serum LDL, serum VLDL; and parameters involved in glucose metabolism encompassing serum glucose and serum HbA1c.
Performance on both the Korčula and the Vis cohorts was used to select hyperparameters for deep learning. Results on the Korčula cohort with 16% observations set aside for testing and using the Vis cohort for testing were in agreement for all the combinations of hyperparameters tested. Adding or removing hidden layers, changing the number of units per layer, changing the activation function, and changing the number of epochs gave inferior results compared to the selected hyperparameters. This architecture of the deep learning network is therefore close to the optimum for the information contained in the glycans. To compare the performance of deep learning and elastic net models, we normalized the RMSE of predicted biochemical and physiological parameters by the mean of each respective parameter and plotted the relative RMSE values on graph (Figure 2 and Figure 3). Although the performance of the deep learning model is similar to the elastic net regression when validated on the data from the same cohort (Figure 2), it shows superior performance when compared to elastic net regression in the case of validation on the Vis cohort (Figure 3).
While validation on a subset of the same cohort, Korčula cohort, containing a larger set of physiological and biochemical parameters, gave valuable insight into the performance of the model and enabled optimization, a better insight into the power of a model to generalize was given when it was applied on the Vis cohort, where the same sample collection and glycan analysis protocols enabled direct comparison, yet the dataset was collected from a different population and at a different time (Figure 2 and Figure 3).
When we observe each parameter separately (Figure 2 and Figure 3) we can see that IgG glycan composition offered the strongest predictive power for serum albumin levels, followed by serum uric acid levels, in both cohorts. In the Korčula cohort, our glycan-based models also demonstrated superior performance in predicting central DBP and SBP. Conversely, IgG glycan composition exhibited the weakest predictive ability for serum triglycerides in both cohorts and serum VLDL levels in the Korčula cohort only.
Prediction of an individual’s sex, which was used as an input variable, was an additional “sanity check” which any considered model needed to pass. All models we discussed correctly classified all samples by sex.

3. Discussion

In this study, we demonstrated the potential of IgG glycosylation data in predicting biochemical and physiological parameters. While the models exhibited some variability across different parameters, specific biochemical and physiological traits were successfully inferred. These included those related to lipid and glucose metabolism, as well as cardiovascular disease risk. This finding aligns with previous research demonstrating associations between IgG glycosylation patterns and various metabolic disorders, traits, and cardiovascular events [21,24,25,33,34]. Our IgG glycan-based models achieved strong predictive performance for serum albumin levels in both cohorts. This is particularly interesting because low serum albumin (hypoalbuminemia) is recognized as a risk factor for all-cause, cancer, cardiovascular, and respiratory mortalities [35,36,37,38]. Additionally, decreasing albumin levels have been associated with increasing age [39]. Intriguingly, IgG glycosylation, particularly Fc monogalactosylation, has also been linked to life expectancies in both males and females [40]. Furthermore, several IgG glycans have been reported to change with chronological and biological age, with most extensive changes including galactosylation [15]. The fact that our IgG-glycan-based models are able to predict serum albumin levels, a well-known mortality risk predictor, suggests that IgG glycosylation itself may hold promise as a potential prognostic factor in an individual’s mortality risk.
Plomp et al. [21] demonstrated that for all IgG subclasses, reduced galactosylation alongside increased core fucosylation are associated with increased inflammation, assessed by the levels of inflammatory marker CRP, low serum HDL cholesterol, and high triglycerides, which are all indicators of poor metabolic health [21]. Further analysis of IgG glycan structures revealed significant correlations between specific glycan peaks and total cholesterol, triglycerides, and LDL cholesterol levels, with nine glycan peaks, including GP1, GP4, GP5, GP6, GP11, GP14, GP18, GP20, and GP21, used to build a model capable of differentiating individuals with dyslipidemia from healthy controls [24]. In our study, serum levels of HDL, LDL, and total cholesterol were successfully inferred from IgG glycosylation data in both cohorts, further supporting the potential link between IgG glycosylation and lipid metabolism.
In addition to lipids, our models exhibited the ability to predict serum glucose and HbA1c levels in both the Vis and Korčula cohorts. Alterations in IgG glycosylation have been linked to impaired glucose metabolism in both type 1 (T1DM) [41] and type 2 (T2DM) diabetes mellitus [34]. Subclass-specific IgG Fc glycan changes, including increased monogalactosylation, agalactosylation, and decreased sialylation and bisecting GlcNAc in IgG4; increased bisecting GlcNAc and decreased digalactosylation and total galactosylation in IgG2; and increased monogalactosylation in IgG1, were associated with T2DM [34]. Another subclass-specific study revealed negative associations between insulin levels and IgG1 fucosylation, IgG4 galactosylation, and sialylation, while IgG4 bisection correlated positively with glucose levels. Both insulin and glucose are recognized markers of T2DM [21]. In pregnant women, metabolic parameters related to insulin levels and insulin resistance were significantly associated with core fucosylated, bisected (FA2B), and afucosylated, disialylated (A2G2S2), glycan structures [19].
Hypertension, along with T2DM, is a risk factor for cardiovascular disease. Several studies showed that incorporating IgG glycan data into prediction models improves risk assessment for hypertension compared to traditional models based on age, gender, and BMI [18,25,27]. For instance, in the Kazakh population, decreased galactosylation and increased fucosylated agalactobiantennary Fc glycans in all IgG subclasses correlated with higher SBP and DBP. A model incorporating nine specific IgG glycan traits successfully discriminated between hypertensive and healthy individuals [25]. Additionally, study on Uygur, Kazak, Kirgiz, and Tajik populations revealed ten subclass-specific IgG Fc glycan traits, primarily reflecting decreased galactosylation and sialylation, significantly associated with hypertension. A glycan-based model with five selected glycan traits showed moderate improvement in prediction [18]. Moreover, Kifer et al. identified four specific IgG glycan traits, including bisecting GlcNAc, GP4, GP9, and GP21, predictive of incident hypertension. A linear combination of these traits correlated with both incident hypertension and both SBP and DBP [27]. Central SBP and DBP are among the parameters for which IgG glycan composition showed the best predictive potential in our study. Emerging evidence suggests central blood pressure is more predictive of future cardiovascular events than brachial pressure [42].
Hyperuricemia, a metabolic condition marked by elevated serum uric acid levels, contributes to occurrence of both hypertension [43] and T2DM [44]. Interestingly, one study identified six specific IgG glycan traits positively and another six negatively correlating with serum uric acid levels. Furthermore, including two specific IgG glycan traits in a model alongside BMI and gender distinguished individuals with hyperuricemia from healthy controls [17]. Notably, our IgG glycan-based models achieved strong predictive performance for serum uric acid levels in both the Korčula and Vis cohorts. Barrios et al. [45] proposed that IgG glycans also have a role in kidney function. They identified 14 IgG glycan traits associated with renal function, encompassing galactosylation, sialylation, and bisecting N-acetylglucosamine levels [45]. Moreover, a study using a lectin microarray for IgG glycosylation profiling in peripheral artery disease found a positive association between creatinine levels and IgG fucosylation and galactosylation [28]. Elevated serum creatinine indicates impaired kidney function. In our study, models incorporating IgG glycan data successfully predicted serum creatinine levels in both cohorts, further supporting the potential role of IgG glycosylation in kidney function.
Both IgG glycan-based models developed in this study demonstrated their predictive potential for various biochemical and physiological traits known as risk factors for cardiometabolic diseases. Our study further emphasizes the utility of IgG glycosylation as a valuable marker of an individual’s overall health status. This work also shows that analyzing IgG glycosylation in cohorts with other response variables like diseases or medical conditions may be useful as a step toward glycan-guided diagnostics, using protein glycosylation, a powerful but often overlooked tool, as a substantial upgrade to today’s state of the art in personalized medicine. In those more complex scenarios, we envision that deep learning may significantly outperform classical regression methods due to its ability to capture the complexity of the glycosylation process and the intricate interactions in health and disease. Current study clearly demonstrates that salient information about an individual’s health status is encoded in the protein glycosylation pattern, though extracting it and utilizing its full potential requires large cohorts and sophisticated data analysis methods.
Highly correlated data, such as glycans, have to be analyzed carefully using models that are relatively robust to correlated data. We decided to use elastic net regression, a hybrid between the LASSO and Ridge regression, as a benchmark representing established methodology with robustness to correlated data [29]. In an attempt to build a model that performs at least at the same level and possesses the robustness to correlated data yet is extensible and easily adjusted to other response variables [46], such as disease state, we turned to deep learning using the “keras” library, which taps into the TensorFlow library [47]. The deep learning model demonstrated its superiority when presented with a different cohort for validation (Figure 3). This clearly demonstrates that a deep learning network generalizes better than an elastic net model. A deep learning model, having demonstrated its usefulness in extracting information from the IgG glycosylation profile data, can be easily modified to detect disease status when presented with a dataset containing the appropriate response variable, i.e., the disease state or stage. This is important when looking beyond inferring simple biochemical or physiological parameters. The key useful property of the deep learning model is its ability to capture the complexity of the glycomics data and build accurate predictive models if the response variable shares significant information with the IgG glycosylation pattern, which is often the case with a large number of diseases and conditions [1].
Missing data is a prevalent issue across diverse research fields, such as clinical studies, microarray and spectrometric datasets, and OMICS studies, including glycomics, epigenomics, and genomics [48,49,50,51]. If not handled properly, missing data can significantly compromise studies, reducing statistical power, introducing bias in parameter estimates, and undermining sample representativeness. When managing missing data, imputation methods are favored over complete case analysis or listwise deletion [52]. Deep learning-based models have emerged as powerful tools for missing value imputation. Due to their enhanced imputation accuracy and ability to handle intricate missing patterns and data structures, deep learning-based models have demonstrated superiority over other established imputation methods [53]. This study suggests that the predictive capacity of IgG glycans could be leveraged by researchers to integrate IgG glycome data into deep learning frameworks for imputing missing covariate values, including biochemical and physiological traits themselves. The model can easily be extended to extract the data from IgG glycosylation profiles and leverage that data for detecting the disease status in subsequent research.

4. Materials and Methods

4.1. Cohorts

This study was performed using previously published data obtained from two cohorts—the Korčula cohort (950 individuals) [54,55] and the Vis cohort (888 individuals) [55,56,57]. The Korčula cohort belongs to an adult population of the island of Korčula, Croatia, where all subjects were aged 18 and over [54,55]. All respondents gave informed consent before participating and the study followed all relevant ethical regulations. The Vis cohort belongs to an adult population (also 18 and over) of the Croatian island Vis who were recruited within a larger genetic epidemiology program with the goal to investigate genetic variability and map genes influencing common complex diseases and disease traits in genetically isolated populations [55,56,57]. As for the Korčula cohort, this study also followed all relevant ethical regulations and all respondents signed an informed consent form before participating. Studies in which data from the Korčula and Vis cohort were published received ethical approval from the University of Split, School of Medicine, and the South East Scotland Research Ethics Committee.

4.2. Biochemical and Physiological Parameters

For this study we used previously collected data for physiological and biochemical parameters from individuals in the Korčula [54,55] and Vis cohorts [55,56,57]. The parameters measured in both cohorts included serum creatinine, serum uric acid, serum glucose, serum cholesterol, serum triglycerides, serum HDL, serum LDL, serum calcium, serum albumin, serum HbA1c, and fibrinogen. Additional parameters were assessed only in the Korčula cohort due to their unavailability in the Vis cohort. These included pulse wave velocity, central SBP, central DBP, heart rate, and serum VLDL.

4.3. IgG Isolation and N-Glycan Analyses

For the prediction of physiological and biochemical parameters, we used existing IgG glycosylation data from the Korčula and Vis cohorts published in Trbojević-Akmačić et al. [58] for the Korčula cohort and Pučić et al. [32] for the Vis cohort. Briefly, IgG was isolated from human plasma by CIM® r-Protein G LLD 0.2 mL Monolithic 96-well Plate (2 µm channels) (BIA Separations, a Sartorius company, Ajdovščina, Slovenia). Plasma samples were diluted with 1xPBS (pH 7.4) in a 1:7 ratio for the Korčula cohort and 1:9 for the Vis cohort. Diluted samples were applied to preconditioned protein G plates. To remove unbound proteins, plates were washed with 1xPBS (pH = 7.4). Bound IgG was eluted from the protein G plates with 0.1 M formic acid (pH = 2.5) and immediately neutralized to pH 7.0 using 1 M ammonium bicarbonate. Dried IgG eluates were denatured with 1.33% sodium dodecyl sulfate (SDS) and glycans were released by digestion with recombinant N-glycosidase F (PNGase F). Released glycans were labeled with 2-aminobenzamide (2-AB). Fluorescently labeled IgG glycans were analyzed using ultra-high-performance liquid chromatography based on hydrophilic interactions and fluorescence detection (HILIC-UHPLC-FLD). Data was processed using an automatic processing method with a traditional integration algorithm. The IgG glycans’ chromatograms were integrated into 24 peaks where the amount of glycans in each peak was expressed as a percentage of the total integrated area (% area). Glycan structures corresponding to each peak were determined by exoglycosidase digestion and mass spectrometry as described in Pučić et al. [32].

4.4. Building of Predictive Models Using IgG Glycan Data

A cohort of 950 individuals with complete sets of observations for IgG glycans was used to predict selected biochemical and physiological parameters (the Korčula cohort). An additional cohort of 888 complete observations (the Vis cohort) was used for validation of a subset of biochemical and physiological parameters present in both cohorts. The area (expressed numerically as a percentage, range 0 to 100) under each of the 24 glycan peaks (composition determined by mass spectrometry) and the sex of the individual were used as inputs. Glycomics data were transformed by taking the natural logarithm.
Analysis was performed in an R statistical environment [59]. Package “corrplot” [60] was used for visualization of correlations in the glycomics data. Elastic net regression was performed using the “glmnet” package [61]. For deep learning, we used the “keras” library [62]. The deep learning network consisted of an input layer, three hidden layers of 24 units and an output layer (single unit), with the “relu” activation function; training was performed over 24 epochs with 1/4 of the data set aside for the validation set. Before training, 16% of all observations were set aside as the testing set, on which all performance measurements were conducted, except when the Vis cohort was used for testing/validation of the biochemical and physiological parameters present in both cohorts (the complete Korčula cohort was used for training in that case). Hyperparameters for deep learning were selected manually [63] by systematically varying the number of hidden layers (two to six), number of units in each layer (8, 16, 24, 32, 64), activation function (“relu”, “selu” and combinations thereof), and the number of epochs (10, 24, 50, 100, 500).

Author Contributions

Conceptualization, A.V. (Aleksandar Vojta), A.V. (Ana Vujić) and G.L.; methodology, A.V. (Aleksandar Vojta), A.V. (Ana Vujić) and M.K.; software, A.V. (Aleksandar Vojta); validation, A.V. (Aleksandar Vojta) and A.V. (Ana Vujić); formal analysis, A.V. (Aleksandar Vojta); investigation, A.V. (Aleksandar Vojta) and A.V. (Ana Vujić); resources, G.L. and O.P.; data curation, A.V. (Aleksandar Vojta); writing—original draft preparation, A.V. (Ana Vujić), M.K. and A.V. (Aleksandar Vojta); writing—review and editing, A.V. (Ana Vujić), M.K., A.V. (Aleksandar Vojta), V.Z., G.L. and O.P.; visualization, A.V. (Aleksandar Vojta); supervision, A.V. (Aleksandar Vojta) and V.Z.; project administration, V.Z.; funding acquisition, V.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Croatian Science Foundation grant IgGeneNET (IP-2022-10-1358).

Institutional Review Board Statement

This study uses already published data from studies that were conducted in accordance with the Declaration of Helsinki, and were approved by the Ethics Committee of the University of Split, School of Medicine (protocol code 2181-198-03-04/10-11-0008, approved on 8 March 2011) and the South East Scotland Research Ethics Committees (protocol code 11/AL/0222, approved on 6 May 2011).

Informed Consent Statement

The studies in which the used data was published obtained informed consent from all involved subjects.

Data Availability Statement

This study uses already published data with given studies referenced in the Materials and Methods. The used data is not publicly available but is instead available from the corresponding authors of given studies on reasonable request and in line with the consent given by participants.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Shkunnikova, S.; Mijakovac, A.; Sironic, L.; Hanic, M.; Lauc, G.; Kavur, M.M. IgG Glycans in Health and Disease: Prediction, Intervention, Prognosis, and Therapy. Biotechnol. Adv. 2023, 67, 108169. [Google Scholar] [CrossRef] [PubMed]
  2. Yagi, H.; Yanaka, S.; Kato, K. Structure and Dynamics of Immunoglobulin G Glycoproteins. Adv. Exp. Med. Biol. 2018, 1104, 219–235. [Google Scholar] [CrossRef] [PubMed]
  3. Krapp, S.; Mimura, Y.; Jefferis, R.; Huber, R.; Sondermann, P. Structural Analysis of Human IgG-Fc Glycoforms Reveals a Correlation between Glycosylation and Structural Integrity. J. Mol. Biol. 2003, 325, 979–989. [Google Scholar] [CrossRef]
  4. Dekkers, G.; Treffers, L.; Plomp, R.; Bentlage, A.E.H.; de Boer, M.; Koeleman, C.A.M.; Lissenberg-Thunnissen, S.N.; Visser, R.; Brouwer, M.; Mok, J.Y.; et al. Decoding the Human Immunoglobulin G-Glycan Repertoire Reveals a Spectrum of Fc-Receptor- and Complement-Mediated-Effector Activities. Front. Immunol. 2017, 8, 877. [Google Scholar] [CrossRef]
  5. Ahmed, A.A.; Giddens, J.; Pincetic, A.; Lomino, J.V.; Ravetch, J.V.; Wang, L.-X.; Bjorkman, P.J. Structural Characterization of Anti-Inflammatory Immunoglobulin G Fc Proteins. J. Mol. Biol. 2014, 426, 3166–3179. [Google Scholar] [CrossRef] [PubMed]
  6. Gudelj, I.; Lauc, G.; Pezer, M. Immunoglobulin G Glycosylation in Aging and Diseases. Cell. Immunol. 2018, 333, 65–79. [Google Scholar] [CrossRef]
  7. Anumula, K.R. Quantitative Glycan Profiling of Normal Human Plasma Derived Immunoglobulin and Its Fragments Fab and Fc. J. Immunol. Methods 2012, 382, 167–176. [Google Scholar] [CrossRef]
  8. Zhu, D.; McCarthy, H.; Ottensmeier, C.H.; Johnson, P.; Hamblin, T.J.; Stevenson, F.K. Acquisition of Potential N-Glycosylation Sites in the Immunoglobulin Variable Region by Somatic Mutation is a Distinctive Feature of Follicular Lymphoma. Blood 2002, 99, 2562–2568. [Google Scholar] [CrossRef]
  9. Dunn-Walters, D.; Boursier, L.; Spencer, J. Effect of Somatic Hypermutation on Potential N-Glycosylation Sites in Human Immunoglobulin Heavy Chain Variable Regions. Mol. Immunol. 2000, 37, 107–113. [Google Scholar] [CrossRef]
  10. Zoldoš, V.; Horvat, T.; Lauc, G. Glycomics Meets Genomics, Epigenomics and Other High Throughput Omics for System Biology Studies. Curr. Opin. Chem. Biol. 2013, 17, 34–40. [Google Scholar] [CrossRef]
  11. Lauc, G.; Vojta, A.; Zoldoš, V. Epigenetic Regulation of Glycosylation Is the Quantum Mechanics of Biology. Biochim. Biophys. Acta (BBA) Gen. Subj. 2014, 1840, 65–70. [Google Scholar] [CrossRef] [PubMed]
  12. Klarić, L.; Tsepilov, Y.A.; Stanton, C.M.; Mangino, M.; Sikka, T.T.; Esko, T.; Pakhomov, E.; Salo, P.; Deelen, J.; McGurnaghan, S.J.; et al. Glycosylation of Immunoglobulin G is Regulated by a Large Network of Genes Pleiotropic with Inflammatory Diseases. Sci. Adv. 2020, 6, eaax0301. [Google Scholar] [CrossRef] [PubMed]
  13. Garcia, D.F.; Camelo, J.S.; Molfetta, G.A.; Turcato, M.; Souza, C.F.M.; Porta, G.; Steiner, C.E.; Silva, W.A. Clinical Profile and Molecular Characterization of Galactosemia in Brazil: Identification of Seven Novel Mutations. BMC Med. Genet. 2016, 17, 39. [Google Scholar] [CrossRef] [PubMed]
  14. Chang, I.J.; He, M.; Lam, C.T. Congenital Disorders of Glycosylation. Ann. Transl. Med. 2018, 6, 477. [Google Scholar] [CrossRef] [PubMed]
  15. Krištić, J.; Vučković, F.; Menni, C.; Klarić, L.; Keser, T.; Beceheli, I.; Pučić-Baković, M.; Novokmet, M.; Mangino, M.; Thaqi, K.; et al. Glycans Are a Novel Biomarker of Chronological and Biological Ages. J. Gerontol. Ser. A 2013, 69, 779–789. [Google Scholar] [CrossRef]
  16. Yu, X.; Wang, Y.; Kristic, J.; Dong, J.; Chu, X.; Ge, S.; Wang, H.; Fang, H.; Gao, Q.; Liu, D.; et al. Profiling IgG N-Glycans as Potential Biomarker of Chronological and Biological Ages: A Community-Based Study in a Han Chinese Population. Medicine 2016, 95, e4112. [Google Scholar] [CrossRef]
  17. Hou, H.; Xu, X.; Sun, F.; Zhang, X.; Dong, H.; Wang, L.; Ge, S.; An, K.; Sun, Q.; Li, Y.; et al. Hyperuricemia is associated with immunoglobulin G N-Glycosylation: A Community-Based Study of Glycan Biomarkers. OMICS J. Integr. Biol. 2019, 23, 660–667. [Google Scholar] [CrossRef]
  18. Liu, J.N.; Dolikun, M.; Stambuk, J.; Trbojevic-Akmacic, I.; Zhang, J.; Wang, H.; Zheng, D.Q.; Zhang, X.Y.; Peng, H.L.; Zhao, Z.Y.; et al. The Association Between Subclass-Specific IgG Fc N-Glycosylation Profiles and Hypertension in the Uygur, Kazak, Kirgiz, and Tajik Populations. J. Hum. Hypertens. 2018, 32, 555–563. [Google Scholar] [CrossRef]
  19. Štambuk, T.; Kifer, D.; Smirčić-Duvnjak, L.; Lovrenčić, M.V.; Gornik, O. Associations between Plasma Protein, IgG and IgA N-Glycosylation and Metabolic Health Markers in Pregnancy and Gestational Diabetes. PLoS ONE 2023, 18, e0284838. [Google Scholar] [CrossRef]
  20. Wang, Y.; Klarić, L.; Yu, X.; Thaqi, K.; Dong, J.; Novokmet, M.; Wilson, J.; Polasek, O.; Liu, Y.; Krištić, J.; et al. The Association between Glycosylation of Immunoglobulin G and Hypertension: A Multiple Ethnic Cross-Sectional Study. Medicine 2016, 95, e3379. [Google Scholar] [CrossRef]
  21. Plomp, R.; Ruhaak, L.R.; Uh, H.-W.; Reiding, K.R.; Selman, M.; Houwing-Duistermaat, J.J.; Slagboom, P.E.; Beekman, M.; Wuhrer, M. Subclass-Specific IgG Glycosylation is Associated with Markers of Inflammation and Metabolic Health. Sci. Rep. 2017, 7, 12325. [Google Scholar] [CrossRef] [PubMed]
  22. Perkovic, M.N.; Bakovic, M.P.; Kristic, J.; Novokmet, M.; Huffman, J.E.; Vitart, V.; Hayward, C.; Rudan, I.; Wilson, J.F.; Campbell, H.; et al. The Association Between Galactosylation of Immunoglobulin G and Body Mass Index. Prog. Neuro-Psychopharmacol. Biol. Psychiatry 2014, 48, 20–25. [Google Scholar] [CrossRef] [PubMed]
  23. Liu, D.; Li, Q.; Dong, J.; Li, D.; Xu, X.; Xing, W.; Zhang, X.; Cao, W.; Hou, H.; Wang, H.; et al. The Association between Normal BMI with Central Adiposity and Proinflammatory Potential Immunoglobulin G N-Glycosylation. Diabetes Metab. Syndr. Obes. Targets Ther. 2019, 12, 2373–2385. [Google Scholar] [CrossRef] [PubMed]
  24. Liu, D.; Chu, X.; Wang, H.; Dong, J.; Ge, S.-Q.; Zhao, Z.-Y.; Peng, H.-L.; Sun, M.; Wu, L.-J.; Song, M.-S.; et al. The Changes of Immunoglobulin G N-Glycosylation in Blood Lipids and Dyslipidaemia. J. Transl. Med. 2018, 16, 235. [Google Scholar] [CrossRef]
  25. Gao, Q.; Dolikun, M.; Štambuk, J.; Wang, H.; Zhao, F.; Yiliham, N.; Wang, Y.; Trbojević-Akmačić, I.; Zhang, J.; Fang, H.; et al. Immunoglobulin G N-Glycans as Potential Postgenomic Biomarkers for Hypertension in the Kazakh Population. OMICS J. Integr. Biol. 2017, 21, 380–389. [Google Scholar] [CrossRef]
  26. Meng, X.; Cao, W.; Liu, D.; Elijah, I.M.; Xing, W.; Hou, H.; Xu, X.; Song, M.; Wang, Y. Bidirectional Causality Between Immunoglobulin G N-Glycosylation and Metabolic Traits: A Mendelian Randomization Study. Engineering 2023, 26, 74–88. [Google Scholar] [CrossRef]
  27. Kifer, D.; Louca, P.; Cvetko, A.; Deriš, H.; Cindrić, A.; Grallert, H.; Peters, A.; Polašek, O.; Gornik, O.; Mangino, M.; et al. N-Glycosylation of Immunoglobulin G Predicts Incident Hypertension. J. Hypertens. 2021, 39, 2527–2533. [Google Scholar] [CrossRef]
  28. Li, S.; Meng, J.; Xu, F.; Wang, Q.; Tian, X.; Li, M.; Zeng, X.; Hu, C.; Zheng, Y. IgG Glycosylation Profiling of Peripheral Artery Diseases with Lectin Microarray. J. Clin. Med. 2022, 11, 5727. [Google Scholar] [CrossRef]
  29. Zou, H.; Hastie, T. Regularization and Variable Selection via the Elastic Net. J. R. Stat. Soc. Stat. Methodol. Ser. B 2005, 67, 301–320. [Google Scholar] [CrossRef]
  30. Chen, R.J.; Lu, M.Y.; Williamson, D.F.; Chen, T.Y.; Lipkova, J.; Noor, Z.; Shaban, M.; Shady, M.; Williams, M.; Joo, B.; et al. Pan-Cancer Integrative Histology-Genomic Analysis via Multimodal Deep Learning. Cancer Cell 2022, 40, 865–878.e6. [Google Scholar] [CrossRef]
  31. Din, N.M.U.; Dar, R.A.; Rasool, M.; Assad, A. Breast Cancer Detection Using Deep Learning: Datasets, Methods, and Challenges Ahead. Comput. Biol. Med. 2022, 149, 106073. [Google Scholar] [CrossRef] [PubMed]
  32. Pučić, M.; Knežević, A.; Vidič, J.; Adamczyk, B.; Novokmet, M.; Polašek, O.; Gornik, O.; Šupraha-Goreta, S.; Wormald, M.R.; Redžić, I.; et al. High Throughput Isolation and Glycosylation Analysis of IgG-Variability and Heritability of the IgG Glycome in Three Isolated Human Populations. Mol. Cell. Proteom. 2011, 10, M111.010090. [Google Scholar] [CrossRef] [PubMed]
  33. Menni, C.; Gudelj, I.; Macdonald-Dunlop, E.; Mangino, M.; Zierer, J.; Bešić, E.; Joshi, P.K.; Trbojević-Akmačić, I.; Chowienczyk, P.J.; Spector, T.D.; et al. Glycosylation Profile of Immunoglobulin G Is Cross-Sectionally Associated with Cardiovascular Disease Risk Score and Subclinical Atherosclerosis in Two Independent Cohorts. Circ. Res. 2018, 122, 1555–1564. [Google Scholar] [CrossRef] [PubMed]
  34. Liu, J.; Dolikun, M.; Štambuk, J.; Trbojević-Akmačić, I.; Zhang, J.; Zhang, J.; Wang, H.; Meng, X.; Razdorov, G.; Menon, D.; et al. Glycomics for Type 2 Diabetes Biomarker Discovery: Promise of Immunoglobulin G Subclass-Specific Fragment Crystallizable N-Glycosylation in the Uyghur Population. OMICS J. Integr. Biol. 2019, 23, 640–648. [Google Scholar] [CrossRef] [PubMed]
  35. Gupta, D.; Lis, C.G. Pretreatment Serum Albumin as a Predictor of Cancer Survival: A Systematic Review of the Epidemiological Literature. Nutr. J. 2010, 9, 69. [Google Scholar] [CrossRef]
  36. Phillips, A.; Shaper, A.G.; Whincup, P. Association Between Serum Albumin and Mortality From Cardiovascular Disease, Cancer, and Other Causes. Lancet 1989, 334, 1434–1436. [Google Scholar] [CrossRef]
  37. Djoussé, L.; Rothman, K.J.; Cupples, L.A.; Levy, D.; Ellison, R.C. Serum Albumin and Risk of Myocardial Infarction and All-Cause Mortality in the Framingham Offspring Study. Circulation 2002, 106, 2919–2924. [Google Scholar] [CrossRef]
  38. Wu, C.-Y.; Hu, H.-Y.; Huang, N.; Chou, Y.-C.; Li, C.-P.; Chou, Y.-J. Albumin Levels and Cause-Specific Mortality in Community-Dwelling Older Adults. Prev. Med. 2018, 112, 145–151. [Google Scholar] [CrossRef]
  39. Klonoff-Cohen, H.; Barrett-Connor, E.L.; Edelstein, S.L. Albumin Levels as a Predictor of Mortality in the Healthy Elderly. J. Clin. Epidemiol. 1992, 45, 207–212. [Google Scholar] [CrossRef]
  40. Štambuk, J.; Nakić, N.; Vučković, F.; Pučić-Baković, M.; Razdorov, G.; Trbojević-Akmačić, I.; Novokmet, M.; Keser, T.; Vilaj, M.; Štambuk, T.; et al. Global Variability of the Human IgG Glycome. Aging 2020, 12, 15222–15259. [Google Scholar] [CrossRef]
  41. Rudman, N.; Kifer, D.; Kaur, S.; Simunović, V.; Cvetko, A.; Pociot, F.; Morahan, G.; Gornik, O. Children at Onset of Type 1 Diabetes Show Altered N-Glycosylation of Plasma Proteins and IgG. Diabetologia 2022, 65, 1315–1327. [Google Scholar] [CrossRef] [PubMed]
  42. McEniery, C.M.; Cockcroft, J.R.; Roman, M.J.; Franklin, S.S.; Wilkinson, I.B. Central Blood Pressure: Current Evidence and Clinical Importance. Eur. Heart J. 2014, 35, 1719–1725. [Google Scholar] [CrossRef]
  43. Chen, J.; Chuang, S.; Chen, H.; Yeh, W.; Pan, W. Serum Uric Acid Level as an Independent Risk Factor for All-Cause, Cardiovascular, and Ischemic Stroke Mortality: A Chinese Cohort Study. Arthritis Care Res. 2009, 61, 225–232. [Google Scholar] [CrossRef]
  44. Dehghan, A.; van Hoek, M.; Sijbrands, E.J.; Hofman, A.; Witteman, J.C. High Serum Uric Acid as a Novel Risk Factor for Type 2 Diabetes. Diabetes Care 2008, 31, 361–362. [Google Scholar] [CrossRef]
  45. Barrios, C.; Zierer, J.; Gudelj, I.; Štambuk, J.; Ugrina, I.; Rodríguez, E.; Soler, M.J.; Pavić, T.; Šimurina, M.; Keser, T.; et al. Glycosylation Profile of IgG in Moderate Kidney Dysfunction. J. Am. Soc. Nephrol. 2016, 27, 933–941. [Google Scholar] [CrossRef] [PubMed]
  46. Sun, S. A Survey of Multi-View Machine Learning. Neural Comput. Appl. 2013, 23, 2031–2038. [Google Scholar] [CrossRef]
  47. Erickson, B.J.; Korfiatis, P.; Akkus, Z.; Kline, T.; Philbrick, K. Toolkits and Libraries for Deep Learning. J. Digit. Imaging 2017, 30, 400–405. [Google Scholar] [CrossRef]
  48. Liu, M.; Li, S.; Yuan, H.; Ong, M.E.H.; Ning, Y.; Xie, F.; Saffari, S.E.; Shang, Y.; Volovici, V.; Chakraborty, B.; et al. Handling Missing Values in Healthcare Data: A Systematic Review of Deep Learning-Based Imputation Techniques. Artif. Intell. Med. 2023, 142, 102587. [Google Scholar] [CrossRef]
  49. Browning, S.R. Missing Data Imputation and Haplotype Phase Inference for Genome-Wide Association Studies. Hum. Genet. 2008, 124, 439–450. [Google Scholar] [CrossRef]
  50. Wang, A.; Yang, J.; An, N. Regularized Sparse Modelling for Microarray Missing Value Estimation. IEEE Access 2021, 9, 16899–16913. [Google Scholar] [CrossRef]
  51. Taylor, S.L.; Ruhaak, L.R.; Kelly, K.; Weiss, R.H.; Kim, K. Effects of Imputation on Correlation: Implications for Analysis of Mass Spectrometry Data from Multiple Biological Matrices. Brief. Bioinform. 2016, 18, 312–320. [Google Scholar] [CrossRef] [PubMed]
  52. Kang, H. The Prevention and Handling of the Missing Data. Korean J. Anesthesiol. 2013, 64, 402–406. [Google Scholar] [CrossRef] [PubMed]
  53. Lin, W.-C.; Tsai, C.-F.; Zhong, J.R. Deep Learning for Missing Value Imputation of Continuous Data and the Effect of Data Discretization. Knowl.-Based Syst. 2022, 239, 108079. [Google Scholar] [CrossRef]
  54. Zemunik, T.; Boban, M.; Lauc, G.; Janković, S.; Rotim, K.; Vatavuk, Z.; Benčić, G.; Đogaš, Z.; Boraska, V.; Torlak, V.; et al. Genome-Wide Association Study of Biochemical Traits in Korcula Island, Croatia. Croat. Med. J. 2009, 50, 23–33. [Google Scholar] [CrossRef]
  55. Rudan, I.; Marušić, A.; Janković, S.; Rotim, K.; Boban, M.; Lauc, G.; Grković, I.; Đogaš, Z.; Zemunik, T.; Vatavuk, Z.; et al. “10 001 Dalmatians:” Croatia Launches Its National Biobank. Croat. Med. J. 2009, 50, 4–6. [Google Scholar] [CrossRef]
  56. Rudan, I.; Campbell, H.; Rudan, P. Genetic Epidemiological Studies of Eastern Adriatic Island Isolates, Croatia: Objectives and Strategies. Coll. Antropol. 1999, 23, 531–546. [Google Scholar]
  57. Rudan, I.; Biloglav, Z.; Vorko-Jović, A.; Kujundžić-Tiljak, M.; Stevanović, R.; Ropac, D.; Puntarić, D.; Čučević, B.; Salzer, B.; Campbell, H. Effects of Inbreeding, Endogamy, Genetic Admixture, and Outbreeding on Human Health: A “1001 Dalmatians” Study. Croat. Med. J. 2006, 47, 601–610. [Google Scholar]
  58. Trbojević-Akmačić, I.; Vučković, F.; Pribić, T.; Vilaj, M.; Černigoj, U.; Vidič, J.; Šimunović, J.; Kępka, A.; Kolčić, I.; Klarić, L.; et al. Comparative Analysis of Transferrin and IgG N-Glycosylation in Two Human Populations. Commun. Biol. 2023, 6, 312. [Google Scholar] [CrossRef]
  59. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2023; Available online: https://www.r-project.org/ (accessed on 14 July 2024).
  60. Wei, T.; Simko, V. R Package ‘corrplot’: Visualization of a Correlation Matrix (Version 0.92). 2021. Available online: https://github.com/taiyun/corrplot (accessed on 14 July 2024).
  61. Friedman, J.H.; Hastie, T.; Tibshirani, R. Regularization Paths for Generalized Linear Models via Coordinate Descent. J. Stat. Softw. 2010, 33, 1–22. [Google Scholar] [CrossRef]
  62. Allaire, J.; Chollet, F. keras: R Interface to ‘Keras’ [R Package Version 2.13.0]. Available online: https://cran.r-project.org/web/packages/keras/index.html (accessed on 14 July 2024).
  63. Nematzadeh, S.; Kiani, F.; Torkamanian-Afshar, M.; Aydin, N. Tuning Hyperparameters of Machine Learning Algorithms and Deep Neural Networks Using Metaheuristics: A Bioinformatics Study on Biomedical and Biological Cases. Comput. Biol. Chem. 2021, 97, 107619. [Google Scholar] [CrossRef]
Figure 1. Correlation between the glycan peaks in the glycomics data. The dataset shows high correlation between glycans, which necessitates appropriate predictive models that are immune or at least resistant to such correlated data. Both the elastic net and deep learning have that required property. Glycan composition of each peak within the human IgG glycome has been previously established [32]. A representative chromatogram of the human IgG glycome and a detailed characterization of the glycan composition for each peak can be found in Pučić et al. [32] and Krištić et al. [15].
Figure 1. Correlation between the glycan peaks in the glycomics data. The dataset shows high correlation between glycans, which necessitates appropriate predictive models that are immune or at least resistant to such correlated data. Both the elastic net and deep learning have that required property. Glycan composition of each peak within the human IgG glycome has been previously established [32]. A representative chromatogram of the human IgG glycome and a detailed characterization of the glycan composition for each peak can be found in Pučić et al. [32] and Krištić et al. [15].
Ijms 25 09988 g001
Figure 2. Relative RMSE (root mean square error) normalized to the mean of the dataset for prediction of selected biochemical and physiological parameters based on IgG glycosylation in the Korčula cohort. In addition to the relative area under each of the 24 glycan peaks, only sex was used as an input variable. It is predicted with 100% accuracy, which was used as a “positive control”. Other biochemical and physiological parameters can be predicted from an IgG glycosylation pattern with varying degrees of accuracy. A regression model (glmnet) and a machine learning approach (tensorflow) show a similar degree of predictive power.
Figure 2. Relative RMSE (root mean square error) normalized to the mean of the dataset for prediction of selected biochemical and physiological parameters based on IgG glycosylation in the Korčula cohort. In addition to the relative area under each of the 24 glycan peaks, only sex was used as an input variable. It is predicted with 100% accuracy, which was used as a “positive control”. Other biochemical and physiological parameters can be predicted from an IgG glycosylation pattern with varying degrees of accuracy. A regression model (glmnet) and a machine learning approach (tensorflow) show a similar degree of predictive power.
Ijms 25 09988 g002
Figure 3. Validation of the model trained on the Korčula cohort in the Vis cohort shows that the model performs reasonably well with the data from an unrelated sampling event that encompasses a different population, time, and batch. Only the parameters measured in both cohorts are reported. Again, relative RMSE normalized to the mean of the dataset is shown as a metric of the success of prediction for selected biochemical and physiological parameters (lower RMSE means better prediction), based on IgG glycosylation as input. The machine learning approach (tensorflow) performs significantly better for most parameters than the regression model (glmnet). The machine learning approach has the additional advantage of being easily extensible.
Figure 3. Validation of the model trained on the Korčula cohort in the Vis cohort shows that the model performs reasonably well with the data from an unrelated sampling event that encompasses a different population, time, and batch. Only the parameters measured in both cohorts are reported. Again, relative RMSE normalized to the mean of the dataset is shown as a metric of the success of prediction for selected biochemical and physiological parameters (lower RMSE means better prediction), based on IgG glycosylation as input. The machine learning approach (tensorflow) performs significantly better for most parameters than the regression model (glmnet). The machine learning approach has the additional advantage of being easily extensible.
Ijms 25 09988 g003
Table 1. Demographic characteristics, biochemical and physiological parameters measured in the Korčula and Vis cohorts. The table presents the number of participants categorized by sex, along with the mean and standard deviation (SD) for each parameter. Missing data points for the Vis cohort are denoted by “/”. Nonparametric statistics are used to compare data from two cohorts. HDL (High-density lipoprotein), LDL (Low-density lipoprotein), VLDL (Very-low-density lipoprotein), and HbA1c (Hemoglobin A1c).
Table 1. Demographic characteristics, biochemical and physiological parameters measured in the Korčula and Vis cohorts. The table presents the number of participants categorized by sex, along with the mean and standard deviation (SD) for each parameter. Missing data points for the Vis cohort are denoted by “/”. Nonparametric statistics are used to compare data from two cohorts. HDL (High-density lipoprotein), LDL (Low-density lipoprotein), VLDL (Very-low-density lipoprotein), and HbA1c (Hemoglobin A1c).
Korčula Cohort (N = 950)Vis Cohort (N = 888)p Value
Female615 (64.7%)521 (58.7%)
Male335 (35.3%)367 (41.3%)
Age (years)53.76 ± 16.9756.30 ± 15.74
Pulse wave velocity8.64 ± 2.24/
Central systolic pressure120.16 ± 16.72/
Central diastolic pressure81.51 ± 9.20/
Heart rate65.64 ± 10.34/
Serum creatinine82.70 ± 17.3787.73 ± 28.372.064 × 10−6
Serum uric acid289.28 ± 79.20310.83 ± 93.195.564 × 10−7
Serum glucose5.68 ± 1.485.73 ± 1.554.201 × 10−1
Serum cholesterol5.81 ± 1.226.06 ± 1.144.405 × 10−6
Serum triglycerides1.39 ± 0.861.59 ± 1.017.655 × 10−7
Serum HDL1.49 ± 0.361.60 ± 0.351.031 × 10−10
Serum LDL3.70 ± 1.053.81 ± 1.052.736 × 10−2
Serum VLDL0.63 ± 0.36/
Serum calcium2.43 ± 0.102.33 ± 0.162.051 × 10−83
Serum albumin45.04 ± 2.9145.01 ± 3.642.660 × 10−1
Serum HbA1c5.45 ± 0.775.46 ± 0.876.659 × 10−1
Serum HbA1c, IFCC protocol36.14 ± 8.42/
Fibrinogen2.79 ± 0.893.73 ± 0.991.921 × 10−89
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Vujić, A.; Klasić, M.; Lauc, G.; Polašek, O.; Zoldoš, V.; Vojta, A. Predicting Biochemical and Physiological Parameters: Deep Learning from IgG Glycome Composition. Int. J. Mol. Sci. 2024, 25, 9988. https://doi.org/10.3390/ijms25189988

AMA Style

Vujić A, Klasić M, Lauc G, Polašek O, Zoldoš V, Vojta A. Predicting Biochemical and Physiological Parameters: Deep Learning from IgG Glycome Composition. International Journal of Molecular Sciences. 2024; 25(18):9988. https://doi.org/10.3390/ijms25189988

Chicago/Turabian Style

Vujić, Ana, Marija Klasić, Gordan Lauc, Ozren Polašek, Vlatka Zoldoš, and Aleksandar Vojta. 2024. "Predicting Biochemical and Physiological Parameters: Deep Learning from IgG Glycome Composition" International Journal of Molecular Sciences 25, no. 18: 9988. https://doi.org/10.3390/ijms25189988

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop