Next Article in Journal
Antennal Transcriptome Screening and Identification of Chemosensory Proteins in the Double-Spine European Spruce Bark Beetle, Ips duplicatus (Coleoptera: Scolytinae)
Previous Article in Journal
Exploring the Relationship of rs2802292 with Diabetes and NAFLD in a Southern Italian Cohort—Nutrihep Study
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Prediction of Multiple Degenerative Diseases Based on DNA Methylation in a Co-Physiology Mechanisms Perspective

1
College of Computer Science and Engineering, Changchun University of Technology, Changchun 130051, China
2
School of Information Science and Technology, Institute of Computational Biology, Northeast Normal University, Changchun 130117, China
3
School of Cyber Security, School of Computer Science and Technology, Changchun University, Changchun 130022, China
*
Authors to whom correspondence should be addressed.
Int. J. Mol. Sci. 2024, 25(17), 9514; https://doi.org/10.3390/ijms25179514 (registering DOI)
Submission received: 18 July 2024 / Revised: 26 August 2024 / Accepted: 27 August 2024 / Published: 1 September 2024
(This article belongs to the Section Molecular Genetics and Genomics)

Abstract

:
Degenerative diseases oftentimes occur within the continuous process of aging, and the corresponding clinical manifestations may be neurodegeneration, neoplastic diseases, or various human complex diseases. DNA methylation provides the opportunity to explore aging and degenerative diseases as epigenetic traits. It has already been applied to age prediction and disease diagnosis. It has been shown that various degenerative diseases share co-physiology mechanisms with each other, clues of which may be gained from studying the aging process. Here, we endeavor to predict the risk of degenerative diseases in an aging-relevant comorbid mechanism perspective. Firstly, an epigenetic clock method was implemented based on a multi-scale convolutional neural network, and a Shapley feature attribution analysis was applied to discover the aging-related CpG sites. Then, these sites were further screened to a smaller subset composed of 196 sites by using biomics analysis according to their biological functions and mechanisms. Finally, we constructed a multilayer perceptron (MLP)-based degenerative disease risk prediction model, Mlp-DDR, which was well trained and tested to accurately classify nine degenerative diseases. Recent studies also suggest that DNA methylation plays a significant role in conditions like osteoporosis and osteoarthritis, broadening the potential applications of our model. This approach significantly advances the ability to understand degenerative diseases and represents a substantial shift from traditional diagnostic methods. Despite the promising results, limitations regarding model complexity and dataset diversity suggest directions for future research, including the development of tissue-specific epigenetic clocks and the inclusion of a wider range of diseases.

1. Introduction

In current aging populations, the prevalence of degenerative diseases such as osteoarthritis, cardiovascular diseases, and Alzheimer’s disease is rising, presenting significant social and economic challenges [1,2,3]. Researchers have made significant progress in understanding degenerative diseases, but there is still much to be learned regarding their precise causations and mechanisms. Recent studies have highlighted DNA methylation, an epigenetic modification, as playing a crucial role in cellular aging, potentially serving as a novel biomarker for these conditions [4]. Epigenetic clocks, which utilize DNA methylation patterns, offer a method to measure biological age and identify key methylation sites associated with aging. These insights are invaluable for understanding the mechanisms behind degenerative diseases.
Many previous efforts have been undertaken based on DNA methylation analysis, where primarily sodium bisulfite sequencing was widely used to identify DNA methylations by converting unmethylated cytosine to uracil for detection via sequencing. Correspondingly, bisulfite pyrosequencing was used to analyze DNA methylation at two ataxia telangiectasia mutated (ATM) gene loci in leukocytes, suggesting that ATM methylation could be a marker for breast cancer risk [5]. Whole-genome bisulfite sequencing was employed to assess genome-wide DNA methylation in peripheral blood samples, concluding that hypomethylation in pre-diagnosis samples could predict breast cancer risk [6]. Despite their utility, these methods are costly, time consuming, and lack precision in predicting disease risk, underscoring the need for more effective techniques to utilize DNA methylation in diagnosing degenerative diseases.
Recent studies have further emphasized the role of DNA methylation in bone and joint diseases. For example, the m6A modification has been shown to regulate bone homeostasis, offering new potential approaches for osteoporosis treatment [7]. In osteoarthritis, genome-wide association studies (GWAS) and multi-omic analyses have identified key loci associated with disease progression, highlighting the significance of DNA methylation in regulating joint diseases [8]. These findings underscore the broader relevance of DNA methylation beyond traditional age-related diseases and suggest that a more comprehensive approach to modeling these epigenetic changes could enhance diagnostic capabilities across a wider range of conditions.
As computational capabilities progress and epigenetic data grow, the precision of disease risk prediction has improved significantly. Machine learning, particularly through the development of epigenetic clocks [9,10], excels at handling complex clinical data. These models are designed to accurately predict biological age across various tissues and have been a focal point in optimizing structures to refine age acceleration estimates. Horvath’s multi-tissue clock [11], for instance, utilized an Elastic Net regression model to achieve high precision, although it has not yet been directly applied to specific age-related diseases. The Skin and Blood clock [12] and Zhang’s predictors [13] have advanced our understanding of cellular aging processes [14,15]. However, these models often overlook critical interactions among methylation sites that could be crucial for identifying biomarkers for degenerative diseases. Despite ongoing research into the methylation status of disease-associated genes, the focus has predominantly been on cancer, with minimal attention given to degenerative conditions. There is an urgent need for models that integrate a broader range of biological effects and specifically address degenerative diseases to enhance the diagnostic potential of DNA methylation.
The advent of big data has shifted the focus from traditional machine learning to deep learning, owing to its superior capability to model high-dimensional and structurally complex data [16]. Deep learning algorithms excel at automatically extracting rich and abstract features from raw data, offering significant advantages for handling large datasets. Recent studies demonstrate the efficacy of deep neural networks (DNNs) and convolutional neural networks (CNNs) in predicting medical conditions like pediatric stenosis, concussions, and various cancers using DNA methylation data [17,18,19]. These models outperform traditional machine learning techniques like random forests and support vector machines by integrating multiple approaches for enhanced prediction accuracy. While deep learning models have shown promise, ongoing research focuses on enhancing their interpretability and understanding the biological significance of DNA methylation sites. Further validation on diverse and larger datasets is necessary to ensure their predictive reliability and biological insights, advancing the field beyond current limitations.
In this study, we explored a practical approach to predict the risk of degenerative diseases by leveraging the relationship between aging and these diseases through DNA methylation data. We implemented an epigenetic clock based on a multi-scale convolutional neural network (MSCNN), which accurately predicts biological age by capturing both local and global features of the methylation data. By using Shapley additive explanations (SHAPs), we identified key CpG sites that significantly contribute to the age prediction model, providing insights into the most relevant methylation sites associated with aging. Then, we conducted a proteomics analysis based on protein–protein interaction networks, where the functional enrichment of subnetworks was investigated to confirm 196 marker CpG sites. Finally, a multilayer perceptron (MLP)-based degenerative disease risk prediction model (Mlp-DDR) was well trained and tested for nine classical degenerative diseases.

2. Results

2.1. Validation and Performance Enhancement with Selected CpG Sites

To validate the DNA methylation sites linked to degenerative diseases, further experiments were conducted. We reintegrated 196 CpG sites identified from key subnetworks into the MSCAP model and tested it on five disease datasets. Figure 1 shows the performance comparison between the original MSCAP model with 25,789 sites and the revised model with 196 sites. The updated model consistently outperformed the original, significantly enhancing the predictive accuracy for degenerative diseases. This confirms the relevance of these CpG sites in disease mechanisms, supporting further investigation into their potential for disease prediction.

2.2. Model Performance Analysis

This study developed Mlp-DDR, a degenerative disease risk prediction model, based on a diverse array of diseases: Alzheimer’s disease, Parkinson’s disease, breast cancer, colon cancer, lung cancer, liver cancer, kidney cancer, osteoporosis, and brain cancer. Four commonly used metrics were selected for assessment: AUC, ACC, precision, and recall, with the AUC being the primary metric for evaluating model performance. The predictive performance of Mlp-DDR across these diseases is detailed in Table 1. Notably, Mlp-DDR achieved the highest predictive accuracy for Alzheimer’s disease, with an AUC of 0.96, closely followed by Parkinson’s disease and breast cancer.

3. Discussions

3.1. Comparative Analysis of Machine Learning Models

In addition to the Mlp-DDR model, this study evaluated the performance of six other machine learning algorithms: K-nearest neighbors (KNN) [20], logistic regression (LR), naive Bayes (NB) [21], random forest (RF) [22], support vector machine (SVM) [23], and extreme gradient boosting (XGBoost) [24]. The predictive efficacy of these classifiers was assessed using five-fold cross-validation and four performance metrics. Table 2 provides a comparative overview of the predictive performance of these seven models across nine degenerative disease datasets, showcasing Mlp-DDR’s superior performance across all conditions assessed.
The establishment of the other six classification models involved an initial division of the training and test sets using five-fold cross-validation. Within the training sets, a grid search coupled with three-fold cross-validation was utilized to optimize the models’ parameters for the best possible outcomes. The final classification results were determined by averaging the outcomes of five experiments. Except for the Mlp-DDR model, the naive Bayes model performed the bets across four diseases, while the remaining models demonstrated good performance on two to three diseases. Mlp-DDR was particularly notable for its exceptional predictive accuracy in Alzheimer’s disease, demonstrating its efficacy with an AUC of 0.96, as further evidenced by the ROC curves shown in Figure 2.

3.2. Performance Variability among Different Diseases

The variability in the performance of the Mlp-DDR model across different degenerative diseases underscores the impact of dataset characteristics and disease complexity on predictive accuracy. The model achieved high predictive accuracy for Alzheimer’s disease, Parkinson’s disease, and breast cancer, which can be attributed to the availability of ample and high-quality data samples. These diseases had well-balanced datasets including both positive and negative samples, which helps avoid prediction biases caused by data imbalance. Additionally, the selection of DNA methylation sites closely related to these diseases allowed the model to better capture their unique characteristics and patterns, enhancing predictive reliability.
In contrast, the model exhibited lower accuracy in predicting liver cancer, lung cancer, kidney cancer, brain cancer, and osteoporosis. This reduced performance may be attributed to the inherent heterogeneity of these cancers, which includes varying subtypes, stages, and environmental factors that add layers of complexity to the disease profiles. Moreover, the datasets for these conditions were often smaller and less balanced compared to those for neurodegenerative diseases and breast cancer, posing significant challenges for the predictive model. These complexities may hinder the model’s ability to accurately discern underlying patterns, affecting its overall effectiveness in these conditions.

3.3. Insights from DNA Methylation Data Analysis

The Mlp-DDR model effectively captures the complex relationships within DNA methylation data through its neural network nodes, thereby achieving accurate predictions for degenerative diseases. The DNA methylation sites associated with these diseases enable the model to better learn and understand the diverse patterns of DNA methylation across different disease states. These insights are not only of practical significance for the early diagnosis of diseases but also provide a powerful tool for the in-depth exploration of the degenerative mechanisms of DNA methylation.
By identifying specific DNA methylation sites that are closely linked to disease mechanisms, the model can guide early intervention strategies, allowing for more personalized and timely treatment plans. Furthermore, understanding these methylation patterns could lead to the development of novel biomarkers, improving the precision of diagnostic tests and potentially leading to targeted therapies that address the underlying causes of degenerative diseases. By enhancing our understanding of disease pathogenesis, this approach offers potential for improving the methods used for disease treatment and management, ultimately contributing to better patient outcomes and quality of life.

3.4. Limitations and Future Work

Despite the significant progress made in developing the deep-learning-based epigenetic clock and disease risk prediction model, several limitations remain. The high capacity and complexity of the model, while improving accuracy, result in increased time complexity and demand for advanced experimental environments. Future work could focus on optimizing the model architecture and employing distributed training strategies to enhance efficiency.
Recent research highlights the importance of DNA methylation in bone and joint diseases, such as osteoporosis and osteoarthritis. For example, m6A modification and key loci identified through GWAS and multi-omic analyses demonstrate the critical role of DNA methylation in these conditions [7,8]. Incorporating these insights could extend our model’s applicability, offering more targeted strategies for treating a broader range of degenerative diseases.
Additionally, creating tissue-specific epigenetic clocks could improve model specificity and allow for more precise disease prediction within particular tissues. Expanding the dataset to include a wider range of diseases, such as tumors or adolescent diseases, would further enhance the model’s generalizability and applicability.
These efforts will enhance the model’s practicality and scalability, supporting its broader application in clinical settings and disease prediction tasks.

4. Materials and Methods

4.1. Datasets

DNA methylation data and clinical information of nine degenerative diseases were collected from the GEO database [25]: Alzheimer’s disease, Parkinson’s disease, breast cancer, colon cancer, lung cancer, liver cancer, kidney cancer, osteoporosis, and brain cancer. Only those samples with more than 20 individuals and no missing labels were included, resulting in a total of 2248 individual samples. The corresponding disease type distribution is shown in Figure 3. The involved diseases cover multiple organ systems such as the nervous system, skeletal system, liver, respiratory system, and digestive system, encompassing neurodegenerative diseases leading to memory loss, movement disorders, and cognitive decline, various cancers with progressive tissue damage, and skeletal system diseases.
Parkinson’s disease and Alzheimer’s disease are both neurodegenerative conditions affecting the nervous system, yet they have different stages of disease progression. In the early stages of Parkinson’s disease, patients may exhibit mild symptoms such as hand tremors, muscle stiffness, and reduced mobility. As the disease progresses to the intermediate stage, symptoms typically worsen, and patients may experience more severe motor impairments, including unstable gait, bradykinesia (slowness of movement), and increased muscle rigidity. In the advanced stages of Parkinson’s disease, severe motor disabilities can occur, potentially impacting every aspect of daily life. Patients at this stage may require constant care and support.
The staging of Alzheimer’s disease often utilizes clinical dementia ratings to assess the extent of cognitive impairment. In the early preclinical stage, patients may not show significant cognitive problems, although they might experience mild memory issues. As the disease progresses to mild cognitive impairment, patients may experience mild memory loss and a decline in cognitive function but are still capable of carrying out daily activities. In the moderate cognitive impairment stage, symptoms intensify, and patients may encounter more significant difficulties in performing daily tasks, often requiring assistance. In the later severe cognitive impairment or dementia stage, patients may completely lose the ability to care for themselves, with almost total loss of recognition of their environment and the people around them.
For the clinical data, only the “Pathological Staging” information of the patients was retained. Any sample lacking this information was removed. The staging information of Parkinson’s disease can be classified as “early Parkinson’s” and the staging information of Alzheimer’s disease can be classified as “asymptomatic” and “mild cognitive impairment” as the early stages of these degenerative diseases. Samples classified as “mid-stage Parkinson’s”, “moderate cognitive impairment”, “late-stage Parkinson’s”, and “severe cognitive impairment (dementia)” are defined as the late stages of these degenerative diseases. Table 3 presents the staging information for the nine degenerative diseases. In the clinical data labels, “0” represents the early stage and “1” represents the late stage.

4.2. Model Construction

The entire prediction process is composed of three major components: (1) an epigenetic clock model was used to identify DNA methylation sites highly correlated with age; (2) a biomics analysis module was used to screen the degenerative disease CpG marker sites; (3) a multilayer perceptron module was used to classify the nine types of degenerative diseases.
The epigenetic clock model was constructed based on a multi-scale convolutional neural network (MSCNN), termed MSCAP [26]. Since degenerative diseases are directly correlated with biological age, the MSCAP model, which accurately predicts age, can be used to identify DNA methylation sites highly correlated with age. Next, we utilized the SHAP tool, which calculates the contribution of DNA methylation sites to the model, for weight analysis of the MSCAP model. This analysis screened 1071 key DNA methylation sites critical for the model’s age prediction. Subsequently, we employed bioinformatics tools such as a protein interaction network analysis and KEGG pathway analysis to select 196 DNA methylation sites associated with degenerative diseases. Finally, a dataset containing 196 key DNA methylation sites from 2248 disease samples was fed into a multilayer perceptron module. This module processes the data through three hidden layers, each comprising 64 nodes. The model’s output consists of two labels: 0 (early-stage disease) and 1 (late-stage disease). Figure 4 illustrates the flowchart of the degenerative disease risk prediction model.

4.3. Epigenetic Clock Based on Deep Learning

The multi-scale convolutional age prediction (MSCAP) model employs a multi-scale convolutional neural network (MSCNN) architecture structured into four main components: data input, two fully connected layers, a multi-scale convolutional neural network module, and a final regression output layer. Initially, DNA methylation data from 25,789 CpG sites shared between the Illumina 27K [27] and 450K platforms [28] are input into the model. The first fully connected layer processes these inputs nonlinearly, using a sigmoid activation function to derive correlations between pairs of CpG sites. Subsequently, the MSCNN module, incorporating blocks with varying convolutional kernel sizes (1, 8, 16, and 32), processes the feature vectors to extract both local and global contextual features effectively. This design allows for a robust capture of complex interrelationships within the data, significantly enhancing predictive performance. The final output is generated through a second fully connected layer, which provides the predicted DNA methylation age based on the learned features.
The MSCAP model was trained using DNA methylation data from a diverse set of samples and compared with existing epigenetic clock models, consistently demonstrating enhanced prediction accuracy across multiple independent test datasets. This approach underscores the model’s capability to handle complex, high-dimensional datasets, thereby providing a solid foundation for the further development of predictive models for degenerative diseases.

4.4. Identification of Age-Related CpG Sites

Shapley additive explanations (SHAPs) [29,30], derived from the cooperative game theory, quantify the impact of each feature on a model’s output using Shapley values [31], providing a consistent and interpretable measure of feature importance. This analysis was utilized to determine the contribution of individual CpG sites to the predictive accuracy of the MSCAP model. By analyzing SHAP values, 1071 key CpG sites were identified based on their substantial influence. These sites were then mapped to their corresponding genes using genome annotation databases like the UCSC Genome Browser [32] and Ensembl [33], as well as bioinformatics tools such as BEDTools [34].
This mapping process involved matching coordinates and resulted in these CpG sites being associated with 1033 distinct genes, with some CpG sites mapping to the same gene. Table 4 summarizes part of the mapping results, listing CpG IDs along with their SHAP values, associated gene names, and chromosomal positions. This mapping is crucial for understanding how these CpG sites contribute to age prediction and their potential roles in biological pathways related to aging and degenerative diseases. The selected CpG sites enable a deeper biological interpretation of the MSCAP model’s outcomes, thereby enhancing the understanding of DNA methylation patterns associated with age prediction.

4.5. Enrichment Analysis

We leveraged the STRING database [35] and Cytoscape platform [36] to analyze protein interactions and construct biological networks [37,38], aiming to identify key molecular interactions related to degenerative diseases. STRING, integrating data from comprehensive sources including UniProt and KEGG [39], enabled the mapping of a vast protein interaction network using 1033 genes linked to significant CpG sites [40,41]. We then employed Cytoscape, enhanced with the Molecular Complex Detection (MCODE) plugin, to delve deeper into this network. This analysis identified densely connected regions, or subnetworks, which were prioritized based on their connectivity scores. Ultimately, the top four subnetworks with the highest scores were selected as key subnetworks, and the top four highest-scored subnetworks are depicted in Figure 5.
We subsequently utilized the Metascape platform [42] for a functional enrichment analysis [43,44] of the genes within these critical subnetworks, setting the significance threshold at a p-value of 0.05. The second-highest-scoring subnetwork predominantly involves pathways related to degenerative diseases, suggesting that the genes it includes could play crucial roles in regulating biological processes associated with these diseases. We consider the genes in the Top2 subnetwork as key to understanding the pathogenesis of degenerative diseases. By linking these genes to their corresponding DNA methylation sites, we identified key CpG sites associated with degenerative diseases, involving 83 genes corresponding to 196 DNA methylation sites. Figure 6 displays the KEGG pathway enrichment analysis heatmap for the Top2 subnetwork.

4.6. Hyperparameter Tuning and Evaluation

The optimization of model hyperparameters is principally aimed at identifying the set of hyperparameters that enables the algorithm to achieve optimal performance on a verifiable dataset. However, manual tuning often depends on the researcher’s practical experience and is time consuming and labor intensive; thus, automated hyperparameter optimization is indispensable. This paper employs a grid search approach for hyperparameter tuning coupled with a five-fold cross-validation method to evaluate the optimized model’s classification effectiveness.
Model optimization is a critical step in ensuring the best performance of a model. To select the optimal neural network architecture, this study employs grid search to systematically explore the impact of multiple key hyperparameters on the model’s outcomes, including the number of hidden layers (2, 5, or 8), the number of neurons in each hidden layer (16, 32, or 48), the dropout rate (0 or 0.1), the application of the batch normalization technique, regularization methods (L1 regularization, L2 regularization, or Elastic Net regularization), and the choice of activation functions (ReLU or SeLU). Additionally, this study considers the setting of the learning rate, experimenting with three different values: 0.0002, 0.0005, and 0.001. To ensure the stability and validity of the experiments, certain parameters are fixed, including the use of the Adam optimizer, a batch size set to 128, a fixed training period of 200 epochs, and the use of binary cross-entropy as the loss function. A learning rate decay mechanism is also introduced; if the training loss remains stable for 30 consecutive epochs, the learning rate is decayed by a factor of 0.2 to promote model convergence.
The grid search was utilized to exhaustively search through all possible hyperparameter combinations, training multiple neural network models and evaluating each model’s performance through five-fold cross-validation. The dataset was divided into five exclusive subsets. In each round of cross-validation, one subset is chosen as the validation set, while the remaining four serve as the training set, and the Area Under the ROC Curve (AUC) is used as the performance metric. For each validation set, the model’s AUC value is computed as the performance measure. Finally, the average AUC value obtained from the five cross-validation folds is taken as the ultimate metric for assessing model performance. By employing five-fold cross-validation, the model’s generalization ability and stability are more comprehensively evaluated, ensuring the selected model’s adaptability to unknown data.
After the grid search and cross-validation, the best-performing hyperparameter combination on the validation set was selected. The network consists of three hidden layers, each with 64 neurons, without dropout, and includes batch normalization. The SeLU activation function and Elastic Net regularization were applied, with the learning rate set at 0.001. The model displayed good fitting ability on the training set and also demonstrated high classification accuracy on the test set.

5. Conclusions

In this study, we developed and validated Mlp-DDR, a multi-layer-perceptron-based model for predicting the risk of nine degenerative diseases using DNA methylation data. Our approach involved constructing an epigenetic clock based on a multi-scale convolutional neural network to identify key CpG sites associated with aging, followed by the selection of 196 marker CpG sites through a proteomics and bioinformatics analysis. These selected CpG sites were then used to train the Mlp-DDR model for accurate disease risk prediction.
The Mlp-DDR model demonstrated superior predictive performance across the nine disease types, achieving the highest accuracy for Alzheimer’s disease with an AUC of 0.96. This model outperformed six other machine learning algorithms (K-nearest neighbors, logistic regression, naive Bayes, random forest, support vector machine, and extreme gradient boosting) in terms of key performance metrics such as AUC, accuracy, precision, and recall. The results underscore the potential of deep learning models in handling complex, high-dimensional biological data, thereby offering a significant advancement in the early prediction of degenerative diseases.
In future works, efforts should be focused on enhancing the interpretability of these models and validating their performance across larger and more diverse datasets. Additionally, exploring the integration of multi-omics data and longitudinal studies could further improve the predictive power and clinical applicability of these models, ultimately contributing to better disease management and patient outcomes.

Supplementary Materials

The source code developed for this study, along with the documentation, is available at https://github.com/NENUBioCompute/Mlp-DDR.

Author Contributions

Conceptualization, L.Z., R.C. and H.W.; formal analysis, R.C., C.W. and J.L.; funding acquisition, Z.K. and H.W.; investigation, R.C., C.W. and J.L.; methodology, L.Z., R.C., Z.K. and H.W.; project administration, Z.K.; software, R.C., C.W. and J.L.; supervision, H.W.; validation, R.C., C.W. and J.L.; writing—original draft, R.C. and C.W.; writing—review and editing, R.C., C.W., Z.K. and H.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (No: 62372099); the Jilin Scientific and Technological Development Program (No. 20230401092YY); and the Fundamental Research Funds for the Central Universities (2412023YQ002).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article or Supplementary Materials.

Acknowledgments

We would like to thank the teachers Li Zhang, Han Wang, and Zhejun Kuang for their dedication, as well as other teachers and students from the Institute of Computational Biology, Northeast Normal University, and Changchun University of Technology for their help.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Chaudhary, M.R.; Chaudhary, S.; Sharma, Y.; Singh, T.A.; Mishra, A.K.; Sharma, S.; Mehdi, M.M. Aging, oxidative stress and degenerative diseases: Mechanisms, complications and emerging therapeutic strategies. Biogerontology 2023, 24, 609–662. [Google Scholar] [CrossRef]
  2. Self, W.K.; Holtzman, D.M. Emerging diagnostics and therapeutics for Alzheimer disease. Nat. Med. 2023, 29, 2187–2199. [Google Scholar] [CrossRef] [PubMed]
  3. Tao, C.-W.W.; Shafieizadeh, K.; Sung, K.S.; Kwon, J. Influential factors toward the quality of dining services at continuing care retirement communities: Manager’s perspective. J. Hosp. Tour. Manag. 2023, 56, 60–68. [Google Scholar] [CrossRef]
  4. Simpson, D.J.; Chandra, T. Epigenetic age prediction. Aging Cell 2021, 20, 20. [Google Scholar] [CrossRef] [PubMed]
  5. Brennan, K.; Garcia-Closas, M.; Orr, N.; Fletcher, O.; Jones, M.; Ashworth, A.; Swerdlow, A.; Thorne, H.; Riboli, E.; Vineis, P. Intragenic ATM methylation in peripheral blood DNA as a biomarker of breast cancer risk. Cancer Res. 2012, 72, 2304–2313. [Google Scholar] [CrossRef]
  6. van Veldhoven, K.; Polidoro, S.; Baglietto, L.; Severi, G.; Sacerdote, C.; Panico, S.; Mattiello, A.; Palli, D.; Masala, G.; Krogh, V. Epigenome-wide association study reveals decreased average methylation levels years before breast cancer diagnosis. Clin. Epigenet. 2015, 7, 67. [Google Scholar] [CrossRef]
  7. Huang, M.; Xu, S.; Liu, L.; Zhang, M.; Guo, J.; Yuan, Y.; Xu, J.; Chen, X.; Zou, J. m6A methylation regulates osteoblastic differentiation and bone remodeling. Front. Cell Dev. Biol. 2021, 9, 783322. [Google Scholar] [CrossRef]
  8. Kenny, J.; Mullin, B.H.; Tomlinson, W.; Robertson, B.; Yuan, J.; Chen, W.; Zhao, J.; Pavlos, N.J.; Walsh, J.P.; Wilson, S.G. Age-dependent genetic regulation of osteoarthritis: Independent effects of immune system genes. Arthritis Res. Ther. 2023, 25, 232. [Google Scholar] [CrossRef]
  9. Levine, M.E.; Lu, A.T.; Quach, A.; Chen, B.H.; Assimes, T.L.; Bandinelli, S.; Hou, L.; Baccarelli, A.A.; Stewart, J.D.; Li, Y.; et al. An epigenetic biomarker of aging for lifespan and healthspan. Aging 2018, 10, 573–591. [Google Scholar] [CrossRef]
  10. Hannum, G.; Guinney, J.; Zhao, L.; Zhang, L.; Hughes, G.; Sadda, S.; Klotzle, B.; Bibikova, M.; Fan, J.-B.; Gao, Y. Genome-wide methylation profiles reveal quantitative views of human aging rates. Mol. Cell 2013, 49, 359–367. [Google Scholar] [CrossRef]
  11. Horvath, S. DNA methylation age of human tissues and cell types. Genome Biol. 2013, 14, R115. [Google Scholar] [CrossRef]
  12. Horvath, S.; Oshima, J.; Martin, G.M.; Lu, A.T.; Quach, A.; Cohen, H.; Felton, S.; Matsuyama, M.; Lowe, D.; Kabacik, S. Epigenetic clock for skin and blood cells applied to Hutchinson Gilford Progeria Syndrome and ex vivo studies. Aging 2018, 10, 1758. [Google Scholar] [CrossRef] [PubMed]
  13. Zhang, Q.; Vallerga, C.L.; Walker, R.M.; Lin, T.; Henders, A.K.; Montgomery, G.W.; He, J.; Fan, D.S.; Fowdar, J.; Kennedy, M.; et al. Improved precision of epigenetic clock estimates across tissues and its implication for biological ageing. Genome Med. 2019, 11, 11. [Google Scholar] [CrossRef] [PubMed]
  14. Shi, L.; Hai, B.; Kuang, Z.; Wang, H.; Zhao, J. ResnetAge: A Resnet-Based DNA Methylation Age Prediction Method. Bioengineering 2023, 11, 34. [Google Scholar] [CrossRef] [PubMed]
  15. Zhao, J.; Li, H.; Qu, J.; Zong, X.; Liu, Y.; Kuang, Z.; Wang, H. A multi-organization epigenetic age prediction based on a channel attention perceptron networks. Front. Genet. 2024, 15, 1393856. [Google Scholar] [CrossRef] [PubMed]
  16. Zhang, Z.Q.; Zhao, Y.; Liao, X.K.; Shi, W.Q.; Li, K.L.; Zou, Q.; Peng, S.L. Deep learning in omics: A survey and guideline. Brief. Funct. Genom. 2019, 18, 41–57. [Google Scholar] [CrossRef] [PubMed]
  17. Bahado-Singh, R.O.; Vishweswaraiah, S.; Aydas, B.; Yilmaz, A.; Saiyed, N.M.; Mishra, N.K.; Guda, C.; Radhakrishna, U. Precision cardiovascular medicine: Artificial intelligence and epigenetics for the pathogenesis and prediction of coarctation in neonates. J. Matern.-Fetal Neonatal Med. 2022, 35, 457–464. [Google Scholar] [CrossRef]
  18. Bahado-Singh, R.O.; Vishweswaraiah, S.; Er, A.; Aydas, B.; Turkoglu, O.; Taskin, B.D.; Duman, M.; Yilmaz, D.; Radhakrishna, U. Artificial intelligence and the detection of pediatric concussion using epigenomic analysis. Brain Res. 2020, 1726, 146510. [Google Scholar] [CrossRef] [PubMed]
  19. Xia, C.; Xiao, Y.; Wu, J.; Zhao, X.; Li, H. A convolutional neural network based ensemble method for cancer prediction using DNA methylation data. In Proceedings of the 2019 11th International Conference on Machine Learning and Computing, Zhuhai China, 22–24 February 2019; pp. 191–196. [Google Scholar]
  20. Li, H. K-Nearest Neighbor. In Machine Learning Methods; Springer: Berlin/Heidelberg, Germany, 2023; pp. 55–66. [Google Scholar]
  21. Reddy, E.M.K.; Gurrala, A.; Hasitha, V.B.; Kumar, K.V.R. Introduction to Naive Bayes and a review on its subtypes with applications. In Bayesian Reasoning and Gaussian Processes for Machine Learning Applications; Chapman and Hall/CRC: Boca Raton, FL, USA, 2022; pp. 1–14. [Google Scholar]
  22. Rigatti, S.J. Random forest. J. Insur. Med. 2017, 47, 31–39. [Google Scholar] [CrossRef]
  23. Suthaharan, S. Machine Learning Models and Algorithms for Big Data Classification; Springer: Berlin/Heidelberg, Germany, 2016; Volume 36, pp. 1–12. [Google Scholar]
  24. Chen, T.; Guestrin, C. In Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM Sigkdd International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
  25. Chicco, D. geneExpressionFromGEO: An R package to facilitate data reading from gene expression Omnibus (GEO). In Microarray Data Analysis; R Company: New York, NY, USA, 2022; pp. 187–194. [Google Scholar]
  26. Wang, H.; Cai, R.; Zong, X.; He, Z.; Zhang, L. MSCAP: DNA Methylation Age Predictor based on Multiscale Convolutional Neural Network. In Proceedings of the 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Istanbul, Turkiye, 5–8 December 2023; IEEE: New York, NY, USA, 2023; pp. 3582–3586. [Google Scholar]
  27. Chen, Y.-a.; Choufani, S.; Ferreira, J.C.; Grafodatskaya, D.; Butcher, D.T.; Weksberg, R. Sequence overlap between autosomal and sex-linked probes on the Illumina HumanMethylation27 microarray. Genomics 2011, 97, 214–222. [Google Scholar] [CrossRef]
  28. Vanderlinden, L.A.; Johnson, R.K.; Carry, P.M.; Dong, F.; DeMeo, D.L.; Yang, I.V.; Norris, J.M.; Kechris, K. An effective processing pipeline for harmonizing DNA methylation data from Illumina’s 450K and EPIC platforms for epidemiological studies. BMC Res. Notes 2021, 14, 352. [Google Scholar] [CrossRef]
  29. Chen, H.; Lundberg, S.M.; Lee, S.I. Explaining a series of models by propagating Shapley values. Nat. Commun. 2022, 13, 15. [Google Scholar] [CrossRef] [PubMed]
  30. Van den Broeck, G.; Lykov, A.; Schleich, M.; Suciu, D. On the Tractability of SHAP Explanations. J. Artif. Intell. Res. 2022, 74, 851–886. [Google Scholar] [CrossRef]
  31. Marcílio, W.E.; Eler, D.M. From explanations to feature selection: Assessing SHAP values as feature selection mechanism. In Proceedings of the 2020 33rd SIBGRAPI conference on Graphics, Patterns and Images (SIBGRAPI), Porto de Galinhas, Brazil, 7–10 November 2020; IEEE: New York, NY, USA, 2020; pp. 340–347. [Google Scholar]
  32. Karolchik, D.; Baertsch, R.; Diekhans, M.; Furey, T.S.; Hinrichs, A.; Lu, Y.T.; Roskin, K.M.; Schwartz, M.; Sugnet, C.W.; Thomas, D.J.; et al. The UCSC Genome Browser Database. Nucleic Acids Res. 2003, 31, 51–54. [Google Scholar] [CrossRef] [PubMed]
  33. Howe, K.L.; Achuthan, P.; Allen, J.; Allen, J.; Alvarez-Jarreta, J.; Amode, M.R.; Armean, I.M.; Azov, A.G.; Bennett, R.; Bhai, J.; et al. Ensembl 2021. Nucleic Acids Res. 2020, 49, D884–D891. [Google Scholar] [CrossRef]
  34. Quinlan, A.R.; Hall, I.M. BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics 2010, 26, 841–842. [Google Scholar] [CrossRef]
  35. Szklarczyk, D.; Kirsch, R.; Koutrouli, M.; Nastou, K.; Mehryary, F.; Hachilif, R.; Gable, A.L.; Fang, T.; Doncheva, N.T.; Pyysalo, S.; et al. The STRING database in 2023: Protein-protein association networks and functional enrichment analyses for any sequenced genome of interest. Nucleic Acids Res. 2023, 51, D638–D646. [Google Scholar] [CrossRef] [PubMed]
  36. Legeay, M.; Doncheva, N.T.; Morris, J.H.; Jensen, L.J. Visualize omics data on networks with Omics Visualizer, a Cytoscape App. F1000Research 2020, 9, 157. [Google Scholar] [CrossRef]
  37. Jansen, I.E.; van der Lee, S.J.; Gomez-Fonseca, D.; De Rojas, I.; Dalmasso, M.C.; Grenier-Boley, B.; Zettergren, A.; Mishra, A.; Ali, M.; Andrade, V. Genome-wide meta-analysis for Alzheimer’s disease cerebrospinal fluid biomarkers. Acta Neuropathol. 2022, 144, 821–842. [Google Scholar] [CrossRef]
  38. Juan, H.F.; Lin, J.Y.C.; Chang, W.H.; Wu, C.Y.; Pan, T.L.; Tseng, M.J.; Khoo, K.H.; Chen, S.T. Biomic study of human myeloid leukemia cells differentiation to macrophages using DNA array, proteomic, and bioinformatic analytical methods. Electrophoresis 2002, 23, 2490–2504. [Google Scholar] [CrossRef]
  39. Huang, F.; Fu, M.; Li, J.; Chen, L.; Feng, K.; Huang, T.; Cai, Y.-D. Analysis and prediction of protein stability based on interaction network, gene ontology, and KEGG pathway enrichment scores. Biochim. Biophys. Acta (BBA)-Proteins Proteom. 2023, 1871, 140889. [Google Scholar] [CrossRef] [PubMed]
  40. Safari-Alighiarloo, N.; Taghizadeh, M.; Rezaei-Tavirani, M.; Goliaei, B.; Peyvandi, A.A. Protein-protein interaction networks (PPI) and complex diseases. Gastroenterol. Hepatol. Bed Bench 2014, 7, 17. [Google Scholar] [PubMed]
  41. Wang, S.; Wu, R.X.; Lu, J.Q.; Jiang, Y.J.; Huang, T.; Cai, Y.D. Protein-protein interaction networks as miners of biological discovery. Proteomics 2022, 22, 20. [Google Scholar] [CrossRef] [PubMed]
  42. Zhou, Y.Y.; Zhou, B.; Pache, L.; Chang, M.; Khodabakhshi, A.H.; Tanaseichuk, O.; Benner, C.; Chanda, S.K. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat. Commun. 2019, 10, 10. [Google Scholar] [CrossRef]
  43. Korotkevich, G.; Sukhov, V.; Budin, N.; Shpak, B.; Artyomov, M.N.; Sergushichev, A. Fast gene set enrichment analysis. biorxiv 2016, 060012. [Google Scholar] [CrossRef]
  44. Tipney, H.; Hunter, L. An introduction to effective use of enrichment analysis software. Hum. Genom. 2010, 4, 202–206. [Google Scholar] [CrossRef]
Figure 1. Comparison of the results before and after retraining the MSCAP model using selected CpG sites. This figure displays the performance comparison of the MSCAP model using the original 25,789 sites (‘Before’) and after retraining with the 196 selected methylation sites (‘After’) across five degenerative disease datasets: GSE111223, GSE80970, GSE99624, GSE43414, and GSE59685. The graph shows that the retrained model with fewer sites significantly outperforms the original, indicating that these sites are highly predictive of degenerative diseases. The Mean Absolute Deviation (MAD) scores are lower in the ‘After’ scenario, demonstrating improved model accuracy.
Figure 1. Comparison of the results before and after retraining the MSCAP model using selected CpG sites. This figure displays the performance comparison of the MSCAP model using the original 25,789 sites (‘Before’) and after retraining with the 196 selected methylation sites (‘After’) across five degenerative disease datasets: GSE111223, GSE80970, GSE99624, GSE43414, and GSE59685. The graph shows that the retrained model with fewer sites significantly outperforms the original, indicating that these sites are highly predictive of degenerative diseases. The Mean Absolute Deviation (MAD) scores are lower in the ‘After’ scenario, demonstrating improved model accuracy.
Ijms 25 09514 g001
Figure 2. ROC curves of seven models for nine degenerative diseases. This figure illustrates the ROC curves for various models tested across nine degenerative diseases: Alzheimer’s disease, Parkinson’s disease, breast cancer, colon cancer, lung cancer, liver cancer, kidney cancer, osteoporosis, and brain cancer. Each curve represents the performance of a different predictive model (MLP, logistic regression, naive Bayes, random forest, SVM, and XGBoost) across these diseases. The Area Under the Curve (AUC) for each model is indicated in the legend, reflecting the models’ ability to distinguish between disease and non-disease states effectively. The Alzheimer’s disease model, particularly MLP-DDR, achieved the highest AUC of 0.96, demonstrating its superior predictive accuracy. The data used for this analysis were collected from the GEO database, encompassing DNA methylation data and clinical information for the nine degenerative diseases.
Figure 2. ROC curves of seven models for nine degenerative diseases. This figure illustrates the ROC curves for various models tested across nine degenerative diseases: Alzheimer’s disease, Parkinson’s disease, breast cancer, colon cancer, lung cancer, liver cancer, kidney cancer, osteoporosis, and brain cancer. Each curve represents the performance of a different predictive model (MLP, logistic regression, naive Bayes, random forest, SVM, and XGBoost) across these diseases. The Area Under the Curve (AUC) for each model is indicated in the legend, reflecting the models’ ability to distinguish between disease and non-disease states effectively. The Alzheimer’s disease model, particularly MLP-DDR, achieved the highest AUC of 0.96, demonstrating its superior predictive accuracy. The data used for this analysis were collected from the GEO database, encompassing DNA methylation data and clinical information for the nine degenerative diseases.
Ijms 25 09514 g002
Figure 3. Pie chart of the dataset distribution of degenerative diseases. This pie chart clearly delineates the distribution of the dataset across nine different types of degenerative diseases. Alzheimer’s disease comprises the largest segment at 32%, followed by Parkinson’s disease at 21%, emphasizing their prominent representation in the dataset. Both breast cancer and colon cancer are significant portions of the dataset, with breast cancer accounting for 13% and colon cancer for 12%. The remaining five diseases are sorted in order of their share. The data used for this analysis were sourced from the GEO database.
Figure 3. Pie chart of the dataset distribution of degenerative diseases. This pie chart clearly delineates the distribution of the dataset across nine different types of degenerative diseases. Alzheimer’s disease comprises the largest segment at 32%, followed by Parkinson’s disease at 21%, emphasizing their prominent representation in the dataset. Both breast cancer and colon cancer are significant portions of the dataset, with breast cancer accounting for 13% and colon cancer for 12%. The remaining five diseases are sorted in order of their share. The data used for this analysis were sourced from the GEO database.
Ijms 25 09514 g003
Figure 4. Flow chart of risk prediction for degenerative diseases. This figure illustrates the structured flowchart of the degenerative disease risk prediction model, divided into three phases. First, an epigenetic clock model, MSCAP, is constructed using a multi-scale convolutional neural network (MSCNN) to identify DNA methylation sites with significant degenerative patterns. Next, the SHAP tool analyzes the MSCAP model’s weights, identifying 1071 significant DNA methylation sites. Through proteomics analysis, 196 sites associated with degenerative disease mechanisms are selected. Finally, 2248 disease samples, including the 196 key sites, are input into a multilayer perceptron (MLP) module with three hidden layers of 64 nodes each, outputting two labels: 0 (early-stage disease) and 1 (late-stage disease).
Figure 4. Flow chart of risk prediction for degenerative diseases. This figure illustrates the structured flowchart of the degenerative disease risk prediction model, divided into three phases. First, an epigenetic clock model, MSCAP, is constructed using a multi-scale convolutional neural network (MSCNN) to identify DNA methylation sites with significant degenerative patterns. Next, the SHAP tool analyzes the MSCAP model’s weights, identifying 1071 significant DNA methylation sites. Through proteomics analysis, 196 sites associated with degenerative disease mechanisms are selected. Finally, 2248 disease samples, including the 196 key sites, are input into a multilayer perceptron (MLP) module with three hidden layers of 64 nodes each, outputting two labels: 0 (early-stage disease) and 1 (late-stage disease).
Ijms 25 09514 g004
Figure 5. Four key subnetworks. This figure displays the top four subnetworks identified using STRING and Cytoscape with the MCODE plugin, which highlight crucial gene interactions linked to degenerative diseases. Node size and color indicate the gene’s importance and connectivity; larger and darker nodes signify higher significance.
Figure 5. Four key subnetworks. This figure displays the top four subnetworks identified using STRING and Cytoscape with the MCODE plugin, which highlight crucial gene interactions linked to degenerative diseases. Node size and color indicate the gene’s importance and connectivity; larger and darker nodes signify higher significance.
Ijms 25 09514 g005
Figure 6. KEGG pathway analysis of the Top2 subnetwork. This figure displays a heatmap of the KEGG pathway enrichment analysis for the Top2 subnetwork, derived from our research using the Metascape platform. Each bar in the heatmap represents a specific pathway, with the color intensity reflecting the −log10(P) value, which quantifies the statistical significance of the enrichment. Higher values, indicated by darker colors, suggest stronger associations between these pathways and the genes within the Top2 subnetwork, underscoring their potential impact on degenerative disease mechanisms.
Figure 6. KEGG pathway analysis of the Top2 subnetwork. This figure displays a heatmap of the KEGG pathway enrichment analysis for the Top2 subnetwork, derived from our research using the Metascape platform. Each bar in the heatmap represents a specific pathway, with the color intensity reflecting the −log10(P) value, which quantifies the statistical significance of the enrichment. Higher values, indicated by darker colors, suggest stronger associations between these pathways and the genes within the Top2 subnetwork, underscoring their potential impact on degenerative disease mechanisms.
Ijms 25 09514 g006
Table 1. Predictive performance of Mlp-DDR on nine degenerative diseases.
Table 1. Predictive performance of Mlp-DDR on nine degenerative diseases.
Degenerative Disease TypeAUCACCRecallPrecision
Alzheimer’s Disease0.960.920.970.92
Parkinson’s Disease0.920.900.960.87
Breast Cancer0.930.910.970.91
Colon Cancer0.850.830.970.79
Lung Cancer0.850.770.890.77
Liver Cancer0.830.730.620.83
Kidney Cancer0.810.830.940.83
Osteoporosis0.710.630.820.50
Brain Cancer0.800.711.000.63
Table 2. (a) Comparative predictive performance of seven models on nine degenerative disease datasets. (b) Comparative predictive performance of seven models across nine degenerative disease datasets. (c) Comparative predictive performance of seven models across nine degenerative disease datasets.
Table 2. (a) Comparative predictive performance of seven models on nine degenerative disease datasets. (b) Comparative predictive performance of seven models across nine degenerative disease datasets. (c) Comparative predictive performance of seven models across nine degenerative disease datasets.
(a)
Disease TypeModelAUCACCPrecisionRecall
Alzheimer’s DiseaseMLP0.960.920.970.92
LR0.930.930.990.91
SVM0.920.930.990.91
RF0.910.930.990.91
XGBoost0.900.930.990.91
KNN0.910.930.970.93
NB0.900.930.980.92
Parkinson’s DiseaseMLP0.920.900.960.87
LR0.890.900.980.86
SVM0.870.911.000.86
RF0.870.900.980.86
XGBoost0.880.881.000.82
KNN0.850.850.890.84
NB0.900.911.000.86
Breast CancerMLP0.930.910.970.91
LR0.840.791.000.77
SVM0.850.861.000.84
RF0.820.740.790.85
(b)
Disease TypeModelAUCACCPrecisionRecall
Breast CancerXGBoost0.650.700.760.82
KNN0.840.890.960.90
NB0.810.720.690.91
Colon CancerMLP0.850.830.970.79
LR0.780.591.000.59
SVM0.790.591.000.59
RF0.760.791.000.74
XGBoost0.750.791.000.74
KNN0.820.830.880.83
NB0.770.780.910.76
Lung CancerMLP0.850.770.890.77
LR0.780.631.000.63
SVM0.800.770.890.77
RF0.750.831.000.79
XGBoost0.770.831.000.79
KNN0.690.700.740.78
NB0.750.831.000.79
Liver CancerMLP0.830.730.620.83
LR0.640.551.000.55
SVM0.690.640.790.63
RF0.660.610.710.63
XGBoost0.700.730.960.68
KNN0.820.820.830.83
NB0.690.700.830.69
Kidney CancerMLP0.810.830.940.83
LR0.770.671.000.67
SVM0.480.671.000.67
RF0.720.790.940.79
XGBoost0.690.750.880.78
KNN0.590.580.560.75
NB0.770.790.940.79
OsteoporosisMLP0.710.630.820.50
LR0.680.371.000.37
SVM0.680.371.000.37
RF0.520.570.360.40
XGBoost0.690.630.910.50
KNN0.610.600.640.47
NB0.700.601.000.48
(c)
Disease TypeModelAUCACCPrecisionRecall
Brain CancerMLP0.800.711.000.63
LR0.660.501.000.50
SVM0.760.501.000.50
RF0.670.671.000.60
XGBoost0.580.580.750.56
KNN0.750.751.000.67
NB0.760.750.830.71
Table 3. Staging information of degenerative diseases.
Table 3. Staging information of degenerative diseases.
DiseaseStageSample Count
Alzheimer’s DiseaseEarly401
Late321
Parkinson’s DiseaseEarly229
Late238
Breast CancerEarly148
Late134
Colon CancerEarly146
Late121
Lung CancerEarly84
Late104
Liver CancerEarly68
Late46
Kidney CancerEarly37
Late39
OsteoporosisEarly38
Late34
Brain CancerEarly33
Late27
Table 4. Coordinate information of partial CpG sites.
Table 4. Coordinate information of partial CpG sites.
CpG IDSHAP ValuesGene NameChromosome Position
cg00008493−3.31 × 10−5COX8C14
cg00024812−5.67 × 10−5CPSF32
cg00033773−4.54 × 10−5MORG119
cg00094319−2.96 × 10−5PABPC313
cg00103783−8.55 × 10−5MPDU117
cg00136477−7.97 × 10−5C1QC1
cg00187686−2.08 × 10−5TCN111
cg00234961−2.22 × 10−5ZBED422
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhang, L.; Cai, R.; Wang, C.; Liu, J.; Kuang, Z.; Wang, H. Prediction of Multiple Degenerative Diseases Based on DNA Methylation in a Co-Physiology Mechanisms Perspective. Int. J. Mol. Sci. 2024, 25, 9514. https://doi.org/10.3390/ijms25179514

AMA Style

Zhang L, Cai R, Wang C, Liu J, Kuang Z, Wang H. Prediction of Multiple Degenerative Diseases Based on DNA Methylation in a Co-Physiology Mechanisms Perspective. International Journal of Molecular Sciences. 2024; 25(17):9514. https://doi.org/10.3390/ijms25179514

Chicago/Turabian Style

Zhang, Li, Ruirui Cai, Chencai Wang, Jialong Liu, Zhejun Kuang, and Han Wang. 2024. "Prediction of Multiple Degenerative Diseases Based on DNA Methylation in a Co-Physiology Mechanisms Perspective" International Journal of Molecular Sciences 25, no. 17: 9514. https://doi.org/10.3390/ijms25179514

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop