Next Article in Journal
Recent Strategies for Ni3S2-Based Electrocatalysts with Enhanced Hydrogen Evolution Performance: A Tutorial Review
Next Article in Special Issue
Measurement of Disease Comorbidity Using Semantic Profiling of Disease Genes
Previous Article in Journal
NK Cells Modulate Dendritic Cell (DC) Signaling Pathways and DC Recruitment in Chlamydial Infection
Previous Article in Special Issue
One Health Approach to the Computational Design of a Lipoprotein-Based Multi-Epitope Vaccine Against Human and Livestock Tuberculosis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

In Silico Analysis Identified Putative Pathogenic Missense Single Nucleotide Polymorphisms (SNPs) in the Human HNF1A Gene

by
Hitham Aldharee
1,2,* and
Hamdan Z. Hamdan
1
1
Department of Pathology, College of Medicine, Qassim University, Buraidah 51452, Saudi Arabia
2
Unit of Genetic Diabetes, Abdullah Al Othaim Diabetes Center, Medical City, Qassim University, Buraidah 51452, Saudi Arabia
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2025, 26(8), 3768; https://doi.org/10.3390/ijms26083768
Submission received: 5 March 2025 / Revised: 29 March 2025 / Accepted: 15 April 2025 / Published: 16 April 2025

Abstract

:
Maturity-onset diabetes of the young (MODY) is a rare genetic condition that affects children, adolescents, and adults. Studies have shown that genetic changes in the HNF1A gene are associated with MODY-3. However, most of the causative variants and the molecular mechanisms remain underexplored. This study aims to better understand MODY-3 by investigating HNF1A-missense variants with clinical uncertainty. Various bioinformatics tools were utilised to address the clinical uncertainty of missense variants in the HNF1A gene that have not been linked with HNF1A-related conditions, sourced from the Genome Aggregation Database (GnomAD v4.1.0). Among the clinically uncertain 2444 variants, only 138 were classified as missense with clinically uncertain significance. Results show that four variants (Arg168Cys, Glu275Ala, Gly375Asp and Val411Phe) were consistently predicted as pathogenic by all tools. The allele frequency (AF) of the commonly predicted disease-causing variants was very low in the global population. The assessment of the secondary structure of filtered variants indicates that variants (Arg168Cys and Glu275Ala) are located in the helical region of the HNF1A protein. At the same time (Gly375Asp and Val411Phe) are found in the protein’s coil, suggesting structural changes at the site of variations. The prediction of protein stability was conducted using I-Mutant and MuPro. Both tools collectively indicate decreased protein stability for the variants (Arg168Cys, Glu275Ala, Gly375Asp and Val411Phe). Predicting the protein’s 3D structure for the HNF1A wild-type and mutants indicates potential structural damages in Arg168Cys and Gly375Asp. Additionally, results show that the amino acids at the variation sites of the variants (Arg168Cys, Glu275Ala, Gly375Asp and Val411Phe) were highly conserved. To conclude, 4 out of the 138 missense variants labelled as uncertain significance were found to be consistently pathogenic using in silico tools in this study. Our findings aim to support variant interpretation, understand the genotype–phenotype association of diabetes, and provide better healthcare services for patients with diabetes.

1. Introduction

Maturity-onset diabetes of the young (MODY; MIM #606391) is a rare form of genetic diabetes that was first described in 1974 [1,2]. MODY comprises hyperglycaemia in young people (under 25 years of age) caused by monogenic alterations of genes responsible for beta-cell function and insulin action, representing 1–5% of diabetic cases [2,3,4]. It is categorised into subtypes (MODY1-14), each linked with a defect in a specific gene and mostly inherited in an autosomal dominant pattern [1,5]. MODY-3 is the most prevalent subtype, caused by variants in the hepatocyte nuclear factor 1-alpha gene (HNF1A; OMIM #142410), which is responsible for producing the HNF1A protein that plays crucial roles in regulating the activity of genes involved in blood glucose level control.
The HNF1A is mapped to (12q24.31) and comprises 10 exons, which encode 631 amino acids and the HNF1A protein [5,6,7].
According to published data by the National Human Genome Research Institute (NHGRI), an individual genome is estimated to contain thousands of single nucleotide polymorphisms (SNPs), insertion/deletion, and structural variants. These variations may be pathogenic, linked to some abnormal conditions, present benign effects, or are uncertain. Regarding MODY-3, a list of variants have been identified within the HNF1A gene: missense, Indels, frameshift, inframe deletion, inframe insertion, and splice sites [5,8]. Although most genetic changes are characterised as SNPs, whether the reported variants alter the normal expression and function of the HNF1A protein is still under-studied. Furthermore, missense variants are known for their effects on the translated messenger RNA (mRNA) and expressed protein. According to the Genome Aggregation Database (GnomAD v4.1.0), hundreds of missense variants have been reported in the HNF1A gene; however, some are classified as pathogenic, likely pathogenic, or benign, while the remaining variants are under investigation and classified as uncertain in terms of their clinical effects.
Identifying pathogenic SNPs involved in the causation of rare diseases such as MODY is complex and challenging due to the requirement of many cases, multiple genetic testing, and SNPs prioritisation. An alternative, efficient, and reliable approach is recommended to support the discrimination of unclassified variants and determine whether they are causative and affect the disease’s prognosis. Clinically, computational-based evidence is approved as a criterion for classifying variants by the American College of Medicine (ACMG) and the Association of Molecular Pathology (ACP) [9]. Moreover, it has been reported that most algorithms used for missense variants show an accuracy of 65–80% in examining disease-causative variants [10]. Therefore, in silico analysis through bioinformatics tools could prove to be a promising strategy to address this limitation in MODY, given the capabilities of such tools in differentiating disease-causative from non-causative variants.
This study aims to enhance the genetic understanding of MODY-3 by detailing the potential effects of missense variants with uncertain clinical significance reported by the GnomAD v4.1.0 database on the normal expression and function of the HNF1A protein.

2. Results

2.1. Variant Recruitment and Selection

Searching the GnomAD v4.1.0 databases revealed 2442 variants in the HNF1A gene. Among these, 817 were missense variants, with the remaining distributed across splice site, 5′ UTR, 3′ UTR, frameshift, in-frame insertion and deletion, intronic, start-lost, stop-gained, stop-retained, and synonymous variants. We selected a total of 138 missense variants, which cause changes in amino acids and are categorised as uncertain significance (US), for further bioinformatic analysis.

2.2. Predicting Pathogenicity of Variants

The selected missense–US variants were analysed using the following bioinformatics tools to assess their potential pathogenicity: SNPs&GO, Meta-SNP, Polyphen2, PANTHER, SIFT, PhD-SNP, and SNAP2. The results indicate that all tools collectively predicted only four out of 138 variants as pathogenic (Table 1). Individual reports show that SNPs&GO predicted 135 variants as disease-causing, Meta-SNP identified 24 as disease-causing, Polyphen2 indicated 86 as probably damaging, PANTHER found 40 as disease-causing, SIFT identified 46 as disease-causing, PhD-SNP reported 22 as disease-causing, and SNAP2 indicated 79 as disease-causing. Ultimately, the variants commonly predicted to be pathogenic were selected for further analysis.
Further analysis of the commonly predicted variants using AlphaMissense (https://www.science.org/doi/10.1126/science.adg7492, accessed on 24 January 2025), an AI-based tool, yielded the following scores: p.Arg168Cys (0.97, Pathogenic), p.Glu275Ala (0.85, Pathogenic), p.Gly375Asp (0.92, Pathogenic), and p.Val411Ala (0.78, likely pathogenic). These high-confidence predictions support our earlier findings and reinforce the deleterious nature of these missense mutations.

2.3. Determining Variant Frequency

The frequency of the commonly predicted disease-causing variants was assessed using dbSNP (https://www.ncbi.nlm.nih.gov/snp/, accessed on 24 January 2025). The results indicate that the frequency of variant I among the global population was (0.00006), variant II (0.00004), variant III (0.00004), and variant IV (0.00004). The frequency of all variants is very low, suggesting a high probability of pathogenicity.

2.4. Predicting Protein Secondary Structure

Quick2D, a comprehensive tool, was used to predict secondary structures. The results indicate that Arg168Cys and Val411Phe reside within the alpha helix region, which may disrupt protein integrity and stability. Conversely, Glu275Ala is located in the coil region, while Gly375Asp is located in the strand region, which could influence beta-sheet formation and protein–protein interaction.

2.5. Predicting Protein Stability

The stability of proteins associated with common pathogenic variants was evaluated using the MUpro and I-Mutant bioinformatics tools. The results indicated that variants I, II, III, and IV were typically predicted to reduce protein stability, suggesting a loss of function in the HNF1A protein (Table 2).

2.6. Protein–Protein Interaction

The interactor proteins of HNF1A were examined using the STRING bioinformatics tool. The findings indicate that the HNF1A protein interacts with PCBD1, PCBD2, HNF1B, HNF4G, FOXA2, FOXA3, GCK, HNF4A, ONECUT1, and SLC2A2 (Figure 1). Among these, PCBD1 displays the closest interaction, directly interacting with HNF1A.

2.7. Predicting Protein 3D Structure

The 3D structural analysis of the HNF1A wild-type protein and the predicted pathogenic variants was carried out using I-TASSER and UCSF Chimera tools. Results indicate that structural damage was observed in Arg168Cys (in terms of replacing buried charged residue R with a non-charged residue C, with RSA = 0.4%) and Gly375Asp (manifested as introducing a charged residue in a place of uncharged residue with RSA = 1.1%) (Figure 2B), while no damage was seen in the structure of the other mutants. Additionally, the 3D models of the mutants were superimposed onto the wild-type model for manual comparison of the structural similarities. Findings suggest that the variant Arg168Cys shows the highest similarity to the wild-type protein compared to Gly375Asp (Figure 2C).

2.8. Conservation of the HNF1A Amino Acids

Results from multiple sequence alignments using the Jalview software reveal that all amino acids at the variation sites of variants I, II, III, and IV are highly conserved across all selected species (Figure 3). These findings suggest a functional and/or evolutionary significance of the residues at the variation sites.

3. Materials and Methods

3.1. Variants Recruitment and Selection

Missense variants of the HNF1A gene were retrieved from the GnomAD v4.1.0 database (https://gnomad.broadinstitute.org, accessed on 3 October 2024). A file listing all the variants of the HNF1A gene was manually filtered using an MS Excel-supported file. Only missense variants classified as variants of unknown significance (VUS), have not been reported in ClinVar with HNF1A- related conditions, and located within the coding region of the HNF1A gene were selected for further analysis. The transcript (ENST00000257555.11) was used as a reference in accordance with the Human Genome Variation Society (HGVS) guidelines. The selected variants were analysed using various bioinformatics tools for variant prioritisation.

3.2. Predicting Pathogenicity of Variants

The following bioinformatics tools were used to predict and distinguish pathogenic variants from non-pathogenic ones: SNPs&GO (https://snps.biofold.org/snps-and-go/snps-and-go.html, accessed on 12 October 2024), Meta-SNP (https://snps.biofold.org/meta-snp/, accessed on 26 October 2024), Polymorphism Phenotyping v2 (Polyphen2) (http://genetics.bwh.harvard.edu/pph2/, accessed on 13 November 2024), protein analysis through evolutionary relationships (PANTHER) (https://pantherdb.org, accessed on 21 November 2024), sorting intolerant from tolerant (SIFT) (https://sift.bii.a-star.edu.sg, accessed on 5 December 2024), predictor of human deleterious SNP (PhD-SNP) (https://snps.biofold.org/phd-snp/phd-snp.html, accessed on 17 December 2024), and SNAP2 (https://bio.tools/snap2, accessed on 26 December 2024), and AlphaMissense (https://www.science.org/doi/10.1126/science.adg7492, accessed on 24 January 2025) [11].

3.3. Determining Variant Frequency

The Single Nucleotide Polymorphism Database (dbSNP) was used to calculate the variation allele frequency (VAF) of selected variants predicted as pathogenic by all the bioinformatic tools (https://www.ncbi.nlm.nih.gov/snp/, accessed on 2 January 2025).

3.4. Predicting Protein Secondary Structure

The prediction of the secondary structure of the HNF1A natural protein and the proteins of selected variants predicted as pathogenic by all the bioinformatics tools was conducted using the Quick2D (https://toolkit.tuebingen.mpg.de/tools/quick2d, accessed on 23 March 2025).

3.5. Predicting Protein Stability

The protein stability (PS) of selected variants, which were predicted as pathogenic by all the bioinformatic tools, was assessed using MUpro (http://mupro.proteomics.ics.uci.edu, accessed on 15 January 2025) and I-Mutant 2.0 (https://folding.biofold.org/i-mutant/i-mutant2.0.html, accessed on 18 January 2025).

3.6. Protein–Protein Interaction

The assessment of HNF1A interactive proteins was conducted using the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) (https://string-db.org, accessed on 18 January 2025), which relies on co-expression, experimental data, function, and gene fusion and interaction.

3.7. Predicting Protein 3D Structure

The 3D structure of the HNF1A natural protein and the proteins of selected variants, predicted to be pathogenic by all bioinformatic tools, were predicted using the Iterative Threading ASSEmbly Refinement (I-TASSER) tool (https://seq2fun.dcmb.med.umich.edu//I-TASSER/, accessed on 21 January 2025). Additionally, the Missense3D tool (https://missense3d.bc.ic.ac.uk/missense3d/, accessed on 24 January 2025), was employed to predict any possible structural damage in the 3D protein structure, caused by the amino acid substitutions. This tool calculates the relative solvent accessibility (RSA%) of the protein following the introduction of the substituted amino acid. It provides both quantitative and qualitative assessments of the structural impact. The UCSF Chimera software (https://www.cgl.ucsf.edu/chimera/, accessed on 24 January 2025) was used to visualise the 3D structure of the proteins and to overlay the normal and mutated proteins for comparative analysis of structural variations.

3.8. Conservation Analysis

The conservation assessment of the HNF1A protein among the species was conducted using Jalview software. The amino acid sequence of the human HNF1A protein, along with those of other species, was retrieved from the NCBI’s HomoloGene sub-database (https://www.ncbi.nlm.nih.gov, accessed on 25 January 2025). Multiple sequence alignment (MSA) conservation analysis, and consensus were carried out on Jalview software to examine the conserved amino acids among selected species and visualised by using Jalview software (version 2.11.4.1).

4. Discussion

The in silico approach is widely used and enables researchers to conduct targeted and functional variant analyses for specific diseases. In this study, we investigate missense variants in the HNF1A gene that are classified as having uncertain significance and have not been linked with HNF1A- related conditions. HNF1A is a well-known transcription factor, and its variants are strongly associated with the development of maturity-onset diabetes of the young (MMODY).
Expression analysis has shown that HNF1A, as a transcription factor, regulates key pathways involved in glucose metabolism, such as glycolysis, the Krebs cycle, and the respiratory chain. These pathways constitute a cascade system that directly contributes to insulin secretion, termed glucose-stimulated insulin secretion [12]. Additionally, HNF1A activates other transcription factors that also affect glucose metabolism [13,14]. Consequently, variants in the HNF1A gene are clinically linked to the pathogenesis of MODY.
The initial analysis identified 138 missense variants with clinically uncertain significance. The study’s major finding was the identification of four missense SNPs (Cys168Arg, Glu275Ala, Gly375Asp and Val411Phe) predicted to be pathogenic using seven bioinformatics tools. The variants (Arg168Cys, Glu275Ala, Gly375Asp and Val411Phe) were predicted to decrease protein stability. Additionally, three-dimensional (3D) analysis predicts that the variants (Arg168Cys and Gly375Asp) are associated with 3D structural damage.
The HNF1A protein is a transcription factor that possesses the typical structure of transcription factors, comprising an N-terminal dimerization domain, a DNA-binding domain, and a C-terminal transactivation domain [14,15,16]. The DNA-binding domain plays a crucial role in identifying the promoter region of target genes, facilitating the assembly and initiation of transcription machinery [15]. Therefore, structural and functional alterations in this sensitive region may provide a rationale for the abnormalities observed in MODY3 cases. Variant I (Arg168Cys) also lies within the α-helix of the DNA-binding domain. The substitution of arginine—a large, positively charged amino acid—for cysteine, a smaller, polar, and non-charged amino acid, may disrupt DNA binding with HNFA1. Although both amino acids are hydrophilic, arginine is structurally the largest amino acid in this context and carries a positive charge, whereas cysteine is structurally smaller and uncharged. Also, introducing a cysteine residue may also confer redox sensitivity. Under oxidative conditions, cysteine could form disulfide bonds that alter protein folding or promote dimerization, potentially affecting nuclear localisation, stability, or protein–protein interaction [17]. These biophysical changes support the structural and functional disruption predicted by the current bioinformatics analysis.
In addition, variant VI (Gly375Asp) is predicted to cause structural damage and affect HNF1A activity functionally. This variant is found in the C-terminal transactivation domain. While it is not directly involved in DNA binding, its role influences the transcription rate of target genes [18]. Amino acid substitutions may alter the interaction between HNF1A and its activators or co-activators, ultimately resulting in reduced transcriptional activity. Also, this region is classified as an intrinsically disordered area, enabling interactions with other transcription factors, repressors, and activators that regulate transcription activity.
From an evolutionary perspective, all the investigated variant positions are highly conserved among vertebrates, highlighting their critical role in protein function across species [19]. The high degree of conservation in this transcription factor may be attributed to its interactions with multiple proteins across various species. Therefore, these conserved regions are thought to resist the negative selection pressure associated with their variants [20].
Post-translational modifications (PTMs) of HNF1A, including phosphorylation and acetylation, are recognised as regulators of its transcriptional activity. Although the identified variants are not located within known PTM sites, it remains possible that structural changes induced by these mutations could affect the accessibility or recognition of nearby PTM sites [21].
Previous studies on MODY3 have indicated that the location of variants within the HNF1A gene can influence the disease phenotype [22,23]. Specifically, variants in the DNA-binding domain are linked to reduced HNF1A-DNA binding, resulting in decreased transcriptional activity of target genes. Moreover, certain variants may affect nuclear localisation [24]. Consequently, variants in the DNA-binding domain are typically regarded as more severe than those in the transactivation domain. Clinically, this severity presents as an earlier age of onset and severe hyperglycaemia, often necessitating early insulin therapy [22,23].
Additionally, the epigenetic regulation of HNF1A expression and its target genes may affect phenotypic variability, while lifestyle factors such as diet and metabolic stress can further alter disease severity. Thus, they can influence disease onset and severity even among individuals carrying the same pathogenic variant [25].

5. Conclusions

To date, the variants investigated have been classified as variants of uncertain significance (VUS). However, based on the current computational analysis, they may warrant reclassification as likely pathogenic. Nonetheless, this study has limitations that need to be addressed. Firstly, although the reported alleles are rare, destabilise the HNF1A protein, and are predicted to be pathogenic, they have not yet been associated with MODY onset cases. This could be attributed to the fact that MODY cases are rare, with most either undiagnosed or misdiagnosed. Additional, molecular dynamics simulation and in vitro functional assays of the mutant proteins, such as differential scanning fluorimetry and circular dichroism spectroscopy, would provide practical validation of the variants’ pathogenicity and their impact on protein function and stability.

Author Contributions

Conceptualization, H.A. and H.Z.H.; methodology, H.A. and H.Z.H.; software, H.A. and H.Z.H.; validation, H.A and H.Z.H.; formal analysis, H.A. and H.Z.H.; investigation, H.A. and H.Z.H.; data curation, H.A. and H.Z.H.; writing—original draft preparation, H.A. and H.Z.H.; writing—review and editing, H.A. and H.Z.H.; visualization, H.A. and H.Z.H.; supervision, H.A. and H.Z.H.; funding acquisition, H.A. All authors have read and agreed to the published version of the manuscript.

Funding

The researchers would like to thank the Deanship of Graduate Studies and Scientific Research at Qassim University for financial support (QU-APC-2025).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All relevant data are within the manuscript.

Conflicts of Interest

Both authors declare that they have no conflicts of interest.

References

  1. Fajans, S.S.; Bell, G.I. MODY: History, genetics, pathophysiology, and clinical decision making. Diabetes Care 2011, 34, 1878–1884. [Google Scholar] [CrossRef] [PubMed]
  2. Oliveira, S.C.; Neves, J.S.; Pérez, A.; Carvalho, D. Maturity-onset diabetes of the young: From a molecular basis perspective toward the clinical phenotype and proper management. Endocrinol. Diabetes Nutr. 2020, 67, 137–147. [Google Scholar] [CrossRef] [PubMed]
  3. Pihoker, C.; Gilliam, L.K.; Ellard, S.; Dabelea, D.; Davis, C.; Dolan, L.M.; Greenbaum, C.J.; Imperatore, G.; Lawrence, J.M.; Marcovina, S.M.; et al. Prevalence, characteristics and clinical diagnosis of maturity onset diabetes of the young due to mutations in HNF1A, HNF4A, and glucokinase: Results from the SEARCH for diabetes in Youth. J. Clin. Endocrinol. Metab. 2013, 98, 4055–4062. [Google Scholar] [CrossRef]
  4. Nkonge, K.M.; Nkonge, D.K.; Nkonge, T.N. The epidemiology, molecular pathogenesis, diagnosis, and treatment of maturity-onset diabetes of the young (MODY). Clin. Diabetes Endocrinol. 2020, 6, 1–10. [Google Scholar] [CrossRef]
  5. Behl, R.; Malhotra, N.; Joshi, V.; Poojary, S.; Middha, S.; Gupta, S.; Olaonipekun, A.B.; Okoye, I.; Wagh, B.; Biswas, D.; et al. Meta-analysis of HNF1A-MODY3 variants among human population. J. Diabetes Metab. Disord. 2022, 21, 1037–1046. [Google Scholar] [CrossRef]
  6. Valkovicova, T.; Skopkova, M.; Stanik, J.; Gasperikova, D. Novel insights into genetics and clinics of the HNF1A-MODY. Endocr. Regul. 2019, 53, 110–134. [Google Scholar] [CrossRef]
  7. Firdous, P.; Nissar, K.; Ali, S.; Ganai, B.A.; Shabir, U.; Hassan, T.; Masoodi, S.R. Genetic testing of maturity-onset diabetes of the young current status and future perspectives. Front. Endocrinol. 2018, 9, 253. [Google Scholar] [CrossRef]
  8. Iafusco, F.; Maione, G.; Mazzaccara, C.; Di Candia, F.; Mozzillo, E.; Franzese, A.; Tinto, N. NGS analysis revealed digenic heterozygous GCK and HNF1A variants in a child with mild hyperglycemia: A case report. Diagnostics 2021, 11, 1164. [Google Scholar] [CrossRef]
  9. Richards, S.; Aziz, N.; Bale, S.; Bick, D.; Das, S.; Gastier-Foster, J.; Grody, W.W.; Hegde, M.; Lyon, E.; Spector, E.; et al. Standards and guidelines for the interpretation of sequence variants: A joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet. Med. 2015, 17, 405–424. [Google Scholar] [CrossRef]
  10. Thusberg, J.; Olatubosun, A.; Vihinen, M. Performance of mutation pathogenicity prediction methods on missense variants. Hum. Mutat. 2011, 32, 358–368. [Google Scholar] [CrossRef]
  11. Cheng, J.; Novati, G.; Pan, J.; Bycroft, C.; Žemgulyte, A.; Applebaum, T.; Pritzel, A.; Wong, L.H.; Zielinski, M.; Sargeant, T.; et al. Accurate proteome-wide missense variant effect prediction with AlphaMissense. Science 2023, 381, eadg7492. [Google Scholar] [CrossRef] [PubMed]
  12. Straub, S.G.; Sharp, G.W.G. Glucose-stimulated signaling pathways in biphasic insulin secretion. Diabetes. Metab. Res. Rev. 2002, 18, 451–463. [Google Scholar] [CrossRef]
  13. Bae, J.S.; Kim, T.H.; Kim, M.Y.; Park, J.M.; Ahn, Y.H. Transcriptional Regulation of Glucose Sensors in Pancreatic β-Cells and Liver: An Update. Sensors 2010, 10, 5031–5053. [Google Scholar] [CrossRef] [PubMed]
  14. Kind, L.; Molnes, J.; Tjora, E.; Raasakka, A.; Myllykoski, M.; Colclough, K.; Saint-Martin, C.; Adelfalk, C.; Dusatkova, P.; Pruhova, S.; et al. Molecular mechanism of HNF-1A–mediated HNF4A gene regulation and promoter-driven HNF4A-MODY diabetes. JCI Insight 2024, 9, e175278. [Google Scholar] [CrossRef]
  15. Chi, Y.I.; Frantz, J.D.; Oh, B.C.; Hansen, L.; Dhe-Paganon, S.; Shoelson, S.E. Diabetes mutations delineate an atypical POU domain in HNF-1alpha. Mol. Cell 2002, 10, 1129–1137. [Google Scholar] [CrossRef]
  16. Bach, I.; Yaniv, M. More potent transcriptional activators or a transdominant inhibitor of the HNF1 homeoprotein family are generated by alternative RNA processing. EMBO J. 1993, 12, 4229–4242. [Google Scholar] [CrossRef]
  17. Paulsen, C.E.; Carroll, K.S. Cysteine-mediated redox signaling: Chemistry, biology, and tools for discovery. Chem. Rev. 2013, 113, 4633–4679. [Google Scholar] [CrossRef]
  18. Bjørkhaug, L.; Bratland, A.; Njølstad, P.R.; Molven, A. Functional dissection of the HNF-1alpha transcription factor: A study on nuclear localization and transcriptional activation. DNA Cell Biol. 2005, 24, 661–669. [Google Scholar] [CrossRef]
  19. Worth, C.L.; Gong, S.; Blundell, T.L. Structural and functional constraints in the evolution of protein families. Nat. Rev. Mol. Cell Biol. 2009, 10, 709–720. [Google Scholar] [CrossRef]
  20. Simon, A.L.; Stone, E.A.; Sidow, A. Inference of functional regions in proteins by quantification of evolutionary constraints. Proc. Natl. Acad. Sci. USA 2002, 99, 2912–2917. [Google Scholar] [CrossRef]
  21. Zhang, C.; Xie, F.; Li, L.; Zhang, C.; Zhang, Y.; Ying, W.; Liu, L.; Yan, X.; Yin, F.; Zhang, L. Hepatocyte nuclear factor 1 alpha (HNF1A) regulates transcription of O-GlcNAc transferase in a negative feedback mechanism. FEBS Lett. 2019, 593, 1050–1060. [Google Scholar] [CrossRef] [PubMed]
  22. Bellanné-Chantelot, C.; Carette, C.; Riveline, J.P.; Valéro, R.; Gautier, J.F.; Larger, E.; Reznik, Y.; Ducluzeau, P.H.; Sola, A.; Hartemann-Heurtier, A.; et al. The type and the position of HNF1A mutation modulate age at diagnosis of diabetes in patients with maturity-onset diabetes of the young (MODY)-3. Diabetes 2008, 57, 503–508. [Google Scholar] [CrossRef] [PubMed]
  23. Harries, L.W.; Ellard, S.; Stride, A.; Morgan, N.G.; Hattersley, A.T.; Vaxillaire, M.; Tuomi, T.; Barbetti, E.; Njolstad, P.R.; Hansen, T.; et al. Isomers of the TCF1 gene encoding hepatocyte nuclear factor-1 alpha show differential expression in the pancreas and define the relationship between mutation position and clinical phenotype in monogenic diabetes. Hum. Mol. Genet. 2006, 15, 2216–2224. [Google Scholar] [CrossRef] [PubMed]
  24. Sneha, P.; Thirumal, K.D.; George Priya Doss, C.; Siva, R.; Zayed, H. Determining the role of missense mutations in the POU domain of HNF1A that reduce the DNA-binding affinity: A computational approach. PLoS ONE 2017, 12, e0174953. [Google Scholar]
  25. Zoldoš, V.; Horvat, T.; Novokmet, M.; Cuenin, C.; Mužinić, A.; Pučić, M.; Hufman, J.E.; Gornik, O.; Polašek, O.; Campbell, H.; et al. Epigenetic silencing of HNF1A associates with changes in the composition of the human plasma N-glycome. Epigenetics 2012, 7, 164–172. [Google Scholar] [CrossRef]
Figure 1. Protein–protein interaction using the STRING bioinformatic tool. The interaction prediction was based on known interactions, predicted interaction, and other data such as textmining, co-expression, and protein homology.
Figure 1. Protein–protein interaction using the STRING bioinformatic tool. The interaction prediction was based on known interactions, predicted interaction, and other data such as textmining, co-expression, and protein homology.
Ijms 26 03768 g001
Figure 2. The three-structure (3D) analysis of the wild-type HNF1A protein and mutants. (A) The 3D model of the normal HNF1Aprotein. (B) The 3D model of the mutants (Arg168Cys and Gly375Asp). (C) Superimposed models of the mutants with the wild-type HNF1A protein.
Figure 2. The three-structure (3D) analysis of the wild-type HNF1A protein and mutants. (A) The 3D model of the normal HNF1Aprotein. (B) The 3D model of the mutants (Arg168Cys and Gly375Asp). (C) Superimposed models of the mutants with the wild-type HNF1A protein.
Ijms 26 03768 g002
Figure 3. Conservation analysis of amino acids of the HNF1A protein using Jalview software. Variants of interest are highlighted in black boxes. Red boxes indicate the amino acid sequence of the human HNF1A.
Figure 3. Conservation analysis of amino acids of the HNF1A protein using Jalview software. Variants of interest are highlighted in black boxes. Red boxes indicate the amino acid sequence of the human HNF1A.
Ijms 26 03768 g003
Table 1. List of variants commonly predicted to be pathogenic by all tools.
Table 1. List of variants commonly predicted to be pathogenic by all tools.
Polyphen2SNPs&GoMetaSNPSIFTPantherPHD SNPSNAP2AlphaMissense
S.NoChr:bpAllelesAAAA CoordPredProbPredProbPredScorePredScorePredPreservation TimePredScorePredScorePreScore
I120989008 C/TCys/Arg 168 ProD 1 D 0.994 D0.56D 0.03 D0.693D 0.678 D0.625P0.97
II120994274A/CGlu/Ala 275 ProD1D0.989D0.502D0.01D0.729D0.562D0.695P0.85
III120996557G/TGly/Asp 375 ProD0.998D0.992D0.728D0D0.698D0.68D0.745P0.92
IV120996664G/TVal/Phe 411 ProD1D0.993D0.669D0.02D0.602D0.609D0.69LP0.78
Key: Chr: Chromosome; bp: base pair; AA: amino acid; AA coord: amino acid coordinate; Pred: prediction; Prob: probability; D: damaging; P: pathogenic; LP: likely pathogenic.
Table 2. List of variants showing decreases in the protein stability.
Table 2. List of variants showing decreases in the protein stability.
I-MutantMuPro
Variant No.rs IDAAAA CoordStabilityRIStabilityScore
Irs764434453Cys/Arg 168 decrease 3 decrease −0.57
IIrs199890776Glu/Ala275decrease 2 decrease −0.37
IIIrs1315462017Gly/Asp375decrease 5 decrease −0.24
IVrs767284188Val/Phe 411 decrease 9 decrease −1.43
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Aldharee, H.; Hamdan, H.Z. In Silico Analysis Identified Putative Pathogenic Missense Single Nucleotide Polymorphisms (SNPs) in the Human HNF1A Gene. Int. J. Mol. Sci. 2025, 26, 3768. https://doi.org/10.3390/ijms26083768

AMA Style

Aldharee H, Hamdan HZ. In Silico Analysis Identified Putative Pathogenic Missense Single Nucleotide Polymorphisms (SNPs) in the Human HNF1A Gene. International Journal of Molecular Sciences. 2025; 26(8):3768. https://doi.org/10.3390/ijms26083768

Chicago/Turabian Style

Aldharee, Hitham, and Hamdan Z. Hamdan. 2025. "In Silico Analysis Identified Putative Pathogenic Missense Single Nucleotide Polymorphisms (SNPs) in the Human HNF1A Gene" International Journal of Molecular Sciences 26, no. 8: 3768. https://doi.org/10.3390/ijms26083768

APA Style

Aldharee, H., & Hamdan, H. Z. (2025). In Silico Analysis Identified Putative Pathogenic Missense Single Nucleotide Polymorphisms (SNPs) in the Human HNF1A Gene. International Journal of Molecular Sciences, 26(8), 3768. https://doi.org/10.3390/ijms26083768

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop