In Silico Deciphering of the Potential Impact of Variants of Uncertain Significance in Hereditary Colorectal Cancer Syndromes

Fasano, Candida; Lepore Signorile, Martina; De Marco, Katia; Forte, Giovanna; Disciglio, Vittoria; Sanese, Paola; Grossi, Valentina; Simone, Cristiano

doi:10.3390/cells13161314

Open AccessReview

In Silico Deciphering of the Potential Impact of Variants of Uncertain Significance in Hereditary Colorectal Cancer Syndromes

by

Candida Fasano

^1,*,

Martina Lepore Signorile

¹,

Katia De Marco

¹,

Giovanna Forte

¹,

Vittoria Disciglio

¹,

Paola Sanese

¹,

Valentina Grossi

¹

and

Cristiano Simone

^1,2,*

¹

Medical Genetics, National Institute of Gastroenterology, IRCCS “Saverio de Bellis” Research Hospital, 70013 Castellana Grotte, Italy

²

Medical Genetics, Department of Precision and Regenerative Medicine and Jonic Area (DiMePRe-J), University of Bari Aldo Moro, 70124 Bari, Italy

^*

Authors to whom correspondence should be addressed.

Cells 2024, 13(16), 1314; https://doi.org/10.3390/cells13161314

Submission received: 5 June 2024 / Revised: 23 July 2024 / Accepted: 3 August 2024 / Published: 6 August 2024

Download

Browse Figure

Versions Notes

Abstract

:

Colorectal cancer (CRC) ranks third in terms of cancer incidence worldwide and is responsible for 8% of all deaths globally. Approximately 10% of CRC cases are caused by inherited pathogenic mutations in driver genes involved in pathways that are crucial for CRC tumorigenesis and progression. These hereditary mutations significantly increase the risk of initial benign polyps or adenomas developing into cancer. In recent years, the rapid and accurate sequencing of CRC-specific multigene panels by next-generation sequencing (NGS) technologies has enabled the identification of several recurrent pathogenic variants with established functional consequences. In parallel, rare genetic variants that are not characterized and are, therefore, called variants of uncertain significance (VUSs) have also been detected. The classification of VUSs is a challenging task because each amino acid has specific biochemical properties and uniquely contributes to the structural stability and functional activity of proteins. In this scenario, the ability to computationally predict the effect of a VUS is crucial. In particular, in silico prediction methods can provide useful insights to assess the potential impact of a VUS and support additional clinical evaluation. This approach can further benefit from recent advances in artificial intelligence-based technologies. In this review, we describe the main in silico prediction tools that can be used to evaluate the structural and functional impact of VUSs and provide examples of their application in the analysis of gene variants involved in hereditary CRC syndromes.

Keywords:

hereditary colorectal polyposis syndromes; hereditary nonpolyposis colorectal cancer; variants of uncertain significance; in silico prediction tools; protein stability; protein functions

1. Introduction

Colorectal cancer (CRC) is the third most common cancer in the world and accounts for more than 8% of deaths from all causes annually [1]. Between 6 and 10% of all CRC cases and around 20% of those detected before the age of 50 have identifiable hereditary pathogenic mutations in genes that significantly increase CRC susceptibility [2,3]. In most hereditary CRC syndromes, cancer arises from primary lesions such as polyps and adenomas, but the pathways leading to carcinoma development vary in the different disorders. The identification of the main hereditary mutations involved in these disorders has been crucial to improving the comprehension of the basic molecular processes responsible for CRC tumorigenesis [4]. Genetic susceptibility to CRC seems more widespread than previously expected. Recent reports uncovered disease-causing genetic variants in a wide variety of cancer susceptibility genes with high and moderate penetrance [5]. These pathogenic variants have been described in over 10% of patients diagnosed with advanced cancer, including CRC [6].

The advent of next-generation sequencing (NGS) technologies has significantly enhanced our ability to identify genetic variants. In addition to expediting the identification of recurrent pathogenic variants with established functional consequences, NGS has also revealed several rare uncharacterized genetic variants, which are, therefore, called variants of uncertain significance (VUSs) [7]. The majority of VUSs can be grouped into three main categories based on the type of genetic alteration, i.e., missense substitutions (most frequent), splice junction variants, and in-frame insertion or deletion variants (in-frame indels), but their functional classification has proven challenging when using multigene panels in genetic testing [4,8]. The assessment of the functional impact of a missense VUS is complex due to the specific biochemical properties of each amino acid, which modify the stability and function of the affected protein. Therefore, a missense substitution can have a variety of effects, ranging from no impact to completely abolishing protein function or even leading to the acquisition of new functions or increased stability. Splice junction variants can abrogate splicing or increase or decrease its efficiency. In particular, they can affect precursor mRNA-spliceosome interactions, leading to exon skipping, full intron inclusion, and alternative use of neighboring cryptic splice sites [9]. These events result in nucleotide insertions or deletions (in-frame indels) that impair protein structure and function due to extra or missing amino acids or even entire domains [10].

VUS assessment is particularly important when the variant occurs in a clinically significant gene, as the interpretation of its structural and functional implications can be very useful in clinical practice and for the surveillance of hereditary disorders [11]. Clinicians, therefore, need clear guidance regarding the significance of variants that may have practical consequences. Following genetic testing to detect mutations in germline CRC susceptibility genes, three possible outcomes can occur: (i) no variant is found; (ii) the identified variant is known as pathogenic or benign; or (iii) the identified variant is a VUS. If a pathogenic variant is detected, the patient should receive genetic counseling and be treated according to gene-specific guidelines and their personal and family history of cancer. Moreover, “cascade testing” should be performed on relatives at risk to ascertain whether they also carry the variant, and appropriate screening programs should be recommended, including earlier and more frequent colonoscopies [12,13].

According to the National Comprehensive Cancer Network (NCCN, https://www.nccn.org, accessed on 5 July 2024) guidelines, clinical surveillance for patients with a VUS in an oncogene associated with hereditary CRC syndromes should be the same as that indicated for the general population [14]. Still, in these cases, the clinical geneticist is responsible for evaluating the patient’s clinical phenotype and family history to decide whether a segregation analysis of the VUS in the family is warranted. Although VUSs are not used as markers to increase clinical surveillance, it should be noted that many of the variants originally classified as VUSs have been subsequently characterized as pathogenic, thus initially escaping NCCN-recommended clinical surveillance programs, with serious clinical implications for the affected patients. Therefore, the discovery of a VUS poses a problem because it is unclear if the mutation is benign or pathogenic, and family members cannot be stratified according to their risk of developing CRC. This makes clinical management more challenging. Clinicians can only evaluate the putative functional implications of a VUS based on information gathered from specific databases, which unfortunately are not updated on a regular basis, and current literature [15]. These limitations may be overcome, at least in part, by the use of in silico tools. This approach can provide valuable insights by predicting the potential impact of the identified variants on protein structure and function. As such, in silico tools are crucial resources for prioritizing specific VUSs for further investigation and guiding clinical decisions [11].

Ideally, the management of genetic disorders associated with CRC would require collaborative efforts from multidisciplinary teams to integrate computational predictions with experimental validations and genetic counseling. These three key aspects are essential for enhancing the accuracy of VUS interpretation and promoting more efficient clinical surveillance, with the ultimate goal of advancing personalized medicine.

In this review, we summarize current in silico methodologies available to assess the structural and functional implications of VUSs in key genes playing a role in hereditary CRC syndromes.

2. Pathology of Hereditary CRC Syndromes

CRC is an epithelial-originated cancer that typically begins as an adenoma. While the majority of CRCs occur in individuals with no family history of the disease or other risk conditions, approximately 30% of CRC patients have family members affected by the same cancer [16]. Current epidemiological evidence shows that people are more likely to develop CRC or adenomatous polyps if they have one or more first-degree relatives affected by these conditions. Although not fully clear yet, this may be due to a mix of shared environmental variables and genetic factors [17].

CRC screening guidelines recommend that most average-risk patients start screening at 50 years of age [18]. The suggested screening age and frequency may vary based on the presence of polyps with specific histotypes or a family history of CRC. Patients who have a first-degree relative diagnosed with CRC and a family history of the disease should have a colonoscopy every 5 years beginning at 40 years of age or 10 years before their relative’s diagnosis age [18,19].

Hereditary CRC syndromes are associated with a significant increase in CRC risk and early onset of the disease. Based on the number and histotype of CRC lesions, they can be classified into two major phenotypic categories: polyposis and nonpolyposis syndromes [1].

2.1. Hereditary CRC Polyposis Syndromes

Hereditary CRC polyposis syndromes comprise familial adenomatous polyposis (FAP), attenuated familial adenomatous polyposis (AFAP), MUTYH-associated polyposis (MAP), polymerase proofreading-associated polyposis (PPAP), NTHL1 tumor syndrome, Peutz–Jeghers syndrome (PJS), juvenile polyposis syndrome (JPS), PTEN hamartoma tumor syndrome (PHTS), hereditary mixed polyposis syndrome (HMPS), serrated polyposis syndrome (SPS), and the recently characterized gastric polyposis and desmoid FAP (GD-FAP) [1,20].

FAP is an autosomal dominant hereditary cancer syndrome caused by germline heterozygous mutations in the adenomatous polyposis coli (APC) gene, which is located on chromosome 5q21 and is considered the ‘gatekeeper’ tumor suppressor gene for CRC [21,22]. This condition is the second most prevalent inherited CRC syndrome, representing about 1% of all CRC cases [1]. FAP is characterized by the early (late childhood) appearance of hundreds to thousands of adenomatous polyps [22]. In patients with FAP, the development of CRC depends on the co-occurrence of two molecular events triggering the disease, as postulated by Knudson’s two-hit hypothesis. The first is a germline APC mutation, and the second may be an additional somatic mutation in APC or its loss of heterozygosity (LOH) [23]. Although further mutations in the KRAS, TP53, and SMAD4 genes may occur during FAP-related tumorigenesis, APC loss or germline mutations are crucial steps triggering CRC [24,25]. FAP genotypes are further complicated by the presence of several VUSs in the APC gene.

AFAP is a subtype of FAP in which patients develop less severe symptoms. AFAP patients exhibit fewer than 100 polyps, delayed initiation of colorectal adenomas, and a likely lower lifetime risk of CRC. In these patients, adenomas are often flat and located in the proximal colonic region and upper gastrointestinal tract [26]. Approximately 10% of AFAP patients display mutations in exon 9 as well as in the 5′ and 3′ terminal regions of the APC gene. Additionally, 7% of these patients have a genetic alteration in the MUTYH gene [27]. Based on the annotations recorded in the ClinVar Miner database (https://clinvarminer.genetics.utah.edu/, accessed on 7 April 2024 [28]), out of 10,625 APC variants associated with FAP and AFAP syndromes, 6139 are VUSs (accessed on 7 April 2024) (Table 1).

GD-FAP was recently described as a novel FAP clinical variant characterized by widespread gastric polyposis and the presence of desmoid tumors as extracolonic lesions. Genetically, GD-FAP patients exhibit germline mutations in the extreme 3′ end of the APC gene [20].

MAP is an autosomal recessive syndrome caused by biallelic germline variants in the MUTYH gene, which encodes a central effector of the DNA base excision repair (BER) pathway involved in oxidative stress response [29]. Patients with MAP show a phenotype that mimics FAP and AFAP syndromes, ranging from one colorectal adenocarcinoma and a few polyps to serrated polyps [30]. Monoallelic MUTYH mutations have been linked to a higher risk of CRC, particularly in MAP patients with first-degree relatives who had the disease [31]. Based on current ClinVar Miner data, 14 out of 51 genetic variants identified so far in the MUTYH gene are classified as VUSs.

PPAP is an autosomal dominant polyposis syndrome characterized by germline heterozygous missense variants located in the exonuclease (proofreading) domains of the polymerase-coding genes POLE or POLD [32]. PPAP patients may exhibit FAP or AFAP phenotypes along with other tumors showing somatic hypermutation [33]. Based on the annotations reported in the ClinVar Miner database, 2378 VUSs have been identified in the POLD gene and 379 in the POLE gene (Table 1).

NTHL1 tumor syndrome, a recently identified rare autosomal recessive polyposis, is caused by biallelic variations in the NTHL1 gene. NTHL1 is a DNA N-glycosylase that catalyzes the first step of the BER pathway [34,35]. Patients with NTHL1 tumor syndrome exhibit many tumors, all clinically associated with polyposis [35]. To date, 201 NTHL1 germline variants have been associated with NTHL1 tumor syndrome, more than half of which (102) are VUSs (Table 1).

Hamartomatous polyposis syndromes (HPS) are a subtype of CRC polyposis that exhibit autosomal dominant patterns of inheritance and include PJS, JPS, and PHTS [36].

PJS is caused by germline mutations in the tumor suppressor serine-threonine kinase STK11 gene (STK11) and is often associated with autosomal dominant mutations in the serine/threonine-protein kinase MTOR gene (MTOR) [37,38]. STK11 regulates cell proliferation, metabolism, and cell polarity [39,40]. Germline pathogenic STK11 mutations are detected in 50–70% of PJS patients [4]. Clinically, the appearance of PJS polyps occurs early, at an average age of 12 years. PJS patients may develop a variable number of polyps located exclusively in the small intestine and often exhibit mucocutaneous pigmentations and a family history of PJS [36]. Of note, out of 1912 germline STK11 mutations that have been associated with PJS, 837 are VUSs.

JPS is defined by the presence of several colonic and/or stomach hamartomas. Approximately 50–70% of JPS patients have been shown to harbor germline pathogenic mutations in the BMPR1A and SMAD4 genes. JPS is linked to a high risk of gastric and colorectal malignancies. People with SMAD4 mutations have an increased likelihood of developing hereditary hemorrhagic telangiectasia (HHT) [36]. Based on ClinVar Miner annotations, 1600 germline BMPR1A variants and 1348 germline SMAD4 variants are associated with JPS. Of these, 813 and 586 have been identified as VUSs, respectively.

PHTS is an autosomal dominant disease caused by germline pathogenic mutations in the PTEN gene. Rarely, it may also be caused by mosaicism with de novo PTEN mutations [36,41]. This syndrome comprises a spectrum of hamartoma conditions, including Bannayan–Riley–Ruvalcaba syndrome (BRRS), a congenital disease characterized by macrocephaly, lipomas, pigmented macules, and intestinal hamartomatous polyposis [42]; Cowden syndrome (CS), which is associated with a high risk for benign and malignant tumors of the thyroid, breast, kidney, and endometrium [36,43]; Lhermitte–Duclos disease (LDD), which is characterized by abnormal cerebellum growth [44]; segmental outgrowth-lipomatosis-arteriovenous malformation-epidermal nevus (SOLAMEN) syndrome, whose main clinical features are the presence of lipomas, hamartomatous polyps, macrocephaly, and a higher susceptibility to developing several tumors [45]; the PTEN-related Proteus syndrome (PS), which causes vascular abnormalities and various tissue overgrowths [46]; and macrocephaly-autism syndrome (MCEPHAS) [47]. Interestingly, out of 1848 PTEN germline mutations associated so far with PHTS diseases, 730 are VUSs (Table 1).

HMPS is characterized by multiple colorectal polyps of different histotypes (hamartomas, serrated lesions, and adenomas). The most frequent germline mutations detected in this polyposis are located in the coding and non-coding regions (upstream intron duplication) of the GREM1 gene [48]. Currently, all four germline variants detected in this gene are classified as VUSs (Table 1).

SPS is a rare condition defined by the occurrence of at least one of the following diagnostic criteria: (i) serrated polyp(s) in the proximal colon in a person who has a first-degree family member affected by the disease; (ii) more than five serrated polyps in the proximal colon, of which two are larger than 10 mm; and (iii) more than twenty serrated polyps [49]. Due to the low frequency of SPS cases, a driver gene has not been identified yet; however, emerging evidence suggests that germline mutations in the RNF43 gene may be associated with this polyposis [50,51,52]. Yet, out of 111 RNF43 germline mutations identified in patients with SPS, the vast majority (104) have been classified as VUSs (Table 1).

2.2. Hereditary Nonpolyposis CRC

Hereditary nonpolyposis colorectal cancer (HNPCC) syndromes are classified as DNA mismatch repair-deficient (MMR-d) or -proficient (MMR-p) based on the presence or absence of germline mutations in DNA MMR genes [50].

Lynch syndrome (LS) is an MMR-d HNPCC characterized by mutations in one or more DNA MMR genes (MLH1, MSH2, MSH6, and PMS2) [53]. These mutations have a high degree of penetrance and are thus linked to increased susceptibility to certain types of cancer [4]. An individual who has inherited a DNA MMR gene mutation faces a 70–80% chance of developing CRC during their lifetime, with this risk starting at a young age. Furthermore, women harboring genetic alterations in these genes have a significantly higher susceptibility to endometrial cancer, with a combined lifetime risk ranging from 40% to 60% [54]. The Amsterdam Criteria and Bethesda Guidelines, which are widely used for identifying individuals with LS, rely on the detection of particular site-specific malignancies that occur at an early age [4,55].

MMR-d cancers display marked instability at certain DNA microsatellites and are therefore classified as microsatellite instability-high (MSI-H). Additionally, these tumors are characterized by loss of expression of the affected DNA MMR protein, as determined by immunohistochemistry [53]. While CRC and endometrial cancer are the primary malignancies found in most LS families, individuals carrying these mutations also have a higher risk of developing ovarian, gastric, small intestinal, urinary tract, brain, pancreatic, and prostate cancer, as well as sebaceous neoplasms of the skin. MSI detection is based on PCR assays to amplify microsatellite sections of the DNA, followed by a comparison between normal and tumoral samples. This analysis can be used as a preliminary assessment to identify candidates for LS multigene panel testing [56]. Among the germline variants associated with LS, 828 have been identified in the MLH1 gene (86 of which are classified as VUSs), 1732 in the MSH2 gene (470 VUSs), 518 in the MSH6 gene (156 VUSs), and 225 in the PMS2 gene (61 VUSs) (Table 1).

Approximately 50% of the patients that fulfill the Amsterdam criteria for the diagnosis of HNPCC have MMR-p disease, with no detectable germline variants in MMR genes. These individuals have a lower CRC lifetime risk compared with LS patients and are not at higher risk for malignancies other than colon cancer [57]. Currently, the only gene that could be associated with MMR-p HNPCC is RPS20, which encodes for a ribosomal protein. To date, five VUSs potentially associated with HNPCC have been identified in this gene (Table 1) [4].

The diagnosis of hereditary CRC syndromes is based on the classification of the identified variants in databases such as ClinVar and SIFT; thus, in silico approaches are already integrated, at least in part, into current clinical practice. However, as reported in Table 1, there is a high number of variants whose functional impact and clinical significance have not been defined yet.

3. In Silico Prediction of VUS Impact on Protein Function in Hereditary CRC Syndromes

In recent years, the growing repository of genetic data has led to the identification of numerous VUSs, adding further complexity to clinical decision-making. On the other hand, the identification of a myriad of genetic variants resulting from NGS studies has accelerated the development of bioinformatics tools, allowing researchers to computationally predict the functional implications of sequence variations and identify pathogenic variants [58]. Several classes of sequence variations at the nucleotide level are involved in human diseases, including substitutions, insertions, deletions, frameshifts, and nonsense mutations. Frameshift mutations and nonsense mutations are highly likely to have a detrimental impact on protein function. Therefore, the efforts of bioinformaticians have mainly focused on the development of algorithms that predict the effects of missense variants based on different approaches, such as the conservation level of amino acids at a specific position across comparable sequences or the structural impact of the amino acid change in protein stability or function [59].

In silico tools leverage computational algorithms to predict the consequences of VUSs at the molecular level. The first tools were created about twenty years ago, such as SIFT (Sorting Intolerant From Tolerant, https://sift.bii.a-star.edu.sg/index.html, latest version updated on 25 April 2024, accessed on 14 April 2024 [60,61]) and PolyPhen (Polymorphism Phenotyping, later upgraded to PolyPhen2, http://genetics.bwh.harvard.edu/pph2 version polyphen-2.2.3-databases-2021_05.tar.bz2, accessed on 14 April 2024 [62,63]). SIFT uses sequence homology and the physical characteristics of amino acids to predict whether an amino acid substitution impacts the function of the affected protein. In particular, it calculates the probability that a given amino acid substitution at a particular position will be tolerated. If the normalized value is below a specific threshold, the amino acid substitution is predicted to have a deleterious effect on protein function [62]. PolyPhen2 is more focused on predicting the potential effect of coding nonsynonymous single nucleotide polymorphisms (SNPs) based on a Bayesian probabilistic classifier with machine learning techniques and has an excellent pipeline for multiple sequence alignment [63].

SIFT and PolyPhen were used by Chao and colleagues to develop a bioinformatic algorithm named multivariate analysis of protein polymorphisms-mismatch repair (MAPP-MMR) to specifically classify pathogenic and benign MLH1 and MSH2 missense variants associated with LS [64].

Similarly to SIFT and PolyPhen, PROVEAN (Protein Variation Effect Analyzer; http://provean.jcvi.org/, PROVEAN v1.1, accessed on 14 April 2024 [65]) is a software that predicts whether amino acid substitutions or indels affect the biological activity of a protein. It filters sequence variants to find critical nonsynonymous or indel variants that may have deleterious effects on protein function [65].

SIFT, PROVEAN, and PolyPhen-2, together with two other tools (PhD-SNP (version PhD-SNP2.0.7, accessed on 14 April 2024) and SNPs&GO last version 8.0, accessed on 14 April 2024), were used in a comparative in silico prediction analysis to identify three MSH6 missense mutations (G932Q, F1104Q, and E1234Q) that may contribute to protein dysfunction and CRC development [66]. In another study, Jansen and colleagues identified in silico nine predicted damaging missense variants in the POLD1 gene by performing an integrated prediction analysis with SIFT and PROVEAN [67].

In 2011, a novel in silico prediction tool named Mutation Assessor (http://mutationassessor.org/r3/, Release 3, accessed on 14 April 2024 [68]) was created to predict the functional consequences of amino acid substitutions by considering the evolutionary conservation level of the mutated amino acid in protein homologs. This algorithm has been validated on 60,000 germline and somatic variants of diseases recorded in the OMIM database (https://www.omim.org/, version 2024, accessed on 14 April 2024), including those identified in the Cancer Genome Atlas project (https://www.cbioportal.org/, version v6.0.14, accessed on 14 April 2024). Of note, this tool was used to filter the potential pathogenetic variants in a subset of CRC patients carrying germline and somatic mutations in APC and TP53 but not in other WNT genes (TCF7L2, AMER1, FBXW7, SOX9, CTNNB1). The final result of this multiple correspondence analysis was the identification of two CRC oncodriver signatures [69].

The Panther (Protein Analysis Through Evolutionary Relationships, https://www.pantherdb.org/tools, release 19.0, accessed on 14 April 2024 [70]) server is a classification system developed to provide details on the phylogeny, function, and functional impact of genetic variants that influence the evolution of protein-coding gene families. In an interesting work, Panther and other in silico tools were used to find novel pathogenetic missense variants (R358W, K306S, R310G, S433R, and R361C) in SMAD proteins, which are driver effectors of juvenile polyposis. In particular, the authors performed a comparative in silico analysis with different tools, including PANTHER, SIFT, PolyPhen, SNPs&GO, I-Mutant 3.0, and MUpro, to evaluate damaging missense variants in SMAD genes at both the structural and functional levels [71].

MutationTaster2 (https://www.mutationtaster.org/, version2021, accessed on 14 April 2024 [72]) is a web-based software designed to predict the potential impact of different types of genetic variants, with a particular focus on missense, intronic and synonymous variants, indel mutations, and variants in intron-exon junction regions. The MutationTaster2 predictor employs a Bayes classifier and interprets the clinical significance of the analyzed VUSs by using a comprehensive collection of SNPs from the ClinVar [73] and HGMD [74] public databases, which contain established disease variants. In a recent case report, the MutationTaster software was used to predict the functional impact of the MLH1 frameshift mutation p.(Glu34ArgfsTer4) identified in a patient with LS. The variant was predicted to result in a non-functional protein and have a disease-causing effect [75].

Unlike other tools mentioned above, SNAP2 (Screening for Non-Acceptable Polymorphisms, http://www.ngrl.org.uk/Manchester/page/snap-screening-nonacceptable-polymorphisms.html, version 2024, accessed on 14 April 2024) [76]) does not provide predictions on the likelihood of a variant to cause a disease. Instead, it is designed to specifically assess whether the variant affects the molecular function of the protein and can thus be very helpful when combined with other prediction methods in a comprehensive computational analysis. For instance, in a recent study, SNAP2 was used together with other tools to classify as deleterious seven nonsynonymous SNPs (C76Y, C124R, C124Y, C376Y, R443C, R480W, and W487R) found in the highly conserved regions of BMPR1A, a gene associated with JPS [77].

Align-GVGD (http://agvgd.hci.utah.edu/agvgd_input.php, accessed on 23 July 2024 [78]) is one of the first free software for multiple sequence alignments. Based on the physical and chemical characteristics of amino acids, it predicts the regions that are most likely to encompass missense substitutions with deleterious or neutral effects [78]. This in silico software was used to reclassify a VUS identified in a patient with multiple colonic adenomatous polyps. The patient had the heterozygous pathogenic variant c.1187G>A (p.Gly396Asp) in exon 13 and the VUS c.1379T>C (p.Leu460Ser) in exon 14 of the MUTYH gene [79]. The authors reclassified the VUS as pathogenic based on the genetic evidence that it was in trans with the pathogenic mutation, on the clinical phenotype, and on in silico prediction findings suggesting a deleterious effect [79].

Of note, a recent in silico phylogenetic study of pathogenic variants involved in DNA repair, and therefore in CRC tumorigenesis, identified a high degree of conservation of these variants only between modern and ancient humans and not between homologous proteins of different species [80]. This evidence seems to question the validity of in silico software (e.g., SIFT, Mutation Assessor) designed for the prediction of deleterious variants based on evolutionarily conserved amino acid positions in homologous proteins. On the other hand, another recent study showed that the outcomes of functional analyses of VUSs identified in MMR genes of potential LS patients agree with the findings of in silico prediction analyses based on the conservation of residue variations in the affected DNA repair proteins [81]. Interestingly, a computational study assessed the usefulness of in silico tools to topologically map variants to surface or buried regions of highly conserved protein structures. This study confirmed that benign variants were predominantly buried inside the proteins, while pathogenic variants were mainly located on their surface [82]. Overall, this evidence suggests that in silico methods designed for identifying deleterious variants in human cancer genes based on the evolutionary conservation of variant residues may be less informative about the clinical significance of a VUS than previously thought. Nonetheless, in silico analysis of the conserved regions between homologous proteins is very useful to establish whether a given VUS maps to a domain that is conserved in different species and, therefore, is likely critical for the biological function of the affected protein.

Despite their limitations, in silico predictions offer a valuable initial screening step in VUS interpretation. Discrepancies among prediction tools emphasize the need for complementary approaches to assess VUS significance. In this regard, the accuracy of variant classification can be enhanced by integrating multiple prediction algorithms and experimental data. Various experimental methodologies can be used to ascertain whether a VUS will impact mRNA and protein stability and/or biological functions. The effects on mRNA stability and function can be investigated by low-throughput techniques such as RT-PCR, Sanger sequencing, digital droplet PCR (ddPCR), and in vitro minigene and mutagenesis assays. The effects on protein structure and stability can be assessed by different approaches, including immunohistochemistry analysis to evaluate the presence/absence of the protein in patient-derived tissues and immunoblotting analysis, which is a semiquantitative technique allowing the identification of potentially truncated proteins. On the other hand, high-throughput methodologies such as nuclear magnetic resonance (NMR) spectroscopy, X-ray crystallography, and cryo-electron microscopy (cryoEM) are essential for analyzing structural changes in the tridimensional conformation of mutated proteins [83]. The impact of a VUS on protein function can be evaluated by different in vitro methodologies, such as pull-down and enzymatic assays (if the protein is an enzyme), and by high-throughput approaches, such as mass spectrometry analysis (to assess the loss of post-translational modifications site in mutated protein) or the recently developed multiplexed (functional) assays for variant effects (MAVEs). MAVEs allow the stratification of variants by their impact and are based on a one-by-one, post hoc approach that offers an in-depth understanding of sequence-function correlations based on a versatile methodology. Indeed, MAVE experiments enable the analysis of variants in several classes of sequence, including enhancers, promoters, mRNA untranslated regions, splice sites, and in parallel in different types of proteins [84]. Overall, each of these experimental methodologies alone may be poorly informative; therefore, it is often necessary to integrate various approaches according to the VUS type, the availability of resources, equipment, and skills, and a cost-benefit assessment. Generally speaking, the main advantage of low-throughput techniques is that they are less expensive, fast, and do not require high skills; however, they sometimes do not provide sufficient insight to answer the experimental question. Conversely, high-throughput techniques are more informative but also more expensive and time-consuming. Although necessary to validate the clinical significance of a VUS, experimental approaches have limitations in terms of time and costs, thus the availability of state-of-the-art in silico functional predictors for early VUS analysis remains crucial.

Future advancements in machine learning algorithms and the incorporation of multi-omics data are anticipated to improve the reliability of in silico predictions. Currently, clinical and experimental databases have proven very useful in improving the interpretation of VUS’s impact on protein function. In Table 2, we provide a list of databases that are commonly used by clinical experts and researchers faced with the challenge of interpreting the clinical significance of a VUS.

Despite having been created several years ago, these databases are still used by the scientific community to assess the clinical and functional implications of genetic variants, especially in hereditary disorders like CRC (Table 2). Below are some significant examples of their applications in clinical and functional studies on CRC hereditary syndromes.

The authors of a recent report analyzed the occurrence of second cancers in individuals with early-onset (aged less than 50 years) LS. They provided evidence from cBioPortal annotations to show that the FLT3 gene had the highest frequency of copy number alterations among 1438 CRC patients aged 18 to 48 years old with concomitant acute myeloid leukemia (AML). The presence of co-occurring genetic alterations in FLT3/JAK2 and JAK2/CTNNB1 was observed. The results provided valuable insights into the increased likelihood of AML and LS occurring together [102].

In another study, the LOVD database was employed to identify gene-phenotype associations and genotype-phenotype correlations in the BMPR1A gene. This information was then used to make recommendations for the clinical surveillance of JPS and modify the American College of Medical Genetics and Genomics (ACMG) classification of pathogenicity for BMPR1A or SMAD4 variants associated with JPS cases [103].

Recently, a tumor mutational signature analysis conducted using the COSMIC database identified the presence of homologous recombination deficiency (HRD) in familial CRC disorders. Remarkably, this report showed that pathogenic mutations in both BRCA1 and RNF43 were inherited together and were associated with CRC in a family with a specific type of familial CRC known as familial colorectal cancer type X (FCCTX) [52].

Notably, the gnomAD database was recently used to assess the novel pathogenic association of a series of genes, including NSD1, HDAC10, KRT24, ACACA, and TP63, with CRC predisposition [104], while other databases, i.e., ClinVar, HGMD, and InSight, were previously used in a meta-analysis to identify a new pathogenic variant associated with LS in MSH6 exon 4. In this pilot study, the authors suggested combining NGS testing and canonical MSI analysis in the diagnosis of LS in patients considered to have sporadic CRC. The inclusion criteria for NGS testing were MSI positivity, BRAF V600E, and MHL1 methylation negativity [105].

Other computational methods developed to accurately predict the pathogenicity of a VUS, such as Multivariate Analysis of Protein Polymorphism (MAPP, http://www.ngrl.org.uk, version 3.0, [106]) and Rare Exome Variant Ensemble Learner (Revel, https://sites.google.com/site/revelgenomics, release 3 May 2021, accessed on 23 July 2024 [107]), use algorithms based on statistically multivariate analysis [106]. MAPP is a software based on the analysis of physicochemical variation in sequence alignment columns, while REVEL is an ensemble method designed to predict the pathogenicity of missense variants based on a combination of scores from 13 individual tools [107,108].

Karabachev and colleagues evaluated the accuracy of these and other computational tools (Align-GVGD, SIFT, PolyPhen2, MAPP, and REVEL) in predicting the pathogenicity of 1800 APC VUS reported in the NCBI ClinVar database using multiple protein sequence alignments (PMSA) of 1924 APC missense variants. When used individually, prediction accuracies for pathogenic/likely pathogenic (range 17.5–75.0%) and benign/likely benign (range 25.0–82.5%) responses differed significantly for APC missense variants in ClinVar. Instead, creating a curated APC PMSA containing >3 substitutions/site, large enough for statistically significant in silico analysis, yielded predictions of 76.2–100% accuracy with the five methods integrated into the APC PMSA [106]. Computational approaches based on PMSA have the potential to serve as highly effective classifiers for different variations of hereditary cancer genes. Nevertheless, several attributes of the APC gene and protein might complicate the outcomes of in silico techniques. An organized examination of these characteristics could significantly enhance the mechanization of alignment-based methodologies and the application of prognostic algorithms in genes related to hereditary cancer [106].

4. In Silico Prediction of VUS Impact on Protein Structure in CRC Hereditary Syndromes

In the last decade, great efforts have been made by researchers and bioinformaticians to develop algorithms and data sources that could help predict the effects of germline and somatic mutations on the structural stability of cancer-associated proteins. Current methodologies are primarily based on in silico structural modeling software allowing to statically or dynamically study the identified variants [83]. During a biological process, proteins can assume different conformations thanks to their intrinsic flexibility, which is crucial for acquiring their native structure. The conformation of a mutant protein differs from the native one in terms of structure and stability, altering the fine balance that regulates the functional activity of the protein [109].

Molecular dynamics simulation (MDS) is a widely used method for investigating the conformational dynamics of biomolecules, particularly proteins [110]. It was shown to be especially valuable for modeling alterations in the three-dimensional (3D) structures of proteins resulting from mutations such as amino acid substitutions, which modify the bonds and locations of the atoms in the wild-type protein [111,112]. MDS computes the potential energy related to the spatial coordinates of each atom in the system. The system’s potential energy is determined by evaluating a range of chemical and physical properties associated with the protein. This approach allows researchers to accurately assess the effects of a missense mutation by measuring changes in atomic or residue distances, alterations in secondary and tertiary protein structures, and modifications to hydrogen, disulfide, and ionic bonds [109]. The precision of MDS is heavily reliant on the 3D configurations of biomolecules. The use of MDS software and the analysis of established force fields have effectively uncovered the structural modifications caused by mutations, which can lead to changes in the stability of a protein, thereby affecting its biological function. The five software packages most commonly used in this area are NAMD [111], MSCALE [113], CHARMM [114], GROMACS [115], and Amber [116].

Recently, a computational approach combining in silico structural analysis and MDS was used to investigate the relationship between PHTS-associated cancer and autism spectrum disorder (ASD) by analyzing 17 selected PTEN mutations detected in a cohort of 138 PHTS patients. Six mutations (p.L23F, p.Y65C, p.Y68H, p.I101T, p.I122S, and p.L220V) were found exclusively in patients with ASD, six mutations were found exclusively in patients with PHTS-associated cancer (p.D24G, p.D92A, p.R130G, p.M134R, p.M205V, and p.L345V), four mutations (p.R130Q, p.C136R, p.Y155C, and p.R173C) were found in both phenotypes in different patients, and one mutation was detected in a patient with both ASD and cancer (p.S170I). The MDS analysis performed using GROMACS v4.6.3 showed that the six PTEN mutations detected in PHTS-associated cancer patients strongly reduce the structural stability of the protein and increase the dynamics across the domain interfaces, causing a marked tendency to protein unfolding and the closure of the active site pocket. This ultimately results in the inactivation of the enzyme [117].

Another important example of the application of MDS in the analysis of the structural impact of VUSs is a novel protein structure-based algorithm called deep learning-Ramachandran plot-molecular dynamics simulation (DL-RP-MDS), which was recently used to assess the structural impact of MLH1 missense VUSs [118]. In this study, Tam and colleagues combined DL techniques with the RP-MDS method to analyze 447 MLH1 missense VUSs. Of these, 126 were predicted to have a deleterious effect on MLH1 structure and stability [118]. The RP-MDS method combines two in silico approaches to investigate the structural changes caused by a VUS [119]. RP captures the atomic angle distortion caused by amino acid substitution, while MDS simulates the physical movement of atoms and molecules after interacting for a fixed period, and the resulting trajectories are used to determine the macroscopic thermodynamic properties of the mutated protein [119]. In addition, these data were analyzed with an unsupervised learning model consisting of an auto-encoder and neural network classifier to identify the variants resulting in significant alterations in protein structure [119].

Ongoing advances in the methodologies used for studying 3D protein structures, such as NMR, X-ray crystallography, and cryoEM, have significantly increased the number of known protein structures archived in the Protein Data Bank (PDB) database (https://www.rcsb.org/, latest version updated in July 2024 [120]), which currently features 218,853 recorded structures and 1,068,577 computed structure models (accessed on 7 April 2024). The consistent growth of the PDB promoted the development of various in silico prediction tools to study the structural impact of a variant based on the structure of the wild-type protein recorded in this database. Algorithms that estimate the structural impact of a single amino acid substitution can be classified into two types based on whether or not they rely on free energy calculation [83]. Energy-based methods employ experimentally determined disparities in free energy (ΔΔG) between wild-type and variant structures to develop prediction models, while non-energy-based methods directly use structural features such as variation of hydrophobicity and surface accessibility [83]. These methods can then be used to predict the resulting functional implications. In Table 3, we provided a list of in silico software commonly used to analyze the 3D structures of protein variants and their potential effects on protein stability.

Several of these software tools have been taken advantage of to improve our knowledge about the structural impact of VUSs in CRC hereditary syndromes.

A few years ago, Doss et al. used I-Mutant 3.0, MUpro, SIFT, PolyPhen, PANTHER, and other tools to analyze the structural and functional effects of nonsynonymous SNPs in genes of the SMAD family. In this report, the primary mutations of SMAD native proteins, together with their amino acid locations (R358W, K306S, R310G, S433R, and R361C), were considered for structure analysis. To analyze the stability of the natural and mutant-modeled proteins, the authors used the SRide server [71]. SRide identified the stabilizing residues by calculating parameters like conservation score, stabilization center, long-range order, and surrounding hydrophobicity. The variation of potential energy and root mean square deviation values were calculated to compare the resulting native and modeled structures.

In 2022, DynaMut, DUET, and mCSM were used to predict the structural effect and the impact on gastric cancer hereditary susceptibility of a VUS (c.728G>A p.R243Q) identified in the MSH2 gene in a Tunisian family suspected of having both hereditary diffuse gastric cancer (HDGC) and LS. Structural prediction analysis of the variant revealed that it seems to disrupt the stability of the MSH2-MLH1 complex and its binding to the DNA [142]. Further molecular modeling investigation indicated that these effects may be due to changes in the electrostatic potential of the MSH2 interaction surface. Overall, this evidence suggested that the status of the variant should be revised from VUS to likely pathogenic [142].

In another study, I-Mutant3 and MUpro were used to identify MSH2 SNPs that could lead to structural and functional alterations resulting in CRC carcinogenesis. In particular, the authors performed a computational analysis of protein stability by integrating I-Mutant3 and MUpro support vector machine (SVM)-based algorithms. I-Mutant predicts alterations in protein stability caused by single amino acid substitutions based on the protein structure or sequence recorded in the ProTherm database. The ProTherm database comprises the most extensive and complete collection of experimental thermodynamic data. It specifically focuses on the changes in free energy resulting from mutations under various conditions and their effect on protein stability. MUpro is a machine learning-based tool that uses SVM and neural network algorithms to predict alterations in protein stability caused by individual amino acid substitutions [143]. In addition, four distinct computational tools (SIFT, PROVEAN, PANTHER, and PolyPhen) were used to predict the functional deleterious effects of MSH2 SNPs [143]. MDS techniques revealed that six SNPs located in the MSH2/MSH6 interaction domain have a significant impact on MSH2 stability and interactions [143].

In a more recent report, a comprehensive meta-analysis based on the use of various computational software tools allowed the authors to identify pathogenic missense variants in 26 genes (ABRAXAS1, ATM, BARD1, BLM, BRCA1, BRCA2, BRIP1, CDH1, CHEK2, EPCAM, MEN1, MLH1, MRE11, MSH2, MSH6, MUTYH, NBN, PALB2, PMS2, PTEN, RAD50, RAD51C, RAD51D, STK11, TP53, and XRCC2) examined in numerous NGS panels to assess the level of hereditary risk in various cancer types, including CRC. First, the authors collected over a thousand missense variations in these genes from ClinVar and a cohort of 355 breast cancer patients. The potential effects of missense variations on protein stability were evaluated with five distinct predictor programs (SAAF2EC, MUpro, MAESTRO, mCSM, and CUPSAT). Next, the authors used the protein structures predicted by AlphaFold (AF2), an artificial intelligence (AI) system, to perform a structure-based analysis of these hereditary cancer proteins. According to previous AF2-derived findings, the confidence score for a particular variant in the AF2 structure may predict pathogenicity more reliably than any stability predictor. This study confirmed that the AF2 confidence score can be used as a valid indicator of variant pathogenicity [144]. These studies are good examples of how in silico methods can be effectively used to locate putative pathogenic variants eligible for large-scale investigations.

In recent years, AI has proven to be a valuable tool for integrating the different in silico methods available for VUS analysis in order to expand the knowledge of VUS structure-function relationships and improve their clinical interpretation [145]. The latest advancements in AI prediction for missense variants, specifically focusing on protein structure-based approaches, highlight the complexity and the potential of this intriguing approach. Significantly, progress in protein structure prediction using deep learning, as is the case with AlphaFold2 [146] and RoseTTAFold [147], has enhanced AI models for estimating the effects of protein variants by including data on tertiary structures [83]. AlphaFold is a pioneering computational approach able to accurately predict protein structures at the atomic level, even in cases where a comparable structure is not available. RoseTTAFold (version 2.0, 2021) is an advanced software that employs deep learning techniques to rapidly and precisely predict protein structures with only a small amount of data. While ascertaining the configuration of a single protein can take several years of laboratory experimentation without the assistance of computational approaches, it can be estimated in just a few minutes using such dedicated software [148]. Importantly, these AI-based sequence and structural prediction algorithms are constantly being updated. For instance, the most recent version of Rosetta, RoseTTAFold All-Atom (RFAA), models complexes that contain proteins, nucleic acids, small molecules, metals, and covalent modifications based on their sequences and chemical structures [148]. Hopefully, in the near future, this tool will thus be integrated with an algorithm for determining the impact of genetic alterations on protein structure and function.

5. Conclusions

Recognizing whether a VUS is pathogenic or benign can help clinicians interpret the findings of genetic testing and provide guidance to patients and their family members who have inherited the variant. This enables a more informed clinical assessment of their “personalized” cancer risk and a better choice of follow-up options. According to recent research, cancer patients who have not responded to previous treatments might benefit from referring to multidisciplinary molecular tumor board teams [149,150]. Based on a thorough integrated review of the results of genetic testing, in silico prediction analysis, other laboratory results (imaging, pathology, biomarkers, etc.), the patient’s clinical and family history, and possibly available clinical trials, these interdisciplinary teams can then recommend tailored therapeutic solutions.

Considering that VUSs represent a high proportion of all genetic variants identified, the development of more accurate in silico predictors of their impact to support clinical surveillance decisions remains a riveting challenge [151]. The main advantage of these tools is that they provide initial insights into the potential pathogenic effect of a variant in a fast and affordable manner. Indeed, functional studies, although necessary for the classification of VUSs, cannot be considered the first approach to evaluate VUS clinical significance because they are expensive and time-consuming, which is unsustainable when dealing with rare syndromes.

In our opinion, there exists no single ideal tool capable of definitively addressing the crucial question of the possible pathogenicity of a VUS. Although different in silico tools are designed to evaluate specific effects of a VUS, in silico meta-analyses with multivariate approaches are needed to analyze multiple aspects of the clinical significance of a variant. For example, sequence-based algorithms are limited in interpreting the potential clinical significance of a VUS because they do not consider the three-dimensional structural features that determine the protein’s function. In fact, the development of AI-based algorithms that combined the structural and sequence features has significantly improved the performance of variant prediction.

However, in silico tools have limitations that can sometimes be confusing rather than clarifying. For example, different predictors can provide conflicting responses even when analyzing the same variant. This happens not only because these tools assess a variety of structural and functional characteristics (ΔΔG, conserved positions, surface or internal mutations in the three-dimensional structure of proteins, chemical alterations in secondary and tertiary structures, etc.) but also because their algorithms are frequently designed using inaccurate benchmarks. Currently, bioinformaticians who develop a novel prediction tool assess the performance of their new software by comparing it with previously published predictors using consolidated variant databases. This frequently introduces bias because predictor performance is evaluated with the same data used to create the tool. Thus, strong and impartial benchmarking by independent groups is necessary to develop more accurate tools [145].

Massive advancements have been achieved in the computational prediction of the structural and functional impact of genetic variants in recent years. Prediction tools offer a scalable and quick way for clinical and research laboratories to assess the potential effects of novel variants. However, determining to what extent clinicians can trust the findings from in silico prediction methods is still a challenging task. According to ACMG guidelines, the specificity of most in silico tools is rather low, which affects their reliability when it comes to predicting missense changes with a milder effect and causes missense variants to be overpredicted as deleterious [152]. While computational prediction methods alone are insufficient to ascertain the pathogenicity of a variant, they are very useful in selecting the VUSs that warrant experimental characterization to validate their clinical significance, especially for VUSs detected in patients (or their relatives) with hereditary CRC syndrome phenotypes.

Furthermore, these tools are usually based on complicated algorithms that are difficult to handle for non-experts, and their use is hindered by the difficulty of correctly interpreting the results. A further limitation to their application in clinical routine is the so-called data circularity [153]. Grimm and collaborators defined two types of circularities that can distort the evaluation of predictor tools. Type 1 circularity mostly impacts techniques that are based on machine learning. A technique is vulnerable to type 1 circularity when it reuses training data for the model in the validation of its execution. Type 2 circularity arises when the same datasets of protein variants are used for the training and evaluation of the tools employed for predicting the clinical significance of a VUS. This may lead to misleading conclusions on the predictive ability of the algorithms in the study of proteins that have an equal number of pathogenic and benign variants, potentially resulting in inaccurate predictions [153]. In particular, predictors are frequently tested for effectiveness using extensive datasets containing confirmed deleterious or benign genetic variations. The benchmarking data may overlap with the data used to train certain supervised predictors, resulting in data reuse or circularity. This, in turn, can lead to an overestimation of the performance and effectiveness of such predictors [145]. Large-scale functional tests known as deep mutational scans offer a possible solution to the problem of circularity by providing independent datasets of variant effect measurements. Such functional tests appear more reliable in predicting the clinical impact of mutations [145]. In addition, the remarkable developments made in protein structure prediction and MDS techniques not only demonstrate the potential benefits of AI in structural biology but also open new promising horizons in AI-assisted structural and functional studies of genetic variants, especially VUSs. As shown by the growing number of articles published on this topic, MDS and structural-functional predictors are becoming crucial for the assessment of the functional and clinical impact of VUSs. Previous reports have demonstrated that these integrated approaches are feasible and provided hints for creating learning models with even more accurate variant effect prediction capabilities despite also highlighting a variety of issues [83]. Predictive structural AI-based methodologies also have the potential to overcome the main limitations of in silico tools in VUS evaluation, allowing the development of increasingly personalized clinical management strategies for patients (Figure 1). A broader application of AI-based structure prediction tools for protein function analysis may accelerate the assessment of the clinical impact of a variant by reducing the time and number of experiments needed to confirm it. As a result, current research is focused on creating new algorithms designed to model protein structures and predict within a single in silico pipeline the structural and functional effects of VUSs, thereby allowing a more accurate interpretation of their clinical implications.

Author Contributions

Conceptualization, C.F. and C.S.; writing—original draft preparation, C.F.; writing—review and editing, C.F., V.G. and C.S.; visualization, P.S., G.F., M.L.S., K.D.M. and V.D.; supervision, C.S.; project administration, C.S.; funding acquisition, C.S., C.F., V.D., V.G., P.S. and M.L.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Italian Ministry of Health “Ricerca Corrente 2024–2026” to C.F. and V.D.; “Ricerca Corrente 2023–2025” to V.G.; “Ricerca Corrente 2024–2026” to C.S., “Starting Grant” SG-2019-12371540 to P.S., by the Italian Association for Cancer Research (AIRC) IG-23794 to C.S. and an AIRC Fellowship for Italy to M.L.S. (ID26678-2021).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The authors reported the link to access to each dataset, software, and server mentioned in this work.

Acknowledgments

We thank Francesco Paolo Jori for his helpful discussion during the manuscript preparation and editorial assistance.

Conflicts of Interest

The authors declare no conflict of interest.

References

Ma, H.; Brosens, L.A.A.; Offerhaus, G.J.A.; Giardiello, F.M.; de Leng, W.W.J.; Montgomery, E.A. Pathology and genetics of hereditary colorectal cancer. Pathology 2018, 50, 49–59. [Google Scholar] [CrossRef] [PubMed]
Siegel, R.L.; Miller, K.D.; Fuchs, H.E.; Jemal, A. Cancer statistics, 2022. CA. Cancer J. Clin. 2022, 72, 7–33. [Google Scholar] [CrossRef] [PubMed]
Siegel, R.L.; Wagle, N.S.; Cercek, A.; Smith, R.A.; Jemal, A. Colorectal cancer statistics, 2023. CA. Cancer J. Clin. 2023, 73, 233–254. [Google Scholar] [CrossRef] [PubMed]
Valle, L.; Vilar, E.; Tavtigian, S.V.; Stoffel, E.M. Genetic predisposition to colorectal cancer: Syndromes, genes, classification of genetic variants and implications for precision medicine. J. Pathol. 2019, 247, 574–588. [Google Scholar] [CrossRef] [PubMed]
Stern, B.; McGarrity, T.; Baker, M. Incorporating Colorectal Cancer Genetic Risk Assessment into Gastroenterology Practice. Curr. Treat. Options Gastroenterol. 2019, 17, 702–715. [Google Scholar] [CrossRef] [PubMed]
Yurgelun, M.B.; Kulke, M.H.; Fuchs, C.S.; Allen, B.A.; Uno, H.; Hornick, J.L.; Ukaegbu, C.I.; Brais, L.K.; McNamara, P.G.; Mayer, R.J.; et al. Cancer Susceptibility Gene Mutations in Individuals with Colorectal Cancer. J. Clin. Oncol. Off. J. Am. Soc. Clin. Oncol. 2017, 35, 1086–1095. [Google Scholar] [CrossRef] [PubMed]
Abbes, S.; Baldi, S.; Sellami, H.; Amedei, A.; Keskes, L. Molecular methods for colorectal cancer screening: Progress with next-generation sequencing evolution. World J. Gastrointest. Oncol. 2023, 15, 425–442. [Google Scholar] [CrossRef] [PubMed]
Valle, L.; Monahan, K.J. Genetic predisposition to gastrointestinal polyposis: Syndromes, tumour features, genetic testing, and clinical management. Lancet Gastroenterol. Hepatol. 2024, 9, 68–82. [Google Scholar] [CrossRef] [PubMed]
Oh, R.Y.; AlMail, A.; Cheerie, D.; Guirguis, G.; Hou, H.; Yuki, K.E.; Haque, B.; Thiruvahindrapuram, B.; Marshall, C.R.; Mendoza-Londono, R.; et al. A systematic assessment of the impact of rare canonical splice site variants on splicing using functional and in silico methods. Hum. Genet. Genomics Adv. 2024, 5, 100299. [Google Scholar] [CrossRef]
Ward, A.J.; Cooper, T.A. The pathobiology of splicing. J. Pathol. 2010, 220, 152–163. [Google Scholar] [CrossRef]
McInnes, G.; Sharo, A.G.; Koleske, M.L.; Brown, J.E.H.; Norstad, M.; Adhikari, A.N.; Wang, S.; Brenner, S.E.; Halpern, J.; Koenig, B.A.; et al. Opportunities and challenges for the computational interpretation of rare variation in clinically important genes. Am. J. Hum. Genet. 2021, 108, 535–548. [Google Scholar] [CrossRef]
Kahi, C.J.; Myers, L.J.; Slaven, J.E.; Haggstrom, D.; Pohl, H.; Robertson, D.J.; Imperiale, T.F. Lower endoscopy reduces colorectal cancer incidence in older individuals. Gastroenterology 2014, 146, 718–725.e3. [Google Scholar] [CrossRef] [PubMed]
Lower, A.H.; Moseley, R.H. Outcomes of colorectal cancer screening. Gastroenterology 2014, 146, 596–597. [Google Scholar]
Benson, A.B.; Venook, A.P.; Adam, M.; Chang, G.; Chen, Y.-J.; Ciombor, K.K.; Cohen, S.A.; Cooper, H.S.; Deming, D.; Garrido-Laguna, I.; et al. Colon Cancer, Version 3.2024, NCCN Clinical Practice Guidelines in Oncology. J. Natl. Compr. Cancer Netw. JNCCN 2024, 22, e240029. [Google Scholar] [CrossRef]
Zhunussova, G.; Afonin, G.; Abdikerim, S.; Jumanov, A.; Perfilyeva, A.; Kaidarova, D.; Djansugurova, L. Mutation Spectrum of Cancer-Associated Genes in Patients with Early Onset of Colorectal Cancer. Front. Oncol. 2019, 9, 673. [Google Scholar] [CrossRef]
Lewandowska, A.; Rudzki, G.; Lewandowski, T.; Stryjkowska-Góra, A.; Rudzki, S. Risk Factors for the Diagnosis of Colorectal Cancer. Cancer Control J. Moffitt Cancer Cent. 2022, 29, 10732748211056692. [Google Scholar] [CrossRef] [PubMed]
Berbecka, M.; Berbecki, M.; Gliwa, A.M.; Szewc, M.; Sitarz, R. Managing Colorectal Cancer from Ethology to Interdisciplinary Treatment: The Gains and Challenges of Modern Medicine. Int. J. Mol. Sci. 2024, 25, 2032. [Google Scholar] [CrossRef] [PubMed]
Schmoll, H.J.; Van Cutsem, E.; Stein, A.; Valentini, V.; Glimelius, B.; Haustermans, K.; Nordlinger, B.; van de Velde, C.J.; Balmana, J.; Regula, J.; et al. ESMO Consensus Guidelines for management of patients with colon and rectal cancer. a personalized approach to clinical decision making. Ann. Oncol. Off. J. Eur. Soc. Med. Oncol. 2012, 23, 2479–2516. [Google Scholar] [CrossRef] [PubMed]
Rex, D.K.; Boland, C.R.; Dominitz, J.A.; Giardiello, F.M.; Johnson, D.A.; Kaltenbach, T.; Levin, T.R.; Lieberman, D.; Robertson, D.J. Colorectal Cancer Screening: Recommendations for Physicians and Patients From the U.S. Multi-Society Task Force on Colorectal Cancer. Gastroenterology 2017, 153, 307–323. [Google Scholar] [CrossRef]
Disciglio, V.; Fasano, C.; Cariola, F.; Forte, G.; Grossi, V.; Sanese, P.; Lepore Signorile, M.; Resta, N.; Lotesoriere, C.; Stella, A.; et al. Gastric polyposis and desmoid tumours as a new familial adenomatous polyposis clinical variant associated with APC mutation at the extreme 3’-end. J. Med. Genet. 2020, 57, 356–360. [Google Scholar] [CrossRef]
Kinzler, K.W.; Vogelstein, B. Lessons from hereditary colorectal cancer. Cell 1996, 87, 159–170. [Google Scholar] [CrossRef] [PubMed]
Sadien, I.D.; Davies, R.J.; Wheeler, J. The genomics of sporadic and hereditary colorectal cancer. Ann. R. Coll. Surg. Engl. 2024, 106, 313–320. [Google Scholar] [CrossRef] [PubMed]
Knudson, A.G. Two genetic hits (more or less) to cancer. Nat. Rev. Cancer 2001, 1, 157–162. [Google Scholar] [CrossRef] [PubMed]
Fearon, E.R.; Vogelstein, B. A genetic model for colorectal tumorigenesis. Cell 1990, 61, 759–767. [Google Scholar] [CrossRef] [PubMed]
Waliszewski, P. Controversies about the genetic model of colorectal tumorigenesis. Pol. J. Pathol. Off. J. Pol. Soc. Pathol. 1995, 46, 239–243. [Google Scholar]
Brosens, L.A.A.; Wood, L.D.; Offerhaus, G.J.; Arnold, C.A.; Lam-Himlin, D.; Giardiello, F.M.; Montgomery, E.A. Pathology and Genetics of Syndromic Gastric Polyps. Int. J. Surg. Pathol. 2016, 24, 185–199. [Google Scholar] [CrossRef]
Grover, S.; Kastrinos, F.; Steyerberg, E.W.; Cook, E.F.; Dewanwala, A.; Burbidge, L.A.; Wenstrup, R.J.; Syngal, S. Prevalence and phenotypes of APC and MUTYH mutations in patients with multiple colorectal adenomas. JAMA 2012, 308, 485–492. [Google Scholar] [CrossRef] [PubMed]
Henrie, A.; Hemphill, S.E.; Ruiz-Schultz, N.; Cushman, B.; DiStefano, M.T.; Azzariti, D.; Harrison, S.M.; Rehm, H.L.; Eilbeck, K. ClinVar Miner: Demonstrating utility of a Web-based tool for viewing and filtering ClinVar data. Hum. Mutat. 2018, 39, 1051–1060. [Google Scholar] [CrossRef]
MUTYH-Associated Tumor Syndrome: The Other Face of MAP. Available online: https://pubmed.ncbi.nlm.nih.gov/35422474/ (accessed on 11 April 2024).
Guarinos, C.; Juárez, M.; Egoavil, C.; Rodríguez-Soler, M.; Pérez-Carbonell, L.; Salas, R.; Cubiella, J.; Rodríguez-Moranta, F.; de-Castro, L.; Bujanda, L.; et al. Prevalence and characteristics of MUTYH-associated polyposis in patients with multiple adenomatous and serrated polyps. Clin. Cancer Res. Off. J. Am. Assoc. Cancer Res. 2014, 20, 1158–1168. [Google Scholar] [CrossRef]
Win, A.K.; Dowty, J.G.; Cleary, S.P.; Kim, H.; Buchanan, D.D.; Young, J.P.; Clendenning, M.; Rosty, C.; MacInnis, R.J.; Giles, G.G.; et al. Risk of colorectal cancer for carriers of mutations in MUTYH, with and without a family history of cancer. Gastroenterology 2014, 146, 1208–1211.e5. [Google Scholar] [CrossRef]
Roberts, M.E.; Nimrichter, S.; Marshall, M.L.; Flynn, E.K.; Person, R.; Hruska, K.S.; Kruszka, P.; Juusola, J. Phenotypic continuum between POLE-related recessive disorders: A case report and literature review. Am. J. Med. Genet. A. 2022, 188, 3121–3125. [Google Scholar] [CrossRef] [PubMed]
Palles, C.; Cazier, J.-B.; Howarth, K.M.; Domingo, E.; Jones, A.M.; Broderick, P.; Kemp, Z.; Spain, S.L.; Guarino, E.; Salguero, I.; et al. Germline mutations affecting the proofreading domains of POLE and POLD1 predispose to colorectal adenomas and carcinomas. Nat. Genet. 2013, 45, 136–144. [Google Scholar] [CrossRef] [PubMed]
Weren, R.D.A.; Ligtenberg, M.J.L.; Kets, C.M.; de Voer, R.M.; Verwiel, E.T.P.; Spruijt, L.; van Zelst-Stams, W.A.G.; Jongmans, M.C.; Gilissen, C.; Hehir-Kwa, J.Y.; et al. A germline homozygous mutation in the base-excision repair gene NTHL1 causes adenomatous polyposis and colorectal cancer. Nat. Genet. 2015, 47, 668–671. [Google Scholar] [CrossRef] [PubMed]
Pinto, C.; Guerra, J.; Pinheiro, M.; Escudeiro, C.; Santos, C.; Pinto, P.; Porto, M.; Bartosch, C.; Silva, J.; Peixoto, A.; et al. Combined germline and tumor mutation signature testing identifies new families with NTHL1 tumor syndrome. Front. Genet. 2023, 14, 1254908. [Google Scholar] [CrossRef] [PubMed]
Gorji, L.; Albrecht, P. Hamartomatous polyps: Diagnosis, surveillance, and management. World J. Gastroenterol. 2023, 29, 1304–1314. [Google Scholar] [CrossRef] [PubMed]
Resta, N.; Pierannunzio, D.; Lenato, G.M.; Stella, A.; Capocaccia, R.; Bagnulo, R.; Lastella, P.; Susca, F.C.; Bozzao, C.; Loconte, D.C.; et al. Cancer risk associated with STK11/LKB1 germline mutations in Peutz–Jeghers syndrome patients: Results of an Italian multicenter study. Dig. Liver Dis. 2013, 45, 606–611. [Google Scholar] [CrossRef] [PubMed]
Clark, R.A.F.; Pavlis, M. Dysregulation of the mTOR pathway secondary to mutations or a hostile microenvironment contributes to cancer and poor wound healing. J. Investig. Dermatol. 2009, 129, 529–531. [Google Scholar] [CrossRef] [PubMed]
Bourouh, M.; Marignani, P.A. The Tumor Suppressor Kinase LKB1: Metabolic Nexus. Front. Cell Dev. Biol. 2022, 10, 881297. [Google Scholar] [CrossRef]
Choudhury, D.; Ghosh, D.; Mondal, M.; Singha, D.; Pothuraju, R.; Malakar, P. Polyploidy and mTOR signaling: A possible molecular link. Cell Commun. Signal. CCS 2024, 22, 196. [Google Scholar] [CrossRef]
Cavaillé, M.; Crampon, D.; Achim, V.; Bubien, V.; Uhrhammer, N.; Privat, M.; Ponelle-Chachuat, F.; Gay-Bellile, M.; Lepage, M.; Ouedraogo, Z.G.; et al. Diagnosis of PTEN mosaicism: The relevance of additional tumor DNA sequencing. A case report and review of the literature. BMC Med. Genom. 2023, 16, 166. [Google Scholar] [CrossRef]
Litzendorf, M.; Hoang, K.; Vaccaro, P. Recurrent and extensive vascular malformations in a patient with Bannayan--Riley--Ruvalcaba syndrome. Ann. Vasc. Surg. 2011, 25, 1138.e15–1138.e19. [Google Scholar] [CrossRef] [PubMed]
Yehia, L.; Heald, B.; Eng, C. Clinical Spectrum and Science Behind the Hamartomatous Polyposis Syndromes. Gastroenterology 2023, 164, 800–811. [Google Scholar] [CrossRef] [PubMed]
Alanazi, A.I.; Alanezi, T.; Aljofan, Z.F.; Alarabi, A.; Elwatidy, S. Lhermitte-Duclos disease: A systematic review. Surg. Neurol. Int. 2023, 14, 351. [Google Scholar] [CrossRef] [PubMed]
Cammarata, E.; Andreassi, M.; Gironi, L.C.; Savoia, P. Segmental overgrowth, lipomatosis, arteriovenous malformation and epidermal nevus (SOLAMEN) syndrome: Sarcomatous transformation. Ital. J. Dermatol. Venereol. 2022, 157, 298–299. [Google Scholar] [CrossRef] [PubMed]
Eng, C.; Thiele, H.; Zhou, X.P.; Gorlin, R.J.; Hennekam, R.C.; Winter, R.M. PTEN mutations and proteus syndrome. Lancet Lond. Engl. 2001, 358, 2079–2080. [Google Scholar] [CrossRef] [PubMed]
Dhamija, R.; Hoxworth, J.M. Imaging of PTEN-related abnormalities in the central nervous system. Clin. Imaging 2020, 60, 180–185. [Google Scholar] [CrossRef] [PubMed]
Lieberman, S.; Walsh, T.; Schechter, M.; Adar, T.; Goldin, E.; Beeri, R.; Sharon, N.; Baris, H.; Ben Avi, L.; Half, E.; et al. Features of Patients with Hereditary Mixed Polyposis Syndrome Caused by Duplication of GREM1 and Implications for Screening and Surveillance. Gastroenterology 2017, 152, 1876–1880.e1. [Google Scholar] [CrossRef]
Rosty, C.; Brosens, L.A.A. Pathology of Gastrointestinal Polyposis Disorders. Gastroenterol. Clin. N. Am. 2024, 53, 179–200. [Google Scholar] [CrossRef] [PubMed]
van Herwaarden, Y.J.; Koggel, L.M.; Simmer, F.; Vink-Börger, E.M.; Dura, P.; Meijer, G.A.; Nagengast, F.M.; Hoogerbrugge, N.; Bisseling, T.M.; Nagtegaal, I.D. RNF43 mutation analysis in serrated polyposis, sporadic serrated polyps and Lynch syndrome polyps. Histopathology 2021, 78, 749–758. [Google Scholar] [CrossRef]
Mikaeel, R.R.; Young, J.P.; Li, Y.; Poplawski, N.K.; Smith, E.; Horsnell, M.; Uylaki, W.; Tomita, Y.; Townsend, A.R.; Feng, J.; et al. RNF43 pathogenic Germline variant in a family with colorectal cancer. Clin. Genet. 2022, 101, 122–126. [Google Scholar] [CrossRef] [PubMed]
Chan, J.M.; Clendenning, M.; Joseland, S.; Georgeson, P.; Mahmood, K.; Joo, J.E.; Walker, R.; Como, J.; Preston, S.; Chai, S.M.; et al. Inherited BRCA1 and RNF43 pathogenic variants in a familial colorectal cancer type X family. Fam. Cancer 2024, 23, 9–21. [Google Scholar] [CrossRef] [PubMed]
Lepore Signorile, M.; Disciglio, V.; Di Carlo, G.; Pisani, A.; Simone, C.; Ingravallo, G. From Genetics to Histomolecular Characterization: An Insight into Colorectal Carcinogenesis in Lynch Syndrome. Int. J. Mol. Sci. 2021, 22, 6767. [Google Scholar] [CrossRef] [PubMed]
Nolano, A.; Medugno, A.; Trombetti, S.; Liccardo, R.; De Rosa, M.; Izzo, P.; Duraturo, F. Hereditary Colorectal Cancer: State of the Art in Lynch Syndrome. Cancers 2022, 15, 75. [Google Scholar] [CrossRef] [PubMed]
Vasen, H.F.; Watson, P.; Mecklin, J.P.; Lynch, H.T. New clinical criteria for hereditary nonpolyposis colorectal cancer (HNPCC, Lynch syndrome) proposed by the International Collaborative group on HNPCC. Gastroenterology 1999, 116, 1453–1456. [Google Scholar] [CrossRef]
Pantaleo, A.; Forte, G.; Cariola, F.; Valentini, A.M.; Fasano, C.; Sanese, P.; Grossi, V.; Buonadonna, A.L.; De Marco, K.; Lepore Signorile, M.; et al. Tumor Testing and Genetic Analysis to Identify Lynch Syndrome Patients in an Italian Colorectal Cancer Cohort. Cancers 2023, 15, 5061. [Google Scholar] [CrossRef] [PubMed]
Terradas, M.; Capellá, G.; Valle, L. Dominantly Inherited Hereditary Nonpolyposis Colorectal Cancer Not Caused by MMR Genes. J. Clin. Med. 2020, 9, 1954. [Google Scholar] [CrossRef] [PubMed]
Gyulkhandanyan, A.; Rezaie, A.R.; Roumenina, L.; Lagarde, N.; Fremeaux-Bacchi, V.; Miteva, M.A.; Villoutreix, B.O. Analysis of protein missense alterations by combining sequence- and structure-based methods. Mol. Genet. Genom. Med. 2020, 8, e1166. [Google Scholar] [CrossRef] [PubMed]
Choi, Y.; Sims, G.E.; Murphy, S.; Miller, J.R.; Chan, A.P. Predicting the functional effect of amino acid substitutions and indels. PLoS ONE 2012, 7, e46688. [Google Scholar] [CrossRef] [PubMed]
Ng, P.C.; Henikoff, S. Predicting deleterious amino acid substitutions. Genome Res. 2001, 11, 863–874. [Google Scholar] [CrossRef]
Ng, P.C.; Henikoff, S. SIFT: Predicting amino acid changes that affect protein function. Nucleic Acids Res. 2003, 31, 3812–3814. [Google Scholar] [CrossRef]
Ramensky, V.; Bork, P.; Sunyaev, S. Human non-synonymous SNPs: Server and survey. Nucleic Acids Res. 2002, 30, 3894–3900. [Google Scholar] [CrossRef]
Adzhubei, I.A.; Schmidt, S.; Peshkin, L.; Ramensky, V.E.; Gerasimova, A.; Bork, P.; Kondrashov, A.S.; Sunyaev, S.R. A method and server for predicting damaging missense mutations. Nat. Methods 2010, 7, 248–249. [Google Scholar] [CrossRef]
Chao, E.C.; Velasquez, J.L.; Witherspoon, M.S.L.; Rozek, L.S.; Peel, D.; Ng, P.; Gruber, S.B.; Watson, P.; Rennert, G.; Anton-Culver, H.; et al. Accurate classification of MLH1/MSH2 missense variants with multivariate analysis of protein polymorphisms-mismatch repair (MAPP-MMR). Hum. Mutat. 2008, 29, 852–860. [Google Scholar] [CrossRef]
Choi, Y.; Chan, A.P. PROVEAN web server: A tool to predict the functional effect of amino acid substitutions and indels. Bioinformatics 2015, 31, 2745–2747. [Google Scholar] [CrossRef]
Wang, L.; Tu, H.; Zeng, L.; Gao, R.; Luo, S.; Xiong, C. Identification and in silico Analysis of Nonsense SNPs of Human Colorectal Cancer Protein. J. Oleo Sci. 2022, 71, 363–370. [Google Scholar] [CrossRef] [PubMed]
Jansen, A.M.L.; Ghosh, P.; Dakal, T.C.; Slavin, T.P.; Boland, C.R.; Goel, A. Novel candidates in early-onset familial colorectal cancer. Fam. Cancer 2020, 19, 1–10. [Google Scholar] [CrossRef] [PubMed]
Reva, B.; Antipin, Y.; Sander, C. Predicting the functional impact of protein mutations: Application to cancer genomics. Nucleic Acids Res. 2011, 39, e118. [Google Scholar] [CrossRef] [PubMed]
De Nicola, F.; Goeman, F.; Pallocca, M.; Sperati, F.; Pizzuti, L.; Melucci, E.; Casini, B.; Amoreo, C.A.; Gallo, E.; Diodoro, M.G.; et al. Deep sequencing and pathway-focused analysis revealed multigene oncodriver signatures predicting survival outcomes in advanced colorectal cancer. Oncogenesis 2018, 7, 55. [Google Scholar] [CrossRef]
Mi, H.; Muruganujan, A.; Thomas, P.D. PANTHER in 2013: Modeling the evolution of gene function, and other gene attributes, in the context of phylogenetic trees. Nucleic Acids Res. 2013, 41, D377–D386. [Google Scholar] [CrossRef]
George Priya Doss, C.; Nagasundaram, N.; Tanwar, H. Predicting the impact of deleterious single point mutations in SMAD gene family using structural bioinformatics approach. Interdiscip. Sci. Comput. Life Sci. 2012, 4, 103–115. [Google Scholar] [CrossRef]
Schwarz, J.M.; Cooper, D.N.; Schuelke, M.; Seelow, D. MutationTaster2: Mutation prediction for the deep-sequencing age. Nat. Methods 2014, 11, 361–362. [Google Scholar] [CrossRef] [PubMed]
Landrum, M.J.; Lee, J.M.; Riley, G.R.; Jang, W.; Rubinstein, W.S.; Church, D.M.; Maglott, D.R. ClinVar: Public archive of relationships among sequence variation and human phenotype. Nucleic Acids Res. 2014, 42, D980–D985. [Google Scholar] [CrossRef] [PubMed]
Stenson, P.D.; Mort, M.; Ball, E.V.; Shaw, K.; Phillips, A.D.; Cooper, D.N. The Human Gene Mutation Database: Building a comprehensive mutation repository for clinical and molecular genetics, diagnostic testing and personalized genomic medicine. Hum. Genet. 2014, 133, 1–9. [Google Scholar] [CrossRef]
Xu, J.; Song, J.; Zhu, W.; Zuo, L.; Wu, J.; Zhang, L.; Wang, T.; Guo, J. A novel germline frameshift mutation in the MLH1 gene in a patient with Lynch syndrome. Cancer Genet. 2023, 274–275, 54–58. [Google Scholar] [CrossRef] [PubMed]
Hecht, M.; Bromberg, Y.; Rost, B. Better prediction of functional effects for sequence variants. BMC Genom. 2015, 16 (Suppl. S8), S1. [Google Scholar] [CrossRef] [PubMed]
Islam, M.J.; Parves, M.R.; Mahmud, S.; Tithi, F.A.; Reza, M.A. Assessment of structurally and functionally high-risk nsSNPs impacts on human bone morphogenetic protein receptor type IA (BMPR1A) by computational approach. Comput. Biol. Chem. 2019, 80, 31–45. [Google Scholar] [CrossRef] [PubMed]
Mathe, E.; Olivier, M.; Kato, S.; Ishioka, C.; Hainaut, P.; Tavtigian, S.V. Computational approaches for predicting the biological effect of p53 missense mutations: A comparison of three sequence analysis based methods. Nucleic Acids Res. 2006, 34, 1317–1325. [Google Scholar] [CrossRef] [PubMed]
Kidambi, T.D.; Goldberg, D.; Nussbaum, R.; Blanco, A.; Umetsu, S.E.; Terdiman, J.P.; Lee, J.K. Novel variant of unknown significance in MUTYH in a patient with MUTYH-associated polyposis: A case to reclassify. Clin. J. Gastroenterol. 2018, 11, 457–460. [Google Scholar] [CrossRef]
Zhao, B.; Li, J.; Sinha, S.; Qin, Z.; Kou, S.H.; Xiao, F.; Lei, H.; Chen, T.; Cao, W.; Ding, X.; et al. Pathogenic variants in human DNA damage repair genes mostly arose in recent human history. BMC Cancer 2024, 24, 415. [Google Scholar] [CrossRef]
Mahdouani, M.; Zhuri, D.; Sezginer Guler, H.; Hmida, D.; Sana, M.; Azaza, M.; Ben Said, M.; Masmoudi, S.; Hmila, F.; Youssef, S.; et al. Functional analysis of MMR gene VUS from potential Lynch syndrome patients. PLoS ONE 2024, 19, e0304141. [Google Scholar] [CrossRef]
Duong, H.T.T.; Suzuki, H.; Katagiri, S.; Shibata, M.; Arai, M.; Yura, K. Computational study of the impact of nucleotide variations on highly conserved proteins: In the case of actin. Biophys. Physicobiol. 2022, 19, e190025. [Google Scholar] [CrossRef]
Li, C.; Luo, Y.; Xie, Y.; Zhang, Z.; Liu, Y.; Zou, L.; Xiao, F. Structural and functional prediction, evaluation, and validation in the post-sequencing era. Comput. Struct. Biotechnol. J. 2024, 23, 446–451. [Google Scholar] [CrossRef]
Gasperini, M.; Starita, L.; Shendure, J. The power of multiplexed functional analysis of genetic variants. Nat. Protoc. 2016, 11, 1782–1787. [Google Scholar] [CrossRef]
Krassowski, M.; Paczkowska, M.; Cullion, K.; Huang, T.; Dzneladze, I.; Ouellette, B.F.F.; Yamada, J.T.; Fradet-Turcotte, A.; Reimand, J. ActiveDriverDB: Human disease mutations and genome variation in post-translational modification sites of proteins. Nucleic Acids Res. 2018, 46, D901–D910. [Google Scholar] [CrossRef]
Cerami, E.; Gao, J.; Dogrusoz, U.; Gross, B.E.; Sumer, S.O.; Aksoy, B.A.; Jacobsen, A.; Byrne, C.J.; Heuer, M.L.; Larsson, E.; et al. The cBio cancer genomics portal: An open platform for exploring multidimensional cancer genomics data. Cancer Discov. 2012, 2, 401–404. [Google Scholar] [CrossRef] [PubMed]
Sondka, Z.; Dhir, N.B.; Carvalho-Silva, D.; Jupe, S.; Madhumita; McLaren, K.; Starkey, M.; Ward, S.; Wilding, J.; Ahmed, M.; et al. COSMIC: A curated database of somatic variants and clinical data for cancer. Nucleic Acids Res. 2024, 52, D1210–D1217. [Google Scholar] [CrossRef]
Liu, X.; Jian, X.; Boerwinkle, E. dbNSFP: A lightweight database of human nonsynonymous SNPs and their functional predictions. Hum. Mutat. 2011, 32, 894–899. [Google Scholar] [CrossRef] [PubMed]
Sherry, S.T.; Ward, M.H.; Kholodov, M.; Baker, J.; Phan, L.; Smigielski, E.M.; Sirotkin, K. dbSNP: The NCBI database of genetic variation. Nucleic Acids Res. 2001, 29, 308–311. [Google Scholar] [CrossRef] [PubMed]
Lappalainen, I.; Lopez, J.; Skipper, L.; Hefferon, T.; Spalding, J.D.; Garner, J.; Chen, C.; Maguire, M.; Corbett, M.; Zhou, G.; et al. DbVar and DGVa: Public archives for genomic structural variation. Nucleic Acids Res. 2013, 41, D936–D941. [Google Scholar] [CrossRef]
Ainscough, B.J.; Griffith, M.; Coffman, A.C.; Wagner, A.H.; Kunisaki, J.; Choudhary, M.N.; McMichael, J.F.; Fulton, R.S.; Wilson, R.K.; Griffith, O.L.; et al. DoCM: A database of curated mutations in cancer. Nat. Methods 2016, 13, 806–807. [Google Scholar] [CrossRef]
Lek, M.; Karczewski, K.J.; Minikel, E.V.; Samocha, K.E.; Banks, E.; Fennell, T.; O’Donnell-Luria, A.H.; Ware, J.S.; Hill, A.J.; Cummings, B.B.; et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature 2016, 536, 285–291. [Google Scholar] [CrossRef]
Stenson, P.D.; Mort, M.; Ball, E.V.; Evans, K.; Hayden, M.; Heywood, S.; Hussain, M.; Phillips, A.D.; Cooper, D.N. The Human Gene Mutation Database: Towards a comprehensive repository of inherited mutation data for medical research, genetic diagnosis and next-generation sequencing studies. Hum. Genet. 2017, 136, 665–677. [Google Scholar] [CrossRef] [PubMed]
Thompson, B.A.; Spurdle, A.B.; Plazzer, J.-P.; Greenblatt, M.S.; Akagi, K.; Al-Mulla, F.; Bapat, B.; Bernstein, I.; Capellá, G.; den Dunnen, J.T.; et al. Application of a 5-tiered scheme for standardized classification of 2,360 unique mismatch repair gene variants in the InSiGHT locus-specific database. Nat. Genet. 2014, 46, 107–115. [Google Scholar] [CrossRef] [PubMed]
LOVD: Easy Creation of a Locus-Specific Sequence Variation Database Using an “LSDB-in-a-Box” Approach-Fokkema-2005-Human Mutation. Available online: https://onlinelibrary.wiley.com/doi/10.1002/humu.20201 (accessed on 24 April 2024).
Amberger, J.S.; Hamosh, A. Searching Online Mendelian Inheritance in Man (OMIM): A Knowledgebase of Human Genes and Genetic Phenotypes. Curr. Protoc. Bioinform. 2017, 58, 1.2.1–1.2.12. [Google Scholar] [CrossRef]
Thorn, C.F.; Ellison, D.H.; Turner, S.T.; Altman, R.B.; Klein, T.E. PharmGKB summary: Diuretics pathway, pharmacodynamics. Pharmacogenet. Genom. 2013, 23, 449–453. [Google Scholar] [CrossRef]
Cariaso, M.; Lennon, G. SNPedia: A wiki supporting personal genome annotation, interpretation and analysis. Nucleic Acids Res. 2012, 40, D1308–D1312. [Google Scholar] [CrossRef]
Béroud, C.; Collod-Béroud, G.; Boileau, C.; Soussi, T.; Junien, C. UMD (Universal mutation database): A generic software to build and analyze locus-specific databases. Hum. Mutat. 2000, 15, 86–94. [Google Scholar] [CrossRef]
Laskowski, R.A.; Stephenson, J.D.; Sillitoe, I.; Orengo, C.A.; Thornton, J.M. VarSite: Disease variants and protein structure. Protein Sci. Publ. Protein Soc. 2020, 29, 111–119. [Google Scholar] [CrossRef] [PubMed]
Hu, Z.; Yu, C.; Furutsuki, M.; Andreoletti, G.; Ly, M.; Hoskins, R.; Adhikari, A.N.; Brenner, S.E. VIPdb, a genetic Variant Impact Predictor Database. Hum. Mutat. 2019, 40, 1202–1214. [Google Scholar] [CrossRef]
Lehrer, S.; Rheinstein, P.H. Increased Risk of Acute Myelogenous Leukemia After Early Onset but Not Late-Onset Colorectal Cancer. Am. J. Clin. Oncol. 2020, 43, 263–269. [Google Scholar] [CrossRef]
Papadopulos, M.E.; Plazzer, J.P.; Macrae, F.A. Genotype-phenotype correlation of BMPR1a disease causing variants in juvenile polyposis syndrome. Hered. Cancer Clin. Pract. 2023, 21, 12. [Google Scholar] [CrossRef] [PubMed]
Quintana, I.; Mur, P.; Terradas, M.; García-Mulero, S.; Aiza, G.; Navarro, M.; Piñol, V.; Brunet, J.; Moreno, V.; Sanz-Pamplona, R.; et al. Potential Involvement of NSD1, KRT24 and ACACA in the Genetic Predisposition to Colorectal Cancer. Cancers 2022, 14, 699. [Google Scholar] [CrossRef] [PubMed]
Kašubová, I.; Holubeková, V.; Janíková, K.; Váňová, B.; Sňahničanová, Z.; Kalman, M.; Plank, L.; Lasabová, Z. Next Generation Sequencing in Molecular Diagnosis of Lynch Syndrome—A Pilot Study Using New Stratification Criteria. Acta Medica 2018, 61, 98–102. [Google Scholar] [CrossRef] [PubMed]
Stone, E.A.; Sidow, A. Physicochemical constraint violation by missense substitutions mediates impairment of protein function and disease severity. Genome Res. 2005, 15, 978–986. [Google Scholar] [CrossRef] [PubMed]
Ioannidis, N.M.; Rothstein, J.H.; Pejaver, V.; Middha, S.; McDonnell, S.K.; Baheti, S.; Musolf, A.; Li, Q.; Holzinger, E.; Karyadi, D.; et al. REVEL: An Ensemble Method for Predicting the Pathogenicity of Rare Missense Variants. Am. J. Hum. Genet. 2016, 99, 877–885. [Google Scholar] [CrossRef] [PubMed]
Karabachev, A.D.; Martini, D.J.; Hermel, D.J.; Solcz, D.; Richardson, M.E.; Pesaran, T.; Sarkar, I.N.; Greenblatt, M.S. Curated multiple sequence alignment for the Adenomatous Polyposis Coli (APC) gene and accuracy of in silico pathogenicity predictions. PLoS ONE 2020, 15, e0233673. [Google Scholar] [CrossRef] [PubMed]
Scheraga, H.A.; Khalili, M.; Liwo, A. Protein-folding dynamics: Overview of molecular simulation techniques. Annu. Rev. Phys. Chem. 2007, 58, 57–83. [Google Scholar] [CrossRef] [PubMed]
Zhao, F.; Zheng, L.; Goncearenco, A.; Panchenko, A.R.; Li, M. Computational Approaches to Prioritize Cancer Driver Missense Mutations. Int. J. Mol. Sci. 2018, 19, 2113. [Google Scholar] [CrossRef] [PubMed]
Phillips, J.C.; Braun, R.; Wang, W.; Gumbart, J.; Tajkhorshid, E.; Villa, E.; Chipot, C.; Skeel, R.D.; Kalé, L.; Schulten, K. Scalable molecular dynamics with NAMD. J. Comput. Chem. 2005, 26, 1781–1802. [Google Scholar] [CrossRef]
Li, M.-H.; Luo, Q.; Xue, X.-G.; Li, Z.-S. Molecular dynamics studies of the 3D structure and planar ligand binding of a quadruplex dimer. J. Mol. Model. 2011, 17, 515–526. [Google Scholar] [CrossRef]
Woodcock, H.L.; Miller, B.T.; Hodoscek, M.; Okur, A.; Larkin, J.D.; Ponder, J.W.; Brooks, B.R. MSCALE: A General Utility for Multiscale Modeling. J. Chem. Theory Comput. 2011, 7, 1208–1219. [Google Scholar] [CrossRef]
Miller, B.T.; Singh, R.P.; Klauda, J.B.; Hodoscek, M.; Brooks, B.R.; Woodcock, H.L. CHARMMing: A new, flexible web portal for CHARMM. J. Chem. Inf. Model. 2008, 48, 1920–1929. [Google Scholar] [CrossRef] [PubMed]
Van Der Spoel, D.; Lindahl, E.; Hess, B.; Groenhof, G.; Mark, A.E.; Berendsen, H.J.C. GROMACS: Fast, flexible, and free. J. Comput. Chem. 2005, 26, 1701–1718. [Google Scholar] [CrossRef] [PubMed]
Case, D.A.; Cheatham, T.E., III; Darden, T.; Gohlke, H.; Luo, R.; Merz, K.M., Jr.; Onufriev, A.; Simmerling, C.; Wang, B.; Woods, R.J. The Amber biomolecular simulation programs. J. Comput. Chem. 2005, 26, 1668–1688. [Google Scholar] [CrossRef] [PubMed]
Smith, I.N.; Thacker, S.; Jaini, R.; Eng, C. Dynamics and Structural Stability Effects of Germline PTEN Mutations Associated with Cancer versus Autism Phenotypes. J. Biomol. Struct. Dyn. 2019, 37, 1766–1782. [Google Scholar] [CrossRef]
Tam, B.; Qin, Z.; Zhao, B.; Sinha, S.; Lei, C.L.; Wang, S.M. Classification of MLH1 Missense VUS Using Protein Structure-Based Deep Learning-Ramachandran Plot-Molecular Dynamics Simulations Method. Int. J. Mol. Sci. 2024, 25, 850. [Google Scholar] [CrossRef]
Tam, B.; Sinha, S.; Wang, S.M. Combining Ramachandran plot and molecular dynamics simulation for structural-based variant classification: Using TP53 variants as model. Comput. Struct. Biotechnol. J. 2020, 18, 4033–4039. [Google Scholar] [CrossRef]
Berman, H.M.; Westbrook, J.; Feng, Z.; Gilliland, G.; Bhat, T.N.; Weissig, H.; Shindyalov, I.N.; Bourne, P.E. The Protein Data Bank. Nucleic Acids Res. 2000, 28, 235–242. [Google Scholar] [CrossRef] [PubMed]
Masso, M.; Vaisman, I.I. AUTO-MUTE: Web-based tools for predicting stability changes in proteins due to single amino acid replacements. Protein Eng. Des. Sel. PEDS 2010, 23, 683–687. [Google Scholar] [CrossRef]
Jubb, H.C.; Saini, H.K.; Verdonk, M.L.; Forbes, S.A. COSMIC-3D provides structural perspectives on cancer genetics for drug discovery. Nat. Genet. 2018, 50, 1200–1202. [Google Scholar] [CrossRef]
Parthiban, V.; Gromiha, M.M.; Schomburg, D. CUPSAT: Prediction of protein stability upon point mutations. Nucleic Acids Res. 2006, 34, W239–W242. [Google Scholar] [CrossRef] [PubMed]
Rodrigues, C.H.; Pires, D.E.; Ascher, D.B. DynaMut: Predicting the impact of mutations on protein conformation, flexibility and stability. Nucleic Acids Res. 2018, 46, W350–W355. [Google Scholar] [CrossRef] [PubMed]
Pires, D.E.V.; Ascher, D.B.; Blundell, T.L. DUET: A server for predicting effects of mutations on protein stability using an integrated computational approach. Nucleic Acids Res. 2014, 42, W314–W319. [Google Scholar] [CrossRef] [PubMed]
Guerois, R.; Nielsen, J.E.; Serrano, L. Predicting changes in the stability of proteins and protein complexes: A study of more than 1000 mutations. J. Mol. Biol. 2002, 320, 369–387. [Google Scholar] [CrossRef] [PubMed]
Capriotti, E.; Fariselli, P.; Rossi, I.; Casadio, R. A three-state prediction of single point mutations on protein stability changes. BMC Bioinform. 2008, 9 (Suppl. 2), S6. [Google Scholar] [CrossRef] [PubMed]
Chen, C.-W.; Lin, J.; Chu, Y.-W. iStable: Off-the-shelf predictor integration for predicting protein stability changes. BMC Bioinform. 2013, 14 (Suppl. 2), S5. [Google Scholar] [CrossRef] [PubMed]
Laimer, J.; Hofer, H.; Fritz, M.; Wegenkittl, S.; Lackner, P. MAESTRO--multi agent stability prediction upon point mutations. BMC Bioinform. 2015, 16, 116. [Google Scholar] [CrossRef] [PubMed]
Pires, D.E.V.; Ascher, D.B.; Blundell, T.L. mCSM: Predicting the effects of mutations in proteins using graph-based signatures. Bioinformatics 2014, 30, 335–342. [Google Scholar] [CrossRef] [PubMed]
Ittisoponpisan, S.; Islam, S.A.; Khanna, T.; Alhuzimi, E.; David, A.; Sternberg, M.J.E. Can Predicted Protein 3D Structures Provide Reliable Insights into whether Missense Variants Are Disease Associated? J. Mol. Biol. 2019, 431, 2197–2212. [Google Scholar] [CrossRef]
Cheng, J.; Randall, A.; Baldi, P. Prediction of protein stability changes for single-site mutations using support vector machines. Proteins 2006, 62, 1125–1132. [Google Scholar] [CrossRef]
Wagih, O.; Galardini, M.; Busby, B.P.; Memon, D.; Typas, A.; Beltrao, P. A resource of variant effect predictions of single nucleotide variants in model organisms. Mol. Syst. Biol. 2018, 14, e8430. [Google Scholar] [CrossRef] [PubMed]
Giollo, M.; Martin, A.J.M.; Walsh, I.; Ferrari, C.; Tosatto, S.C.E. NeEMO: A method using residue interaction networks to improve prediction of protein stability upon mutation. BMC Genom. 2014, 15 (Suppl. 4), S7. [Google Scholar] [CrossRef] [PubMed]
Kelley, L.A.; Mezulis, S.; Yates, C.M.; Wass, M.N.; Sternberg, M.J.E. The Phyre2 web portal for protein modeling, prediction and analysis. Nat. Protoc. 2015, 10, 845–858. [Google Scholar] [CrossRef] [PubMed]
PhyreRisk: A Dynamic Web Application to Bridge Genomics, Proteomics and 3D Structural Data to Guide Interpretation of Human Genetic Variants-ScienceDirect. Available online: https://www.sciencedirect.com/science/article/pii/S0022283619302517?via%3Dihub (accessed on 25 April 2024).
López-Ferrando, V.; Gazzo, A.; de la Cruz, X.; Orozco, M.; Gelpí, J.L. PMut: A web-based tool for the annotation of pathological variants on proteins, 2017 update. Nucleic Acids Res. 2017, 45, W222–W228. [Google Scholar] [CrossRef] [PubMed]
Wainreb, G.; Wolf, L.; Ashkenazy, H.; Dehouck, Y.; Ben-Tal, N. Protein stability: A single recorded mutation aids in predicting the effects of other mutations in the same amino acid site. Bioinformatics 2011, 27, 3286–3292. [Google Scholar] [CrossRef] [PubMed]
Getov, I.; Petukh, M.; Alexov, E. SAAFEC: Predicting the Effect of Single Point Mutations on Protein Folding Free Energy Using a Knowledge-Modified MM/PBSA Approach. Int. J. Mol. Sci. 2016, 17, 512. [Google Scholar] [CrossRef] [PubMed]
Magyar, C.; Gromiha, M.M.; Pujadas, G.; Tusnády, G.E.; Simon, I. SRide: A server for identifying stabilizing residues in proteins. Nucleic Acids Res. 2005, 33, W303–W305. [Google Scholar] [CrossRef]
Quan, L.; Lv, Q.; Zhang, Y. STRUM: Structure-based prediction of protein stability changes upon single-point mutation. Bioinformatics 2016, 32, 2936–2946. [Google Scholar] [CrossRef] [PubMed]
Kabbage, M.; Ben Aissa-Haj, J.; Othman, H.; Jaballah-Gabteni, A.; Laarayedh, S.; Elouej, S.; Medhioub, M.; Kettiti, H.T.; Khsiba, A.; Mahmoudi, M.; et al. A Rare MSH2 Variant as a Candidate Marker for Lynch Syndrome II Screening in Tunisia: A Case of Diffuse Gastric Carcinoma. Genes 2022, 13, 1355. [Google Scholar] [CrossRef]
Singh, S.; Sharma, S.; Baranwal, M. Identification of SNPs in hMSH3/MSH6 interaction domain affecting the structure and function of MSH2 protein. Biotechnol. Appl. Biochem. 2022, 69, 2454–2465. [Google Scholar] [CrossRef]
Keskin Karakoyun, H.; Yüksel, Ş.K.; Amanoglu, I.; Naserikhojasteh, L.; Yeşilyurt, A.; Yakıcıer, C.; Timuçin, E.; Akyerli, C.B. Evaluation of AlphaFold structure-based protein stability prediction on missense variations in cancer. Front. Genet. 2023, 14, 1052383. [Google Scholar] [CrossRef]
Livesey, B.J.; Marsh, J.A. Interpreting protein variant effects with computational predictors and deep mutational scanning. Dis. Model. Mech. 2022, 15, dmm049510. [Google Scholar] [CrossRef] [PubMed]
Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A.; et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef]
Baek, M.; DiMaio, F.; Anishchenko, I.; Dauparas, J.; Ovchinnikov, S.; Lee, G.R.; Wang, J.; Cong, Q.; Kinch, L.N.; Schaeffer, R.D.; et al. Accurate prediction of protein structures and interactions using a three-track neural network. Science 2021, 373, 871–876. [Google Scholar] [CrossRef]
Krishna, R.; Wang, J.; Ahern, W.; Sturmfels, P.; Venkatesh, P.; Kalvet, I.; Lee, G.R.; Morey-Burrows, F.S.; Anishchenko, I.; Humphreys, I.R.; et al. Generalized biomolecular modeling and design with RoseTTAFold All-Atom. Science 2024, 384, eadl2528. [Google Scholar] [CrossRef]
Fernández Montes, A.; Alonso, V.; Aranda, E.; Élez, E.; García Alfonso, P.; Grávalos, C.; Maurel, J.; Vera, R.; Vidal, R.; Aparicio, J. SEOM-GEMCAD-TTD clinical guidelines for the systemic treatment of metastatic colorectal cancer (2022). Clin. Transl. Oncol. Off. Publ. Fed. Span. Oncol. Soc. Natl. Cancer Inst. Mex. 2023, 25, 2718–2731. [Google Scholar] [CrossRef] [PubMed]
Lauricella, S.; Rausa, E.; Pellegrini, I.; Ricci, M.T.; Signoroni, S.; Palassini, E.; Cavalcoli, F.; Pasanisi, P.; Colombo, C.; Vitellaro, M. Current management of familial adenomatous polyposis. Expert Rev. Anticancer Ther. 2024, 24, 363–377. [Google Scholar] [CrossRef] [PubMed]
Balmaña, J.; Digiovanni, L.; Gaddam, P.; Walsh, M.F.; Joseph, V.; Stadler, Z.K.; Nathanson, K.L.; Garber, J.E.; Couch, F.J.; Offit, K.; et al. Conflicting Interpretation of Genetic Variants and Cancer Risk by Commercial Laboratories as Assessed by the Prospective Registry of Multiplex Testing. J. Clin. Oncol. Off. J. Am. Soc. Clin. Oncol. 2016, 34, 4071–4078. [Google Scholar] [CrossRef] [PubMed]
Richards, S.; Aziz, N.; Bale, S.; Bick, D.; Das, S.; Gastier-Foster, J.; Grody, W.W.; Hegde, M.; Lyon, E.; Spector, E.; et al. Standards and guidelines for the interpretation of sequence variants: A joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet. Med. 2015, 17, 405–423. [Google Scholar] [CrossRef]
Grimm, D.G.; Azencott, C.-A.; Aicheler, F.; Gieraths, U.; MacArthur, D.G.; Samocha, K.E.; Cooper, D.N.; Stenson, P.D.; Daly, M.J.; Smoller, J.W.; et al. The evaluation of tools used to predict the impact of missense variants is hindered by two types of circularity. Hum. Mutat. 2015, 36, 513–523. [Google Scholar] [CrossRef]

Figure 1. Schematic workflow to assess the clinical significance of a VUS. The process involves an initial in silico prediction analysis of the structural and functional effects of the variant and a final experimental validation to achieve a more personalized diagnosis and follow-up program.

Table 1. Number of driver genetic variants identified in hereditary CRC syndromes as reported in the ClinVar Miner database (https://clinvarminer.genetics.utah.edu/, accessed on 7 April 2024 ClinVar version 2024-03-31, [28]).

Hereditary CRC Syndrome	Driver Genes	Pathogenic	Likely Pathogenic	Uncertain Significance	Likely Benign	Benign	Total
FAP and AFAP	APC	1796	213	6139	2249	228	10,625
MAP	MUTYH	9	1	14	27	0	51
PPAP	POLD	4	2	2378	1732	98	4214
PPAP	POLE	16	12	379	231	60	698
NTHL1 tumor syndrome	NTHL1	58	38	102	3	0	201
PJS	STK11	193	55	837	750	77	1912
JPS	BMPR1A	164	41	813	552	30	1600
JPS	SMAD4	139	37	586	569	17	1348
PHTS	PTEN	487	122	730	456	53	1848
HMPS	GREM1	0	0	4	0	0	4
SPS	RNF43	2	2	104	2	1	111
LS	MLH1	527	120	86	40	55	828
	MSH2	756	255	470	165	86	1732
	MSH6	203	55	156	57	47	518
	PMS2	83	32	61	13	36	225
MMR-p HNPCC	RPS20	0	0	5	0	0	5

Table 2. List of major databases for analyzing the clinical significance of genetic variants. The resources are listed in alphabetical order.

Database	Description	Link	References	Number of Tool Citations *
ActiveDriverDB	Human proteo-genomics database that annotates disease mutations and population variants using post-translational modifications	https://activedriverdb.org/ (accessed on 14 April 2024)	[85]	3
cBioPortal (Cancer Genomics Portal)	Open-access resource that is useful to interactively explore multidimensional cancer genomics data sets. It presently provides access to data from about 100,000 tumor samples collected from 218 different cancer research studies	https://www.cbioportal.org/ (accessed on 14 April 2024)	[86]	2113
ClinVar (Clinical Variants)	Portal of human variations classified for diseases	https://www.ncbi.nlm.nih.gov/clinvar/ (accessed on 14 April 2024)	[73]	1055
ClinVar Miner (Clinical Variants Miner)	Portal for viewing and filtering ClinVar data	https://clinvarminer.genetics.utah.edu/ (accessed on 14 April 2024)	[28]	4
COSMIC (Catalogue Of Somatic Mutations In Cancer)	Curated database of somatic and germline mutations	https://cancer.sanger.ac.uk/cosmic (accessed on 14 April 2024)	[87]	212
dbNSFP (Database for Nonsynonymous SNPs’ Functional Predictions)	Database of functional predictions and annotations for human nonsynonymous SNPs	http://database.liulab.science/dbNSFP#database (accessed on 14 April 2024)	[88]	35
dbSNP (Single Nucleotide Polymorphism Database)	SNP catalog designed to facilitate large-scale studies and association between genetics, functional implications, population genetics, and evolutionary biology of SNPs	https://www.ncbi.nlm.nih.gov/snp/ (accessed on 14 April 2024)	[89]	1087
dbVar (Database of Genomic Variation)	Repository of structural variations in the human genome allowing to search, read, and download data from submitted studies	https://www.ncbi.nlm.nih.gov/dbvar/ (accessed on 14 April 2024)	[90]	27
DoCM (Database Of Curated Mutations)	Curated database of validated cancer driver mutations	http://www.docm.info/ (accessed on 14 April 2024)	[91]	16
GnomAD (GeNOMe Aggregation Database)	Collection of standardized exome and genome sequencing data from numerous large-scale sequencing initiatives	https://gnomad.broadinstitute.org/, accessed on 14 April 2024	[92]	866
HGMD (The Human Gene Mutation Database)	Comprehensive repository of inherited mutation data for medical research, genetic diagnosis, and NGS studies	https://www.hgmd.cf.ac.uk/ac/index.php/, accessed on 14 April 2024	[93]	225
InSiGHT (International Society for Gastrointestinal Hereditary Tumours)	Extensive database of DNA variations that have been re-sequenced in genes associated with gastrointestinal cancer	https://www.insight-group.org/variants/databases/ (accessed on 14 April 2024)	[94]	62
LoVD (Leiden Open Variation Database)	Web-based open-source database collecting DNA sequence variants associated with genetic (hereditary) diseases	https://www.lovd.nl/ (accessed on 14 April 2024)	[95]	158
OMIM (Online Mendelian Inheritance in Man)	Collection of genetic phenotypes associated with Mendelian inherited disorders	https://omim.org/ (accessed on 14 April 2024)	[96]	8182
PharmGKB (Pharmacogenomics Knowledge Base)	Comprehensive database providing researchers and clinicians with information regarding how genetic diversity affects drug response	https://www.pharmgkb.org/ (accessed on 14 April 2024)	[97]	546
SNPedia (Single Nucleotide Polymorphism encyclopedia)	Database referencing peer-reviewed scientific literature that gathers data on the impact of DNA polymorphisms with an emphasis on medical, phenotypic, and genealogical correlations of SNPs	https://www.snpedia.com/index.php/SNPedia (accessed on 14 April 2024)	[98]	19
UMD (Universal Mutation Database)	Database of driver mutations, focusing on their importance for the twelve main types of cancer	https://bio.tools/umd (accessed on 14 April 2024)	[99]	15
VarSite (Variant Site database)	Web service that maps natural variations from gnomAD and known disease-associated variants from UniProt and ClinVar onto 3D protein structures stored in the Protein Data Bank	https://www.ebi.ac.uk/thornton-srv/databases/VarSite (accessed on 14 April 2024)	[100]	4
VIPdb (Variant Impact Predictor Database)	Comprehensive resource that facilitates the exploration of suitable tools and aids in the creation of enhanced methods for accurately predicting the impact of genetic variants	https://genomeinterpretation.org/vipdb (accessed on 14 April 2024)	[101]	3

* Based on a PubMed search performed using the name or URL link of the tools as keywords (accessed July 2024).

Table 3. List of useful in silico prediction algorithms for predicting the impact of variations on protein structure and stability. The resources are listed in alphabetical order.

Resource	Description	Link	References	Number of Tool Citations *
AUTO-MUTE version 2.0	Software using ΔΔG calculations and knowledge-based potentials	http://proteins.gmu.edu/automute (accessed on 20 April 2024)	[121]	5
Cosmic-3D Release v99	Tool that analyzes cancer mutations within the framework of three-dimensional protein structures	https://cancer.sanger.ac.uk/cosmic3d/ (accessed on 20 April 2024)	[122]	4
CUPSAT	Software using ΔΔG calculations with mean force atom pair and torsion angle potentials	https://cupsat.brenda-enzymes.org/ (accessed on 20 April 2024)	[123]	34
DynaMut	Software using ΔΔG calculations to predict the effects of variants on protein flexibility	http://biosig.unimelb.edu.au/dynamut/ (accessed on 20 April 2024)	[124]	47
DUET	Software that predicts the effects of mutations on protein stability by calculating changes in ∆∆G	https://biosig.lab.uq.edu.au/duet (accessed on 20 April 2024)	[125]	11
FOLD-X Version 3.0	Software using empirical force fields to calculate ΔΔG	https://software.embl-em.de/software/6 (accessed on 20 April 2024)	[126]	39
i-Mutant 3.0	Software using support vector machines (SVMs) to calculate ΔΔG	http://gpcr2.biocomp.unibo.it/cgi/predictors/I-Mutant3.0/I-Mutant3.0.cgi (accessed on 20 April 2024)	[127]	27
iStable Version 2.0	Software using SVMs to analyze protein stability and calculate ΔΔG	http://predictor.nchu.edu.tw/iStable (accessed on 20 April 2024)	[128]	37
MAESTRO Version 1.2.35	Software using ΔΔG calculations and multi-agent stability prediction	http://biwww.che.sbg.ac.at/MAESTRO (accessed on 20 April 2024)	[129]	62
mCSM	Software using graph-based signatures to calculate ΔΔG	https://biosig.lab.uq.edu.au/mcsm (accessed on 20 April 2024)	[130]	137
Missense3D (Release June 2019)	Tool that predicts structural alterations resulting from amino acid substitutions. Analysis of experimental coordinates and expected structures is also possible	http://missense3d.bc.ic.ac.uk/missense3d/ (accessed on 20 April 2024)	[131]	21
MUpro (Release 6.0, 2021)	Software using SVMs to predict variation in protein stability	http://mupro.proteomics.ics.uci.edu/ (accessed on 20 April 2024)	[132]	86
Mutfunc Version 2.0	Web resource reporting mutations that are either expected to cause instability in protein structure or that occur in functionally significant regions	www.mutfunc.com (accessed on 20 April 2024)	[133]	2
NeEMO	Software using amino acids involved in protein-to-protein interaction networks to calculate ΔΔG	https://biocomputingup.it/ (accessed on 20 April 2024)	[134]	18
Phyre 2 version 2.0	Tool that predicts protein sequence structure and function using automatic fold recognition	http://www.sbg.bio.ic.ac.uk/~phyre2/html/page.cgi?id=index (accessed on 20 April 2024)	[135]	191
PhyreRisk Version 1.0.1	Open-access program that maps human variations onto protein structure, integrating genomic, proteomic, and structural data	http://phyrerisk.bc.ic.ac.uk/ (accessed on 20 April 2024)	[136]	3
PMut Version 2017	Software designed to identify and predict pathological mutations. It labels mutations by processing several types of sequence information using neural networks	https://mmb.irbbarcelona.org/PMut (accessed on 20 April 2024)	[137]	183
ProMaya	Software using random forests regression for ΔΔG calculations	http://bental.tau.ac.il/ProMaya/ (accessed on 20 April 2024)	[138]	3
SAAFEC-SEQ Version 1.0	Software using multiple linear regression to calculate ΔΔG	http://compbio.clemson.edu/lab/ (accessed on 20 April 2024)	[139]	7
SRide	Server allowing for detection of stabilizing residues within proteins	http://sride.enzim.hu (accessed on 20 April 2024)	[140]	7
STRUM Version STRUM.tar.bz2	Software that predicts alterations caused by single-point nonsynonymous SNPs in protein folding stability by calculating changes in ∆∆G	https://zhanggroup.org/STRUM/ (accessed on 20 April 2024)	[141]	3

* Based on a PubMed search performed using the name or URL link of the tools as keywords (accessed July 2024).

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Fasano, C.; Lepore Signorile, M.; De Marco, K.; Forte, G.; Disciglio, V.; Sanese, P.; Grossi, V.; Simone, C. In Silico Deciphering of the Potential Impact of Variants of Uncertain Significance in Hereditary Colorectal Cancer Syndromes. Cells 2024, 13, 1314. https://doi.org/10.3390/cells13161314

AMA Style

Fasano C, Lepore Signorile M, De Marco K, Forte G, Disciglio V, Sanese P, Grossi V, Simone C. In Silico Deciphering of the Potential Impact of Variants of Uncertain Significance in Hereditary Colorectal Cancer Syndromes. Cells. 2024; 13(16):1314. https://doi.org/10.3390/cells13161314

Chicago/Turabian Style

Fasano, Candida, Martina Lepore Signorile, Katia De Marco, Giovanna Forte, Vittoria Disciglio, Paola Sanese, Valentina Grossi, and Cristiano Simone. 2024. "In Silico Deciphering of the Potential Impact of Variants of Uncertain Significance in Hereditary Colorectal Cancer Syndromes" Cells 13, no. 16: 1314. https://doi.org/10.3390/cells13161314

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

In Silico Deciphering of the Potential Impact of Variants of Uncertain Significance in Hereditary Colorectal Cancer Syndromes

Abstract

1. Introduction

2. Pathology of Hereditary CRC Syndromes

2.1. Hereditary CRC Polyposis Syndromes

2.2. Hereditary Nonpolyposis CRC

3. In Silico Prediction of VUS Impact on Protein Function in Hereditary CRC Syndromes

4. In Silico Prediction of VUS Impact on Protein Structure in CRC Hereditary Syndromes

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI