Next Article in Journal
The Interaction of Selective A1 and A2A Adenosine Receptor Antagonists with Magnesium and Zinc Ions in Mice: Behavioural, Biochemical and Molecular Studies
Next Article in Special Issue
Future Perspectives in Detecting EGFR and ALK Gene Alterations in Liquid Biopsies of Patients with NSCLC
Previous Article in Journal
Interspecific Variation in One-Carbon Metabolism within the Ovarian Follicle, Oocyte, and Preimplantation Embryo: Consequences for Epigenetic Programming of DNA Methylation
Previous Article in Special Issue
Identification of New Genes Involved in Germline Predisposition to Early-Onset Gastric Cancer
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Whole Exome Sequencing Identifies APCDD1 and HDAC5 Genes as Potentially Cancer Predisposing in Familial Colorectal Cancer

by
Diamanto Skopelitou
1,2,3,4,
Beiping Miao
1,2,3,
Aayushi Srivastava
1,2,3,4,
Abhishek Kumar
1,5,6,
Magdalena Kuświk
7,
Dagmara Dymerska
7,
Nagarajan Paramasivam
8,
Matthias Schlesner
9,
Jan Lubinski
7,
Kari Hemminki
1,10,11,
Asta Försti
1,2,3 and
Obul Reddy Bandapalli
1,2,3,4,*
1
Molecular Genetic Epidemiology, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
2
Hopp Children’s Cancer Center (KiTZ), 69120 Heidelberg, Germany
3
Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ) and German Cancer Consortium (DKTK), 69120 Heidelberg, Germany
4
Medical Faculty, Heidelberg University, 69120 Heidelberg, Germany
5
Institute of Bioinformatics, International Technology Park, Bangalore 560066, India
6
Manipal Academy of Higher Education (MAHE), Manipal 576104, India
7
Department of Genetics and Pathology, Pomeranian Medical University, 71252 Szczecin, Poland
8
Computational Oncology, Molecular Diagnostics Program, National Center for Tumor Diseases (NCT), 69120 Heidelberg, Germany
9
Bioinformatics and Omics Data Analytics, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
10
Cancer Epidemiology, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
11
Biomedical Center, Faculty of Medicine in Pilsen, Charles University in Prague, 30605 Pilsen, Czech Republic
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2021, 22(4), 1837; https://doi.org/10.3390/ijms22041837
Submission received: 22 January 2021 / Revised: 6 February 2021 / Accepted: 7 February 2021 / Published: 12 February 2021
(This article belongs to the Special Issue Linking Genomic Changes with Cancer in the NGS Era)

Abstract

:
Germline mutations in predisposition genes account for only 20% of all familial colorectal cancers (CRC) and the remaining genetic burden may be due to rare high- to moderate-penetrance germline variants that are not explored. With the aim of identifying such potential cancer-predisposing variants, we performed whole exome sequencing on three CRC cases and three unaffected members of a Polish family and identified two novel heterozygous variants: a coding variant in APC downregulated 1 gene (APCDD1, p.R299H) and a non-coding variant in the 5′ untranslated region (UTR) of histone deacetylase 5 gene (HDAC5). Sanger sequencing confirmed the variants segregating with the disease and Taqman assays revealed 8 additional APCDD1 variants in a cohort of 1705 familial CRC patients and no further HDAC5 variants. Proliferation assays indicated an insignificant proliferative impact for the APCDD1 variant. Luciferase reporter assays using the HDAC5 variant resulted in an enhanced promoter activity. Targeting of transcription factor binding sites of SNAI-2 and TCF4 interrupted by the HDAC5 variant showed a significant impact of TCF4 on promoter activity of mutated HDAC5. Our findings contribute not only to the identification of unrecognized genetic causes of familial CRC but also underline the importance of 5’UTR variants affecting transcriptional regulation and the pathogenesis of complex disorders.

1. Introduction

Whole exome sequencing (WES) is gaining relevance for molecular genetic research of familial cancer and the identification of new cancer-predisposing variants. Among hereditary malignancies, colorectal cancer (CRC) shows one of the highest proportions of familial cases and heritable factors have been estimated to account for about 35% of CRC risk, according to twin studies [1]. Cancer-predisposing germline mutations of APC, MUTYH and mismatch repair genes are already known to be associated with familial CRC and to lead to: phenotypes of well-defined Mendelian CRC syndromes, familial adenomatous polyposis (FAP), resulting from APC gene mutations; MUTYH-associated polyposis (MAP); and Lynch syndrome, a hereditary non-polyposis colon cancer (HNPCC) syndrome caused by mismatch repair gene mutations (MLH1, MSH2, MSH6, PMS2 and EPCAM) [2]. CRC fulfilling the diagnostic criteria of Lynch syndrome but not linked to pathogenic mismatch repair gene mutations or the resulting microsatellite instability has been classified as Familial Colorectal Cancer Type X (FCCTX). Although FCCTX is considered as a heterogeneous group with unknown genetic etiology, candidate genes such as CENPE, KIF24, GALNT12, ZNF367, GABBR2 and BMP4 have been suggested as cancer predisposing in these patients [3]. Sequencing studies have further identified germline variants in HNRNPA0 and WIF1 genes in a family with susceptibility to multiple early onset cancers including CRC [4] as well as a germline mutation in NTHL1 gene in three unrelated families with adenomatous polyposis and various cancer types including CRC [5,6]. Moreover, germline as well as somatic mutations in POLE and POLD1 genes have been associated with both sporadic and familial CRC contributing to the genetic understanding of CRC inheritance [7,8]. Nevertheless, germline mutations in these or other established predisposition genes account for only 5 to 10% of all CRC [9]. Since the genetic background of most familial CRC cases has still not been sufficiently explored, the application of WES on these patients within pedigree-based studies bears great potential for the exploration of the remaining genetic burden.
As 98% of the genome is non-coding, a high proportion of variants are identified in this region [10] and are gaining relevance in the understanding of inherited cancer predisposition [11,12]. The great potential impact of variants within the 5′ untranslated region (UTR) of a gene or up to 1 kb upstream of transcription start sites can be attributed to possible changes in transcriptional regulatory elements, such as binding motifs in promoters, enhancers or super-enhancers. On the other hand, the 3′UTR of a gene and the flanking region downstream of transcription end sites can carry essential miRNA binding sites where small RNAs containing the respective complementary sequence can post-transcriptionally attach; hence, mRNA translation can be suppressed leading to the inhibition of gene expression [13]. By this and many other means, non-coding regions can play an important role in transcriptional and post-transcriptional regulation of gene expression, which is why genetic variation of non-coding DNA has to be considered in the analysis and prioritization of potentially cancer-causing variants.
We have developed the Familial Cancer Variant Prioritization Pipeline (FCVPPv2) for evaluation of both coding and non-coding variants and implemented it in the prioritization of novel missense variants in the tumor suppressor genes DICER1 in Hodgkin lymphoma and CPXM1 in papillary thyroid cancer and in the pathways enriched in these entities [14,15,16,17]. In the present study, tools such as the Combined Annotation Dependent Depletion v1.4 (CADD) tool [18], SNPnexus [19] and the Bedtools intersect function were applied as part of the non-coding analysis of our pipeline in order to identify important regulatory elements. Using FCVPPv2 and literature mining, we were able to prioritize two novel heterozygous variants in a family affected by CRC, a coding variant in the APC downregulated 1 (APCDD1) gene and a non-coding variant in the histone deacetylase 5 (HDAC5) gene. Whereas the APCDD1 variant was identified in 8 additional cases among 1705 CRC families, cell proliferation assays indicated an insignificant proliferative impact for the variant. We did not find any other familial CRC cases with the HDAC5 variant, but functional experiments showed a significant impact of the 5′UTR variant on expression of HDAC5, involved in cellular processes such as proliferation, differentiation, apoptosis and cell cycle progression. Luciferase reporter assays resulted in enhanced promoter activity of the HDAC5 gene carrying this variant compared to the wild-type sequence and targeting of transcription factor binding sites interrupted by this variant showed an impact of TCF4 on promoter activity of mutated HDAC5.

2. Results

2.1. FCVPPv2 Analysis of Coding Variants Prioritized a Missense Variant in APCDD1 Gene

Application of WES on the studied CRC-affected family identified 13,733 variants with MAF (minor allele frequency) ≤ 0.1%. Filtering according to the probability for each family member of being a Mendelian case (Figure 1, Supplementary Table S1) narrowed down the number of identified variants to 783. For analysis of coding variants (n = 101), synonymous variants (n = 35) were filtered out as they are generally considered to play a minor role in the development of diseases and cancer. The remaining 66 nonsynonymous variants, frameshift deletions/insertions or variants of unknown significance were further evaluated. Application of the PHRED-like CADD score cut-off of ≥10 reduced the number of variants to 51 and screening according to the three conservational scores GERP, PhastCons and PhyloP further narrowed down the number to 38 variants. Eighteen variants were annotated and at least 3 out of 4 intolerance scores were favorable and were further considered for deleteriousness screening. Since 12 of the variants did not fulfill the criterion of being annotated as deleterious by at least 60% of all deleteriousness scores, they were excluded. Lastly, 7 nonsynonymous variants passed all the criteria considering a MAF of 0.1% in the non-Finnish European population of gnomAD database: APCDD1 (p.R299H), FLNC (p.G553S), KCNH6 (p.L403V), LSR (p.A139D), MTX1 (p.Y228C), SDS (p.T185I), ZW10 (p.A732P) (Table 1).
SNAP2 analysis indicated a functional effect of the amino acid substitutions induced by the variants in the APCDD1 (p.R299H), ZW10 (p.A732P) and LSR (p.A139D) genes by predicting scores of >50, whereas application of CGI did not identify any cancer drivers among the variants. The lipolysis-stimulated lipoprotein receptor encoded by the LSR gene is generally known to play a role in metabolism by inducing the uptake of triglyceride-rich lipoproteins like chylomicrons, LDL and VLDL from blood into cells. Since further literature mining did not reveal any association with colorectal carcinogenesis, the identified variant in the LSR gene was considered to be of minor impact on the development of CRC in the studied family. The ZW10 gene encodes a protein of the mitotic checkpoint controlling chromosome segregation during cell division. On the background of causing chromosomal instability when mutated in a model system, Wang et al. have identified two somatic missense variants in ZW10 gene (p.N123T, p.S623G) in a panel of CRCs [21]. As the prioritized ZW10 variant identified in the studied family (p.A732P) is not located in the adjacent regions and is moreover located close to the end of the protein (779aa), its functional and potentially cancer-predisposing impact was considered as improbable.
The APCDD1 variant is located in the second of two functional APCDDC domains (51–283, 284–490), according to Interpro, Pfam and SMART [22,23,24], required for interaction with Wnt ligands and their receptors. Since the APCDD1 gene has been linked to CRC by being involved in the Wnt signaling pathway as a direct target of the beta-Catenin/TCF4 complex, the associated variant (p.R299H) was prioritized as the top cancer-predisposing candidate of all identified missense variants (Figure 2) [25]. Pedigree segregation of the APCDD1 variant was checked by IGV (Integrative Genomics Viewer) and further confirmed by targeted Sanger sequencing showing the heterozygous variant (p.R299H) for the two CRC cases (III7, III8) and the individual with polyps (III10) and the wild-type sequence for II7 (CRC at the age of 83 years) and the two controls of the family (III3 and IV3), respectively (Supplementary Figure S1a).

2.2. FCVPPv2 Analysis of Non-Coding Variants Prioritized a 5′UTR Variant in HDAC5 Gene

In agreement with the high proportion of non-coding DNA in the human genome, 674 of the 783 pedigree-filtered variants (86%) were located in the non-coding sequence such as intronic and intergenic regions, 1 kb downstream and upstream regions, the 5′UTR, 3′UTR and non-coding RNAs (ncRNAs), respectively (Figure 2). Screening of the 174 upstream/5′UTR as well as downstream/3′UTR variants, excluding intronic, intergenic and ncRNAs variants, for an updated PHRED-like CADD score ≥ 10 resulted in 38 variants. Filtering with conservational scores narrowed down this number to 8 5′UTR and 18 3′UTR variants. Application of non-coding scores derived from SNPnexus revealed that all 8 5′UTR variants reached at least 50% of the cut-off values, whereas 2 out of 18 3′UTR variants were excluded due to insufficient non-coding scores. The remaining 24 non-coding candidates were further evaluated for the presence of specific regulatory elements such as TFBS (transcription factor binding sites) and CpG islands for 5′UTR and miRNA binding sites for 3′UTR variants. Since all 5′UTR candidates showed either a CpG island or a TFBS identified by SNPnexus, CADD v1.4 or Bedtools intersect function, our analysis resulted in 8 top 5′UTR variants, summarized in Table 2.
Out of the remaining 16 3′UTR variants only 8 were annotated with a predicted miRNA target site by the Bedtools intersect function and with a mirSVR score ≤ −0.1, shortlisted as the top 3′UTR variants in Table 3.
Checking the shortlisted variants for their involvement in molecular mechanisms of colorectal carcinogenesis such as Wnt and Notch signaling pathways, which are generally known to play a crucial role in CRC initiation [29], revealed that the HDAC5 gene was implicated in colorectal carcinogenesis by upregulating the Delta-like 4 ligand (DLL4), a vascular specific Notch ligand essential for tumor angiogenesis [30,31]. The potentially pathogenic role of HDAC5 in CRC was clinically confirmed by a further study showing the upregulation of HDAC5 protein in patients with early colon field carcinogenesis [32]. Therefore, the short-listed 5′UTR variant of the HDAC5 gene (17_42200942_T_G) was considered as a promising cancer-predisposing candidate. As the variant was annotated to be located at an active transcription start site according to ChromHmm (cHmmTssA, Score = 0.984) and Segway (TSS) (Table 2), an activating impact of the 5′UTR variant on HDAC5 gene expression was hypothesized. The location of the variant in a CpG island and multiple TFBSs as well as the high PHRED-like CADD score of 21.9 supported the potential functional role of the identified HDAC5 variant in cancer predisposition, leading to its final prioritization (Figure 2). Pedigree segregation of the prioritized variant was checked by IGV and further confirmed by targeted Sanger sequencing showing the wild-type sequence for family members III3, III10 and IV3 and the heterozygous variant (T → G) for II7, III7 and III8, respectively (Supplementary Figure S1b).

2.3. Allele Frequency in a Large Familial CRC Cohort

Custom-made Taqman assays for screening of the APCDD1 and HDAC5 variants in 1705 familial CRC cases and 1674 healthy elderly individuals from Poland confirmed the variants in the family. Screening of APCDD1 resulted in identifying the variant in an additional 8 familial CRC cases and 2 healthy individuals (odds ratio (OR) = 4.44, 95% confidence interval (CI) = [0.96; 20.56], p = 0.06). Additionally, one individual, who originally was in the healthy control group but developed CRC at the age of 55 years, was heterozygous for the variant. That increased the OR to 4.93 (95%CI = [1.08; 22.53], p = 0.04). The existence of the heterozygous APCDD1 variant was confirmed by Sanger sequencing in all positive samples. All CRC patients were diagnosed at ages between 30 and 64 years and had at least one family member diagnosed with CRC, in some cases also with other cancers such as breast, cervical, female genital tract, kidney and lung cancer and leukemia. The sampling ages of the two healthy individuals were 55 years and 80 years and they had no family history of any cancer. No other familial CRC cases showing the HDAC5 variant were identified.

2.4. APCDD1 Variant Did Not Show a Significant Effect on Proliferation of HEK293T and HT-29 Cells

In order to investigate the functional impact of the identified APCDD1 variant, CCK-8 proliferation assays were conducted for pAPCDD1WT and pAPCDD1MUT using HEK293T and HT-29 cell lines. We did not find any significant difference in viable cell numbers between pAPCDD1WT and pAPCDD1MUT transfected cells at any measured time point (p = 0.05, Supplementary Figure S2). These results indicate an improbable proliferative impact of the variant in HEK293T cells as well as in colon cancer cells HT-29, excluding the APCDD1 variant as a sole potentially cancer-predisposing candidate in the studied family.

2.5. 5′ UTR Variant of HDAC5 Gene Enhances Promoter Activity

To test our hypothesis that the identified 5′UTR variant contributes to increased HDAC5 expression at transcriptional level, we performed luciferase reporter assays in HEK293T cells with both pHDAC5WT and pHDAC5MUT. The results of the reporter assays revealed a consistently higher luciferase activity (R = LF/LR) of cells transfected with pHDAC5MUT compared to pHDAC5WT after normalization to pGL4.10 vector (RE = 1), respectively (Figure 3). Despite the overall increasing tendency, only the first time point after 24 h post transfection resulted in a significant fold change in activity (ΔFAMUT/WT = 122.64%, t-test p < 0.01), whereas the later time points showed no significant difference between pHDAC5WT and pHDAC5MUT using t-tests at a significance level of α = 0.05. Since both plasmids, pHDAC5WT and pHDAC5MUT, only differ in the variant of interest (HDAC5: 17_42200942_T_G), the detected increase of luminescent signal can be traced back to the variant itself causing the enhanced promoter activity.

2.6. 5′ UTR Variant of HDAC5 Disrupts SNAI-2 and TCF4 Transcription Factor Binding Sites

Analysis of TFBS predicted by Jaspar2020 with the default relative profile score threshold of 80% resulted in the identification of 22 newly created TFBS (only present in HDACMUT) and 43 TFBS destroyed by the variant (only present in HDAC5WT). The overall 65 identified TFBS differing between HDAC5WT and HDACMUT were found to be targeted by 51 different transcription factors. Further restriction of the relative score to 85%, which is referring to the likelihood sequence for the motif, narrowed down the number of identified transcription factors to 17, targeting 21 different TFBS (Table 4). Literature mining showed an association of two specific transcription factors with colorectal carcinogenesis: SNAI-2 and TCF4 have been shown to be involved in Wnt as well as in Notch pathway and were thus considered as promising candidates for upregulation of HDAC5 promoter activity in further luciferase reporter assays [33,34]. According to Jaspar 2020, TCF4 was annotated to bind at 25 TFBS within the cloned HDAC5 sequence of which 2 were disrupted in HDACMUT, whereas SNAI-2 was reported to bind at 13 TFBS with 1 binding site disrupted in HDACMUT.

2.7. Co-Transfection of HDAC5 and TCF4 Increases Promoter Activity Due to 5′UTR Variant of HDAC5 Gene

To investigate the effect of potential regulatory transcription factors (SNAI-2 and TCF4) on the promoter activity of HDAC5 gene, HEK293T cells that do not express these transcription factors endogenously were co-transfected with the respective expression vectors of pSNAI-2 or pTCF4 followed by luciferase reporter assays. The results showed an enhanced promoter activity for almost all measured time points after expression of the transcription factors compared to the cells only transfected with respective pHDAC5WT/pHDAC5MUT vectors, respectively (Figure 3).
Co-transfection of pSNAI-2 led to a significant increase in luciferase activity of both pHDAC5WT and pHDAC5MUT cells compared to respective cells not expressing pSNAI-2, showing similar fold changes at 24 h and 36 h (t-test p < 0.05). To see if the described effect is partly mediated by the identified variant, we next compared the luciferase activity between pHDAC5WT and pHDAC5MUT cells co-transfected with pSNAI-2. This resulted in a significant fold change in promoter activity after 24 h and 36 h of transfection (ΔFAMUT,S/WT,S = 120.44% (24 h), 120.55% (36 h), t-test p < 0.05). Nevertheless, the fold change between pHDAC5WT and pHDAC5MUT cells not expressing pSNAI-2 was observed to be similar (ΔFAMUT/WT = 122.64% (24 h), t-test p < 0.01), which was also mirrored by the overlapping 95% confidence intervals of differences between pHDAC5WT and pHDAC5MUT cells respectively with or without pSNAI-2 expression (95% CIMUT,S-WT,S = [10.90; 54.57]; 95% CIMUT-WT = [12.37; 47.96]). In summary, SNAI-2 transcription factor increased HDAC5 promoter activity independent of the variant of interest after 24 h and 36 h of transfection.
Co-transfection of pTCF4 expression vector also led to enhanced luciferase activity in both pHDAC5WT and pHDAC5MUT cells compared to respective cells not expressing pTCF4, whereas a higher effect on the cells carrying the mutated sequence was observed. This stronger enhancement of pHDAC5MUT promoter activity by pTCF4 expression is well reflected in the comparison between pHDAC5WT and pHDAC5MUT cells: after co-transfection of pTCF4, pHDAC5MUT cells showed a significantly higher promoter activity compared to pHDAC5WT cells at all four measured time points, reaching its maximum at 72 h (ΔFAMUT,T/WT,T = 131.66%, 132.99%, 132.05%, 161.52%, respectively, t-test p < 0.05). In contrast to that, fold changes in promoter activity of pHDAC5MUT compared to pHDAC5WT without any co-transfection never exceeded the maximum value of 122.64% (24 h post transfection) and were thus lower than in pTCF4 co-transfected cells. Consistently, the difference between means of pHDAC5WT and pHDAC5MUT promoter activity was detected to be at least 2.5-fold higher in cells co-transfected with pTCF4 compared to cells not expressing pTCF4; here again the maximum of differences between means was reached after 72 h, showing an almost 8-fold value (ΔRMUT,T-WT,T = 119.9; ΔRMUT-WT = 15.29). Furthermore, comparison of the 95% confidence intervals of differences between pHDAC5WT and pHDAC5MUT revealed no overlapping for cells with or without pTCF4 expression 24 h post transfection (95% CI MUT,T-WT,T = [73.44; 81.18]; 95% CIMUT-WT = [12.37; 47.96]). Summing up the described results, the enhancing effect of pTCF4 on HDAC5 promoter activity can be partly traced back to a mechanism depending on the prioritized 5′UTR variant.
The dependency of TCF4-mediated promoter activity enhancement on the identified HDAC5 variant was confirmed in a second experiment of luciferase reporter assays, focusing only on the comparison of pTCF4 co-transfected cells at the same 4 time points: Fold changes in promoter activity of pHDAC5MUT compared to pHDAC5WT even reached values between 368.82% and 405.05% at all measured time points. Two-tailed t-tests resulted in extreme significance of promoter activity increase between pHDAC5WT and pHDAC5MUT cells (p < 0.0001).

3. Discussion

By applying our in-house developed FCVPPv2 on a CRC-affected family, we were able to prioritize two novel heterozygous germline variants, a coding variant in APCDD1 and a non-coding variant in the HDAC5 gene. The APCDD1 variant was identified in 8 additional familial CRC cases, 1 CRC case without family history and in 2 healthy elderly individuals without cancer family history, leading to a 4.9-fold increased CRC risk for the variant carriers (p = 0.04), while no other HDAC5 variants were identified among the 1705 familial CRC cases. Cell proliferation assays indicated an insignificant proliferative impact for the APCDD1 variant and luciferase reporter assay results showed an increased promoter activity by the 5′UTR variant of the HDAC5 gene.
APCDD1 was first identified as a direct Wnt target of the β-catenin/TCF4 transcription complex by Takahashi et al. who further reported increased APCDD1 expression in primary colon cancer tissue compared with corresponding healthy tissue [25]. In contrast, a study by De Sousa et al. showed that increased expression was restricted only to adenoma stages and not observed in carcinoma stages, and that Wnt target genes such as APCDD1 were epigenetically silenced by promoter methylation in different colon cancer cell lines. Since re-expression of APCDD1 was further associated with decreased Wnt signaling levels, the authors explained these observations in CRC by the known negative feedback regulation of Wnt signaling driven by several target genes including APCDD1 [35,36]. Showing decreased levels of β-catenin and Wnt target genes upon APCDD1 expression, Ordóñez-Morán et al. confirmed its role as a Wnt inhibitor and further proposed APCDD1 as a potential tumor suppressor in CRC [37]. Accordingly, both mentioned studies found a correlation of high APCDD1 expression with favorable prognosis for CRC patients [35,37]. Though a cell growth-promoting function was reported by Takahashi et al. for APCDD1 in vitro and in vivo using colon cancer LoVo cells [25], cell proliferation assays of this study using HEK293T and colon cancer cells HT-29 could not confirm the postulated proliferative impact of wild-type APCDD1 nor of the identified APCDD1 variant. Based on the potential tumor suppressor role of APCDD1 and the variant being relatively common among Polish CRC cases, the identified APCDD1 variant could be considered to play a possible role in colorectal carcinogenesis, though our cell proliferation experiments did not show any difference between the wild-type and mutated HT-29 cells.
On the other hand, our study showed that implementation of the identified 5′UTR variant of the HDAC5 gene increased the promoter activity. Experimental confirmation of pedigree segregation as well as the established functional role of HDAC5 in colorectal carcinogenesis supported a role for the 5′UTR variant in CRC predisposition: HDAC5 plays a crucial part in epigenetic modulation of gene expression. By removing acetyl groups from the N-acetyllysine residues of histones, HDACs are able to enhance chromatin condensation, leading to transcriptional repression of genes [38]. Thus, dysregulation of HDACs induces chromatin rearrangement possibly affecting tumor suppressor genes or oncogenes which may explain the well-established association of HDACs with carcinogenesis of various malignancies such as CRC, medulloblastoma [39] or hepatocellular carcinoma [40]. In particular, upregulation of Notch ligand DLL4 and the possibly resulting activation of Notch pathway and angiogenesis could represent an important carcinogenic mechanism induced by HDAC5 in CRC. Furthermore, HDAC inhibitors such as azaindolylsulfonamides have already been investigated in CRC xenografts and have shown promising results in tumor growth suppression [41].
We showed an enhancing impact of the identified non-coding variant on HDAC5 promoter activity with the help of luciferase reporter assays. A further upregulation of promoter activity by TCF4 expression, especially in cells carrying the mutation, could implicate TCF4 as a potential regulator of HDAC5 expression partly depending on the inserted variant. Since implementation of the HDAC5 5′UTR variant leads to loss of TCF4 binding sites, the described enhancement could be traced back to the reduction of a potentially repressive function of TCF4 on HDAC5 promoter activity at these specific sites. The enhancing effect of TCF4 as well on promoter activity of cells not carrying the mutation might be explained by further, still unexplored regulatory mechanisms as additional molecular interactions of TCF4 transcription factor with HDAC5 promoter sequence. This may be supported by the identification of 23 additional TFBS targeted by TCF4 within the cloned HDAC5 sequence. The results of this work indicate the involvement of TCF4 transcription factor in regulation of HDAC5 gene expression, resulting in HDAC5 upregulation and potentially promoting colorectal carcinogenesis in this family.
The proposed regulating impact of TCF4 on HDAC5 gene expression in the studied family with CRC aggregation is further supported by the generally established role of TCF4 in colorectal carcinogenesis. As part of the Wnt signaling pathway, TCF4 was shown to form the β-catenin/TCF4 transcription complex in the nucleus and induce gene expression of Wnt targets such as MYC [33,42,43,44], a known oncogene overexpressed in CRC [45,46,47]. As reported by Hatzis et al., genes upregulated by TCF4 are involved in cell proliferation, transcription, cell adhesion, negative regulation of programmed cell death, establishment and maintenance of chromatin [48]. Furthermore, TCF4 has been reported as a negative prognostic factor in CRC and is associated with shorter overall survival [49]. Although the molecular mechanisms leading to further HDAC5 upregulation after the assumed TCF4 binding site loss are not yet fully understood, the described carcinogenic role of TCF4 in CRC is supported by our results and supports reciprocally the postulated CRC promoting function of HDAC5 gene. A possible explanation approach for the underlying molecular mechanisms may consider the known dual regulatory role of TCF4 depending on the interaction with either transcriptional co-repressor (such as Groucho/transducin-like enhancer of split (Gro/TLE) family members and HDACs) or co-activator complexes (such as β-catenin and SMADs) [50,51,52,53,54,55,56].
The results of our functional experiments provided further evidence for the application of the FCVPPv2 to families with cancer aggregation and confirmed our pipeline’s significance in the prioritization of both the coding and non-coding variants. The integration of a variety of annotation tools by the FCVPPv2 enables the identification of functionally important coding regions and regulatory elements in the non-coding sequence of genes. Regarding the prediction of TFBSs modified by the prioritized variant, CADD v1.4 identified a relatively high number of overlapping ChIP-seq TFBSs whereas SNPnexus and the intersect function of Bedtools did not reveal any. The differing results for predicted TFBSs could be traced back to the different databases each tool is based on and different cell lines used in the studies. Thus, the synergetic application of all tools within the FCVPPv2 with subsequent integration of their respective predictions could be considered as a good approach for an all-encompassing analysis of TFBSs.

4. Materials and Methods

4.1. Patient Samples

A family with 8 confirmed CRC cases in 3 generations was identified at the Department of Genetics and Pathology, Pomeranian Medical University in Szczecin, Poland. Blood samples were collected from 3 CRC cases, 2 siblings who were diagnosed with CRC at the age of 52 and 35 years and their aunt who developed CRC at the age of 83, 1 individual with polyps diagnosed at the age of 56, 59 and 71 years and 2 unaffected family members (Figure 1). The study was approved by the Bioethics Committee of the Pomeranian Medical Academy in Szczecin (No: BN-001/174/05). All participating individuals gave informed consent.

4.2. Whole Exome Sequencing and Variant Evaluation

Genomic DNA was isolated using a modified Lahiri and Schnabel method [57] and WES was performed for 3 CRC cases (II-7, III-7, III-8), a family member with polyps (III-10) and 2 unaffected family members (III-3, IV-3) using Illumina-based small read sequencing. Mapping to reference human genome (assembly version Hs37d5) was performed using BWA [58] and duplicates were removed using Picard (http://broadinstitute.github.io/picard/, accessed on 22 January 2021).

4.3. Variant Calling, Annotation and Filtering

Single nucleotide variants (SNVs) were detected by using SAM tools [59] and indels by using Platypus [60]. Variants were annotated using ANNOVAR [61], 1000 Genomes project [62], dbSNP [63] and Exome Aggregation Consortium (ExAC) [64]. Variants with a quality score of greater than 20 and a coverage of greater than 5×, SNVs that passed the strand bias filter (a minimum one read support from both forward and reverse strand) and indels that passed all the Platypus internal filters were retained. With respect to the 1000 Genomes project Phase 3, non-TCGA ExAC data [64], NHLBI-ESP6500 and local datasets variants with minor allele frequency (MAF) less than 0.1% in the European population were selected. A pairwise comparison of shared rare variants among the family was performed to check for sample swaps and family relatedness.

4.4. Variant Filtering According to FCVPPv2

The Familial Cancer Variant Prioritization Pipeline version 2 (FCVPPv2) was applied for evaluation of identified variants as described below and summarized in Figure 2 [14].

4.4.1. Familial Segregation of the Cancer Predisposing Variants

Considering the probability of carrying the cancer-predisposing variant for each analyzed family member, they were classified as cases and controls according to the presence or absence of CRC. Generally, all affected family members should carry the variant of interest, with the following exception: Since the typical age of onset in hereditary CRC patients is considered to be lower than in the general population, such as 45 years in HNPCC families [65] compared to 63 years in sporadic CRC [66], family member II-7 developed CRC at a relatively high age of 83 years, atypical for familial inheritance. Therefore, she could be considered as a phenocopy in this family expressing the phenotypic disease, but not the underlying genotype. Whereas II-7 could possibly but not definitely carry the variant of interest, family members III-7 and III-8 are considered as certain cases and thus carriers of the variant due to their young age of onset typical for familial CRC. The unaffected family members III-3 and IV-3 should not show the cancer-predisposing variant of interest and are thus defined as controls in this family. On the other hand, family member III-10 was diagnosed with multiple colorectal polyps at the age of 56, 59 and 71 years. Since colorectal polyps could be a preliminary stage of CRC, especially when recurrent, III-10 could be considered as a possible carrier of the variant. Based on these definitions, the identified variants were filtered according to their presence in the cases, absence in the controls and presence or absence in the old-age CRC case II-7 and the polyp carrier.

4.4.2. Analysis of Coding Variants

All variants were ranked using the Combined Annotation Dependent Depletion (CADD) tool v1.3 [67]; evolutionary conservation scores: Genomic Evolutionary Rate Profiling (GERP > 2.0), PhastCons (>0.3) and PhyloP (≥3.0) [68,69,70]; intolerance scores based on allele frequency data from our in-house datasets, from ESP [71] and ExAC supplemented by the ExAC-derived Z-score [72]; and deleteriousness scoring tools accessed from dbNSFP v3.0 (database for nonsynonymous SNVs’ functional predictions) [73]. The variants should reach a PHRED-like CADD-score of ≥ 10 and fulfill at least 2 out of 3 conservational scores, 3 out of 4 intolerance scores and at least 60% of all 12 deleteriousness scores to be taken into account for further analysis. The final exonic candidates were further screened by considering the allele frequency in the non-Finnish European population in the latest version of gnomAD database (https://gnomad.broadinstitute.org/, accessed on 22 January 2021), the potential impact of amino acid substitutions with the help of Snap2 (https://rostlab.org/services/snap2web/, accessed on 22 January 2021), the prediction of cancer drivers by Cancer Genome Interpreter (https://www.cancergenomeinterpreter.org/, accessed on 22 January 2021) and the recent literature for reported gene–cancer relations and potentially cancer-related protein functions [74,75].

4.4.3. Analysis of Non-Coding Variants

Non-coding variants were analyzed with the updated version of CADD (CADD v1.4) that provides comprehensive information about the functional importance of non-coding regions by integrating a variety of scoring tools such as transcription factor binding sites (TFBS) located in the 5′UTR and 1 kb flanking region upstream of transcription start sites, mirSVR for ranking putative microRNA target sites [76], chromHmm and Segway that provide information about the biological function and active regulatory regions based on large-scale functional genomics datasets such as ChIP-seq data [26,28]. The variants were also analyzed by SNPnexus for identification of CpG islands and TFBS and for annotation of the functional impact of all non-coding variants [19]. Variants of the 5′UTR and 1 kb upstream region as well as 3′UTR and 1 kb downstream region were scanned for potential regulatory elements by means of Bedtools intersect function and respective databases: the FANTOM5 consortium and the super-enhancer archive (SEA) were used for identification of promoters, enhancers or super-enhancers [77] and Targetscan 7.0 was used for identification of microRNA target sites [78].
The literature was checked for any gene–cancer relations and potentially cancer-related protein functions of the top non-coding candidates.

4.5. Analysis of Transcription Factor Binding Sites

By uploading the wild-type sequence (HDAC5WT) and the sequence containing the variant in the 5′UTR of HDAC5 gene (HDAC5MUT) to Jaspar2020, potential TFBS were predicted and compared [79].

4.6. Variant Validation with IGV

Sequencing data of all prioritized variants were checked for correctness using the Integrative Genomics Viewer (IGV), a visualization tool for interactive exploration of large, integrated genomic datasets [80]. By this means, the identified variants were validated and the confidence in variant calls was increased.

4.7. Confirmation of Familial Segregation by Sanger Sequencing

Polymerase chain reaction (PCR) was performed to amplify the 5′UTR of HDAC5 gene (ENST00000225983.6) from DNA of the family members by using HotStarTaq DNA Polymerase (Qiagen, #203205) and following the manufacturer’s instructions. The primers were designed with Primer3 v.0.4.0 (http://bioinfo.ut.ee/primer3-0.4.0/, accessed on 22 January 2021) on 12 April 2019: HDAC5 forward 5′-gggggtctgggtctattttt-3′, reverse 5′-GAAGGGGCAAATCAGACAAC-3′. PCR was run at an annealing temperature of 62 °C with 5% dimethylsulfoxide (DMSO). The amplicons were validated by gel electrophoresis and purified with ExoSAP purification kit according to the manufacturer’s instructions. Sequencing reaction was performed with BigDye Terminator v3.1 Ready Reaction Cycle Sequencing kit (Thermo Fisher Scientific, Darmstadt, HE, Germany, #4337455). The electrophoretic profiles of HDAC5 sequences were analyzed manually.

4.8. Screening of Large Case and Control Cohorts

In order to determine the allele frequency of the HDAC5 and APCDD1 variants, 1705 familial CRC cases and 1674 healthy elderly individuals without cancer family history were checked using custom-made Taqman assays. The existence of heterozygous variants was confirmed by Sanger sequencing.

4.9. PCR-Based Cloning of Gene Reporter Constructs

Cloning primers were designed using Primer3 v.0.4.0 (http://bioinfo.ut.ee/primer3-0.4.0/, accessed on 22 January 2021) on 12 April 2019 for the 5′UTR including the variant of interest as well as the first exon and part of the following intron of HDAC5 gene (ENST00000225983.6). Adding specific restriction sites of Kpn I or Hind III and a 5′ leader sequence of 6 bp resulted in the following primer pair: forward 5′-TAAGCAGGTAC^C gcaccaaagtcagggaagtc-3′; reverse 5′-TGCTTA^AAGCTTgaaggggcaaatcagacaac-3′. PCR was performed with an annealing temperature of 59 °C with 5% DMSO to amplify the required HDAC5 insert with a total length of 1116 bp from human DNA. Digestion of the PCR amplicon and the promoter-less pGL4.10[luc2] vector purchased from Promega (#E6651) was performed using FastDigest Kpn I (Thermo Fisher Scientific, FD0524), FastDigest Hind III (Thermo Fisher Scientific, #FD0504) and FastDigest Buffer 10x (Thermo Fisher Scientific, #B64) according to the manufacturer’s instructions. The digested products were validated by gel electrophoresis and extracted with Monarch® DNA Gel Extraction Kit (New England BioLabs, Frankfurt, HE Germany, #T1020S). Ligation of the digested HDAC5 gene insert (1099 bp) and pGL4.10 vector (4210 bp) was done by using Quick Ligation™ Kit (New England BioLabs, #M2200S). The ligated product pGL4.10-HDAC5 (5309 bp), illustrated in Supplementary Figure S3, was again validated by gel electrophoresis and extracted with Monarch® DNA Gel Extraction Kit (New England BioLabs, #T1020S) for further use as the wild-type pGL4.10-HDAC5 (pHDAC5WT) construct. The mutant pGL4.10-HDAC5 (pHDAC5MUT) construct was created by site-directed mutagenesis using QuikChange II XL Site-Directed Mutagenesis Kit (Agilent, Waldbronn, BW, Germany, #200521) according to the manufacturer’s instructions and the following primer pair: forward 5′-gcggcagcaccgcctcgacggct-3′, reverse 5′-agccgtcgaggcggtgctgccgc-3′, designed based on Agilent QuikChange Primer Design (https://www.agilent.com/store/primerDesignProgram.jsp, accessed on 22 January 2021) on 28 November 2020. Both plasmids, pHDAC5WT and pHDAC5MUT, were confirmed by Sanger sequencing.

4.10. Cloning of SNAI-2, TCF4 and APCDD1

Human pENTR223-SNAI-2 (#172707094), pENTR223-TCF4 (#107260711) and pENTR223-APCDD1 clones (GPCF, #115154469) purchased from the Genomics and Proteomics Core Facility of the DKFZ Heidelberg (GPCF) were cloned into pDEST26 vector using Gateway™ LR Clonase™ II Enzyme mix (Thermo Fisher Scientific, #11791020). The identified variant was introduced into pDEST26-APCDD1 vector (pAPCDD1) using QuikChange II XL SDM Kit and the following SDM primers designed with Agilent QuikChange Primer Design: forward 5′-gggtgagccagcactgtgaggtgcg-3′, reverse 5′-cgcacctcacagtgctggctcaccc-3′. All sequences were confirmed by Sanger sequencing.

4.11. Plasmid Amplification and Extraction

Stellar chemically competent cells (Takara, Saint-Germain-en-Laye, France, #636763) were used for transformation of pHDAC5WT, pSNAI-2, pTCF4 and pAPCDD1WT, whereas pHDAC5MUT and pAPCDD1MUT were transformed into XL10-Gold Ultracompetent cells (Agilent, #200314) after site-directed mutagenesis. Plasmid extraction was performed using PureLink™ HiPure Plasmid Midiprep Kit (Thermo Fisher Scientific, #K210004).

4.12. Cell Line and Culture Conditions

Human embryonic kidney 293 (HEK293T) cells and human colon cancer cells HT-29 were a kind gift from Peter Krammer’s lab (DKFZ) and cultured in RPMI. Using Harmonizome, a database of processed datasets about genes and proteins, endogenous expression levels of proteins for HDAC5, SNAI-2 and TCF4 were checked and ruled out in HEK293T cells [81].

4.13. Cell Proliferation Assay—APCDD1

HEK293T and HT-29 cells were seeded in 24-well plates and 24 h later transfected with either 150 ng of pAPCDD1WT, pAPCDD1MUT or pDEST26 vector as negative control. Merck’s Cell Counting Kit-8 (CCK-8, Darmstadt, HE, Germany #96992) was used for quantitation of viable cell numbers in proliferation and cytotoxicity assays at four different time points: 0 h, 24 h, 48 h and 72 h post transfection. Briefly, a 100 μL cell suspension of each well was treated with 10 μL CCK-8 solution and incubated for 1 h at 37 °C. The absorbance was measured at 450 nm using a microplate reader and the number of viable cells was calculated based on a standard curve. By comparing the numbers of viable cells at different time points and the respective growth curves between pAPCDD1WT and pAPCDD1MUT, the proliferative impact of the implemented APCDD1 variant (p.R299H) could be estimated for each cell line.

4.14. Luciferase Reporter Assay—HDAC5

HEK293T cells were seeded in 48-well plates and 24 h later transfected with 100 ng of pHDAC5WT or pHDAC5MUT as a test reporter, 10 ng renilla as the control reporter and 25 μL Lipofectamine 2000 (Thermo Fisher Scientific, #11668030). Negative controls were considered by including cells transfected with promoter-less pGL4.10 vector (EMPTY, E). For investigating the impact of SNAI-2 and TCF4 transcription factors, cells were co-transfected with 20 ng of pSNAI-2 or pTCF4 expression vector and the corresponding negative controls were included: pGL4.10 vector in combination with each expression vector (pSNAI-2 or pTCF4) and pHDAC5MUT with the empty expression vector pDEST26. Luciferase assays were conducted using the dual-luciferase reporter assay system (Promega, Walldorf, BW, Germany, #E1910) at four different time points: 24 h, 36 h, 48 h and 72 h post transfection. Since renilla luminescence is measured for vector normalization, the relative ratio R of firefly luminescence LF to renilla luminescence LR (R = LF/LR) was calculated and later referred to as luciferase activity. After normalizing R values to the empty promoter-less pGL4.10 vector (RE = 1), the ratios were compared between pHDAC5WT and pHDAC5MUT for each condition. For this purpose, we calculated fold changes in promoter activity (ΔFold Activity MUT/WT = ΔFAMUT/WT = RMUT/RWT) as well as two-tailed t-tests at a significance level of α = 0.05. All experiments were conducted in triplicates and repeated at least thrice.

5. Conclusions

Application of the FCVPPv2 on a CRC-affected family identified a novel missense variant in the APCDD1 gene and a 5′UTR variant in the HDAC5 gene as potentially cancer predisposing. While the APCDD1 variant was relatively common among Polish CRC cases (AF = 0.003) and increased the risk of CRC 4.9-fold for the variant carriers, it did not seem to affect cell proliferation in vitro. On the other hand, the HDAC5 variant shows a low allele frequency in all world populations as well as an enhancing effect on HDAC5 promoter activity in luciferase reporter assays and thereby on HDAC5 gene expression. Our findings support the importance of taking into account both coding and non-coding variants in cancer predisposition, population screening and functional validation of variants.

Supplementary Materials

The following are available online at https://www.mdpi.com/1422-0067/22/4/1837/s1, Figure S1: Representative electropherograms depicting the APCDD1 and HDAC5 variants identified in the studied CRC family, Figure S2: Cell proliferation assays conducted for pAPCDD1WT and pAPCDD1MUT, Figure S3: Graphical overview of pGL4.10-HDAC5 reporter constructs, Table S1: Summary of family members analyzed in our study.

Author Contributions

Conceptualization, K.H., A.F. and O.R.B.; data curation, D.S., A.K., N.P., M.S. and O.R.B.; formal analysis, D.S., A.K., N.P. and O.R.B.; funding acquisition, K.H. and A.F.; investigation, D.S., B.M., A.S., M.K., A.F. and O.R.B.; methodology, D.S., B.M., A.S., D.D., J.L. and O.R.B.; project administration, K.H., A.F. and O.R.B.; resources, D.D. and J.L.; software, A.K., N.P., M.S. and O.R.B.; supervision, K.H., A.F. and O.R.B.; validation, D.S., B.M., A.S., M.K., D.D., J.L. and O.R.B.; visualization, D.S., B.M. and O.R.B.; writing—original draft, D.S. and O.R.B.; writing—review and editing, D.S., A.S., K.H., A.F. and O.R.B. All authors have read and agreed to the published version of the manuscript.

Funding

This article is based upon work from COST Action CA17118, supported by COST (European Cooperation in Science and Technology) and Transcan ERA-NET funding from the German Federal Ministry of Education and Research (BMBF). K.H. was supported from the EU Horizon 2020 program, grant No. 856620.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki and approved by the Bioethics Committee of the Pomeranian Medical Academy in Szczecin (No: BN-001/174/05, 25 October 2005).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study. Written informed consent has been obtained from the patient(s) to conduct this research and publish the results.

Data Availability Statement

Unfortunately, for reasons of ethics and patient confidentiality, we are not able to provide the sequencing data into a public database. The data underlying the results presented in the study are available from the corresponding author or from Asta Försti (Email: [email protected]).

Acknowledgments

We thank the DKFZ Genomics and Proteomics Core Facility for Illumina Sequencing Services and Omics IT and Data Management Core Facility (ODCF), DKFZ for managing the NGS data.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. Lichtenstein, P.; Holm, N.V.; Verkasalo, P.K.; Iliadou, A.; Kaprio, J.; Koskenvuo, M.; Pukkala, E.; Skytthe, A.; Hemminki, K. Environmental and Heritable Factors in the Causation of Cancer—Analyses of Cohorts of Twins from Sweden, Denmark, and Finland. N. Engl. J. Med. 2000, 343, 78–85. [Google Scholar] [CrossRef] [PubMed]
  2. Jasperson, K.W.; Tuohy, T.M.; Neklason, D.W.; Burt, R.W. Hereditary and Familial Colon Cancer. Gastroenterology 2010, 138, 2044–2058. [Google Scholar] [CrossRef] [Green Version]
  3. Da Silva, F.C.; Wernhoff, P.; Dominguez-Barrera, C.; Dominguez-Valentin, M. Update on Hereditary Colorectal Cancer. Anticancer. Res. 2016, 36, 4399–4406. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Wei, C.; Peng, B.; Han, Y.; Chen, W.V.; Rother, J.; Tomlinson, G.E.; Boland, C.R.; Chaussabel, M.; Frazier, M.L.; Amos, C.I. Mutations of HNRNPA0 and WIF1 predispose members of a large family to multiple cancers. Fam. Cancer 2015, 14, 297–306. [Google Scholar] [CrossRef] [Green Version]
  5. Kuiper, R.P.; Hoogerbrugge, N. NTHL1 defines novel cancer syndrome. Oncotarget 2015, 6, 34069–34070. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. Weren, R.D.; Ligtenberg, M.J.; Kets, C.M.; De Voer, R.M.; Verwiel, E.T.; Spruijt, L.; van Zelst-Stams, W.A.; Jongmans, M.C.; Gilissen, C.; Hehir-Kwa, J.Y.; et al. A germline homozygous mutation in the base-excision repair gene NTHL1 causes adenomatous polyposis and colorectal cancer. Nat. Genet. 2015, 47, 668–671. [Google Scholar] [CrossRef] [PubMed]
  7. Briggs, S.; Tomlinson, I. Germline and somatic polymerase epsilon and delta mutations define a new class of hypermutated colorectal and endometrial cancers. J. Pathol. 2013, 230, 148–153. [Google Scholar] [CrossRef] [Green Version]
  8. Palles, C.; Cazier, J.-B.; Howarth, K.M.; Domingo, E.; Jones, A.M.; Broderick, P.; Kemp, Z.; Spain, S.L.; Guarino, E.; Salguero, I.; et al. Germline mutations affecting the proofreading domains of POLE and POLD1 predispose to colorectal adenomas and carcinomas. Nat. Genet. 2013, 45, 136–144. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  9. Lorans, M.; Dow, E.; Macrae, F.A.; Winship, I.M.; Buchanan, D.D. Update on Hereditary Colorectal Cancer: Improving the Clinical Utility of Multigene Panel Testing. Clin. Color. Cancer 2018, 17, e293–e305. [Google Scholar] [CrossRef] [Green Version]
  10. Gloss, B.S.; Dinger, M.E. Realizing the significance of noncoding functionality in clinical genomics. Exp. Mol. Med. 2018, 50, 1–8. [Google Scholar] [CrossRef] [Green Version]
  11. Mucaki, E.J.; Caminsky, N.G.; Perri, A.M.; Lu, R.; Laederach, A.; Halvorsen, M.; Knoll, J.H.M.; Rogan, P.K. A unified analytic framework for prioritization of non-coding variants of uncertain significance in heritable breast and ovarian cancer. BMC Med. Genom. 2016, 9, 19. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Alanazi, I.O.; Al Shehri, Z.S.; Ebrahimie, E.; Giahi, H.; Mohammadi-Dehcheshmeh, M. Non-coding and coding genomic variants distinguish prostate cancer, castration-resistant prostate cancer, familial prostate cancer, and metastatic castration-resistant prostate cancer from each other. Mol. Carcinog. 2019, 58, 862–874. [Google Scholar] [CrossRef] [PubMed]
  13. Shabalina, S.A.; Spiridonov, N.A. The mammalian transcriptome and the function of non-coding DNA sequences. Genome Biol. 2004, 5, 105. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Kumar, A.; Bandapalli, O.R.; Paramasivam, N.; Giangiobbe, S.; Diquigiovanni, C.; Bonora, E.; Eils, R.; Schlesner, M.; Hemminki, K.; Försti, A. Familial Cancer Variant Prioritization Pipeline version 2 (FCVPPv2) applied to a papillary thyroid cancer family. Sci. Rep. 2018, 8, 11635. [Google Scholar] [CrossRef] [Green Version]
  15. Bandapalli, O.R.; Paramasivam, N.; Giangiobbe, S.; Kumar, A.; Benisch, W.; Engert, A.; Witzens-Harig, M.; Schlesner, M.; Hemminki, K.; Försti, A. Whole genome sequencing reveals DICER1 as a candidate predisposing gene in familial Hodgkin lymphoma. Int. J. Cancer 2018, 143, 2076–2078. [Google Scholar] [CrossRef] [Green Version]
  16. Srivastava, A.; Kumar, A.; Giangiobbe, S.; Bonora, E.; Hemminki, K.; Försti, A.; Bandapalli, O.R. Whole Genome Sequencing of Familial Non-Medullary Thyroid Cancer Identifies Germline Alterations in MAPK/ERK and PI3K/AKT Signaling Pathways. Biomolecules 2019, 9, 605. [Google Scholar] [CrossRef] [Green Version]
  17. Srivastava, A.; Giangiobbe, S.; Kumar, A.; Paramasivam, N.; Dymerska, D.; Behnisch, W.; Witzens-Harig, M.; Lubinski, J.; Hemminki, K.; Försti, A.; et al. Identification of Familial Hodgkin Lymphoma Predisposing Genes Using Whole Genome Sequencing. Front. Bioeng. Biotechnol. 2020, 8, 179. [Google Scholar] [CrossRef] [Green Version]
  18. Rentzsch, P.; Witten, D.; Cooper, G.M.; Shendure, J.; Kircher, M. CADD: Predicting the deleteriousness of variants throughout the human genome. Nucleic Acids Res. 2019, 47, D886–D894. [Google Scholar] [CrossRef]
  19. Dayem Ullah, A.Z.; Oscanoa, J.; Wang, J.; Nagano, A.; Lemoine, N.R.; Chelala, C. SNPnexus: Assessing the functional relevance of genetic variation to facilitate the promise of precision medicine. Nucleic Acids Res. 2018, 46, W109–W113. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  20. Stelzer, G.; Rosen, N.; Plaschkes, I.; Zimmerman, S.; Twik, M.; Fishilevich, S.; Stein, T.I.; Nudel, R.; Lieder, I.; Mazor, Y.; et al. The GeneCards Suite: From Gene Data Mining to Disease Genome Sequence Analyses. Curr. Protoc. Bioinform. 2016, 54, 1–30. [Google Scholar] [CrossRef]
  21. Wang, Z.; Cummins, J.M.; Shen, D.; Cahill, D.P.; Jallepalli, P.V.; Wang, T.-L.; Parsons, D.W.; Traverso, G.; Awad, M.; Silliman, N.; et al. Three Classes of Genes Mutated In Colorectal Cancers with Chromosomal Instability. Cancer Res. 2004, 64, 2998–3001. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Mitchell, A.L.; Attwood, T.K.; Babbitt, P.C.; Blum, M.; Bork, P.; Bridge, A.; Brown, S.D.; Chang, H.-Y.; El-Gebali, S.; Fraser, M.I.; et al. InterPro in 2019: Improving coverage, classification and access to protein sequence annotations. Nucleic Acids Res. 2019, 47, D351–D360. [Google Scholar] [CrossRef] [Green Version]
  23. El-Gebali, S.; Mistry, J.; Bateman, A.; Eddy, S.R.; Luciani, A.; Potter, S.C.; Qureshi, M.; Richardson, L.J.; Salazar, G.A.; Smart, A.; et al. The Pfam protein families database in 2019. Nucleic Acids Res. 2019, 47, D427–D432. [Google Scholar] [CrossRef] [PubMed]
  24. Letunic, I.; Bork, P. 20 years of the SMART protein domain annotation resource. Nucleic Acids Res. 2018, 46, D493–D496. [Google Scholar] [CrossRef] [PubMed]
  25. Takahashi, M.; Fujita, M.; Furukawa, Y.; Hamamoto, R.; Shimokawa, T.; Miwa, N.; Ogawa, M.; Nakamura, Y. Isolation of a novel human gene, APCDD1, as a direct target of the beta-Catenin/T-cell factor 4 complex with probable involvement in colorectal carcinogenesis. Cancer Res. 2002, 62, 5651–5656. [Google Scholar]
  26. Ernst, J.; Kellis, M. ChromHMM: Automating chromatin-state discovery and characterization. Nat. Methods 2012, 9, 215–216. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Roadmap Epigenomics, C.; Kundaje, A.; Meuleman, W.; Ernst, J.; Bilenky, M.; Yen, A.; Heravi-Moussavi, A.; Kheradpour, P.; Zhang, Z.; Wang, J.; et al. Integrative analysis of 111 reference human epigenomes. Nature 2015, 518, 317–330. [Google Scholar]
  28. Hoffman, M.M.; Buske, O.J.; Wang, J.; Weng, Z.; Bilmes, J.A.; Noble, W.S. Unsupervised pattern discovery in human chromatin structure through genomic segmentation. Nat. Methods 2012, 9, 473–476. [Google Scholar] [CrossRef] [Green Version]
  29. Fre, S.; Pallavi, S.K.; Huyghe, M.; Laé, M.; Janssen, K.-P.; Robine, S.; Artavanis-Tsakonas, S.; Louvard, D. Notch and Wnt signals cooperatively control cell proliferation and tumorigenesis in the intestine. Proc. Natl. Acad. Sci. USA 2009, 106, 6309–6314. [Google Scholar] [CrossRef] [Green Version]
  30. Badenes, M.; Trindade, A.; Pissarra, H.; Lopes-da-Costa, L.; Duarte, A. Delta-like 4/Notch signaling promotes Apc (Min/+) tumor initiation through angiogenic and non-angiogenic related mechanisms. BMC Cancer 2017, 17, 50. [Google Scholar]
  31. He, P.; Liang, J.; Shao, T.; Guo, Y.; Hou, Y.; Li, Y. HDAC5 promotes colorectal cancer cell proliferation by up-regulating DLL4 expression. Int. J. Clin. Exp. Med. 2015, 8, 6510–6516. [Google Scholar]
  32. Stypula-Cyrus, Y.; Damania, D.; Kunte, D.P.; Cruz, M.D.; Subramanian, H.; Roy, H.K.; Backman, V. HDAC Up-Regulation in Early Colon Field Carcinogenesis Is Involved in Cell Tumorigenicity through Regulation of Chromatin Structure. PLoS ONE 2013, 8, e64600. [Google Scholar] [CrossRef] [Green Version]
  33. Shah, M.; Rennoll, S.A.; Raup-Konsavage, W.M.; Yochum, G.S. A dynamic exchange of TCF3 and TCF4 transcription factors controls MYC expression in colorectal cancer cells. Cell Cycle 2015, 14, 323–332. [Google Scholar] [CrossRef] [Green Version]
  34. Findlay, V.J.; Wang, C.; Nogueira, L.M.; Hurst, K.; Quirk, D.; Ethier, S.P.; O’Carroll, K.F.S.; Watson, D.K.; Camp, E.R. SNAI2 Modulates Colorectal Cancer 5-Fluorouracil Sensitivity through miR145 Repression. Mol. Cancer Ther. 2014, 13, 2713–2726. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. de Sousa, E.M.F.; Colak, S.; Buikhuisen, J.; Koster, J.; Cameron, K.; de Jong, J.H.; Tuynman, J.B.; Prasetyanti, P.R.; Fessler, E.; van den Bergh, S.P.; et al. Methylation of cancer-stem-cell-associated Wnt target genes predicts poor prognosis in colorectal cancer patients. Cell Stem Cell 2011, 9, 476–485. [Google Scholar]
  36. Shimomura, Y.; Agalliu, D.; Vonica, A.; Luria, V.; Wajid, M.; Baumer, A.; Belli, S.; Petukhova, L.; Schinzel, A.; Brivanlou, A.H.; et al. APCDD1 is a novel Wnt inhibitor mutated in hereditary hypotrichosis simplex. Nature 2010, 464, 1043–1047. [Google Scholar] [CrossRef] [Green Version]
  37. Ordóñez-Morán, P.; Dafflon, C.; Imajo, M.; Nishida, E.; Huelsken, J. HOXA5 Counteracts Stem Cell Traits by Inhibiting Wnt Signaling in Colorectal Cancer. Cancer Cell 2015, 28, 815–829. [Google Scholar] [CrossRef] [Green Version]
  38. Marek, L.; Hamacher, A.; Hansen, F.K.; Kuna, K.; Gohlke, H.; Kassack, M.U.; Kurz, T. Histone Deacetylase (HDAC) Inhibitors with a Novel Connecting Unit Linker Region Reveal a Selectivity Profile for HDAC4 and HDAC5 with Improved Activity against Chemoresistant Cancer Cells. J. Med. Chem. 2013, 56, 427–436. [Google Scholar] [CrossRef] [PubMed]
  39. Milde, T.; Oehme, I.; Korshunov, A.; Kopp-Schneider, A.; Remke, M.; Northcott, P.A.; Deubzer, H.E.; Lodrini, M.; Taylor, M.D.; Von Deimling, A.; et al. HDAC5 and HDAC9 in Medulloblastoma: Novel Markers for Risk Stratification and Role in Tumor Cell Growth. Clin. Cancer Res. 2010, 16, 3240–3252. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  40. Fan, J.; Lou, B.; Chen, W.; Zhang, J.; Lin, S.; Lv, F.-F.; Chen, Y. Down-regulation of HDAC5 inhibits growth of human hepatocellular carcinoma by induction of apoptosis and cell cycle arrest. Tumor Biol. 2014, 35, 11523–11532. [Google Scholar] [CrossRef]
  41. Lee, H.-Y.; Tsai, A.-C.; Chen, M.-C.; Shen, P.-J.; Cheng, Y.-C.; Kuo, C.-C.; Pan, S.-L.; Liu, Y.-M.; Liu, J.-F.; Yeh, T.-K.; et al. Azaindolylsulfonamides, with a More Selective Inhibitory Effect on Histone Deacetylase 6 Activity, Exhibit Antitumor Activity in Colorectal Cancer HCT116 Cells. J. Med. Chem. 2014, 57, 4009–4022. [Google Scholar] [CrossRef] [PubMed]
  42. Hrckulak, D.; Janeckova, L.; Lanikova, L.; Kriz, V.; Horazna, M.; Babosova, O.; Vojtechova, M.; Galuskova, K.; Sloncova, E.; Korinek, V. Wnt Effector TCF4 Is Dispensable for Wnt Signaling in Human Cancer Cells. Genes 2018, 9, 439. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  43. Korinek, V.; Barker, N.; Morin, P.J.; van Wichen, D.; de Weger, R.; Kinzler, K.W.; Vogelstein, B.; Clevers, H. Constitutive transcriptional activation by a beta-catenin-Tcf complex in APC-/- colon carcinoma. Science 1997, 275, 1784–1787. [Google Scholar] [CrossRef] [Green Version]
  44. Bienz, M.; Clevers, H. Linking colorectal cancer to Wnt signaling. Cell 2000, 103, 311–320. [Google Scholar] [CrossRef] [Green Version]
  45. Smith, D.R.; Myint, T.; Goh, H.S. Over-expression of the c-myc proto-oncogene in colorectal carcinoma. Br. J. Cancer 1993, 68, 407–413. [Google Scholar] [CrossRef] [Green Version]
  46. Erisman, M.D.; Rothberg, P.G.; Diehl, R.E.; Morse, C.C.; Spandorfer, J.M.; Astrin, S.M. Deregulation of c-myc gene expression in human colon carcinoma is not accompanied by amplification or rearrangement of the gene. Mol. Cell. Biol. 1985, 5, 1969–1976. [Google Scholar] [CrossRef] [Green Version]
  47. Rochlitz, C.F.; Herrmann, R.; De Kant, E. Overexpression and Amplification of c-myc during Progression of Human Colorectal Cancer. Oncology 1996, 53, 448–454. [Google Scholar] [CrossRef]
  48. Hatzis, P.; Van Der Flier, L.G.; Van Driel, M.A.; Guryev, V.; Nielsen, F.; Denissov, S.; Nijman, I.J.; Koster, J.; Santo, E.E.; Welboren, W.; et al. Genome-Wide Pattern of TCF7L2/TCF4 Chromatin Occupancy in Colorectal Cancer Cells. Mol. Cell. Biol. 2008, 28, 2732–2744. [Google Scholar] [CrossRef] [Green Version]
  49. Kriegl, L.; Horst, D.; Reiche, J.A.; Engel, J.; Kirchner, T.; Jung, A. LEF-1 and TCF4 expression correlate inversely with survival in colorectal cancer. J. Transl. Med. 2010, 8, 123. [Google Scholar] [CrossRef] [Green Version]
  50. Mao, C.D.; Byers, S.W. Cell-context dependent TCF/LEF expression and function: Alternative tales of repression, de-repression and activation potentials. Crit. Rev. Eukaryot. Gene Expr. 2011, 21, 207–236. [Google Scholar] [CrossRef] [Green Version]
  51. Hoverter, N.P.; Waterman, M.L. A Wnt-fall for gene regulation: Repression. Sci. Signal. 2008, 1, pe43. [Google Scholar] [CrossRef]
  52. Arce, L.; Yokoyama, N.N.; Waterman, M.L. Diversity of LEF/TCF action in development and disease. Oncogene 2006, 25, 7492–7504. [Google Scholar] [CrossRef] [Green Version]
  53. Cadigan, K.M. TCFs and Wnt/beta-catenin signaling: More than one way to throw the switch. Curr. Top Dev. Biol. 2012, 98, 1–34. [Google Scholar]
  54. Cadigan, K.M.; Waterman, M.L. TCF/LEFs and Wnt Signaling in the Nucleus. Cold Spring Harb. Perspect. Biol. 2012, 4, a007906. [Google Scholar] [CrossRef]
  55. Brantjes, H.; Roose, J.; Van De Wetering, M.; Clevers, H. All Tcf HMG box transcription factors interact with Groucho-related co-repressors. Nucleic Acids Res. 2001, 29, 1410–1419. [Google Scholar] [CrossRef] [Green Version]
  56. Wang, H.; Matise, M.P. Tcf7l2/Tcf4 Transcriptional Repressor Function Requires HDAC Activity in the Developing Vertebrate CNS. PLoS ONE 2016, 11, e0163267. [Google Scholar] [CrossRef] [PubMed]
  57. Lahiri, D.K.; Schnabel, B. DNA isolation by a rapid method from human blood samples: Effects of MgCl2, EDTA, storage time, and temperature on DNA yield and quality. Biochem. Genet. 1993, 31, 321–328. [Google Scholar] [CrossRef]
  58. Li, H.; Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 2009, 25, 1754–1760. [Google Scholar] [CrossRef] [Green Version]
  59. Li, H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 2011, 27, 2987–2993. [Google Scholar] [CrossRef] [Green Version]
  60. Rimmer, A.; Phan, H.; Mathieson, I.; Iqbal, Z.; Twigg, S.R.; Consortium, W.G.S.; Wilkie, A.O.; McVean, G.; Lunter, G. Integrating mapping-, assembly- and haplotype-based approaches for calling variants in clinical sequencing applications. Nat. Genet. 2014, 46, 912–918. [Google Scholar] [CrossRef] [Green Version]
  61. Wang, K.; Li, M.; Hakonarson, H. Annovar: Functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010, 38, e164. [Google Scholar] [CrossRef] [PubMed]
  62. Genomes Project, C.; Auton, A.; Brooks, L.D.; Durbin, R.M.; Garrison, E.P.; Kang, H.M.; Korbel, J.O.; Marchini, J.L.; McCarthy, S.; McVean, G.A.; et al. A global reference for human genetic variation. Nature 2015, 526, 68–74. [Google Scholar]
  63. Smigielski, E.M.; Sirotkin, K.; Ward, M.; Sherry, S.T. dbSNP: A database of single nucleotide polymorphisms. Nucleic Acids Res. 2000, 28, 352–355. [Google Scholar] [CrossRef] [Green Version]
  64. Lek, M.; Karczewski, K.J.; Minikel, E.V.; Samocha, K.E.; Banks, E.; Fennell, T.; O’Donnell-Luria, A.H.; Ware, J.S.; Hill, A.J.; Cummings, B.B.; et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature 2016, 536, 285–291. [Google Scholar] [CrossRef] [Green Version]
  65. Steinke, V.; Engel, C.; Buttner, R.; Schackert, H.K.; Schmiegel, W.H.; Propping, P. Hereditary nonpolyposis colorectal cancer (HNPCC)/Lynch syndrome. Dtsch. Arztebl. Int. 2013, 110, 32–38. [Google Scholar] [CrossRef] [Green Version]
  66. Brandt, A.; Bermejo, J.L.; Sundquist, J.; Hemminki, K. Age of onset in familial cancer. Ann. Oncol. 2008, 19, 2084–2088. [Google Scholar] [CrossRef] [PubMed]
  67. Kircher, M.; Witten, D.M.; Jain, P.; O’Roak, B.J.; Cooper, G.M.; Shendure, J. A general framework for estimating the relative pathogenicity of human genetic variants. Nat. Genet. 2014, 46, 310–315. [Google Scholar] [CrossRef] [Green Version]
  68. Cooper, G.M.; Stone, E.A.; Asimenos, G.; Green, E.D.; Batzoglou, S.; Sidow, A. Distribution and intensity of constraint in mammalian genomic sequence. Genome Res. 2005, 15, 901–913. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  69. Siepel, A.; Bejerano, G.; Pedersen, J.S.; Hinrichs, A.S.; Hou, M.; Rosenbloom, K.; Clawson, H.; Spieth, J.; Hillier, L.W.; Richards, S.; et al. Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Res. 2005, 15, 1034–1050. [Google Scholar] [CrossRef] [Green Version]
  70. Pollard, K.S.; Hubisz, M.J.; Rosenbloom, K.R.; Siepel, A. Detection of nonneutral substitution rates on mammalian phylogenies. Genome Res. 2009, 20, 110–121. [Google Scholar] [CrossRef] [Green Version]
  71. Petrovski, S.; Wang, Q.; Heinzen, E.L.; Allen, A.S.; Goldstein, D.B. Genic intolerance to functional variation and the interpretation of personal genomes. PLoS Genet. 2013, 9, e1003709. [Google Scholar] [CrossRef]
  72. Ward, L.D.; Kellis, M. HaploReg: A resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants. Nucleic Acids Res. 2012, 40, D930–D934. [Google Scholar] [CrossRef] [Green Version]
  73. Liu, X.; Wu, C.; Li, C.; Boerwinkle, E. dbNSFP v3.0: A One-Stop Database of Functional Predictions and Annotations for Human Nonsynonymous and Splice-Site SNVs. Hum. Mutat. 2016, 37, 235–241. [Google Scholar] [CrossRef] [Green Version]
  74. Tamborero, D.; Rubio-Perez, C.; Deu-Pons, J.; Schroeder, M.P.; Vivancos, A.; Rovira, A.; Tusquets, I.; Albanell, J.; Rodon, J.; Tabernero, J.; et al. Cancer Genome Interpreter annotates the biological and clinical relevance of tumor alterations. Genome Med. 2018, 10, 25. [Google Scholar] [CrossRef]
  75. Hecht, M.; Bromberg, Y.; Rost, B. Better prediction of functional effects for sequence variants. BMC Genom. 2015, 16 (Suppl. 8), S1. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  76. Betel, D.; Koppal, A.; Agius, P.; Sander, C.; Leslie, C. Comprehensive modeling of microRNA targets predicts functional non-conserved and non-canonical sites. Genome Biol. 2010, 11, R90. [Google Scholar] [CrossRef] [Green Version]
  77. Wei, Y.; Zhang, S.; Shang, S.; Zhang, B.; Li, S.; Wang, X.; Wang, F.; Su, J.; Wu, Q.; Liu, H.; et al. SEA: A super-enhancer archive. Nucleic Acids Res. 2016, 44, D172–D179. [Google Scholar] [CrossRef] [Green Version]
  78. Agarwal, V.; Bell, G.W.; Nam, J.-W.; Bartel, D.P. Predicting effective microRNA target sites in mammalian mRNAs. eLife 2015, 4, e05005. [Google Scholar] [CrossRef] [PubMed]
  79. Fornes, O.; Castro-Mondragon, J.A.; Khan, A.; Van Der Lee, R.; Zhang, X.; Richmond, P.A.; Modi, B.P.; Correard, S.; Gheorghe, M.; Baranašić, D.; et al. JASPAR 2020: Update of the open-access database of transcription factor binding profiles. Nucleic Acids Res. 2019, 48, D87–D92. [Google Scholar] [CrossRef]
  80. Robinson, J.T.; Thorvaldsdóttir, H.; Wenger, A.M.; Zehir, A.; Mesirov, J.P. Variant Review with the Integrative Genomics Viewer. Cancer Res. 2017, 77, e31–e34. [Google Scholar] [CrossRef] [Green Version]
  81. Rouillard, A.D.; Gundersen, G.W.; Fernandez, N.F.; Wang, Z.; Monteiro, C.D.; McDermott, M.G.; Ma’Ayan, A. The harmonizome: A collection of processed datasets gathered to serve and mine knowledge about genes and proteins. Database 2016, 2016. [Google Scholar] [CrossRef]
Figure 1. Pedigree of the CRC (colorectal cancer) family with APCDD1 and HADC5 variants analyzed in this study.
Figure 1. Pedigree of the CRC (colorectal cancer) family with APCDD1 and HADC5 variants analyzed in this study.
Ijms 22 01837 g001
Figure 2. Graphical representation of the filtering process according to the Familial Cancer Variant Prioritization Pipeline version 2 (FCVPPv2).
Figure 2. Graphical representation of the filtering process according to the Familial Cancer Variant Prioritization Pipeline version 2 (FCVPPv2).
Ijms 22 01837 g002
Figure 3. pHDAC5MUT shows a significantly increased luciferase activity compared to pHDAC5WT. Dual luciferase reporter assays performed for pGL4.10-HDAC5WT and pGL4.10-HDAC5MUT reporter constructs co-transfected with pSNAI-2 or pTCF4 or no transcription factor (TF) into HEK293T cells. Luciferase activity was measured at four different time points (ad) and normalized to the empty pGL4.10 reporter vector. Each bar represents the mean of three independent experiments with standard deviation. pHDAC5MUT shows a significantly increased luciferase activity compared to pHDAC5WT. Co-transfection of pTCF4 further enhanced the promoter activity of pHDAC5MUT significantly. *—p < 0.05, **—p < 0.01, ***—p < 0.001, ****—p < 0.0001.
Figure 3. pHDAC5MUT shows a significantly increased luciferase activity compared to pHDAC5WT. Dual luciferase reporter assays performed for pGL4.10-HDAC5WT and pGL4.10-HDAC5MUT reporter constructs co-transfected with pSNAI-2 or pTCF4 or no transcription factor (TF) into HEK293T cells. Luciferase activity was measured at four different time points (ad) and normalized to the empty pGL4.10 reporter vector. Each bar represents the mean of three independent experiments with standard deviation. pHDAC5MUT shows a significantly increased luciferase activity compared to pHDAC5WT. Co-transfection of pTCF4 further enhanced the promoter activity of pHDAC5MUT significantly. *—p < 0.05, **—p < 0.01, ***—p < 0.001, ****—p < 0.0001.
Ijms 22 01837 g003
Table 1. Overview of the top exonic missense variants prioritized in the studied CRC family. Chromosomal positions, pedigree segregation, allele frequencies in the non-Finnish European population (NFE), accessed on 28 November 2020, PHRED-like CADD (Combined Annotation Dependent Depletion v1.3) scores, conservational scores and the percentage of reached intolerance and deleteriousness scores are summarized for each variant. Snap2 results for the predicted amino acid changes with calculated effect scores and accuracies given in % as well as CGI predictions were accessed on 28 November 2020 and included in the table. Respective protein functions of the encoded gene products are derived from Genecards on 26 November 2020 [20]. ND—no driver mutation; .—not annotated.
Table 1. Overview of the top exonic missense variants prioritized in the studied CRC family. Chromosomal positions, pedigree segregation, allele frequencies in the non-Finnish European population (NFE), accessed on 28 November 2020, PHRED-like CADD (Combined Annotation Dependent Depletion v1.3) scores, conservational scores and the percentage of reached intolerance and deleteriousness scores are summarized for each variant. Snap2 results for the predicted amino acid changes with calculated effect scores and accuracies given in % as well as CGI predictions were accessed on 28 November 2020 and included in the table. Respective protein functions of the encoded gene products are derived from Genecards on 26 November 2020 [20]. ND—no driver mutation; .—not annotated.
Gene NameVariantPedigree SegregationAllele Frequency in NFECADD SCOREConservational ScoresIntolerance Scores (%)Deleteriousness Scores * (%)Amino Acid ChangeSnap2CGIProtein Function
ExACgnomADGERPPhyloPPhastConsScoreAccuracy (%)
APCDD118_10485580_G_AIII7, III8, III107.37 × 10−51.4 × 10−427.54.949.26110066.67R299H5475NDInhibition of Wnt signaling, controversial function
FLNC7_128480709_G_AIII7, III8, III105.25 × 10−42.57 × 10−429.35.029.821100100G553S3366NDAnchoring of membrane proteins for actin cytoskeleton
KCNH617_61613135_T_GIII7, III8, III104.51 × 10−56.16 × 10−523.43.393.96110091.67L403V653NDRegulation of neurotransmitter release, neuronal excitability, epithelial electrolyte transport
LSR19_35741380_C_AII7, III7, III8, III10..27.44.887.2117575A139D8091NDLipoprotein metabolism
MTX11_155181922_A_GIII7, III8, III109.25 × 10−51.52 × 10−423.33.642.89166.6766.67Y228C3266NDMitochondrial protein import
SDS12_113836573_G_AIII7, III8, III104.59 × 10−55.44 × 10−523.445.69175100T91I4671NDSerine and glycine metabolism, gluconeogenesis
ZW1011_113607367_C_GIII7, III8, III105.99 × 10−46.43 × 10−4346.177.4917575A732P7085NDChromosome segregation, mitotic checkpoint
* Following predictions given by deleteriousness scores were considered as favorable in our analysis: SIFT—Damaging (D); Polyphen2_HumDiv, Polyphen2_HumVar—Probably damaging (D) and Possibly damaging (P); LRT—Deleterious (D); MutationTaster—Disease causing (D) and disease causing automatic (A); MutationAssesor—High (H) and medium (M); FATHMM—Damaging (D); MetaSVM—Damaging (D); MetaLR—Damaging (D); Reliability Index ≥ 5; VEST3 ≥ 0.5; PROVEAN—Damaging (D).
Table 2. 5′UTR germline variants prioritized in the CRC family. Chromosomal position, pedigree segregation and allele frequencies in the non-Finnish European population (NFE), accessed on 28 March 2019, are listed for each variant. Promoters, enhancers, super-enhancers identified with Bedtools intersect function and SEA, FANTOM5 databases on 6 April 2019 are included. Overall deleteriousness, genomic conservation, chromatin state and location within transcription factor binding sites (TFBS) are annotated by CADD v1.4 on 30 March 2019. Further information on TFBS, CpG islands and the general functional impact in form of a summarizing percentage of positive non-coding scores are derived from SNPnexus on 31 March 2019. .—not annotated.
Table 2. 5′UTR germline variants prioritized in the CRC family. Chromosomal position, pedigree segregation and allele frequencies in the non-Finnish European population (NFE), accessed on 28 March 2019, are listed for each variant. Promoters, enhancers, super-enhancers identified with Bedtools intersect function and SEA, FANTOM5 databases on 6 April 2019 are included. Overall deleteriousness, genomic conservation, chromatin state and location within transcription factor binding sites (TFBS) are annotated by CADD v1.4 on 30 March 2019. Further information on TFBS, CpG islands and the general functional impact in form of a summarizing percentage of positive non-coding scores are derived from SNPnexus on 31 March 2019. .—not annotated.
Gene NameVariantPedigree SegregationAllele Frequency in NFEBedtools Intersect ICADD v1.4SNPnexus
ExACgnomAD NFECADD SCOREConservational ScoresChromatin StateTFBSTFBS Peaks IVNon Coding Scores V (%)CpG IslandCpG Ratio VITFBS
PhastConsPhyloPGERPChromHMM II StateChromHMM II ScoreSegway III
CA417_58227298_G_CIII7, III8, III10..P:58227287
..58227313,+
12.60.440.722.77Enh Biv0.24TF01450610.89.
CALM319_47105342_A_GII7, III7, III8, III10...19.80.871.942.49TssA0.87GS1150...
HDAC517_42200942_T_GII7, III7, III8.6.64 × 10−5.21.91.000.873.99TssA0.98TSS174250920.97
PLAA9_26947165_G_AIII7, III8, III10.6.49 × 10−5P:26947129
..26947212,-
220.990.924.8TssA0.94TSS416183.3720.86NRF2
PPTC712_111021082_G_CIII7, III8, III10.2.69 × 10−4SE: hg19_A549_2
12_111015565
16.51.003.854.29TssA0.98TSS265050831.13.
TMEM
115
3_50396814_C_GIII7, III8, III10.1.69 × 10−3.18.11.001.894.82TssA0.82GS182766.7640.67.
TPM29_35690678_C_TIII7, III8, III10.1.36 × 10−3.17.21.000.613.76TssA0.95GE266501090.76.
UBE2K4_39699921_G_CIII7, III8, III10.1.16 × 10−4.17.01.001.394.47TssA0.93TSS142566.7891.01.
I Bedtools intersect: Promoters (P) are listed with their specific genomic position (P: start..end, strand). Super-enhancers (SE) are shown including information about the used reference genome, cell line and genomic position. II ChromHMM: The ChromHmm score shows the proportion of 127 cell types of the Roadmap Epigenomics project in a particular chromatin state with scores closer to 1 indicating more cell types in the particular chromatin state. The 15 chromatin states are defined as follows: TssA—Active transcription start site (TSS), TssAFInk—Flanking active TSS, TxFlnk—Transcribed at gene 5’ and 3’, Tx—Strong transcription, TxWk—Weak transcription, EnhG—Genic enhancers, Enh—Enhancers, ZNF/Rpts—ZNF genes and repeats, Het—Heterochromatin, TssBiv—Bivalent/Poised TSS/Enhancers, BivFlnk—Flanking bivalent TSS/Enhancer, EnhBiv—Bivalent enhancers, ReprPC—Repressed PolyComb, ReprPCWk—Weak Repressed PolyComb, Quies—Quiescent/low [26,27]. III Segway: Segway uses a genomic segmentation method to annotate the chromatin state based on multiple datasets of ChIP-seq experiments. The chromatin states can be annotated as follows: D—dead, F0/1—FAIRE, R0/1/2/4/5—Repressed Region, H3K9me1—histone 3 lysine 9 monomethylation, L0/1—Low zone, GE0/1/2—Gene body (end),TF0/1/2—Transcription factor activity, C0—CTCF, GS—Gene body (start), E/GM—Enhancer/gene middle, GM0/1—Gene body (middle), TSS—Transcription start site, ZnfRpts—zinc finger repeats [28]. IV TFBS peaks: The number of overlapping ChIP TFBS peaks summed over different cell types/tissue. V Non-coding scores: Following cut-offs were used for the interpretation of the non-coding scores derived from SNPnexus: FitCons Score ≥ 0.2 with a p-value ≤ 0.05; EIGEN > 0 (at least 1 of 2 scores); FatHMM > 0.5; GWAVA > 0.4 (at least 2 of 3 scores); DeepSEA > 0.5 (at least 2 of 3 scores); FunSeq2 > 3, ReMM > 0.5. VI CpG Ratio: The ratio of observed to expected CpG in island.
Table 3. 3′UTR germline variants prioritized in the CRC family. Chromosomal position, pedigree segregation and allele frequencies in the non-Finnish European population (NFE), accessed on 28 March 2019, are listed for each variant. miRNA target sites identified with Bedtools intersect function and Targetscan 7.0 databases on 6 April 2019 are included. Overall deleteriousness, genomic conservation, chromatin state and the likelihood of target mRNA downregulation predicted by mirSVR score are annotated by CADD v1.4 on 30 March 2019. The general functional impact of each 3′UTR variant is summarized by the percentage of positive non-coding scores derived from SNPnexus on 31 March 2019. .—not annotated.
Table 3. 3′UTR germline variants prioritized in the CRC family. Chromosomal position, pedigree segregation and allele frequencies in the non-Finnish European population (NFE), accessed on 28 March 2019, are listed for each variant. miRNA target sites identified with Bedtools intersect function and Targetscan 7.0 databases on 6 April 2019 are included. Overall deleteriousness, genomic conservation, chromatin state and the likelihood of target mRNA downregulation predicted by mirSVR score are annotated by CADD v1.4 on 30 March 2019. The general functional impact of each 3′UTR variant is summarized by the percentage of positive non-coding scores derived from SNPnexus on 31 March 2019. .—not annotated.
Gene NameVariantPedigree SegregationAllele Frequency in NFEBedtools intersect ICADD v1.4SNPnexus
Non-Coding Scores IV (%)
ExACgnomADmiRNA Target SitesContext Score ++
Percentile
Site TypePHREDConservational ScoresChromatin StatemirSVR-Score
verPhConsverPhyloPGerpNChromHMM II StateChromHMM II ScoreSegway III
ACTN21_236927298_T_AIII7, III8, III10.6.48 × 10−5miR-450b-5p
miR-891b
36 717mer-m8 7mer-1a13.840.840.605.70Quies0.488R2−0.9066.7
CCNT22_135714351_C_TIII7, III8, III10..miR-4670-3p
miR-3606-5p
93 837mer-1a 8mer15.461.003.285.65Tx
Wk
0.551GE0−1.1183.3
CLK12_201717953_C_TIII7, III8.2.01 × 10−3miR-4718
let-7c-3p
98
96
7mer-m8
7mer-m8
15.570.680.385.34Tx0.921GE0−1.2185.7
DDX1722_38881603_ACT_AII7, III7, III8.2.84 × 10−3miR-550b-3p917mer-m814.910.660.974.36Tx0.858F1−0.35100
FH1_241661076_T_CIII7, III8, III10.3.89 × 10−3miR-4294947mer-m810.40.470.114.56Tx0.709GE0−0.9671.4
LRRC8C1_90180607_T_AIII7, III8, III10.2.59 × 10−4miR-499a-5p
miR-208-3p
miR-4432
miR-8087
96
91
81 90
8mer
7mer-1a
7mer-1a
7mer-1a
21.81.003.216.17Tx
Wk
0.583F1−0.7683.3
SEMA4B15_90772811_G_AIII7, III8, III10..miR-3918
miR-3127-5p
miR-506-5p
miR-10a-3p
99 99 92 948mer
8mer
7mer-m8
7mer-1a
13.650.533.014.95Tx0.606GE1−1.2783.3
TJP115_29993152_G_AIII7, III8.6.49 × 10−5miR-654-3p898mer15.031.001.485.79Tx0.441GE1−0.8083.3
I Bedtools intersect: Using Bedtools intersect function and Targetscan 7.0 database, miRNA target sites were predicted for each variant. Multiple predicted miRNA target sites of one variant are listed separated with commas. The Context Score ++ Percentile shows the percentage of sites for the miRNA with a less favorable Context Score ++ and thus less repression capacity. Hence, a higher Context Score ++ Percentile indicates greater repression at a specific site by a miRNA compared to all sites of this miRNA family. The annotation of the site type gives information about the site efficacy in the seed region and thus about the different numbers of targets identified per miRNA. The order from the strictest to the least strict is as follows: 8mer > 7mer-m8 > 7mer-A1 > 6mer. II ChromHMM: ChromHmm shows the proportion of 127 cell types of the Roadmap Epigenomics project in a particular chromatin state with scores closer to 1 indicating more cell types in the particular chromatin state. The 15 chromatin states are defined as follows: TssA—Active transcription start site (TSS), TssAFInk—Flanking active TSS, TxFlnk—Transcribed at gene 5’ and 3’, Tx—Strong transcription, TxWk—Weak transcription, EnhG—Genic enhancers, Enh—Enhancers, ZNF/Rpts—ZNF genes and repeats, Het—Heterochromatin, TssBiv—Bivalent/Poised TSS/Enhancers, BivFlnk—Flanking bivalent TSS/Enhancer, EnhBiv—Bivalent enhancers, ReprPC—Repressed PolyComb, ReprPCWk—Weak Repressed PolyComb, Quies—Quiescent/low [26,27]. III Segway: Segway uses a genomic segmentation method to annotate the chromatin state based on multiple datasets of ChIP-seq experiments. The chromatin states can be annotated as follows: D—dead, F0/1—FAIRE, R0/1/2/4/5—Repressed Region, H3K9me1—histone 3 lysine 9 monomethylation, L0/1—Low zone, GE0/1/2—Gene body (end),TF0/1/2—Transcription factor activity, C0—CTCF, GS—Gene body (start), E/GM—Enhancer/gene middle, GM0/1—Gene body (middle), TSS—Transcription start site, ZnfRpts—zinc finger repeats [28]. IV Non-coding scores: Following cut-offs were used for the interpretation of the non-coding scores derived from SNPnexus: FitCons Score ≥ 0.2 with a p-value ≤ 0.05; EIGEN > 0 (at least 1 of 2 scores); FatHMM > 0.5; GWAVA > 0.4 (at least 2 of 3 scores); DeepSEA > 0.5 (at least 2 of 3 scores); FunSeq2 > 3, ReMM > 0.5.
Table 4. Summary of transcription factor binding sites (TFBS) identified with Jaspar2020 and relative profile score threshold of 85% on 30 November 2020. Matrix ID, names of targeting transcription factors, relative scores, start and end position, strand information and respective binding sequences are included. A relative score of 1 represents the maximum likelihood sequence for the motif. TFBS newly created by the variant and thus exclusively present in HDAC5MUT sequence are annotated as MUT, whereas TFBS destroyed by the variant and thus exclusively present in HDAC5WT sequence are annotated as WT.
Table 4. Summary of transcription factor binding sites (TFBS) identified with Jaspar2020 and relative profile score threshold of 85% on 30 November 2020. Matrix ID, names of targeting transcription factors, relative scores, start and end position, strand information and respective binding sequences are included. A relative score of 1 represents the maximum likelihood sequence for the motif. TFBS newly created by the variant and thus exclusively present in HDAC5MUT sequence are annotated as MUT, whereas TFBS destroyed by the variant and thus exclusively present in HDAC5WT sequence are annotated as WT.
Matrix IDNameRelative ScoreExclusive for pHDAC5StartEndStrandPredicted Sequence
MA1631.1ASCL1(var.2)0.859965928WT710722-cagcacctcctcg
MA0598.1EHF0.869445857WT711718-acctcctc
MA0056.1MZF10.854364496WT710715+cgagga
MA0673.1NKX2-80.857521064WT708716-ctcctcgac
MA1558.1SNAI-10.875024573WT712721+aggaggtgct
MA0745.1SNAI-20.891814443WT712720+aggaggtgc
MA1563.1SOX180.858931866MUT714721-agcaccGc
MA0079.3SP10.871792316MUT710720-gcaccGcctcg
MA0079.3SP10.862428729WT710720-gcacctcctcg
MA0080.2SPI10.865925519WT712718+aggaggt
MA1566.1TBX30.876483102WT714723+gaggtgctgc
MA0806.1TBX40.854062458WT715722+aggtgctg
MA1567.1TBX60.867675674WT714723+gaggtgctgc
MA1648.1TCF12(var.2)0.868262172WT711721-agcacctcctc
MA0522.3TCF30.875470132WT711721-agcacctcctc
MA0522.2TCF30.869530427WT712721-agcacctcct
MA0830.2TCF40.851528821WT710722-cagcacctcctcg
MA0830.1TCF40.850775152WT712721-agcacctcct
MA0003.1TFAP2A0.860276583MUT707715-Gcctcgacg
MA0815.1TFAP2C(var.3)0.858234123MUT704716+agccgtcgaggCg
MA0815.1TFAP2C(var.3)0.851532375MUT704716-cGcctcgacggct
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Skopelitou, D.; Miao, B.; Srivastava, A.; Kumar, A.; Kuświk, M.; Dymerska, D.; Paramasivam, N.; Schlesner, M.; Lubinski, J.; Hemminki, K.; et al. Whole Exome Sequencing Identifies APCDD1 and HDAC5 Genes as Potentially Cancer Predisposing in Familial Colorectal Cancer. Int. J. Mol. Sci. 2021, 22, 1837. https://doi.org/10.3390/ijms22041837

AMA Style

Skopelitou D, Miao B, Srivastava A, Kumar A, Kuświk M, Dymerska D, Paramasivam N, Schlesner M, Lubinski J, Hemminki K, et al. Whole Exome Sequencing Identifies APCDD1 and HDAC5 Genes as Potentially Cancer Predisposing in Familial Colorectal Cancer. International Journal of Molecular Sciences. 2021; 22(4):1837. https://doi.org/10.3390/ijms22041837

Chicago/Turabian Style

Skopelitou, Diamanto, Beiping Miao, Aayushi Srivastava, Abhishek Kumar, Magdalena Kuświk, Dagmara Dymerska, Nagarajan Paramasivam, Matthias Schlesner, Jan Lubinski, Kari Hemminki, and et al. 2021. "Whole Exome Sequencing Identifies APCDD1 and HDAC5 Genes as Potentially Cancer Predisposing in Familial Colorectal Cancer" International Journal of Molecular Sciences 22, no. 4: 1837. https://doi.org/10.3390/ijms22041837

APA Style

Skopelitou, D., Miao, B., Srivastava, A., Kumar, A., Kuświk, M., Dymerska, D., Paramasivam, N., Schlesner, M., Lubinski, J., Hemminki, K., Försti, A., & Bandapalli, O. R. (2021). Whole Exome Sequencing Identifies APCDD1 and HDAC5 Genes as Potentially Cancer Predisposing in Familial Colorectal Cancer. International Journal of Molecular Sciences, 22(4), 1837. https://doi.org/10.3390/ijms22041837

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop