Next Article in Journal
Paving the Way to Solid Tumors: Challenges and Strategies for Adoptively Transferred Transgenic T Cells in the Tumor Microenvironment
Next Article in Special Issue
Defining A Liquid Biopsy Profile of Circulating Tumor Cells and Oncosomes in Metastatic Colorectal Cancer for Clinical Utility
Previous Article in Journal
Survival Study: International Multicentric Minimally Invasive Liver Resection for Colorectal Liver Metastases (SIMMILR-2)
Previous Article in Special Issue
Routine Immunohistochemical Analysis of Mismatch Repair Proteins in Colorectal Cancer—A Prospective Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Performance of the Use of Genetic Information to Assess the Risk of Colorectal Cancer in the Basque Population

by
Koldo Garcia-Etxebarria
1,2,*,
Ane Etxart
3,
Maialen Barrero
3,
Beatriz Nafria
3,
Nerea Miren Segues Merino
3,
Irati Romero-Garmendia
4,
Andre Franke
5,
Mauro D’Amato
6,7,8 and
Luis Bujanda
2,3
1
Biodonostia, Gastrointestinal Genetics Group, 20014 San Sebastián, Spain
2
Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd), 08036 Barcelona, Spain
3
Biodonostia, Gastrointestinal Disease Group, Universidad del País Vasco (UPV/EHU), 20014 San Sebastián, Spain
4
Department of Genetics, Physical Anthropology and Animal Physiology, University of the Basque Country (Universidad del País Vasco/Euskal Herriko Unibertsitatea), 48940 Leioa, Spain
5
Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, 24105 Kiel, Germany
6
Gastrointestinal Genetics Lab, CIC bioGUNE, Basque Research and Technology Alliance, 48160 Derio, Spain
7
IKERBASQUE, Basque Foundation for Sciences, 48009 Bilbao, Spain
8
Department of Medicine and Surgery, LUM University, 70010 Casamassima, Italy
*
Author to whom correspondence should be addressed.
Cancers 2022, 14(17), 4193; https://doi.org/10.3390/cancers14174193
Submission received: 16 June 2022 / Revised: 18 August 2022 / Accepted: 26 August 2022 / Published: 29 August 2022

Abstract

:

Simple Summary

The risk of developing colorectal cancer (CRC) is partially associated with genetics. Different studies have provided valuable genetic information to understand the biology behind CRC and to build models of genetic risk. However, the study of the applicability of such genetic information within the Basque population is limited. Thus, our objectives were to find out if the genetic variants associated with CRC in other populations are the same in the Basque population and to assess the performance of the use of genetic information to calculate the risk of developing CRC. We found that the available genetic information can be applied to the Basque population, although local genetic variation can affect its use. Our findings will help to refine the use of CRC genetic risk calculation in the Basque population, and we expect that our findings could be useful for other populations.

Abstract

Although the genetic contribution to colorectal cancer (CRC) has been studied in various populations, studies on the applicability of available genetic information in the Basque population are scarce. In total, 835 CRC cases and 940 controls from the Basque population were genotyped and genome-wide association studies were carried out. Mendelian Randomization analyses were used to discover the effect of modifiable risk factors and microbiota on CRC. In total, 25 polygenic risk score models were evaluated to assess their performance in CRC risk calculation. Moreover, 492 inflammatory bowel disease cases were used to assess whether that genetic information would not confuse both conditions. Five suggestive (p < 5 × 10−6) loci were associated with CRC risk, where genes previously associated with CRC were located (e.g., ABCA12, ATIC or ERBB4). Moreover, the analyses of CRC locations detected additional genes consistent with the biology of CRC. The possible contribution of cholesterol, BMI, Firmicutes and Cyanobacteria to CRC risk was detected by Mendelian Randomization. Finally, although polygenic risk score models showed variable performance, the best model performed correctly regardless of the location and did not misclassify inflammatory bowel disease cases. Our results are consistent with CRC biology and genetic risk models and could be applied to assess CRC risk in the Basque population.

1. Introduction

In total, 10% of the cancers diagnosed in the world are colorectal cancers (CRC) and, in addition, CRC is the second cause of cancer death in developed countries [1,2]. The development of CRC can be sporadic or due to inflammatory processes [3]; the risk of CRC is influenced by the environment, genetics, and microbial composition [4,5]. Since CRC is a major public health issue, different strategies for its early detection and prognosis have been proposed and developed [6].
As mentioned, genetic factors are involved in CRC risk, or they can be associated with other risk factors related to CRC. As a consequence, their utility as biomarkers has been explored: their role in CRC risk has been studied by analyzing specific genetic variants [7,8,9], as well as, genome-wide association studies (GWAS) [10]. Moreover, the effect of genetic information on modifiable risk factors (e.g., lipids level) on CRC has been analyzed using Mendelian Randomization analyses [11,12], a method to estimate causal effects if specific assumptions are fulfilled. In addition, it has been detected that some genetic variants involved in the abundance of some microbial groups are related to CRC risk [13]. Finally, it has been proposed that polygenic risk scores (PRS) derived from different genetic studies are useful to predict the risk of CRC of one individual based on the carriership of risk genetic variants, among other factors [14,15].
Previously, 48 SNPs associated with CRC were analyzed in 230 CRC cases and 230 controls from the Basque population [16]. From those analyzed SNPs, only rs6687758 SNP was associated with CRC risk, and the application of those 48 SNPs as a model to predict PRS risk was successful [16]. Indeed, the Basque population has a particular genetic history compared to the rest of the European population, since the migrations associated with the Steppe pastoralism had less effect on that population, therefore, genetic variants from populations that lived in Europe in the Neolithic [17] or Iron Age [18] could be higher. Previously, a genetic study of this cohort showed that it was useful to study the effect of local genetic variants on the risk and ability to predict the risk of complex diseases [19]. In addition, according to the data available from the Basque Statistic Institute (https://en.eustat.eus, accessed: 1 August 2022), between 2016 and 2019, in the Basque Autonomous Community (Northern Spain) CRC caused 8356 hospitalizations (on average, 95.58 hospitalizations per 100.000 habitants per year), while in the rest of Spain there were 101.12 hospitalizations per 100.000 habitants per year (between 2016–2019, according with Instituto Nacional de Estadística, https://www.ine.es, accessed: 1 August 2022), and in Europe, there were 123.45 hospitalizations per 100.000 habitants per year (between 2016–2019, according to Eurostat, https://ec.europa.eu/eurostat, accessed: 1 August 2022).
In the present study, we analyze a larger Basque cohort (835 cases and 940 controls) to detect the risk factors for CRC that can be explained or inferred from the genetic component of CRC using genome-wide association studies and Mendelian Randomization to assess the applicability of existing CRC PRS models on this population.

2. Materials and Methods

2.1. Recruitment

CRC cases were diagnosed using standard criteria and the samples used in this study were obtained in the standard clinical practice, after informed consent, in Hospital Universitario Donostia (San Sebastian, Spain). The samples of non-CRC controls were obtained through the Basque Biobank; the samples were sourced from healthy blood donors (the age range to be eligible to be a blood donor is 18–65). The information of those blood donors is anonymized and only information about sex and age is made available. In total, 869 cases were recruited, and 987 controls were used.
The present study was approved by the Local Ethics Committee (Comité de Ética de la Investigación con medicamentos de Euskadi, code: PI+CES-BIOEF 2017-10).

2.2. Genotyping and Imputation

Illumina Global Screening Array was used to genotype the DNA samples of the individuals analyzed in this work. For this, Illumina iScan high-throughput screening system was used in the Institute of Clinical Molecular Biology (Kiel, Germany). Raw intensities were transformed to alleles using the GenCall algorithm available in Illumina GenomeStudio software.
Then, the called genotypes and samples were filtered using the following criteria: samples with ≥5% missing rates; markers with non-called alleles; markers with missing call rates > 0.05; related samples (PI-HAT > 0.1875); samples whose genotyped sex could not be determined; samples with high heterozygosity rate (more than 3 times SD from the mean) were excluded. In addition, only autosomal SNPs were kept; markers with Hardy–Weinberg equilibrium p < 1 × 10−5; markers whose P of difference in missingness between cases and control was <1 × 10−5; samples that were outliers, identified using principal component analysis (deviation of more than 6 times interquartile range), using FlashPCA (v2.0) [20], were removed.
Additional SNPs were imputed using the Sanger Imputation service. Release 1.1 of the Haplotype Reference Consortium was used as a reference panel, and the EAGLE2+PBWT pipeline was used to carry out the imputation [21,22,23]. Once imputed, markers with INFO score < 0.80, MAF < 0.01 and non-biallelic markers were removed.
After genotyping, quality control and imputation, 5,399,981 SNPs from 1775 individuals (835 cases and 940 controls) were kept.

2.3. Genetic Analyses

2.3.1. Admixture Analysis

Genotyped SNPs were pruned using Plink (v1.90) [24] and SNPs from regions with high linkage disequilibrium were removed. Admixture (v1.3) [25] was used to analyze the admixture of the samples of our cohort, with settings K between 1 and 10, and using the results with the lowest cross-validation value.

2.3.2. Genome-Wide Association Study

GWAS analyses of CRC cases and non-CRC controls were performed using logistic regression implemented in Plink [24], adjusting by sex, age and the first 4 principal components. In addition, GWAS of right colon cancer, left colon cancer, and rectal cancer vs non-CRC controls, as well as right colon cancer vs left colon cancer, and colon cancer vs rectal cancer were carried out using logistic regression implemented in Plink, and adjusting by sex, age and first 4 principal components.
To compare our results with SNPs previously associated with CRC, SNPs associated with the “Colorectal cancer” term (EFO_0005842) and studied in populations of European origin were retrieved from GWAS Catalog [26]. In total, 209 SNP from 34 studies were retrieved.
Moreover, CRC patients were compared to 492 inflammatory bowel disease patients without CRC [19] to find genetic differences in our cohort. To perform that analysis, a logistic regression implemented in Plink, adjusting by sex, age and first 4 principal components, was used. In addition, a comparison of CRC patients against the mentioned inflammatory bowel disease patients plus controls was carried out.

2.3.3. Mendelian Randomization Analyses

For carrying out Mendelian Randomization (MR) analyses TwoSampleMR (v0.5.6) [27] and gsmr (v1.0.9) [28] packages from R language (v4.0.5) were used [29], as we have used previously to study the effect of modifiable risk factors in CRC risk [13].
First, we selected the modifiable risk factors based on a previous work [12] which analyzed modifiable risk factors using Mendelian Randomization that affects CRC (BMI, cholesterol, triglycerides, selenium, iron, vitamin B12, metabolism, body fat percentage, waist circumference, IL6 receptor and height). Then, we retrieved the instruments available in MRC-IEU (https://gwas.mrcieu.ac.uk, accessed: 14 February 2022) of those traits through TwoSampleMR [27]. In addition, to analyze the effect of the microbiota in CRC cancer, we retrieved instruments of bacterial phyla which are available from MiBioGen consortium data [30].
Then, the analysis was carried out if 10 or more instruments were available, and HEIDI outlier analysis was used to discard heterogenous instruments. The strength of the instruments was measured by the F-statistic: F = R2(N – K − 1)/K(1 − R2), where R2 is the variance explained by genetic variance, N is the sample size, and K is the number of instruments [31]. In addition, I2 was calculated using TwoSampleMR R Package.
The MR analyses were carried out using Inverse Variance Weighted, Weighted Median and MR Egger methods. In addition, the heterogeneity Q test and pleiotropy test available in TwoSampleMR R Package were used as sensitivity tests. The analysis was applied to all CRC cases, as well as, right colon cancer, left colon cancer and rectal cancer analyses.

2.3.4. Polygenic Risk Scores

Polygenic risk scores (PRS) were retrieved from PGS Catalog [32]. 29 scores available in the “Colorectal cancer” term (EFO_0005842) derived using cohorts with >90% samples of European ancestry and whose assembly version was known were used for the PRS analysis [33,34,35,36,37,38,39,40,41]. From those 29 panels, our cohort had available SNPs to apply in 25 of them. In addition, the PRS used previously in the Basque population was tested [16]. The weights of the SNPs present in our data were applied in our cohort using Plink [24]. The performance of the PRS was measured by comparing the PRS score distribution of CRC cases and non-CRC controls using a T-test using R language [29]; the effect size of the T-test was calculated using Cohen’s d through the package rstatix (https://CRAN.R-project.org/package=rstatix, accessed: 28 April 2022) of R language, the area under de curve, sensitivity and specificity was calculated using pROC package of R language. The 95% of confidence interval of the area under the curve was calculated using that package and the DeLong method.
In addition, CRC PRS were applied in 492 patients with inflammatory bowel disease without CRC [19] to measure the ability to distinguish both conditions.
Additional statistical analyses and graphics were done using R language [29].

3. Results

In this study, we have analyzed 835 CRC cases and 940 population-based controls (Table 1). In the cases and the controls, around two-thirds of the individuals were males (63.47% and 67.13%, respectively), and cases were older (average age, 73.54) than the controls (average age, 41.53). The majority of the CRC patients were in stages II and III (37.61% and 26.71%, respectively), with located tumors in the rectum (28.14%) and left colon (26.23%) (Table 1).
The individuals with modern European ancestry overlapped with the Iberian population of 1000 Genomes data, while the ancient European ancestry was distanced from European populations (Supplementary Figure S1A). In addition, the PC1 of the principal component analysis of the samples was determined by the ancestry component of our cohort (Supplementary Figure S1B).

3.1. Genome-Wide Association Studies

The genome-wide association study of all CRC cases showed five suggestive (p < 5 × 10−6) signals (Table 2). The most significant SNP was rs77317240, located in chromosome 2 and upstream of ABCA12 and ATIC genes (p = 5.8 × 10−7; OR = 6.4; CI 95%, 3.1–13.2). Other suggestive SNPs were located in ERBB4 and MAGI2 genes, and downstream of the IL15 gene (Table 2).
When cancer locations were analyzed separately different signals were detected (Table 2): 16 in right colon cancer (the most significant signal was located in the NTF3 gene), 7 in left colon cancer (the most significant signal was located in the ABCC12 gene), and 10 in rectal cancer (the most significant signal was located in BRD7 gene). When locations were compared (Table 2), 2 signals were detected when comparing left and right colon cancers (the most significant genetic variant was located in the FERMT2 gene) and 3 when comparing rectal vs colon cancers (the most significant genetic variant was located in CNTNAP2 gene).
Among the SNPs previously associated with CRC (Supplementary Table S1), 16 SNPs (7.65% of SNPs previously associated) showed nominal association in our cohort. When those SNPs were analyzed by the location of cancer, 9 (4.31%) were nominally significant in right colon cancer, 12 (5.74%) in left colon cancer (including rs6687758, an SNP previously associated with CRC in the Basque population) and 12 (5.74%) in rectal cancer. Among the 31 SNPs previously associated with CRC in more than one study (Supplementary Table S1), 5 SNPs (16.13%) showed nominal association in CRC; 3 (9.68%) in right colon cancer; 3 (9.68%) in left colon cancer and 1 (3.23%) in rectal cancer.
Regarding the comparison with inflammatory bowel disease (Table 3), 11 genomic regions had suggestive different frequencies. Among them, the signal located upstream of the ATP8B4 gene (rs541295) reached a genome-wide significant p-value (p = 1.8 × 10−8). When colorectal cancer was compared with the pool of controls and inflammatory bowel disease (Table 3), the most significant signal in CRC vs controls (upstream of the ABCA12 and ATIC genes) was detected. In addition, 4 of the signals detected when CRC was compared with inflammatory bowel disease patients were suggestive: in the HLA region, in the DLGAP2 gene, downstream of the PTCHD3 gene and upstream of the ATP8B4 gene.

3.2. Mendelian Randomization

Mendelian Randomization analyses were carried out to analyze the effect of modifiable risk factors and the abundance of bacterial phyla on CRC risk. The instruments used seemed appropriate (Supplementary Table S2), although the modifiable risk factors were stronger than bacterial phyla (F-statistic between 55.82–211.35 in the former, 18.73–20.28 in the latter).
When analyzing the effect of modifiable risk factors on CRC, there were no significant results (Figure 1A, Supplementary Table S3). However, when the locations of CRC were separately analyzed, the MR Egger method showed the effect of total cholesterol (beta = 2.4 ± 1.1; p = 0.0395) on left-sided colon cancer risk, and the effect of BMI (beta = 8.7 ± 3.3; p = 0.0094) in rectal cancer risk. In the latter, pleiotropic effects were detected (p = 0.0112, Supplementary Table S3). In addition, Inverse Variance Weighted method showed the effect of LDL cholesterol (beta = 1.56 ± 0.64; p = 0.0148) on left-sided colon cancer risk.
In the case of bacterial phyla (Figure 1B, Supplementary Table S4), according to MR Egger method, Firmicutes phylum showed a significant effect on CRC and left colon cancer (beta=3.6 ± 1.7; p = 0.0364; beta = 6.4 ± 2.8; p = 0.0282, respectively), although pleiotropy was detected in both cases (p = 0.0347; p = 0.0456, respectively, Supplementary Table S4), as well as, heterogeneity in the used instruments (Q-test p = 0.0336 and p = 0.0107, respectively, Supplementary Table S4). In the case of Inverse Variance Weighted, there was an inverse effect of Cyanobacteria abundance on CRC risk and left colon cancer risk (beta = −0.86 ± 0.39; p = 0.0299; beta = −1.66 ± 0.68; p = 0.014, respectively).

3.3. Polygenic Risk Scores

Polygenic risk scores for our cohort were built using 25 different models available in PGS Catalog for CRC. From all of them (Figure 2 and Figure 3A), PGS000785 showed the best discrimination between the PRS values for cases and controls (T-test p = 2.12 × 10−14; small effect according to Cohen’s d), as well as, the best AUC value (0.6, CI 95% 0.58–0.62); followed by PGS000734 and PGS000765 (both p = 2.64 × 10−13; small effect according to Cohen’s d; AUC of 0.6, CI 95% 0.57–0.61). In addition, the PRS used previously in a Basque cohort showed lower significance (p = 0.0003; negligible effect according to Cohen’s d) and AUC value (0.55, CI 95% 0.52–0.56).
The PGS000785 PRS model had a good performance regardless of the location of CRC (Figure 3B): the distribution of the PRS score was significantly higher in right colon cancer (p = 3.05 × 10−6), left colon cancer (p = 7.49 × 10−6) and rectal cancer (p = 3.33 × 10−6) compared to controls, while there were no significant differences comparing locations. In addition, that model was able to differentiate inflammatory bowel disease patients from colorectal cancer patients (p = 2.36 × 10−10, Figure 3C), regardless of the type of inflammatory bowel disease (Crohn’s Disease, p = 2.61 × 10−7; Ulcerative colitis, p = 5.08 × 10−7; Figure 3D).

4. Discussion

The development of colorectal cancer (CRC) is influenced by environmental factors [4], microbiome composition [5] and genetic factors. In this work, we have analyzed the contribution of the genetic component to CRC risk in the Basque population, a population with a particular genetic history. That particular genetic history was reflected in the principal component analysis and, as it was done before [19], adjusting for PCs is enough to avoid artifacts due to the presence of two ancestries in the population.
Previously, selected SNPs were analyzed in CRC in the Basque population [16] and, in this study, we have used a GWAS approach and increased the sample size. In that previous work, the SNP rs6687758 was nominally significant [16] and we have been able to detect the nominal significance of that SNP in left colon cancer, as well as more genetic variants. We are aware that the sample size affected the results we obtained, and, for example, few previously associated SNPs with CRC were detected in our study. However, we were able to find nominally significant results for the SNPs detected in more than one study. In addition, the majority of SNPs detected in previous studies were not detected in other studies. Thus, the genetic risk of CRC could be partially due to local variation, therefore, it seems appropriate for the genetic analysis of CRC in new populations.
The most significant signal in CRC, although it was not genome-wide significant, was located between ABCA12 and ATIC genes. It has been reported that the expression of ABCA12 is upregulated in CRC [42,43], its expression is higher in the colon than in the rectum [43], and its expression is higher in colorectal adenoma than in hyperplastic polyp [44]. In the case of the ATIC gene, it has been proposed that its expression could be a prognostic marker for colon adenocarcinoma [45]; its presence in small extracellular vesicles in serum is useful to differentiate early colorectal neoplasia from advanced colorectal neoplasia [46].
Another suggestive signal was located on the ERBB4 gene. In cell culture and mice, it has been observed that ERBB4 expression and signaling can prevent apoptosis of the cells in an inflammatory environment [47], therefore, its chronic overexpression could contribute to the appearance of tumors, since apoptosis of colonic cells is inhibited [48]. In humans, it has been reported the overexpression of ERBB4 in CRC and that tumors with high levels of this receptor could have enhanced cell survival [49]. In addition, it has been suggested that the expression of ERBB4 is associated with unfavorable clinical outcomes in CRC [50] and that it could be a marker of a higher risk of recurrence [51]. Additionally, it has been reported that ERRB4 expression is positively associated with lymph node metastasis [50]; that ERBB4 could play a relevant role in a gene network associated with progression from colon adenocarcinoma to liver metastases [52], and that ERBB4 could be part of a pathway that enhances the invasion of CRC cells [53].
Additional suggestive signals were located in the MAGI2 gene and downstream of the IL15 gene. The SNP rs34931968 detected in our cohort is located in the MAGI2 gene, upstream of a lncRNA that is next to MAGI2 (called MAGI2-AS3), a lncRNA that has been involved in CRC [54,55,56]. In addition, the SNP rs34931968 is in linkage disequilibrium with an SNP (rs7783388) involved in CRC throughout changes in MAGI2-AS3 expression [56]. In the case of IL15, its expression has been associated with the outcome of CRC [57].
When the locations of the tumors were analyzed separately, other possible relevant genes were detected. In right colon cancer, the most significant signal was located in NTF3, a gene implicated in unfavorable prognosis in hepatocellular carcinoma [58,59]; in left colon cancer ABCC12 gene, another ATP-binding cassette as the previously discussed ABCA12; in rectal cancer BRD7 gene, a possible oncogene involved in CRC progression [60]. In addition, in rectal cancer the SNP rs13403794 was detected, an SNP located upstream of ADAM17, which is a gene that is part of the signaling pathway involved in colorectal cancer progression and chemoresistance [61]. When locations were compared, additional genes were detected: FERMT2, whose overexpression in CRC has been detected and associated with cell growth [62]; CNTNAP2, a gene that has not been associated with CRC. It has been observed that the genetic mechanisms behind CRC could be different depending on its location [63] and the differences in the genetic variants detected in our study are consistent with that suggestion.
On the whole, considering the biological role of some of the genes where the suggestive genetic variants were located, those genetic variants could be markers of the progression of CRC, at least in the Basque population, although follow-up analyses are needed to confirm their potential utility as markers.
Various modifiable risk factors have been observed to affect CRC risk [11,12,64], but we were not able to find those effects when all CRC patients were analyzed. However, when each location was analyzed, the effect of genetic risk to higher cholesterol levels (general levels or LDL) on left colon cancer and higher BMI on rectal cancer were detected, as has been suggested previously for CRC [11,12,13,64,65]. Although we tried to replicate the results obtained using Mendelian Randomization in previous works [12,13] and the traits and instruments used seem appropriate to replicate them, the results we obtained were limited or were detected only by one method. It could be possible that the size and characteristics of our cohort and GWAS analyses complicate the finding of clear causalities, since the traits we used to have strong instruments to avoid the biases of our cohort.
The genetic signature of the abundance of Firmicutes was associated with a higher risk of CRC and left colon cancer in our cohort, although the results should be taken with caution since heterogeneity was detected. In addition, that association had a pleiotropic effect, that is, rather than the presence of Firmicutes affecting the risk of CRC (cause and effect), there is a shared genetic component that affects both (common biologic mechanism). It has been described the importance of the microbiota in CRC risk and development [66,67], the differences in its composition between left and right colon cancer [68,69,70] and shared genetic variants in CRC risk and the abundance of Firmicutes [13]. Although the connection we have detected between CRC and Firmicutes is based only on their shared genetic variants, it has been observed that the involvement of Firmicutes in CRC risk was variable [68,69,70,71]: some genera of Firmicutes were enriched in CRC while others were depleted. In the case of Cyanobacteria, a higher abundance of that phylum has been observed in colorectal adenomas [72], and in animal models, it has been observed a higher abundance of Cyanobacteria when oxaliplatin is administered [73]. Therefore, follow-up analyses of Firmicutes and Cyanobacteria as a marker of CRC risk in the Basque cohort are needed. Although the involvement of Firmicutes and Cyanobacteria in CRC seems biologically possible, their connection through Mendelian Randomization in our work seems weak, since they have been detected only by one method. In addition, although the study of the effect of host genetics on microbial abundance has been a valuable resource [30], it could be possible that the available instruments are not still appropriate to carry out Mendelian Randomization analyses, at least in our cohort.
Finally, polygenic risk scores (PRS) have been proposed as a tool for risk prediction in colorectal cancer [15]. We applied several publicly available PRS models, and their performance was variable. The best model was built using different sources available in GWAS Catalog and the interplay between genetic risk and modifiable risk factors [37]. In the case of CRC, that work suggested that PRS was the primary determinant of risk stratification in their application of the PRS model in UK Biobank data [37]. Although our cohort has a slightly different genetic background, since there is a higher genetic component of ancient European ancestry, the application of the PRS was able to differentiate CRC cases from controls, regardless of the location of the tumors. Since the AUC was low and the effect small, additional genetic or non-genetic risk factors should be incorporated to build a model for better discrimination. In addition, this PRS did not confuse CRC and inflammatory bowel disease or its main types in our cohort, suggesting that when there are overlapping symptoms, the use of that PRS would not misclassify an IBD patient as a CRC patient. In addition, we found genetic variants that could be used to discriminate between CRC and inflammatory bowel disease in our cohort, although follow-up analyses are needed. Regarding the PRS previously used in Basques [16], the performance in our data was not as good as the best model, but the controls showed lower PRS than CRC cases (p = 0.003), similar to the previous analysis of Basques (p = 0.002 for the unweighted values, p = 0.036 for weighted values) [16]. Therefore, the incorporation of a different set of SNPs for the development of more precise PRS models is still necessary, and the performance of PRS models should be investigated in additional samples of this population.
Considering the results obtained in the different analyses we have carried out since the results are quite consistent with previous results, genetic CRC risk in the Basque population seems to be similar to other European populations. The suggestive signals from the GWAS were consistent with CRC biology, although in some variants the frequency in the Basque population was quite different. Mendelian Randomization analyses did not find clear causal relationships, although the traits used were reported to affect CRC risk in other cohorts, therefore, follow-up studies are needed to assess if our results are due to methodological constraints or differences in the specific mechanisms. Finally, the application of polygenic risk scores based on European populations seemed a feasible approach to capture the CRC risk in the Basque population, although they can be improved. Thus, as happened in inflammatory bowel disease [19], the genetic architecture of CRC risk in the Basque population is similar to other European populations but local genetic variation shapes the risk.

5. Conclusions

In conclusion, we have analyzed the genetic component of the risk of CRC in the Basque population. Although the sample size was limited and there were constraints in the analyses due to the cohort used, we detected genetic factors whose involvement in the risk of CRC is consistent with the biological mechanisms of CRC, and we identified plausible genetic markers and an appropriate polygenic risk score model to assess the genetic contribution to CRC risk in this population. In the future, those genetic factors and the polygenic risk score model should be validated in follow-up studies.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/cancers14174193/s1, Figure S1: PCA plots of analyzed Basque cohort; Table S1: Results of SNPs previously associated with CRC; Supplementary Table S2: Sensitivity analyses of used instruments in Mendelian Randomization analyses; Supplementary Table S3: Results of Mendelian Randomization analyses using modifiable risk factors as exposures; Supplementary Table S4: Results of Mendelian Randomization analyses using bacterial phyla as exposures.

Author Contributions

Conceptualization, K.G.-E., M.D. and L.B.; methodology, K.G.-E.; formal analysis, K.G.-E., I.R.-G.; resources, A.E., M.B., B.N., N.M.S.M., A.F., M.D., L.B.; data curation, A.E., M.B., B.N., N.M.S.M.; writing—original draft preparation, K.G.-E.; writing—review and editing, K.G.-E., I.R.-G., L.B.; supervision, K.G.-E.; funding acquisition, M.D., L.B. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially founded by Gipuzkoako Foru Aldundia/Diputación Foral de Gipuzkoa (Code: 111/17).

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by Comité de Ética de la Investigación con medicamentos de Euskadi (code: PI+CES-BIOEF 2017-10).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data presented in this study are available on request from the authors. The data are not publicly available due to ethical reasons (genotype data cannot be shared).

Acknowledgments

Samples and data used in the present work were provided by the Basque Biobank (http://www.biobancovasco.org, accessed: 1 August 2022). We want to thank Miguel Ángel Vesga from the Basque Centre of Transfusion and Human Tissues for providing the access to control samples.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Stewart, B.; Wild, C. World Cancer Report 2014; IARC Publications: Lyon, France, 2014; ISBN 978-92-832-0443-5. [Google Scholar]
  2. Ferlay, J.; Soerjomataram, I.; Dikshit, R.; Eser, S.; Mathers, C.; Rebelo, M.; Parkin, D.M.; Forman, D.; Bray, F. Cancer incidence and mortality worldwide: Sources, methods and major patterns in GLOBOCAN 2012. Int. J. Cancer 2015, 136, E359–E386. [Google Scholar] [CrossRef] [PubMed]
  3. Lasry, A.; Zinger, A.; Ben-Neriah, Y. Inflammatory networks underlying colorectal cancer. Nat. Immunol. 2016, 17, 230–240. [Google Scholar] [CrossRef]
  4. Cross, A.J.; Ferrucci, L.M.; Risch, A.; Graubard, B.I.; Ward, M.H.; Park, Y.; Hollenbeck, A.R.; Schatzkin, A.; Sinha, R. A large prospective study of meat consumption and colorectal cancer risk: An investigation of potential mechanisms underlying this association. Cancer Res. 2010, 70, 2406–2414. [Google Scholar] [CrossRef] [PubMed]
  5. Wong, S.H.; Yu, J. Gut microbiota in colorectal cancer: Mechanisms of action and clinical applications. Nat. Rev. Gastroenterol. Hepatol. 2019, 16, 690–704. [Google Scholar] [CrossRef]
  6. Levin, B.; Lieberman, D.A.; McFarland, B.; Smith, R.A.; Brooks, D.; Andrews, K.S.; Dash, C.; Giardiello, F.M.; Glick, S.; Levin, T.R.; et al. Screening and surveillance for the early detection of colorectal cancer and adenomatous polyps, 2008: A joint guideline from the American Cancer Society, the US Multi-Society Task Force on Colorectal Cancer, and the American College of Radiology. CA. Cancer J. Clin. 2008, 58, 130–160. [Google Scholar] [CrossRef] [PubMed]
  7. Abulí, A.; Fernández-Rozadilla, C.; Alonso-Espinaco, V.; Muñoz, J.; Gonzalo, V.; Bessa, X.; González, D.; Clofent, J.; Cubiella, J.; Morillas, J.D.; et al. Case-control study for colorectal cancer genetic susceptibility in EPICOLON: Previously identified variants and mucins. BMC Cancer 2011, 11, 339. [Google Scholar] [CrossRef]
  8. Abulí, A.; Bessa, X.; González, J.R.; Ruizponte, C.; Cáceres, A.; Muñoz, J.; Gonzalo, V.; Balaguer, F.; Fernándezrozadilla, C.; González, D.; et al. Susceptibility genetic variants associated with colorectal cancer risk correlate with cancer phenotype. Gastroenterology 2010, 139, 788–796.e6. [Google Scholar] [CrossRef] [PubMed]
  9. Burns, M.B.; Montassier, E.; Abrahante, J.; Priya, S.; Niccum, D.E.; Khoruts, A.; Starr, T.K.; Knights, D.; Blekhman, R. Colorectal cancer mutational profiles correlate with defined microbial communities in the tumor microenvironment. PLoS Genet. 2018, 14, 1–24. [Google Scholar] [CrossRef]
  10. Law, P.J.; Timofeeva, M.; Fernandez-Rozadilla, C.; Broderick, P.; Studd, J.; Fernandez-Tajes, J.; Farrington, S.; Svinti, V.; Palles, C.; Orlando, G.; et al. Association analyses identify 31 new risk loci for colorectal cancer susceptibility. Nat. Commun. 2019, 10, 1–15. [Google Scholar] [CrossRef]
  11. Rodriguez-broadbent, H.; Law, P.J.; Sud, A.; Palin, K.; Tuupanen, S.; Kaasinen, E.; Sarin, A.; Ripatti, S.; Eriksson, J.G. Mendelian randomisation implicates hyperlipidaemia as a risk factor for colorectal cancer. Int. J. Cancer 2017, 140, 2701–2708. [Google Scholar] [CrossRef] [Green Version]
  12. Cornish, A.J.; Law, P.J.; Timofeeva, M.; Palin, K.; Farrington, S.M.; Palles, C.; Jenkins, M.A.; Casey, G.; Brenner, H.; Chang-Claude, J.; et al. Modifiable pathways for colorectal cancer: A mendelian randomisation analysis. Lancet Gastroenterol. Hepatol. 2020, 5, 55–62. [Google Scholar] [CrossRef]
  13. Garcia-Etxebarria, K.; Clos-Garcia, M.; Telleria, O.; Nafría, B.; Alonso, C.; Iruarrizaga-Lejarreta, M.; Franke, A.; Crespo, A.; Iglesias, A.; Cubiella, J.; et al. Interplay between genome, metabolome and microbiome in colorectal cancer. Cancers 2021, 13, 6216. [Google Scholar] [CrossRef] [PubMed]
  14. Ibáñez-Sanz, G.; Diéz-Villanueva, A.; Alonso, M.H.; Rodríguez-Moranta, F.; Pérez-Gómez, B.; Bustamante, M.; Martin, V.; Llorca, J.; Amiano, P.; Ardanaz, E.; et al. Risk Model for Colorectal Cancer in Spanish Population Using Environmental and Genetic Factors: Results from the MCC-Spain study. Sci. Rep. 2017, 7, 43263. [Google Scholar] [CrossRef]
  15. Thomas, M.; Sakoda, L.C.; Hoffmeister, M.; Rosenthal, E.A.; Lee, J.K.; van Duijnhoven, F.J.B.; Platz, E.A.; Wu, A.H.; Dampier, C.H.; de la Chapelle, A.; et al. Genome-wide Modeling of Polygenic Risk Score in Colorectal Cancer Risk. Am. J. Hum. Genet. 2020, 107, 432–444. [Google Scholar] [CrossRef] [PubMed]
  16. Alegria-Lertxundi, I.; Aguirre, C.; Bujanda, L.; Fernández, F.J.; Polo, F.; Ordovás, J.M.; Carmen Etxezarraga, M.; Zabalza, I.; Larzabal, M.; Portillo, I.; et al. Single nucleotide polymorphisms associated with susceptibility for development of colorectal cancer: Case-control study in a Basque population. PLoS ONE 2019, 14, e0225779. [Google Scholar] [CrossRef]
  17. Günther, T.; Valdiosera, C.; Malmström, H.; Ureña, I.; Rodriguez-Varela, R.; Sverrisdóttir, Ó.O.; Daskalaki, E.A.; Skoglund, P.; Naidoo, T.; Svensson, E.M.; et al. Ancient genomes link early farmers from Atapuerca in Spain to modern-day Basques. Proc. Natl. Acad. Sci. USA 2015, 112, 11917–11922. [Google Scholar] [CrossRef]
  18. Olalde, I.; Mallick, S.; Patterson, N.; Rohland, N.; Villalba-Mouco, V.; Silva, M.; Dulias, K.; Edwards, C.J.; Gandini, F.; Pala, M.; et al. The genomic history of the Iberian Peninsula over the past 8000 years. Science (80-) 2019, 363, 1230–1234. [Google Scholar] [CrossRef]
  19. Garcia-Etxebarria, K.; Merino, O.; Gaite-Reguero, A.; Rodrigues, P.M.; Herrarte, A.; Etxart, A.; Ellinghaus, D.; Alonso-Galan, H.; Franke, A.; Marigorta, U.M.; et al. Local genetic variation of inflammatory bowel disease in Basque population and its effect in risk prediction. Sci. Rep. 2022, 12, 3386. [Google Scholar] [CrossRef] [PubMed]
  20. Abraham, G.; Qiu, Y.; Inouye, M. FlashPCA2: Principal component analysis of Biobank-scale genotype datasets. Bioinformatics 2017, 33, 2776–2778. [Google Scholar] [CrossRef] [PubMed]
  21. Loh, P.R.; Danecek, P.; Palamara, P.F.; Fuchsberger, C.; Reshef, Y.A.; Finucane, H.K.; Schoenherr, S.; Forer, L.; McCarthy, S.; Abecasis, G.R.; et al. Reference-based phasing using the Haplotype Reference Consortium panel. Nat. Genet. 2016, 48, 1443–1448. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Durbin, R. Efficient haplotype matching and storage using the positional Burrows-Wheeler transform (PBWT). Bioinformatics 2014, 30, 1266–1272. [Google Scholar] [CrossRef]
  23. McCarthy, S.; Das, S.; Kretzschmar, W.; Delaneau, O.; Wood, A.R.; Teumer, A.; Kang, H.M.; Fuchsberger, C.; Danecek, P.; Sharp, K.; et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat. Genet. 2016, 48, 1279–1283. [Google Scholar] [CrossRef]
  24. Chang, C.C.; Chow, C.C.; Tellier, L.C.A.M.; Vattikuti, S.; Purcell, S.M.; Lee, J.J. Second-generation PLINK: Rising to the challenge of larger and richer datasets. Gigascience 2015, 4, 7. [Google Scholar] [CrossRef] [PubMed]
  25. Alexander, D.H.; Novembre, J.; Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009, 19, 1655–1664. [Google Scholar] [CrossRef] [PubMed]
  26. Buniello, A.; Macarthur, J.A.L.; Cerezo, M.; Harris, L.W.; Hayhurst, J.; Malangone, C.; McMahon, A.; Morales, J.; Mountjoy, E.; Sollis, E.; et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 2019, 47, D1005–D1012. [Google Scholar] [CrossRef]
  27. Hemani, G.; Zheng, J.; Elsworth, B.; Wade, K.H.; Haberland, V.; Baird, D.; Laurin, C.; Burgess, S.; Bowden, J.; Langdon, R.; et al. The MR-base platform supports systematic causal inference across the human phenome. Elife 2018, 7, 1–29. [Google Scholar] [CrossRef]
  28. Zhu, Z.; Zheng, Z.; Zhang, F.; Wu, Y.; Trzaskowski, M.; Maier, R.; Robinson, M.R.; McGrath, J.J.; Visscher, P.M.; Wray, N.R.; et al. Causal associations between risk factors and common diseases inferred from GWAS summary data. Nat. Commun. 2018, 9, 224. [Google Scholar] [CrossRef]
  29. R Development Core Team. R: A Language and Eviroment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2008. [Google Scholar]
  30. Kurilshikov, A.; Medina-Gomez, C.; Bacigalupe, R.; Radjabzadeh, D.; Wang, J.; Demirkan, A.; Le Roy, C.I.; Raygoza Garay, J.A.; Finnicum, C.T.; Liu, X.; et al. Large-scale association analyses identify host factors influencing human gut microbiome composition. Nat. Genet. 2021, 53, 156–165. [Google Scholar] [CrossRef] [PubMed]
  31. Burgess, S.; Thompson, S.G. Avoiding bias from weak instruments in Mendelian randomization studies. Int. J. Epidemiol. 2011, 40, 755–764. [Google Scholar] [CrossRef]
  32. Lambert, S.A.; Gil, L.; Jupp, S.; Ritchie, S.C.; Xu, Y.; Buniello, A.; McMahon, A.; Abraham, G.; Chapman, M.; Parkinson, H.; et al. The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation. Nat. Genet. 2021, 53, 420–425. [Google Scholar] [CrossRef]
  33. Jia, G.; Lu, Y.; Wen, W.; Long, J.; Liu, Y.; Tao, R.; Li, B.; Denny, J.C.; Shu, X.-O.; Zheng, W. Evaluating the Utility of Polygenic Risk Scores in Identifying High-Risk Individuals for Eight Common Cancers. JNCI Cancer Spectr. 2020, 4, pkaa021. [Google Scholar] [CrossRef] [PubMed]
  34. Hsu, L.; Jeon, J.; Brenner, H.; Gruber, S.B.; Schoen, R.E.; Berndt, S.I.; Chan, A.T.; Chang-Claude, J.; Du, M.; Gong, J.; et al. A Model to Determine Colorectal Cancer Risk Using Common Genetic Susceptibility Loci. Gastroenterology 2015, 148, 1330–1339.e14. [Google Scholar] [CrossRef] [PubMed]
  35. Fritsche, L.G.; Patil, S.; Beesley, L.J.; VandeHaar, P.; Salvatore, M.; Ma, Y.; Peng, R.B.; Taliun, D.; Zhou, X.; Mukherjee, B. Cancer PRSweb: An Online Repository with Polygenic Risk Scores for Major Cancer Traits and Their Evaluation in Two Independent Biobanks. Am. J. Hum. Genet. 2020, 107, 815–836. [Google Scholar] [CrossRef]
  36. Huyghe, J.R.; Bien, S.A.; Harrison, T.A.; Kang, H.M.; Chen, S.; Schmit, S.L.; Conti, D.V.; Qu, C.; Jeon, J.; Edlund, C.K.; et al. Discovery of common and rare genetic risk variants for colorectal cancer. Nat. Genet. 2019, 51, 76–87. [Google Scholar] [CrossRef]
  37. Kachuri, L.; Graff, R.E.; Smith-Byrne, K.; Meyers, T.J.; Rashkin, S.R.; Ziv, E.; Witte, J.S.; Johansson, M. Pan-cancer analysis demonstrates that integrating polygenic risk scores with modifiable risk factors improves risk prediction. Nat. Commun. 2020, 11, 6084. [Google Scholar] [CrossRef] [PubMed]
  38. Shi, Z.; Yu, H.; Wu, Y.; Lin, X.; Bao, Q.; Jia, H.; Perschon, C.; Duggan, D.; Helfand, B.T.; Zheng, S.L.; et al. Systematic evaluation of cancer-specific genetic risk score for 11 types of cancer in The Cancer Genome Atlas and Electronic Medical Records and Genomics cohorts. Cancer Med. 2019, 8, 3196–3205. [Google Scholar] [CrossRef]
  39. Graff, R.E.; Cavazos, T.B.; Thai, K.K.; Kachuri, L.; Rashkin, S.R.; Hoffman, J.D.; Alexeeff, S.E.; Blatchins, M.; Meyers, T.J.; Leong, L.; et al. Cross-cancer evaluation of polygenic risk scores for 16 cancer types in two large cohorts. Nat. Commun. 2021, 12, 970. [Google Scholar] [CrossRef]
  40. Archambault, A.N.; Su, Y.-R.; Jeon, J.; Thomas, M.; Lin, Y.; Conti, D.V.; Win, A.K.; Sakoda, L.C.; Lansdorp-Vogelaar, I.; Peterse, E.F.P.; et al. Cumulative Burden of Colorectal Cancer–Associated Genetic Variants Is More Strongly Associated with Early-Onset vs Late-Onset Cancer. Gastroenterology 2020, 158, 1274–1286.e12. [Google Scholar] [CrossRef]
  41. Schmit, S.L.; Edlund, C.K.; Schumacher, F.R.; Gong, J.; Harrison, T.A.; Huyghe, J.R.; Qu, C.; Melas, M.; Van Den Berg, D.J.; Wang, H.; et al. Novel Common Genetic Susceptibility Loci for Colorectal Cancer. JNCI J. Natl. Cancer Inst. 2019, 111, 146–157. [Google Scholar] [CrossRef]
  42. Dvorak, P.; Pesta, M.; Soucek, P. ABC gene expression profiles have clinical importance and possibly form a new hallmark of cancer. Tumor Biol. 2017, 39, 1010428317699800. [Google Scholar] [CrossRef] [Green Version]
  43. Hlavata, I.; Mohelnikova-Duchonova, B.; Vaclavikova, R.; Liska, V.; Pitule, P.; Novak, P.; Bruha, J.; Vycital, O.; Holubec, L.; Treska, V.; et al. The role of ABC transporters in progression and clinical outcome of colorectal cancer. Mutagenesis 2012, 27, 187–196. [Google Scholar] [CrossRef] [PubMed]
  44. Wang, B.; Wang, X.; Tseng, Y.; Huang, M.; Luo, F.; Zhang, J.; Liu, J. Distinguishing colorectal adenoma from hyperplastic polyp by WNT2 expression. J. Clin. Lab. Anal. 2021, 35, 1–10. [Google Scholar] [CrossRef]
  45. Zhang, Z.; Zhu, H.; Li, Q.; Gao, W.; Zang, D.; Su, W.; Yang, R.; Zhong, J. Gene Expression Profiling of Tricarboxylic Acid Cycle and One Carbon Metabolism Related Genes for Prognostic Risk Signature of Colon Carcinoma. Front. Genet. 2021, 12, 1–14. [Google Scholar] [CrossRef]
  46. Chang, L.C.; Hsu, Y.C.; Chiu, H.M.; Ueda, K.; Wu, M.S.; Kao, C.H.; Shen, T.L. Exploration of the Proteomic Landscape of Small Extracellular Vesicles in Serum as Biomarkers for Early Detection of Colorectal Neoplasia. Front. Oncol. 2021, 11, 1–12. [Google Scholar] [CrossRef]
  47. Frey, M.R.; Edelblum, K.L.; Mullane, M.T.; Liang, D.; Polk, D.B. The ErbB4 Growth Factor Receptor Is Required for Colon Epithelial Cell Survival in the Presence of TNF. Gastroenterology 2009, 136, 217–226. [Google Scholar] [CrossRef] [PubMed]
  48. Frey, M.R.; Hilliard, V.C.; Mullane, M.T.; Polk, D.B. ErbB4 promotes cyclooxygenase-2 expression and cell survival in colon epithelial cells. Lab. Investig. 2010, 90, 1415–1424. [Google Scholar] [CrossRef]
  49. Williams, C.S.; Bernard, J.K.; Beckler, M.D.; Almohazey, D.; Washington, M.K.; Smith, J.J.; Frey, M.R. ERBB4 is over-expressed in human colon cancer and enhances cellular transformation. Carcinogenesis 2015, 36, 710–718. [Google Scholar] [CrossRef] [PubMed]
  50. Jia, X.; Wang, H.; Li, Z.; Yan, J.; Guo, Y.; Zhao, W.; Gao, L.; Wang, B.; Jia, Y. HER4 promotes the progression of colorectal cancer by promoting epithelial-mesenchymal transition. Mol. Med. Rep. 2020, 21, 1779–1788. [Google Scholar] [CrossRef]
  51. Baiocchi, G.; Lopes, A.; Coudry, R.A.; Rossi, B.M.; Soares, F.A.; Aguiar, S.; Guimarães, G.C.; Ferreira, F.O.; Nakagawa, W.T. ErbB family immunohistochemical expression in colorectal cancer patients with higher risk of recurrence after radical surgery. Int. J. Colorectal Dis. 2009, 24, 1059–1068. [Google Scholar] [CrossRef]
  52. Chu, S.; Wang, H.; Yu, M. A putative molecular network associated with colon cancer metastasis constructed from microarray data. World J. Surg. Oncol. 2017, 15, 115. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  53. Bae, J.A.; Yoon, S.; Park, S.Y.; Lee, J.H.; Hwang, J.E.; Kim, H.; Seo, Y.W.; Cha, Y.J.; Hong, S.P.; Kim, H.; et al. An unconventional KITENIN/ErbB4-mediated downstream signal of EGF upregulates c-Jun and the invasiveness of colorectal cancer cells. Clin. Cancer Res. 2014, 20, 4115–4128. [Google Scholar] [CrossRef] [PubMed]
  54. Ren, H.; Li, Z.; Tang, Z.; Li, J.; Lang, X. Long noncoding MAGI2-AS3 promotes colorectal cancer progression through regulating miR-3163/TMEM106B axis. J. Cell. Physiol. 2020, 235, 4824–4833. [Google Scholar] [CrossRef] [PubMed]
  55. Poursheikhani, A.; Abbaszadegan, M.R.; Nokhandani, N.; Kerachian, M.A. Integration analysis of long non-coding RNA (lncRNA) role in tumorigenesis of colon adenocarcinoma. BMC Med. Genomics 2020, 13, 108. [Google Scholar] [CrossRef] [PubMed]
  56. Yang, X.; Wu, S.; Li, X.; Yin, Y.; Chen, R. MAGI2-AS3 rs7783388 polymorphism contributes to colorectal cancer risk through altering the binding affinity of the transcription factor GR to the MAGI2-AS3 promoter. J. Clin. Lab. Anal. 2020, 34, 1–11. [Google Scholar] [CrossRef]
  57. Mlecnik, B.; Bindea, G.; Angell, H.K.; Sasso, M.S.; Obenauf, A.C.; Fredriksen, T.; Lafontaine, L.; Bilocq, A.M.; Kirilovsky, A.; Tosolini, M.; et al. Functional network pipeline reveals genetic determinants associated with in situ lymphocyte proliferation and survival of cancer patients. Sci. Transl. Med. 2014, 6, 228ra37. [Google Scholar] [CrossRef]
  58. Yang, Q.-X.; Liu, T.; Yang, J.-L.; Liu, F.; Chang, L.; Che, G.-L.; Lai, S.-Y.; Jiang, Y.-M. Low expression of NTF3 is associated with unfavorable prognosis in hepatocellular carcinoma. Int. J. Clin. Exp. Pathol. 2020, 13, 2280–2288. [Google Scholar]
  59. Liu, R.; Li, R.; Yu, H.; Liu, J.; Zheng, S.; Li, Y.; Ye, L. NTF3 Correlates With Prognosis and Immune Infiltration in Hepatocellular Carcinoma. Front. Med. 2021, 8, 795849. [Google Scholar] [CrossRef]
  60. Zhao, R.; Liu, Y.; Wu, C.; Li, M.; Wei, Y.; Niu, W.; Yang, J.; Fan, S.; Xie, Y.; Li, H.; et al. BRD7 Promotes Cell Proliferation and Tumor Growth Through Stabilization of c-Myc in Colorectal Cancer. Front. Cell Dev. Biol. 2021, 9, 659392. [Google Scholar] [CrossRef]
  61. Pelullo, M.; Nardozza, F.; Zema, S.; Quaranta, R.; Nicoletti, C.; Besharat, Z.M.; Felli, M.P.; Cerbelli, B.; d’Amati, G.; Palermo, R.; et al. Kras/ADAM17-Dependent Jag1-ICD Reverse Signaling Sustains Colorectal Cancer Progression and Chemoresistance. Cancer Res. 2019, 79, 5575–5586. [Google Scholar] [CrossRef]
  62. Kiriyama, K.; Hirohashi, Y.; Torigoe, T.; Kubo, T.; Tamura, Y.; Kanaseki, T.; Takahashi, A.; Nakazawa, E.; Saka, E.; Ragnarsson, C.; et al. Expression and function of FERMT genes in colon carcinoma cells. Anticancer Res. 2013, 33, 167–173. [Google Scholar]
  63. Huyghe, J.R.; Harrison, T.A.; Bien, S.A.; Hampel, H.; Figueiredo, J.C.; Schmit, S.L.; Conti, D.V.; Chen, S.; Qu, C.; Lin, Y.; et al. Genetic architectures of proximal and distal colorectal cancer are partly distinct. Gut 2021, 70, 1325–1334. [Google Scholar] [CrossRef] [PubMed]
  64. Passarelli, M.N.; Newcomb, P.A.; Makar, K.W.; Burnett-Hartman, A.N.; Potter, J.D.; Upton, M.P.; Zhu, L.C.; Rosenfeld, M.E.; Schwartz, S.M.; Rutter, C.M. Blood lipids and colorectal polyps: Testing an etiologic hypothesis using phenotypic measurements and Mendelian randomization. Cancer Causes Control 2015, 26, 467–473. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  65. Ma, Y.; Yang, Y.; Wang, F.; Zhang, P.; Shi, C.; Zou, Y.; Qin, H. Obesity and risk of colorectal cancer: A systematic review of prospective studies. PLoS ONE 2013, 8, e53916. [Google Scholar] [CrossRef]
  66. Saus, E.; Iraola-Guzmán, S.; Willis, J.R.; Brunet-Vega, A.; Gabaldón, T. Microbiome and colorectal cancer: Roles in carcinogenesis and clinical potential. Mol. Aspects Med. 2019, 69, 93–106. [Google Scholar] [CrossRef] [PubMed]
  67. Flemer, B.; Lynch, D.B.; Brown, J.M.R.; Jeffery, I.B.; Ryan, F.J.; Claesson, M.J.; O’Riordain, M.; Shanahan, F.; O’Toole, P.W. Tumour-associated and non-tumour-associated microbiota in colorectal cancer. Gut 2017, 66, 633–643. [Google Scholar] [CrossRef] [PubMed]
  68. Miyake, T.; Mori, H.; Yasukawa, D.; Hexun, Z.; Maehira, H.; Ueki, T.; Kojima, M.; Kaida, S.; Iida, H.; Shimizu, T.; et al. The Comparison of Fecal Microbiota in Left-Side and Right-Side Human Colorectal Cancer. Eur. Surg. Res. 2021, 62, 248–254. [Google Scholar] [CrossRef]
  69. Phipps, O.; Quraishi, M.N.; Dickson, E.A.; Steed, H.; Kumar, A.; Acheson, A.G.; Beggs, A.D.; Brookes, M.J.; Al-Hassi, H.O. Differences in the on-and off-tumor microbiota between right-and left-sided colorectal cancer. Microorganisms 2021, 9, 1108. [Google Scholar] [CrossRef]
  70. Yang, X.; Pan, Y.; Wu, W.; Qi, Q.; Zhuang, J.; Xu, J.; Han, S. Analysis of prognosis, genome, microbiome, and microbial metabolome in different sites of colorectal cancer. J. Transl. Med. 2019, 17, 1–22. [Google Scholar] [CrossRef]
  71. Clos-Garcia, M.; Garcia, K.; Alonso, C.; Iruarrizaga-Lejarreta, M.; D’amato, M.; Crespo, A.; Iglesias, A.; Cubiella, J.; Bujanda, L.; Falcón-Pérez, J.M. Integrative analysis of fecal metagenomics and metabolomics in colorectal cancer. Cancers 2020, 12, 1142. [Google Scholar] [CrossRef]
  72. Lu, Y.; Chen, J.; Zheng, J.; Hu, G.; Wang, J.; Huang, C.; Lou, L.; Wang, X.; Zeng, Y. Mucosal adherent bacterial dysbiosis in patients with colorectal adenomas. Sci. Rep. 2016, 6, 26337. [Google Scholar] [CrossRef]
  73. Luo, H.; Liu, L.; Zhao, J.-J.; Mi, X.-F.; Wang, Q.-J.; Yu, M. Effects of oxaliplatin on inflammation and intestinal floras in rats with colorectal cancer. Eur. Rev. Med. Pharmacol. Sci. 2020, 24, 10542–10549. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Mendelian Randomization results. The beta value and standard error are depicted for each method used. (A) Modifiable risk factors. (B) Bacterial phyla.
Figure 1. Mendelian Randomization results. The beta value and standard error are depicted for each method used. (A) Modifiable risk factors. (B) Bacterial phyla.
Cancers 14 04193 g001
Figure 2. Performance of different Polygenic Risk Score sets. For each parameter, the value and 95% of confidence interval are depicted. Cohen’s d, the effect size of the T-test and 95% of the confidence interval, <0.2 negligible effects, 0.2–0.5 small effect, 0.5–0.8 moderate effect, >0.8 large effects.
Figure 2. Performance of different Polygenic Risk Score sets. For each parameter, the value and 95% of confidence interval are depicted. Cohen’s d, the effect size of the T-test and 95% of the confidence interval, <0.2 negligible effects, 0.2–0.5 small effect, 0.5–0.8 moderate effect, >0.8 large effects.
Cancers 14 04193 g002
Figure 3. Distribution of polygenic risk score based on PGS000785 set. P, the p-value of the T-test. (A) Colorectal cancer. (B) According to the location. (C) Comparison with patients with irritable bowel disease. (D) Comparison with patients of main types of irritable bowel disease.
Figure 3. Distribution of polygenic risk score based on PGS000785 set. P, the p-value of the T-test. (A) Colorectal cancer. (B) According to the location. (C) Comparison with patients with irritable bowel disease. (D) Comparison with patients of main types of irritable bowel disease.
Cancers 14 04193 g003
Table 1. Demographics of the participants.
Table 1. Demographics of the participants.
CasesControls
N835940
Male (%)530 (63.47%)631 (67.13%)
Female (%)305 (36.53%)309 (32.87%)
Age (SE)73.54 (11.38)41.53 (11.79)
Stage
037 (4.43%)
I130 (15.57%)
II314 (37.61%)
III223 (26.71%)
IV105 (12.57%)
Undetermined26 (3.11%)
Location
Right170 (20.36%)
Left219 (26.23%)
Rectal235 (28.14%)
Unspecific211 (25.27%)
Table 2. Suggestive signals (p < 5 × 10−6) detected in colorectal cancer and the locations. Gene, gene where is located the SNP or nearest gene 100kb upstream or downstream from the SNP. OR, odds ratio. CI 95%, confidence interval of 95% of the odds ratio. Freq, frequency of A1 in Basque cohort. Freq EUR, frequency of A1 in European populations of 1 KG.
Table 2. Suggestive signals (p < 5 × 10−6) detected in colorectal cancer and the locations. Gene, gene where is located the SNP or nearest gene 100kb upstream or downstream from the SNP. OR, odds ratio. CI 95%, confidence interval of 95% of the odds ratio. Freq, frequency of A1 in Basque cohort. Freq EUR, frequency of A1 in European populations of 1 KG.
Lead SNPPositionGeneA1A2OR (CI 95%)p-ValueFreqFreq EUR
Colorectal cancer vs. controls
rs793747322:212815957ERBB4TC8.5 (3.4–21.0)4.5 × 10−60.0320.022
rs773172402:216091445Upstream of ABCA12 and ATICTC6.4 (3.1–13.2)5.8 × 10−70.0390.024
rs1164431464:142699393Downstream of IL15GA16.3 (5.0–53.8)4.4 × 10−60.0130.02
rs349319687:79055118MAGI2TG29.7 (7.1–124.3)3.4 × 10−60.0110.01
rs169396716:86289580LINC01081GA11.4 (4.1–32.1)3.9 × 10−60.0170.024
Right colon cancer vs. controls
rs30046811:69054715Downstream of DEPDC1TG11.8 (4.3–32.7)2.0 × 10−60.0620.073
rs774454701:226800066Downstream of STUM and ITPKBGC18.5 (5.3–64.5)4.8 × 10−60.0440.055
rs766537934:47962934CNGA1, LOC101927157GT21.7 (6.4–73.8)7.9 × 10−70.0280.036
rs1424447384:106095747TET2, TET2-AS1AG51.1 (9.6–270.9)3.8 × 10−60.0110.005
rs46963374:153602674TMEM154, LOC105377495AC35.8 (8.2–156.2)2.0 × 10−60.0230.023
rs1394325454:174624195 GA48.4 (9.6–244.9)2.7 × 10−60.0120.022
rs132110796:36977349FGD2GC43.9 (9.2–210.2)2.2 × 10−60.0190.012
rs1905910667:89988294GTPBP10AG40.6 (8.8–186.4)1.9 × 10−60.0170.011
rs757722328:83689525 TC15.8 (4.9–51.2)4.3 × 10−60.0390.045
rs1180252649:119407781ASTN2, LOC105376240TC25.7 (6.4–102.7)4.3 × 10−60.0260.022
rs1693348912:5572210NTF3TC34.9 (9.1–133.3)2.0 × 10−70.020.044
rs7826362018:72995680TSHZ1TC43.6 (9.2–207.9)2.2 × 10−60.0110.019
rs14845220219:2527577GNG7AG34.6 (8.3–144.8)1.2 × 10−60.0220.01
rs3591412919:48115566BICRATG56.2 (11.2–283.0)1.0 × 10−60.0130.009
rs2849519722:36050632APOL6TC39.9 (9.1–174.2)9.4 × 10−70.0230.017
rs11782038122:40738486Downstream of TNRC6B, upstream of ADSLAG37.0 (8.4–163.1)1.8 × 10−60.0130.028
Left colon cancer vs. controls
rs1120335252:23176856 TG39.4 (8.2–189.6)4.5 × 10−60.0170.015
rs1393670402:173950614MAP3K20TC33.0 (7.7–142.5)2.8 × 10−60.0190.014
rs727744689:137697318COL5A1CT15.1 (5.0–45.3)1.3 × 10−60.0350.051
rs11414441716:48116976ABCC12TC149.8 (20.2–1112.0)9.7 × 10−70.010.008
rs1772160017:27268513PHF12, LOC101927018AG25.9 (6.9–97.7)1.6 × 10−60.0370.053
rs14010726918:1828990 TC26.8 (6.6–109.2)4.4 × 10−60.0230.027
rs6209328518:49252189 AG12.8 (4.3–38.4)4.9 × 10−60.0440.035
Rectal cancer vs. controls
rs781449881:102199388LINC01709CT54.9 (11.2–268.4)7.6 × 10−70.0130.018
rs134037942:9785060Upstream of YWHAQ and ADAM17CT65.5 (12.0–355.9)1.3 × 10−60.0120.021
rs3548562:142433670LRP1B, LOC107985779CT17.4 (5.5–55.0)1.1 × 10−60.0270.062
rs1164431464:142699393Downstream of IL15GA40.3 (9.2–176.9)9.7 × 10−70.0130.02
rs729093996:86581045 TG74.7 (13.5–414.7)8.1 × 10−70.0140.03
rs715161148:784674DLGAP2CT5.2 (2.6–10.4)2.7 × 10−60.1110.112
rs6184809710:50134508WDFY4, LRRC18GA8.6 (3.5–21.0)2.9 × 10−60.0730.089
rs7747080214:27547598LOC105370420GT12.4 (4.2–36.5)4.6 × 10−60.0270.033
rs7679978214:91624544DGLUCYAG18.9 (5.4–65.4)3.8 × 10−60.0290.039
rs14155382416:50380386BRD7CT45.8 (10.4–202.4)4.5 × 10−70.0170.05
Left colon cancer vs. right colon cancer
rs46553031:213834643LOC105372912TA2.2 (1.6–3.0)3.6 × 10−60.430.377
rs6200570414:53465150Downstream of DDHD1, upstream of FERMT2AG0.4 (0.3–0.6)9.8 × 10−70.4640.503
Rectal cancer vs. colon cancer
rs731719067:147986529CNTNAP2TC2.2 (1.6–2.9)6.4 × 10−70.230.154
rs97730258:6674458XKR5GA0.5 (0.3–0.6)1.5 × 10−60.4140.468
rs7961956221:38742422DYRK1ACT2.7 (1.8–4.1)1.8 × 10−60.10.093
Table 3. Suggestive signals (p < 5 × 10−6) detected in the comparison of colorectal cancer and inflammatory bowel disease. Gene, gene where is located the SNP or nearest gene 100kb upstream or downstream from the SNP. OR, odds ratio. CI 95%, confidence interval of 95% of the odds ratio. Freq, frequency of A1 in Basque cohort. Freq EUR, frequency of A1 in European populations of 1 KG.
Table 3. Suggestive signals (p < 5 × 10−6) detected in the comparison of colorectal cancer and inflammatory bowel disease. Gene, gene where is located the SNP or nearest gene 100kb upstream or downstream from the SNP. OR, odds ratio. CI 95%, confidence interval of 95% of the odds ratio. Freq, frequency of A1 in Basque cohort. Freq EUR, frequency of A1 in European populations of 1 KG.
Lead SNPPositionGeneA1A2OR (CI 95%)p-ValueFreqFreq EUR
Colorectal cancer vs inflammatory bowel disease
rs354936871:41285292KCNQ4AC0.4 (0.3–0.6)4.2 × 10−60.1220.147
rs768452712:73665817ALMS1TG0.3 (0.2–0.5)2.9 × 10−60.0430.048
rs67388052:231083171SP110CT0.4 (0.3–0.6)4.6 × 10−70.1350.128
rs100077844:81977690BMP3CT0.5 (0.4–0.7)1.8 × 10−60.2280.222
rs1812066735:25834969 CG0.3 (0.1–0.5)4.1 × 10−60.0390.0467
rs728407406:18745458 CT0.1 (0.0–0.2)1.1 × 10−60.0140.03
rs92713656:32586794Downstream of HLA-DRB1 and upstream of HLA-DQA1GT1.8 (1.4–2.3)2.2 × 10−60.3530.388
rs9511976:103210765 CA0.5 (0.4–0.7)5.6 × 10−70.4760.446
rs18756648:827824DLGAP2GA2.3 (1.6–3.3)2.8 × 10−60.1280.161
rs98887410:27684660Downstream of PTCHD3AT0.5 (0.3–0.6)1.6 × 10−60.1740.157
rs54129515:50056050Upstream of ATP8B4GA0.2 (0.1–0.4)1.8 × 10−80.0550.022
Colorectal cancer vs. controls + inflammatory bowel disease
rs75504861:14777040KAZNCT0.6 (0.5–0.7)1.3 × 10−60.4980.475
rs1156819842:216032071Upstream of ABCA12 and ATICTC4.2 (2.4–7.1)2.6 × 10−70.0340.026
rs728407416:18747455 GA0.1 (0.0–0.2)1.8 × 10−60.0140.03
rs50021786:32611590HLA-DQA1GA0.6 (0.5–0.7)6.8 × 10−70.330.374
rs9511976:103210765 CA0.6 (0.5–0.7)2.4 × 10−70.4840.446
rs18756648:827824DLGAP2GA2.2 (1.6–3.0)3.24 × 10−70.1240.161
rs98887410:27684660Downstream of PTCHD3AT0.5 (0.3–0.6)2.0 × 10−60.1710.157
rs15084004914:59165709Downstream of DACT1CT0.1 (0.1–0.3)2.6 × 10−60.0250.052
rs54129515:50056050Upstream of ATP8B4GA0.2 (0.1–0.4)5.3 × 10−80.0450.022
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Garcia-Etxebarria, K.; Etxart, A.; Barrero, M.; Nafria, B.; Segues Merino, N.M.; Romero-Garmendia, I.; Franke, A.; D’Amato, M.; Bujanda, L. Performance of the Use of Genetic Information to Assess the Risk of Colorectal Cancer in the Basque Population. Cancers 2022, 14, 4193. https://doi.org/10.3390/cancers14174193

AMA Style

Garcia-Etxebarria K, Etxart A, Barrero M, Nafria B, Segues Merino NM, Romero-Garmendia I, Franke A, D’Amato M, Bujanda L. Performance of the Use of Genetic Information to Assess the Risk of Colorectal Cancer in the Basque Population. Cancers. 2022; 14(17):4193. https://doi.org/10.3390/cancers14174193

Chicago/Turabian Style

Garcia-Etxebarria, Koldo, Ane Etxart, Maialen Barrero, Beatriz Nafria, Nerea Miren Segues Merino, Irati Romero-Garmendia, Andre Franke, Mauro D’Amato, and Luis Bujanda. 2022. "Performance of the Use of Genetic Information to Assess the Risk of Colorectal Cancer in the Basque Population" Cancers 14, no. 17: 4193. https://doi.org/10.3390/cancers14174193

APA Style

Garcia-Etxebarria, K., Etxart, A., Barrero, M., Nafria, B., Segues Merino, N. M., Romero-Garmendia, I., Franke, A., D’Amato, M., & Bujanda, L. (2022). Performance of the Use of Genetic Information to Assess the Risk of Colorectal Cancer in the Basque Population. Cancers, 14(17), 4193. https://doi.org/10.3390/cancers14174193

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop