Next Article in Journal
The First Complete Chloroplast Genome of Campanula carpatica: Genome Characterization and Phylogenetic Diversity
Previous Article in Journal
The Association between CDKAL1 Gene rs10946398 Polymorphism and Post-Transplant Diabetes in Kidney Allograft Recipients Treated with Tacrolimus
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Multibreed Genome-Wide Association Study for Cattle Leukocyte Telomere Length

by
Alexander V. Igoshin
1,†,
Nikolay S. Yudin
1,†,
Grigorii A. Romashov
1 and
Denis M. Larkin
2,*
1
The Federal Research Center Institute of Cytology and Genetics, The Siberian Branch of the Russian Academy of Sciences (ICG SB RAS), 630090 Novosibirsk, Russia
2
Royal Veterinary College, University of London, London NW1 0TU, UK
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Genes 2023, 14(8), 1596; https://doi.org/10.3390/genes14081596
Submission received: 25 June 2023 / Revised: 26 July 2023 / Accepted: 2 August 2023 / Published: 7 August 2023
(This article belongs to the Section Animal Genetics and Genomics)

Abstract

:
Telomeres are terminal DNA regions of chromosomes that prevent chromosomal fusion and degradation during cell division. In cattle, leukocyte telomere length (LTL) is associated with longevity, productive lifespan, and disease susceptibility. However, the genetic basis of LTL in this species is less studied than in humans. In this study, we utilized the whole-genome resequencing data of 239 animals from 17 cattle breeds for computational leukocyte telomere length estimation and subsequent genome-wide association study of LTL. As a result, we identified 42 significant SNPs, of which eight were found in seven genes (EXOC6B, PTPRD, RPS6KC1, NSL1, AGBL1, ENSBTAG00000052188, and GPC1) when using covariates for two major breed groups (Turano–Mongolian and European). Association analysis with covariates for breed effect detected 63 SNPs, including 13 in five genes (EXOC6B, PTPRD, RPS6KC1, ENSBTAG00000040318, and NELL1). The PTPRD gene, demonstrating the top signal in analysis with breed effect, was previously associated with leukocyte telomere length in cattle and likely is involved in the mechanism of alternative lengthening of telomeres. The single nucleotide variants found could be tested for marker-assisted selection to improve telomere-length-associated traits.

1. Introduction

The development of the genome-wide association study (GWAS) approach, in which the values of a phenotypic trait are compared with genome-wide variation in the studied population, has made it possible to improve the understanding of the genetic contribution to complex phenotypes [1,2]. Due to cost reduction, high-throughput whole-genome sequencing (WGS) is a popular alternative to array-based genotyping in GWASs [3]. Commercial SNP arrays mainly contain genetic variants, frequent in populations involved in their development [4]. Compared with array-based genotyping, whole-genome sequencing offers the advantage of analyzing rare and unique genetic variants. The rare variants in GWAS help solve the problem of complex traits’ “missing heritability” [5]. In addition, raw sequencing data can be used to evaluate quantitative characteristics of the genome, such as the average telomere length (TL) [6], copy number variation [7], mitochondrial or ribosomal DNA copy numbers [8,9], etc.
Telomeres are the DNA regions found at the ends of chromosomes. Together with specific nucleoprotein complexes, they safeguard the chromosomes from degradation during cell division [10]. Telomeric DNA consists of tandem repetitive DNA sequences (e.g., TTAGGG hexanucleotide in vertebrates) several thousand base pairs in length [11]. Since DNA polymerase does not fully replicate the 3′-ends of a DNA, telomeres are shortened by 50–200 nucleotides with each new cell division [12]. Two main antagonistic biochemical pathways regulate telomere length in normal cells. Firstly, a ribonucleoprotein complex called “telomerase” prevents telomere shortening through a reverse transcription reaction [13]. Secondly, the shelterin complex inhibits telomere elongation [14]. The 3′-end of the telomere, consisting of a guanine-rich single-stranded DNA region of 150–200 base pairs, is known to interact with the double-stranded region, forming the so-called T-loop. Since shelterin proteins stabilize the formation of the T-loop, repair proteins cannot recognize the terminal end of the chromosome. In addition, the T-loops inhibit telomerase activity by preventing access to the 3′ tail.
The issue with the end replication leads to the truncation of telomeric DNA during each cell division. As a result, telomere length shortens with age in humans and most other animal species [15,16], probably except naked mole rats (Heterocephalus glaber) [17]. Telomere shortening triggers the activation of DNA damage response, which involves cell aging and signaling pathways. Eventually, this results in cell cycle arrest, apoptosis, and progressive tissue atrophy [18,19]. Therefore, TL has been proposed as an individual’s life expectancy and health status predictor. Indeed, short TL correlates with reduced life expectancy in humans [20], mice [21], sheep [22], dogs [23], wild birds [24], and other species [25]. TL is associated with myocardial infarction, hypertension, Parkinson’s disease, cancer, diabetes, and other age-dependent pathological conditions in humans [26,27,28].
One of the goals of cattle selection is a productive lifespan [29]. However, selection progress is limited by complications in phenotype measurement and a low trait heritability [30]. Because dairy cows are usually culled when in poor health or fertility, their productive lifespan differs substantially from the longevity of humans and wild species [31]. TL, however, could be a proxy phenotype for productive lifespan and health measured at an early age, which would help to improve selection. Just like in most other animals, the telomeres of cattle tend to shorten as animals age [32,33]. It has been shown that adult Holstein cows with short telomeres are more likely to be culled in the subsequent year than cows with long telomeres [34]. In another study, productive life expectancy in Holsteins was positively correlated with TL at the age of one year [35]. Later, the same authors showed that the length of cattle’s productive life is associated with TL at birth [36]. Finally, they concluded that TL attrition was the best predictor of productive lifespan compared with an average TL or TL at the age of one year [37]. Apart from Holsteins, there is additional evidence for a link between TL and productive lifespan. According to Iannuzzi et al. [33], telomere length in the Agerolese cattle breed, showing a long productive lifespan, is significantly higher than in Holsteins, considering animals of the same age.
It is known that a multitude of age-independent pathological conditions related to immunity and fertility are associated with TL. TL at birth was associated with the risk of mastitis in Holstein cows [36]. Short TL was found to be over-represented among patients with unsuccessful pulmonary tuberculosis treatment outcomes [38] and patients with AIDS [39]. TL was also associated with several forms of female and male infertility in humans [40] and semen quality in bulls [41].
The TL trait is influenced by both genetic and environmental factors, making it a complex phenotype. It is known that TL differs significantly between Caenorhabditis elegans strains [42], inbred laboratory mouse strains [43], and dog [23] and cattle [33,44] breeds. TL heritability in Holstein–Friesian cattle ranges from 0.32 to 0.38 [35]. In another study, the monthly heritability of TL in Holsteins ranged from 0.36 to 0.47 [45]. A GWAS in Holstein–Friesian cattle identified 14 candidate genes for TL at birth and 9 at first lactation [36]. However, the genomic architecture of many quantitative traits in livestock differs substantially across various breeds [46,47]. Therefore, the GWAS of TL in non-Holstein breeds may help to gain a complete view of the genetic basis of TL variation in cattle.
Our study aimed to undertake seqGWAS among 17 cattle breeds, combining commercial and native populations with different genetic and environmental backgrounds to identify TL-associated genetic variants and confirm the effects of loci reported in previous studies.
Our recent work summarizing the results of 18 genome-wide association studies for telomere length in four species, including humans, showed that the overlap of candidate genes between species is virtually absent, except pot-2 (Caenorhabditis elegans) and POT1 (Homo sapiens) orthologs [48]. At the same time, multiple human studies show that the top associated genes (e.g., TERC, TERT, STN1, POT1, DCAF4, RTEL1, and NAF1) are consistently detected in samples from different ethnic groups [48]. At first sight, this observation could suggest that TL-control genes differ across animals but are the same within different populations (breeds) of the same species. To some extent, this assumption is reasonable. However, it should be remembered that the level of genetic variation between genes may differ and often is low for genes with high effect, thus affecting their detection by GWAS. Therefore, we hypothesize that relying on GWAS exclusively leads to underestimating the number of TL control genes shared by different species. As per our hypothesis, the study revealed that a few genes associated with telomeres were also related to telomere function in species other than cattle. In other species, however, this correlation has been identified through methods alternative to GWAS.

2. Materials and Methods

2.1. Samples and Sequencing

Whole-blood-extracted DNA genome resequencing data for 239 animals with known age and sex from 17 cattle breeds, of which 15 are reared in Russia, were used for variant calling and leukocyte telomere length estimation (Table S1). Thirteen of these breeds were fully or mostly of European origin (Alatau, Bestuzhev, Charolais, Hereford, Kholmogory, Kostroma, Kazakh Whiteheaded, Black Pied, Red Steppe, Holstein, Tagil, Ukrainian Grey, and Yaroslavl). Four breeds had predominantly Turano–Mongolian genetic backgrounds (Buryat, Kalmyk, Wagyu, and Yakut) [49]. Resequencing data for 86 individuals were downloaded from the NCBI database (see Table S1 for SRA IDs), whereas 153 samples were resequenced (paired-end 2 × 150 bp) by Illumina’s Novaseq6000 technology at Novogene Co., Ltd. (Hong Kong, China) to ~50 Gbp each (~15× coverage). The latter samples’ DNA was extracted using cell lysation and the phenol–chloroform method. Before sequencing, DNA quality was assessed with agarose gel electrophoresis and by measuring 260/280 ratio with a BioTek Epoch spectrophotometer.

2.2. Variant Calling and Data Filtering

The samples were processed using the 1000 Bull Genomes Project guideline [50]. Briefly, raw sequencing reads were cleaned with Trimmomatic v.0.38 [51] and then mapped to the reference cattle genome (ARS-UCD1.2) using BWA-MEM v.0.7.17 [52]. Duplicate reads were marked with Picard v.2.18.2 (http://broadinstitute.github.io/picard/ (accessed on 4 January 2023)), after which we performed a base quality score recalibration (using the 1000 Bull Genome Project’s ARS1.2PlusY_BQSR_v3 dataset). BAM files obtained after this step were used for leukocyte telomere length estimation. A variant calling procedure was performed with GATK v.3.8-1-0-gf15c1c3ef [53].
The gVCF files obtained were merged using the GenotypeGVCFs subroutine of the GATK package. The resulting VCF files were filtered to have only biallelic single nucleotide polymorphisms, to which a “hard filtering” procedure was then applied (GATK’s filtering expression “QD < 2.0||FS > 60.000||MQ < 40.00||MQRankSum < −12.5||ReadPosRankSum < −8.0”). Low-quality genotypes (GQ < 12) were removed using VCFtools v. 0.1.16 [54], and then loci were filtered for minor allele frequency (--maf 0.025) and missing call rate (--geno 0.1) using PLINK v.1.9 software [55]. As filtering for possible paralogous sequence variants, we removed SNPs with excessive heterozygosity (>70%) and high D-statistics values (|D| > 4) calculated using HDplot_python.py script [56].

2.3. Power Estimate

To check whether the sample size of 239 individuals was sufficient to detect significant associations with the TL phenotype, we performed power analysis with QUANTO v.1.2.4 software [57], assuming continuous outcome and additive mode of inheritance. Type I error rate (two-sided) was set to 1.2 × 10−7, as obtained from the effective number of tests (0.05/412,450), which was in turn estimated by LD pruning method (PLINK: --indep-pairwise 5000 100 0.3) [58] applied to 13,923,772 SNPs left after filtering.

2.4. Leukocyte Telomere Length Estimation

Leukocyte telomere length was estimated from BAM files using TelSeq software version 0.0.2 [59]. TL calculated by TelSeq is the cumulative length of telomere repeats and includes interstitial telomere repeats. We modified parameters in the TelSeq source code to adapt it to the cattle genome, which includes changing the number of chromosomal ends, read length, and total GC content (bp). The parameters TELOMERE_ENDS, READ_LENGTH, and GENOME_LENGTH_AT_TEL_GC were set equal to 60, 150, and 321,325,950, respectively. We calculated the latter parameter by measuring the total length of 150 base pair windows in the cattle genome (ARS-UCD1.2), with a GC content between 48% and 52%.

2.5. Association Analysis and Variant Annotation

A genome-wide association analysis was performed with EMMAX software (INTEL compiled version). EMMAX exploits a mixed linear model accounting for population stratification and relatedness between individuals [60]. Initial telomere length estimates were normalized using the “rankit” method [61], implemented in the “rcompanion” R package [62]. The transformed values were then used as a phenotype under study. The association analysis was performed for the 13,923,772 SNPs left after the filtering steps. The kinship matrix (BN matrix) was computed based on a subset of 45,212 SNPs obtained by LD pruning (PLINK: --indep-pairwise 5000 100 0.1).
We conducted two types of GWAS, both including age and sex as covariates, but differently dealing with the multibreed composition of our dataset. In the first type of analysis, one additional covariate accounting for the influence of phylogenetic origin (European/Turano–Mongolian) on telomere length was added. For the second test, we included covariates for each breed’s effect using a “dummy coding” scheme [63] to account for between-breed phenotypic differences. To estimate the inflation of test statistics, λGC factor was calculated [64]. To control for the multiple testing error rate, the false discovery rate (FDR) was applied using the “qvalue” R function [65]. Q-values of 0.05 and 0.10 were the thresholds for significant and suggestive levels, respectively. Single nucleotide polymorphisms showing significant or suggestive association levels were annotated with SnpEff software [66]. Where needed, a search for gene homologs was carried out using the BlastP algorithm [67]. If a candidate gene was associated with telomere length in another study, which utilized a different bovine assembly, the corresponding variant coordinates were remapped to the ARS-UCD1.2 positions using the LiftOver tool [68].
To confirm that the effects of candidate SNPs are independent of age and breed, we regressed the initial and normalized LTL values on age and breed covariates, obtained model residuals, which were further utilized as the adjusted phenotypes, and then analyzed their association with doses of minor alleles using the “lm” R function.

3. Results

3.1. Variant Calling and Data Filtering

In total, 53,240,581 polymorphisms were detected with the GATK HaplotypeCaller. The consecutive removal of indels, multiallelic sites, and low-quality variants resulted in 46,734,220, 45,835,735, and 41,705,108 SNPs, respectively. The filtering for missing genotype rate, minor allele frequency, heterozygosity, and D-statistics removed 7,187,088, 20,225,052, 46,283, and 322,913 SNPs, which resulted in 13,923,772 variants further used in association testing. Their genome distribution was similar to that in other mammals [69,70], with distal chromosome regions having higher SNP density (Figure S1).

3.2. Power Estimate

According to a previous GWAS in Holstein cattle, top SNPs with additive effect explained 10% and 11% of total phenotypic variance in telomere length [36]. Our statistical power estimation analysis showed that the sample size of 239 animals has the power of 0.39 and 0.49 to detect loci with such effects, respectively. This statistical power level is typically regarded as moderate and, in actual use, is enough to uncover associations between genotype and phenotype [71,72,73,74].

3.3. Leukocyte Telomere Length Estimation

The estimated leukocyte telomere lengths ranged from 4.2 Kbp to 84.3 Kbp (Table S1). The highest values were observed in the Kazakh Whiteheaded (mean length: 27.2 Kbp, mean age: 2.7 years), Hereford (22.5 Kbp, two years), and Buryat (22.7 Kbp, 3.3 years) breeds. The cattle breeds with the lowest telomere lengths were the Wagyu, measuring 6.9 Kbp and 2.3 years, and the Ukrainian Grey, measuring 6.1 Kbp and 8.2 years. Overall, the estimated telomere length was negatively correlated with the animal’s age (Pearson’s r = −0.21, p-value < 0.01; Spearman’s ρ = −0.305, p-value < 1 × 10−5).

3.4. Association Analysis and Variant Annotation

The QQ plots and λGC factor values suggest the absence of statistical inflation (Figure S2). Genome-wide association testing with breed covariates (Figure 1, Table S2) resulted in 63 statistically significant (q-value < 0.05) SNPs found on 21 chromosomes. The top association signal (BTA8:36516223, p-value = 2.78 × 10−11, q-value = 2.5 × 10−4) was found in the PTPRD gene. Of the 63 SNPs detected, 3 were downstream gene variants, 3 were upstream gene variants, 44 were found in intergenic regions, 12 were intronic variants, and 1 was a missense variant. The intron and missense SNPs were found in five protein-coding genes (intron: PTPRD, EXOC6B, RPS6KC1, and NELL1 and missense: ENSBTAG00000040318) (Table 1). At the suggestive significance level, there were 56 additional SNPs: downstream (1), intergenic (25), intron (29), and upstream (1) variants.
The analysis with breed group covariate identified 42 SNPs (Figure 1, Table S2) located on 15 chromosomes, with the strongest signal (BTA29:44612473, p-value = 1.89 × 10−15, q-value = 2.63 × 10−8) found in the CTSF-CCDC87 intergenic region. Overall, the SNPs found in this type of analysis were downstream gene variants (2), intergenic region variants (31), intron variants (8), and upstream gene variants (1). The intron variants were found in seven protein-coding genes (EXOC6B, PTPRD, NSL1, RPS6KC1, AGBL1, ENSBTAG00000052188, and GPC1) (Table 1). At the suggestive significance level, 52 additional SNPs were detected: downstream (3), upstream (2), intergenic (22), intron (24), and missense (1) variants.
Minor allele frequencies (MAF) of significant SNPs were in the range of 0.025–0.228 and 0.025–0.186 for the “breed effect” and “group effect” designs, respectively. However, most loci (84% in the former analysis and 95% in the latter) had MAF < 0.05. The number of breeds in which an associated SNP was polymorphic ranged from 2 to 17 (Tables S2 and S3). In general, when the minor allele frequency of an SNP is higher, it tends to segregate in a larger number of breeds.
We conducted a point-based association analysis on the top gene PTPRD, previously linked to leukocyte telomere length in cattle [36]. The analysis was performed using adjusted phenotypes to test SNPs. We selected two SNPs that were found to be significant in both types of our GWAS analysis, namely BTA8:36496551 and BTA8:36516223. Our analysis shows that the minor alleles of these SNPs are associated with reduced LTL independently of animals’ age and breed (Figure 2). This is observed both for initial and normalized LTL values. Other SNPs from the PTPRD gene should have similar effects, as they are in strong linkage disequilibrium (r2 = 0.85–1.0) with BTA8:36496551 and BTA8:36516223.

4. Discussion

In this study, we examine the genetic structure of telomere length in various taurine cattle breeds by analyzing their whole-genome resequencing data. This improves the likelihood of identifying causative variants compared with SNP arrays [75]. Including multiple breeds in the design could lead to more accurate QTL mapping, as the linkage disequilibrium patterns vary between individual breeds [76]. Also, our experiment was designed to point to alleles with effects across multiple breeds [77] and avoid breed- or family-specific effects [78]. Our dataset consists of animals from various populations and environments. Therefore, the variants identified may play a role in different environments where taurine cattle can be found.
Human studies have shown that the length of telomeres at birth is a reliable indicator of a person’s telomere length throughout their life, from childhood to adulthood [79]. As has been shown for Holstein cattle, however, there is only a weak correlation (Pearson’s r = 0.30) between leukocyte telomere length values at birth and at first lactation [36], which suggests that different genetic mechanisms could be involved at different ages, at least in this breed. Since our study involves animals of varying ages, we anticipate that some associated variants will have a lasting impact throughout their lifespan. Our SNPs could therefore be tested as markers of TL in breeding strategies.
The findings of this study agree with previously published data. According to a microarray-based GWAS in Holstein–Friesian cattle, the PTPRD gene was the closest to the most significant SNP associated with telomere length at first lactation [36]. In our study, SNPs from the PTPRD were detected in both experimental designs. Additionally, within our breed design, this gene exhibits the most significant statistical signal. However, according to remapping results, the distance between the variant from the abovementioned study and ours is 1.13 Mb. This falls within the range of distances between causal polymorphism and observed association signals [80]. This is even less surprising considering an extensive level of linkage disequilibrium in Holstein–Friesian cattle resulting from a high level of inbreeding and intense artificial selection in this breed [81]. The PTPRD is a tumor suppressor often inactivated and mutated in tumors [82,83]. The exact way this protein could affect telomere length is unknown but is likely related to the alternative lengthening of telomeres (ALT), representing a way to maintain telomeres unrelated to telomerase activity. Thus, according to Hartlieb et al. [84], deletions in PTPRD are enriched in ALT-positive tumors. The GWAS in humans confirms the participation of genes from the ALT pathway in telomere length variation, though their contribution is relatively minor [85].
All detected SNPs from the PTPRD gene segregate in at least four breeds: Charolais, Hereford, Kazakh Whiteheaded, and Red Steppe. In Holsteins, however, none of the associated SNPs were polymorphic. This may be connected to the limited number of animals engaged (15) and/or genetic differences between Holstein populations used in the study by Ilska-Warner et al. [36] and ours. As was confirmed by the additional analysis of adjusted phenotypes, the top PTPRD gene’s SNPs have breed- and age-independent modes of action. Therefore, we could propose them for further validation and subsequent marker-assisted selection to improve telomere-length-associated traits in the breeds where this haplotype segregates.
Another gene with evidence of possible involvement in TL maintenance is GPC1, which has been found in our “group effect” GWAS. This gene encodes for glypican-1 protein belonging to a family of heparan sulfate proteoglycans. Liu et al. [86] demonstrated that genes differentially expressed between GPC1 high- and low-expressing colon adenocarcinoma tumors are enriched by signaling pathways related to telomere maintenance.
The NSL1 gene may have a potential role in controlling telomere length. It encodes for the NSL1 component of the MIS12 kinetochore complex. As shown by Zhang et al. [87], instability of the MIS12 complex accelerates the senescence of human mesenchymal progenitor cells (hMPCs). Moreover, MIS12-knockout hMPCs have an approximately four-fold decrease in telomere length.
In addition to genes with significant SNPs, there were also potential gene candidates that showed association signals nearby and others that had SNPs at a suggestive significance level. Thus, we detected a significant SNP upstream of ANKRD17, a telomere-associated gene responsible for longevity in naked mole rats [88]. Another candidate is CPSF4, with an intronic variant detected at the suggestive level. According to Chen et al. [89], CPSF4 is a promoter-regulating protein of human telomerase reverse transcriptase (hTERT) capable of enhancing telomerase activity in lung cancer cells and normal lung cells.
The only missense variant found (BTA18:47718019, Lys1887Met) was in ENSBTAG00000040318—the uncharacterized gene. The ENSBTAG00000040318 gene shares 93.5% amino acid sequence identity with the WDR87 gene of Bubalus bubalis (92% cover). This gene encodes for testis development protein NYD-SP11 (https://www.genecards.org/cgi-bin/carddisp.pl?gene=WDR87 (accessed on 22 July 2023)). We found no link between WDR87 and telomeres. However, according to a comparative phylogenetic study by Muntane and colleagues [90], WDR87 is associated with lifespan in primates. The telomere shortening rate is closely linked to lifespan, implying that WDR87 may play a role in telomere biology [16].
Overall, the relation to telomere biology for genes identified in this study was previously shown in cattle (PTPRD), humans (GPC1, NSL1, CPSF4), and naked mole rats (ANKRD17). In agreement with our hypothesis, candidate genes with relation to telomeres in noncattle species had their possible involvement in telomere biology demonstrated by non-GWAS techniques. This stresses the importance of using distinct methodologies for understanding telomere-related mechanisms and pathways shared by animal species.
Considering the relatively small number of genes identified in this study, it is hard to resolve the predominant mechanisms and pathways underlying the genomic architecture of bovine leukocyte telomere length variation. Based on the information collected here and in previous studies, the PTPRD gene may significantly affect telomere length variation in cattle. This could indicate that alternative lengthening of telomeres plays a substantial role in this species. This is also supported by the fact that genes known to be directly related to telomerase have not been detected at the level of significance in our study. On the other hand, according to a study in humans, top association signals harbored by core components of proteins regulating the assembly and activity of the telomerase mostly come from common variants with small effects [85]. Therefore, it is possible that due to a smaller sample size leading to moderate statistical power, SNPs in such genes have not been detected in our study.

5. Conclusions

Our study utilizing whole-genome resequencing of diverse cattle samples reveals several genomic loci associated with bovine leukocyte telomere length. The top results coupled with the previous finding suggest that the PTPRD gene might be one of the major contributors to leucocyte TL variation in cattle. The detected variants could be further tested in marker-assisted selection to improve telomere-length-associated traits such as longevity, productive lifespan, stress resistance, and disease susceptibility in cattle.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/genes14081596/s1, Figure S1: The number of SNPs within 0.5 Mb window size; Figure S2: Quantile–quantile plot of GWAS analysis; Table S1: The list of animals engaged in the study and their corresponding descriptive information; Table S2: Single nucleotide polymorphisms detected at significant and suggestive levels using covariates for breed and group of breeds; Table S3: Animals’ genotypes for detected SNPs.

Author Contributions

Conceptualization, N.S.Y. and D.M.L.; methodology, N.S.Y.; formal analysis, A.V.I. and G.A.R.; resources, N.S.Y. and D.M.L.; writing—original draft preparation, A.V.I. and N.S.Y.; writing—review and editing, D.M.L.; project administration, N.S.Y.; funding acquisition, D.M.L. and N.S.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Russian Scientific Foundation (RSF) grant no. 22-26-00143, https://rscf.ru/project/22-26-00143/ (accessed on 23 June 2023).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The whole-genome sequence’s raw data are available from the corresponding author on a reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Tam, V.; Patel, N.; Turcotte, M.; Bossé, Y.; Paré, G.; Meyre, D. Benefits and Limitations of Genome-Wide Association Studies. Nat. Rev. Genet. 2019, 20, 467–484. [Google Scholar] [CrossRef] [PubMed]
  2. Marigorta, U.M.; Rodríguez, J.A.; Gibson, G.; Navarro, A. Replicability and Prediction: Lessons and Challenges from GWAS. Trends Genet. 2018, 34, 504–517. [Google Scholar] [CrossRef] [PubMed]
  3. McMahon, A.; Lewis, E.; Buniello, A.; Cerezo, M.; Hall, P.; Sollis, E.; Parkinson, H.; Hindorff, L.A.; Harris, L.W.; MacArthur, J.A.L. Sequencing-Based Genome-Wide Association Studies Reporting Standards. Cell Genom. 2021, 1, 100005. [Google Scholar] [CrossRef] [PubMed]
  4. Geibel, J.; Reimer, C.; Weigend, S.; Weigend, A.; Pook, T.; Simianer, H. How Array Design Creates SNP Ascertainment Bias. PLoS ONE 2021, 16, e0245178. [Google Scholar] [CrossRef]
  5. Génin, E. Missing Heritability of Complex Diseases: Case Solved? Hum. Genet. 2020, 139, 103–113. [Google Scholar] [CrossRef]
  6. Lee, M.; Napier, C.E.; Yang, S.F.; Arthur, J.W.; Reddel, R.R.; Pickett, H.A. Comparative Analysis of Whole Genome Sequencing-Based Telomere Length Measurement Techniques. Methods 2017, 114, 4–15. [Google Scholar] [CrossRef]
  7. Chen, J. Statistical Considerations on NGS Data for Inferring Copy Number Variations. Methods Mol. Biol. 2021, 2243, 27–58. [Google Scholar] [CrossRef]
  8. Battle, S.L.; Puiu, D.; Verlouw, J.; Broer, L.; Boerwinkle, E.; Taylor, K.D.; Rotter, J.I.; Rich, S.S.; Grove, M.L.; Pankratz, N.; et al. A Bioinformatics Pipeline for Estimating Mitochondrial DNA Copy Number and Heteroplasmy Levels from Whole Genome Sequencing Data. NAR Genom. Bioinform. 2022, 4, lqac034. [Google Scholar] [CrossRef]
  9. Hall, A.N.; Morton, E.; Queitsch, C. First Discovered, Long out of Sight, Finally Visible: Ribosomal DNA. Trends Genet. 2022, 38, 587–597. [Google Scholar] [CrossRef]
  10. Monaghan, P.; Ozanne, S.E. Somatic Growth and Telomere Dynamics in Vertebrates: Relationships, Mechanisms and Consequences. Philos. Trans. R. Soc. Lond. Ser. B Biol. Sci. 2018, 373, 20160446. [Google Scholar] [CrossRef]
  11. Jenner, L.P.; Peska, V.; Fulnečková, J.; Sýkorová, E. Telomeres and Their Neighbors. Genes 2022, 13, 1663. [Google Scholar] [CrossRef] [PubMed]
  12. Saretzki, G. Telomeres, Telomerase and Ageing. Subcell. Biochem. 2018, 90, 221–308. [Google Scholar] [CrossRef] [PubMed]
  13. Schrumpfová, P.P.; Fajkus, J. Composition and Function of Telomerase-A Polymerase Associated with the Origin of Eukaryotes. Biomolecules 2020, 10, 1425. [Google Scholar] [CrossRef] [PubMed]
  14. de Lange, T. Shelterin-Mediated Telomere Protection. Annu. Rev. Genet. 2018, 52, 223–247. [Google Scholar] [CrossRef] [PubMed]
  15. Shammas, M.A. Telomeres, Lifestyle, Cancer, and Aging. Curr. Opin. Clin. Nutr. Metab. Care 2011, 14, 28–34. [Google Scholar] [CrossRef]
  16. Whittemore, K.; Vera, E.; Martínez-Nevado, E.; Sanpera, C.; Blasco, M.A. Telomere Shortening Rate Predicts Species Life Span. Proc. Natl. Acad. Sci. USA 2019, 116, 15122–15127. [Google Scholar] [CrossRef] [PubMed]
  17. Adwan Shekhidem, H.; Sharvit, L.; Leman, E.; Manov, I.; Roichman, A.; Holtze, S.; Huffman, D.M.; Y Cohen, H.; Bernd Hildebrandt, T.; Shams, I.; et al. Telomeres and Longevity: A Cause or an Effect? Int. J. Mol. Sci. 2019, 20, 3233. [Google Scholar] [CrossRef]
  18. Jafri, M.A.; Ansari, S.A.; Alqahtani, M.H.; Shay, J.W. Roles of Telomeres and Telomerase in Cancer, and Advances in Telomerase-Targeted Therapies. Genome Med. 2016, 8, 69. [Google Scholar] [CrossRef]
  19. Aviv, A.; Shay, J.W. Reflections on Telomere Dynamics and Ageing-Related Diseases in Humans. Philos. Trans. R. Soc. Lond. Ser. B Biol. Sci. 2018, 373, 20160436. [Google Scholar] [CrossRef]
  20. Crocco, P.; De Rango, F.; Dato, S.; Rose, G.; Passarino, G. Telomere Length as a Function of Age at Population Level Parallels Human Survival Curves. Aging 2021, 13, 204–218. [Google Scholar] [CrossRef]
  21. Vera, E.; Bernardes de Jesus, B.; Foronda, M.; Flores, J.M.; Blasco, M.A. The Rate of Increase of Short Telomeres Predicts Longevity in Mammals. Cell Rep. 2012, 2, 732–737. [Google Scholar] [CrossRef] [PubMed]
  22. Wilbourn, R.V.; Moatt, J.P.; Froy, H.; Walling, C.A.; Nussey, D.H.; Boonekamp, J.J. The Relationship between Telomere Length and Mortality Risk in Non-Model Vertebrate Systems: A Meta-Analysis. Philos. Trans. R. Soc. Lond. Ser. B Biol. Sci. 2018, 373, 20160447. [Google Scholar] [CrossRef] [PubMed]
  23. Fick, L.J.; Fick, G.H.; Li, Z.; Cao, E.; Bao, B.; Heffelfinger, D.; Parker, H.G.; Ostrander, E.A.; Riabowol, K. Telomere Length Correlates with Life Span of Dog Breeds. Cell Rep. 2012, 2, 1530–1536. [Google Scholar] [CrossRef]
  24. Bichet, C.; Bouwhuis, S.; Bauch, C.; Verhulst, S.; Becker, P.H.; Vedder, O. Telomere Length Is Repeatable, Shortens with Age and Reproductive Success, and Predicts Remaining Lifespan in a Long-Lived Seabird. Mol. Ecol. 2020, 29, 429–441. [Google Scholar] [CrossRef]
  25. Liu, J.; Wang, L.; Wang, Z.; Liu, J.-P. Roles of Telomere Biology in Cell Senescence, Replicative and Chronological Ageing. Cells 2019, 8, 54. [Google Scholar] [CrossRef]
  26. Armanios, M. The Role of Telomeres in Human Disease. Annu. Rev. Genom. Hum. Genet. 2022, 23, 363–381. [Google Scholar] [CrossRef]
  27. Revy, P.; Kannengiesser, C.; Bertuch, A.A. Genetics of Human Telomere Biology Disorders. Nat. Rev. Genet. 2023, 24, 86–108. [Google Scholar] [CrossRef]
  28. Rossiello, F.; Jurk, D.; Passos, J.F.; d’Adda di Fagagna, F. Telomere Dysfunction in Ageing and Age-Related Diseases. Nat. Cell Biol. 2022, 24, 135–147. [Google Scholar] [CrossRef]
  29. Hu, H.; Mu, T.; Ma, Y.; Wang, X.; Ma, Y. Analysis of Longevity Traits in Holstein Cattle: A Review. Front. Genet. 2021, 12, 695543. [Google Scholar] [CrossRef]
  30. Zhang, H.; Liu, A.; Wang, Y.; Luo, H.; Yan, X.; Guo, X.; Li, X.; Liu, L.; Su, G. Genetic Parameters and Genome-Wide Association Studies of Eight Longevity Traits Representing Either Full or Partial Lifespan in Chinese Holsteins. Front. Genet. 2021, 12, 634986. [Google Scholar] [CrossRef]
  31. De Vries, A. Symposium Review: Why Revisit Dairy Cattle Productive Lifespan? J. Dairy Sci. 2020, 103, 3838–3845. [Google Scholar] [CrossRef] [PubMed]
  32. Miyashita, N.; Shiga, K.; Yonai, M.; Kaneyama, K.; Kobayashi, S.; Kojima, T.; Goto, Y.; Kishi, M.; Aso, H.; Suzuki, T.; et al. Remarkable Differences in Telomere Lengths among Cloned Cattle Derived from Different Cell Types. Biol. Reprod. 2002, 66, 1649–1655. [Google Scholar] [CrossRef] [PubMed]
  33. Iannuzzi, A.; Albarella, S.; Parma, P.; Galdiero, G.; D’Anza, E.; Pistucci, R.; Peretti, V.; Ciotola, F. Characterization of Telomere Length in Agerolese Cattle Breed, Correlating Blood and Milk Samples. Anim. Genet. 2022, 53, 676–679. [Google Scholar] [CrossRef] [PubMed]
  34. Brown, D.E.; Dechow, C.D.; Liu, W.S.; Harvatine, K.J.; Ott, T.L. Hot Topic: Association of Telomere Length with Age, Herd, and Culling in Lactating Holsteins. J. Dairy Sci. 2012, 95, 6384–6387. [Google Scholar] [CrossRef]
  35. Seeker, L.A.; Ilska, J.J.; Psifidi, A.; Wilbourn, R.V.; Underwood, S.L.; Fairlie, J.; Holland, R.; Froy, H.; Salvo-Chirnside, E.; Bagnall, A.; et al. Bovine Telomere Dynamics and the Association between Telomere Length and Productive Lifespan. Sci. Rep. 2018, 8, 12748. [Google Scholar] [CrossRef]
  36. Ilska-Warner, J.J.; Psifidi, A.; Seeker, L.A.; Wilbourn, R.V.; Underwood, S.L.; Fairlie, J.; Whitelaw, B.; Nussey, D.H.; Coffey, M.P.; Banos, G. The Genetic Architecture of Bovine Telomere Length in Early Life and Association With Animal Fitness. Front. Genet. 2019, 10, 1048. [Google Scholar] [CrossRef]
  37. Seeker, L.A.; Underwood, S.L.; Wilbourn, R.V.; Dorrens, J.; Froy, H.; Holland, R.; Ilska, J.J.; Psifidi, A.; Bagnall, A.; Whitelaw, B.; et al. Telomere Attrition Rates Are Associated with Weather Conditions and Predict Productive Lifespan in Dairy Cattle. Sci. Rep. 2021, 11, 5589. [Google Scholar] [CrossRef]
  38. Katoto, P.D.M.C.; Kayembe-Kitenge, T.; Pollitt, K.J.G.; Martens, D.S.; Ghosh, M.; Nachega, J.B.; Nemery, B.; Nawrot, T.S. Telomere Length and Outcome of Treatment for Pulmonary Tuberculosis in a Gold Mining Community. Sci. Rep. 2021, 11, 4031. [Google Scholar] [CrossRef]
  39. Auld, E.; Lin, J.; Chang, E.; Byanyima, P.; Ayakaka, I.; Musisi, E.; Worodria, W.; Davis, J.L.; Segal, M.; Blackburn, E.; et al. HIV Infection Is Associated with Shortened Telomere Length in Ugandans with Suspected Tuberculosis. PLoS ONE 2016, 11, e0163153. [Google Scholar] [CrossRef]
  40. Vasilopoulos, E.; Fragkiadaki, P.; Kalliora, C.; Fragou, D.; Docea, A.O.; Vakonaki, E.; Tsoukalas, D.; Calina, D.; Buga, A.M.; Georgiadis, G.; et al. The Association of Female and Male Infertility with Telomere Length (Review). Int. J. Mol. Med. 2019, 44, 375–389. [Google Scholar] [CrossRef]
  41. Iannuzzi, A.; Della Valle, G.; Russo, M.; Longobardi, V.; Albero, G.; De Canditiis, C.; Kosior, M.A.; Pistucci, R.; Gasparrini, B. Evaluation of Bovine Sperm Telomere Length and Association with Semen Quality. Theriogenology 2020, 158, 227–232. [Google Scholar] [CrossRef] [PubMed]
  42. Cook, D.E.; Zdraljevic, S.; Tanny, R.E.; Seo, B.; Riccardi, D.D.; Noble, L.M.; Rockman, M.V.; Alkema, M.J.; Braendle, C.; Kammenga, J.E.; et al. The Genetic Basis of Natural Variation in Caenorhabditis Elegans Telomere Length. Genetics 2016, 204, 371–383. [Google Scholar] [CrossRef] [PubMed]
  43. Manning, E.L.; Crossland, J.; Dewey, M.J.; Van Zant, G. Influences of Inbreeding and Genetics on Telomere Length in Mice. Mamm. Genome 2002, 13, 234–238. [Google Scholar] [CrossRef] [PubMed]
  44. Tilesi, F.; Di Domenico, E.G.; Pariset, L.; Bosco, L.; Willems, D.; Valentini, A.; Ascenzioni, F. Telomere Length Diversity in Cattle Breeds. Diversity 2010, 2, 1118–1129. [Google Scholar] [CrossRef]
  45. Seeker, L.A.; Ilska, J.J.; Psifidi, A.; Wilbourn, R.V.; Underwood, S.L.; Fairlie, J.; Holland, R.; Froy, H.; Bagnall, A.; Whitelaw, B.; et al. Longitudinal Changes in Telomere Length and Associated Genetic Parameters in Dairy Cattle Analysed Using Random Regression Models. PLoS ONE 2018, 13, e0192864. [Google Scholar] [CrossRef]
  46. Sun, X.; Niu, Q.; Jiang, J.; Wang, G.; Zhou, P.; Li, J.; Chen, C.; Liu, L.; Xu, L.; Ren, H. Identifying Candidate Genes for Litter Size and Three Morphological Traits in Youzhou Dark Goats Based on Genome-Wide SNP Markers. Genes 2023, 14, 1183. [Google Scholar] [CrossRef]
  47. Saatchi, M.; Beever, J.E.; Decker, J.E.; Faulkner, D.B.; Freetly, H.C.; Hansen, S.L.; Yampara-Iquise, H.; Johnson, K.A.; Kachman, S.D.; Kerley, M.S.; et al. QTLs Associated with Dry Matter Intake, Metabolic Mid-Test Weight, Growth and Feed Efficiency Have Little Overlap across 4 Beef Cattle Studies. BMC Genom. 2014, 15, 1004. [Google Scholar] [CrossRef]
  48. Ignatieva, E.V.; Yudin, N.S.; Larkin, D.M. Compilation and Functional Classification of Telomere Length-Associated Genes in Humans and Other Animal Species. Vavilovskii Zhurnal Genet. Sel. 2023, 27, 283–292. [Google Scholar] [CrossRef]
  49. Porter, V.; Alderson, L.; Hall, S.J.G.; Sponenberg, D.P. Mason’s World Encyclopedia of Livestock Breeds and Breeding, 2 Volume Pack; CABI: Wallingford, UK, 2016. [Google Scholar]
  50. 1000 Bull Genomes Project. 1000 Bulls GATK Fastq to GVCF Guidelines (GATKv3.8). 2019. Available online: http://www.1000bullgenomes.com/doco/1000bullsGATK3.8pipelineSpecifications_Run8_Revision_20191101.docx (accessed on 7 August 2021).
  51. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A Flexible Trimmer for Illumina Sequence Data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef]
  52. Li, H. Aligning Sequence Reads, Clone Sequences and Assembly Contigs with BWA-MEM. arXiv 2013, arXiv:1303.3997. [Google Scholar]
  53. McKenna, A.; Hanna, M.; Banks, E.; Sivachenko, A.; Cibulskis, K.; Kernytsky, A.; Garimella, K.; Altshuler, D.; Gabriel, S.; Daly, M.; et al. The Genome Analysis Toolkit: A MapReduce Framework for Analyzing next-Generation DNA Sequencing Data. Genome Res. 2010, 20, 1297–1303. [Google Scholar] [CrossRef] [PubMed]
  54. Danecek, P.; Auton, A.; Abecasis, G.; Albers, C.A.; Banks, E.; DePristo, M.A.; Handsaker, R.E.; Lunter, G.; Marth, G.T.; Sherry, S.T.; et al. The Variant Call Format and VCFtools. Bioinformatics 2011, 27, 2156–2158. [Google Scholar] [CrossRef]
  55. Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M.A.R.; Bender, D.; Maller, J.; Sklar, P.; de Bakker, P.I.W.; Daly, M.J.; et al. PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses. Am. J. Hum. Genet. 2007, 81, 559–575. [Google Scholar] [CrossRef] [PubMed]
  56. McKinney, G.J.; Waples, R.K.; Seeb, L.W.; Seeb, J.E. Paralogs Are Revealed by Proportion of Heterozygotes and Deviations in Read Ratios in Genotyping-by-Sequencing Data from Natural Populations. Mol. Ecol. Resour. 2017, 17, 656–669. [Google Scholar] [CrossRef] [PubMed]
  57. Gauderman, W.J. Sample Size Requirements for Matched Case-Control Studies of Gene-Environment Interaction. Stat. Med. 2002, 21, 35–50. [Google Scholar] [CrossRef]
  58. Sobota, R.S.; Shriner, D.; Kodaman, N.; Goodloe, R.; Zheng, W.; Gao, Y.-T.; Edwards, T.L.; Amos, C.I.; Williams, S.M. Addressing Population-Specific Multiple Testing Burdens in Genetic Association Studies. Ann. Hum. Genet. 2015, 79, 136–147. [Google Scholar] [CrossRef]
  59. Ding, Z.; Mangino, M.; Aviv, A.; Spector, T.; Durbin, R. Estimating Telomere Length from Whole Genome Sequence Data. Nucleic Acids Res. 2014, 42, e75. [Google Scholar] [CrossRef] [PubMed]
  60. Kang, H.M.; Sul, J.H.; Service, S.K.; Zaitlen, N.A.; Kong, S.-Y.; Freimer, N.B.; Sabatti, C.; Eskin, E. Variance Component Model to Account for Sample Structure in Genome-Wide Association Studies. Nat. Genet. 2010, 42, 348–354. [Google Scholar] [CrossRef] [PubMed]
  61. Bliss, C.I. Statistics in Biology, Statistical Methods for Research in the Natural Sciences; McGraw-Hill Book Company: New York, NY, USA, 1967. [Google Scholar]
  62. Mangiafico, S.S. An R Companion for the Handbook of Biological Statistics, Version 1.3.2; Rutgers Cooperative Extension: New Brunswick, NJ, USA, 2015. [Google Scholar]
  63. Alkharusi, H. Categorical Variables in Regression Analysis: A Comparison of Dummy and Effect Coding. Int. J. Educ. 2012, 4, 202–210. [Google Scholar] [CrossRef]
  64. Price, A.L.; Zaitlen, N.A.; Reich, D.; Patterson, N. New Approaches to Population Stratification in Genome-Wide Association Studies. Nat. Rev. Genet. 2010, 11, 459–463. [Google Scholar] [CrossRef]
  65. Storey, J.D.; Bass, A.J.; Dabney, A.; Robinson, D. Qvalue: Q-Value Estimation for False Discovery Rate Control. 2021. Available online: https://github.com/StoreyLab/qvalue (accessed on 25 March 2021).
  66. Cingolani, P. Variant Annotation and Functional Prediction: SnpEff. Methods Mol. Biol. 2022, 2493, 289–314. [Google Scholar] [CrossRef]
  67. Johnson, M.; Zaretskaya, I.; Raytselis, Y.; Merezhuk, Y.; McGinnis, S.; Madden, T.L. NCBI BLAST: A Better Web Interface. Nucleic Acids Res. 2008, 36, W5–W9. [Google Scholar] [CrossRef] [PubMed]
  68. Kuhn, R.M.; Haussler, D.; Kent, W.J. The UCSC Genome Browser and Associated Tools. Brief. Bioinform. 2013, 14, 144–161. [Google Scholar] [CrossRef] [PubMed]
  69. Stoffel, M.A.; Johnston, S.E.; Pilkington, J.G.; Pemberton, J.M. Genetic Architecture and Lifetime Dynamics of Inbreeding Depression in a Wild Mammal. Nat. Commun. 2021, 12, 2972. [Google Scholar] [CrossRef] [PubMed]
  70. Liu, Z.; Li, H.; Zhong, Z.; Jiang, S. A Whole Genome Sequencing-Based Genome-Wide Association Study Reveals the Potential Associations of Teat Number in Qingping Pigs. Animals 2022, 12, 1057. [Google Scholar] [CrossRef]
  71. Tomaszewski, M.; Charchar, F.J.; Barnes, T.; Gawron-Kiszka, M.; Sedkowska, A.; Podolecka, E.; Kowalczyk, J.; Rathbone, W.; Kalarus, Z.; Grzeszczak, W.; et al. A Common Variant in Low-Density Lipoprotein Receptor-Related Protein 6 Gene (LRP6) Is Associated with LDL-Cholesterol. Arterioscler. Thromb. Vasc. Biol. 2009, 29, 1316–1321. [Google Scholar] [CrossRef]
  72. Ren, Y.; Wang, L.; Dai, H.; Qiu, G.; Liu, J.; Yu, D.; Liu, J.; Lyu, C.-Z.; Liu, L.; Zheng, M. Genome-Wide Association Analysis of Anti-TNF-α Treatment Response in Chinese Patients with Psoriasis. Front. Pharmacol. 2022, 13, 968935. [Google Scholar] [CrossRef]
  73. Monir, M.M.; Zhu, J. Comparing GWAS Results of Complex Traits Using Full Genetic Model and Additive Models for Revealing Genetic Architecture. Sci. Rep. 2017, 7, 38600. [Google Scholar] [CrossRef]
  74. Hayward, R.A.; Kent, D.M.; Vijan, S.; Hofer, T.P. Multivariable Risk Prediction Can Greatly Enhance the Statistical Power of Clinical Trial Subgroup Analysis. BMC Med. Res. Methodol. 2006, 6, 18. [Google Scholar] [CrossRef]
  75. Hayes, B.; Macleod, I.; Daetwyler, H.D.; Phil, B.; Chamberlain, A.; Vander Jagt, C.; Capitan, A.; Pausch, H.; Stothard, P.; Liao, X.; et al. Genomic Prediction from Whole Genome Sequence in Livestock: The 1000 Bull Genomes Project. In Proceedings of the 10th World Congress of Genetics Applied to Livestock Production, Vancouver, BC, Canada, 17–22 August 2014. [Google Scholar]
  76. Raven, L.-A.; Cocks, B.G.; Hayes, B.J. Multibreed Genome Wide Association Can Improve Precision of Mapping Causative Variants Underlying Milk Production in Dairy Cattle. BMC Genom. 2014, 15, 62. [Google Scholar] [CrossRef]
  77. Sevillano, C.A.; Ten Napel, J.; Guimarães, S.E.F.; Silva, F.F.; Calus, M.P.L. Effects of Alleles in Crossbred Pigs Estimated for Genomic Prediction Depend on Their Breed-of-Origin. BMC Genom. 2018, 19, 740. [Google Scholar] [CrossRef]
  78. Carvalheiro, R.; Costilla, R.; Neves, H.H.R.; Albuquerque, L.G.; Moore, S.; Hayes, B.J. Unraveling Genetic Sensitivity of Beef Cattle to Environmental Variation under Tropical Conditions. Genet. Sel. Evol. 2019, 51, 29. [Google Scholar] [CrossRef] [PubMed]
  79. Martens, D.S.; Van Der Stukken, C.; Derom, C.; Thiery, E.; Bijnens, E.M.; Nawrot, T.S. Newborn Telomere Length Predicts Later Life Telomere Length: Tracking Telomere Length from Birth to Child- and Adulthood. eBioMedicine 2021, 63, 103164. [Google Scholar] [CrossRef] [PubMed]
  80. Goddard, M.E.; Kemper, K.E.; MacLeod, I.M.; Chamberlain, A.J.; Hayes, B.J. Genetics of Complex Traits: Prediction of Phenotype, Identification of Causal Polymorphisms and Genetic Architecture. Proc. Biol. Sci. 2016, 283, 20160569. [Google Scholar] [CrossRef] [PubMed]
  81. Kim, E.-S.; Kirkpatrick, B.W. Linkage Disequilibrium in the North American Holstein Population. Anim. Genet. 2009, 40, 279–288. [Google Scholar] [CrossRef]
  82. Veeriah, S.; Brennan, C.; Meng, S.; Singh, B.; Fagin, J.A.; Solit, D.B.; Paty, P.B.; Rohle, D.; Vivanco, I.; Chmielecki, J.; et al. The Tyrosine Phosphatase PTPRD Is a Tumor Suppressor That Is Frequently Inactivated and Mutated in Glioblastoma and Other Human Cancers. Proc. Natl. Acad. Sci. USA 2009, 106, 9435–9440. [Google Scholar] [CrossRef]
  83. Szaumkessel, M.; Wojciechowska, S.; Janiszewska, J.; Zemke, N.; Byzia, E.; Kiwerska, K.; Kostrzewska-Poczekaj, M.; Ustaszewski, A.; Jarmuz-Szymczak, M.; Grenman, R.; et al. Recurrent Epigenetic Silencing of the PTPRD Tumor Suppressor in Laryngeal Squamous Cell Carcinoma. Tumour Biol. J. Int. Soc. Oncodev. Biol. Med. 2017, 39, 1010428317691427. [Google Scholar] [CrossRef]
  84. Hartlieb, S.A.; Sieverling, L.; Nadler-Holly, M.; Ziehm, M.; Toprak, U.H.; Herrmann, C.; Ishaque, N.; Okonechnikov, K.; Gartlgruber, M.; Park, Y.-G.; et al. Alternative Lengthening of Telomeres in Childhood Neuroblastoma from Genome to Proteome. Nat. Commun. 2021, 12, 1269. [Google Scholar] [CrossRef]
  85. Codd, V.; Wang, Q.; Allara, E.; Musicha, C.; Kaptoge, S.; Stoma, S.; Jiang, T.; Hamby, S.E.; Braund, P.S.; Bountziouka, V.; et al. Polygenic Basis and Biomedical Consequences of Telomere Length Variation. Nat. Genet. 2021, 53, 1425–1433. [Google Scholar] [CrossRef]
  86. Liu, Y.; Ren, H.; Yang, M.-Q.; Li, J.-Y. GPC1 Is Associated with Poor Prognosis and Treg Infiltration in Colon Adenocarcinoma. Comput. Math. Methods Med. 2022, 2022, 8209700. [Google Scholar] [CrossRef]
  87. Zhang, S.; Wu, Z.; Shi, Y.; Wang, S.; Ren, J.; Yu, Z.; Huang, D.; Yan, K.; He, Y.; Liu, X.; et al. FTO Stabilizes MIS12 and Counteracts Senescence. Protein Cell 2022, 13, 954–960. [Google Scholar] [CrossRef] [PubMed]
  88. Morgan, C.C.; Mc Cartney, A.M.; Donoghue, M.T.A.; Loughran, N.B.; Spillane, C.; Teeling, E.C.; O’Connell, M.J. Molecular Adaptation of Telomere Associated Genes in Mammals. BMC Evol. Biol. 2013, 13, 251. [Google Scholar] [CrossRef] [PubMed]
  89. Chen, W.; Qin, L.; Wang, S.; Li, M.; Shi, D.; Tian, Y.; Wang, J.; Fu, L.; Li, Z.; Guo, W.; et al. CPSF4 Activates Telomerase Reverse Transcriptase and Predicts Poor Prognosis in Human Lung Adenocarcinomas. Mol. Oncol. 2014, 8, 704–716. [Google Scholar] [CrossRef] [PubMed]
  90. Muntané, G.; Farré, X.; Rodríguez, J.A.; Pegueroles, C.; Hughes, D.A.; de Magalhães, J.P.; Gabaldón, T.; Navarro, A. Biological Processes Modulating Longevity across Primates: A Phylogenetic Genome-Phenome Analysis. Mol. Biol. Evol. 2018, 35, 1990–2004. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Manhattan plots for genome-wide association analysis with covariates for a group of breeds (A) and breed effect (B). Solid horizontal lines indicate a significant threshold (q-value = 0.05). In red are genes harboring associated SNPs. In brown are genes with significant upstream or downstream variants.
Figure 1. Manhattan plots for genome-wide association analysis with covariates for a group of breeds (A) and breed effect (B). Solid horizontal lines indicate a significant threshold (q-value = 0.05). In red are genes harboring associated SNPs. In brown are genes with significant upstream or downstream variants.
Genes 14 01596 g001
Figure 2. Boxplots illustrating the effect of two top SNPs from the PTPRD gene using the raw and normalized telomere lengths adjusted for age and breed.
Figure 2. Boxplots illustrating the effect of two top SNPs from the PTPRD gene using the raw and normalized telomere lengths adjusted for age and breed.
Genes 14 01596 g002
Table 1. Genic SNPs significantly associated with LTL in at least one type of analysis.
Table 1. Genic SNPs significantly associated with LTL in at least one type of analysis.
Position (BTA)Ref/Alt AlleleGeneVariant Typeq-Value 1q-Value 2
3:119873308A */GGPC1intron variant3.31 × 10−2NA
8:36400561C */TPTPRDintron variantNA1.45 × 10−2
8:36433838T */GPTPRDintron variantNA2.79 × 10−2
8:36441863A */CPTPRDintron variantNA2.79 × 10−2
8:36496551G */TPTPRDintron variant8.15 × 10−32.57 × 10−3
8:36516223G */APTPRDintron variant1.59 × 10−32.5 × 10−4
8:36554362G */APTPRDintron variantNA2.5 × 10−2
8:36583129A */GPTPRDintron variantNA3.03 × 10−2
11:12087073T/G *EXOC6Bintron variant1.77 × 10−66.21 × 10−3
16:70386226T/C *RPS6KC1intron variant6.22 × 10−33.59 × 10−2
16:70795459G */TNSL1intron variant3.48 × 10−38.89 × 10−2
18:47718019T */AENSBTAG00000040318missense variant7.42 × 10−21.93 × 10−2
21:17460307G/A *AGBL1intron variant1.07 × 10−28.51 × 10−2
21:65476950C/T *ENSBTAG00000052188intron variant2.44 × 10−2NA
29:24118907T */CENSBTAG00000048576 (NELL1)intron variantNA2.33 × 10−3
29:24142137A */GENSBTAG00000048576 (NELL1)intron variantNA4.79 × 10−3
29:24168343A */GENSBTAG00000048576 (NELL1)intron variantNA1.17 × 10−2
*—Allele contributing to higher LTL. 1—Design with covariate for groups of breeds. 2—Design with covariates for individual breeds.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Igoshin, A.V.; Yudin, N.S.; Romashov, G.A.; Larkin, D.M. A Multibreed Genome-Wide Association Study for Cattle Leukocyte Telomere Length. Genes 2023, 14, 1596. https://doi.org/10.3390/genes14081596

AMA Style

Igoshin AV, Yudin NS, Romashov GA, Larkin DM. A Multibreed Genome-Wide Association Study for Cattle Leukocyte Telomere Length. Genes. 2023; 14(8):1596. https://doi.org/10.3390/genes14081596

Chicago/Turabian Style

Igoshin, Alexander V., Nikolay S. Yudin, Grigorii A. Romashov, and Denis M. Larkin. 2023. "A Multibreed Genome-Wide Association Study for Cattle Leukocyte Telomere Length" Genes 14, no. 8: 1596. https://doi.org/10.3390/genes14081596

APA Style

Igoshin, A. V., Yudin, N. S., Romashov, G. A., & Larkin, D. M. (2023). A Multibreed Genome-Wide Association Study for Cattle Leukocyte Telomere Length. Genes, 14(8), 1596. https://doi.org/10.3390/genes14081596

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop