Next Article in Journal
The Phylogenetic Relationship of Lamiinae (Coleoptera: Cerambycidae) Using Mitochondrial Genomes
Previous Article in Journal
The Spherical Evolutionary Multi-Objective (SEMO) Algorithm for Identifying Disease Multi-Locus SNP Interactions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genome-Wide Association Analysis of Reproductive Traits in Chinese Holstein Cattle

1
Tianjin Key Laboratory of Animal Molecular Breeding and Biotechnology, Tianjin Engineering Research Center of Animal Healthy Farming, Institute of Animal Science and Veterinary, Tianjin Academy of Agricultural Sciences, Tianjin 300381, China
2
College of Animal Science and Veterinary Medicine, Tianjin Agricultural University, Tianjin 300384, China
3
Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, China
*
Author to whom correspondence should be addressed.
Current address: Tianjin Institute of Animal Science and Veterinary, Tianjin 300381, China.
Genes 2024, 15(1), 12; https://doi.org/10.3390/genes15010012
Submission received: 28 October 2023 / Revised: 15 December 2023 / Accepted: 17 December 2023 / Published: 20 December 2023
(This article belongs to the Section Animal Genetics and Genomics)

Abstract

:
This study was to explore potential SNP loci for reproductive traits in Chinese Holstein cattle and identify candidate genes. Genome-wide Association Study based on mixed linear model was performed on 643 Holstein cattle using GeneSeek Bovine 50 K SNP chip. Our results detected forty significant SNP loci after Bonferroni correction. We identified five genes (VWC2L, STAT1, PPP3CA, LDB3, and CTNNA3) as being associated with pregnancy ratio of young cows, five genes (PAEP, ACOXL, EPAS1, GLRB, and MARVELD1) as being associated with pregnancy ratio of adult cows, and nine genes (PDE1B, SLCO1A2, ARHGAP26, ADAM10, APBB1, MON1B, COQ9, CDC42BPB, MARVELD1, and HPSE2) as being associated with daughter pregnancy rate. Our study may provide valuable insights into identifying genes related to reproductive traits and help promote the application of molecular breeding in dairy cows.

1. Introduction

Genome-wide Association Study (GWAS) was proposed by Risch and Merikan-gas in 1996 when studying the genetic base of human complex diseases [1]. GWAS applies a Single Nucleotide Polymorphism (SNP) as a molecular genetic marker at genome-wide level and identifies SNPs and candidate genes for target traits [2]. The analysis basis of GWAS is the phenomenon of linkage disequilibrium (LD) caused by recombination in the long-term evolution of natural populations [3]. According to the decay relationship between SNP locus and target trait, the SNP locus can be analyzed using statistics to identify those closely related to phenotypic variation. With the development of sequencing technology and high-throughput genotyping, GWAS has become an important method for analyzing genetic variation. Since the introduction of cattle high-density chips, there are more extensive and in-depth livestock-related GWAS research.
Reproductive traits are important economic traits of dairy cows, including calving interval (CI), calving ease (CE), heifer conception rate (HCR), cow conception rate (CCR), and daughter pregnancy rate (DPR). Among them, HCR, CCR, and DPR can more intuitively represent the reproductive ability of cows. In the past few decades, most intensive selection on milk production traits has led to a decline in the reproductive function of dairy cows, for example, CI, CE, HCR, CCR, and DPR, which indirectly affected the economic benefits [4,5]. Nowadays, the concept of balanced breeding has been popularized and recognized, and the reproductive traits of dairy cows have been given attention. With the continuous maturity of sequencing technology, GWAS studies on the reproductive traits of dairy cows are been reported successively, but there is still a lot of room for research. Olsen et al. [6] performed a GWAS analysis of 2552 Norwegian Red Bull groups from 108 bull families, and target traits mainly included stillbirth, dystocia, and 56-days non-return rate, and found 16 SNPs significantly associated with stillbirth and dystocia. Among them, SNP-rs29012179, located in BTA 12, affects both reproductive and milk production traits. Sahana et al. [7] analyzed on 11 reproductive traits using GWAS and identified 24 QTL regions on 14 chromosomes composed of highly significant SNPs in Danish and Swedish Holstein. Hering et al. [8] performed a GWAS analysis of semen quality with 320 bulls as a test population, finding 34 SNPs that reached genome-level significance. Huang et al. [9] studied the reproductive traits of Holstein cattle using the selective DNA pooling method and found twenty-two SNPs related to fertilization rate and five SNPs related to blastocyst rate. Liu et al. [10]. conducted genome-wide association analysis on the first birth age of HOST cattle and found genes including KLHL 4 Ls, TRAM 1, TRAM 2, ZNF438 and MATK were associated with reproductive disorders within 200 kb upstream and downstream of the significant SNP. Multiple SNP sites within the 1 Mb region of chromosome 7 were significantly associated with the day age of primary labor. However, GWAS analysis of reproductive traits including CI, CE, HCR, CCR, and DPR was not fully explored.
In this study, GWAS was used to explore SNP loci affecting important traits including HCR, CCR and DPR in dairy cows and detect the SNP loci associated with target traits.

2. Materials and Methods

2.1. Sample Genotype and Quality Control

In this study, the tail root blood was collected from 643 Holstein cows, and 3–5 drops of fresh blood from the tail root of dairy cows were drawn for DNA extraction. The first step involved cell lysis, exposing and rupturing membrane barriers such as cell and nuclear membranes. The next step was to remove membrane lipids from the sample; finally, DNA precipitation involved the removal of DNA-associated proteins by protease and removal of RNAs by RNase. The extracted DNA samples were quantitatively detected by a microspectrophotometer, adding DNA samples directly to the base, using two optical fibers to form the detection path, and with full-spectrum detection, the OD value of the DNA samples could be obtained. The DNA genome of Holstein cattle was scanned and analyzed using the Genomic Profiler Bovine 100 K chip containing 88,107 SNPs. Final genotyping data were obtained for the SNP. Quality control of individual data and SNP data in PLINK (v1.9) [11], began by first converting the SNP chip data into files that could be read by the PLINK (v1.9) software (*.map and *.Pad). The converted files included: individual ID, male parent ID (0 without record), female parent ID (0 without record), sex, SNP allele, chromosome, physical location, and genetic distance. A total of 637 individuals and 88,107 SNPs were obtained. The quality control conditions were as follows: (1) individual data with SNP genotype missing rates greater than 10% were filtered out; (2) SNP data with detection rate less than 90% were filtered out; (3) SNP data with a minimum allele frequency of less than 5% were removed; and (4) SNP data with Hardy-Weinberg equilibrium test P values less than 1.0 × 10−7 were removed.
After quality control, a total of 76,160 high-quality SNPs from 637 samples were used for subsequent GWAS analysis. The quality control command in PLINK software were as follows: PLINK- -bfile filename- -mind 0.1- -geno 0.1- -maf 0.05- -hwe 0.0000001- -make-bed- -out outfilename. The output file contained the individual data information and the SNP data information after quality control (Table 1). Information on the distribution of SNP markers on each chromosome before and after quality control is shown in Table 2.

2.2. Statistical Analysis

Principal component analysis (PCA) was evaluated with PLINK (v1.9). The population was corrected for PCA results, thus avoiding false positives [12] due to population stratification. GWAS analysis between reproductive traits and genome-wide SNPs is performed using a mixed linear model in the GEMMA (v0.98.5) software [13], establishing the model as follows:
y = μ + Xβ + Zv + e
In the formula: y is a vector of n × 1, for breeding values after removing genomic effects; μ represents the mean value of the trait phenotypic values; X is the mixed-effect matrix of n × q; β is a vector of q × 1, for the coefficient of the fixed effect; Z is the random polygene effect relationship matrix of n × t; v is the vector influenced by many factors, following the distribution of N (0, K σ α 2 ), where K is related to all SNP genotypes and σ α 2 additive variance, and is the relatedness matrix of t × t; e is the residual vector following the N (0, I σ α 2 ) distribution about the unknown variance σ α 2 .

2.3. Significance Test

Bonferroni correction was conducted for multiple tests. The significance level of the single test is: chromosome level (1/n), genome-wide level (0.05/n), where n is the number of SNP after quality control. Therefore, the chromosome level significant threshold is (1/76160 = 1.31 × 10−5), and the genome-wide level significant threshold is (0.05/76160 = 6.57 × 10−7).

2.4. Population Stratification Analysis

To determine whether the population stratification affected the association between SNPs and traits, Q-Q plots (Quantile-Quantile plot) were drawn using R v3.5.1 [14].

2.5. Screening and Annotation of the Candidate Genes

Based on National Animal Genome Research Program and National Center for Biotechnology databases, ARS-UCD 1.2 was used as the reference genome. Candidate genes for HCR, CCR, and DPR were searched within 50 kb the upstream or downstream of significant SNPs, and genes functions were annotated [15].

2.6. Enrichment Analysis of the Candidate Genes

The enrichment analysis of reproductive trait-related candidates obtained by GWAS analysis was conducted with the DAVID database, and identified the pathways in which these genes were involved.

2.7. Software and the Database Website Used in This Study

PLINK (v1.9): http://pngu.mgh.harvard.edu/~purcell/PLINK/ (accessed on 3 February 2023).
GEMMA (v0.98.5): https://github.com/genetics-statistics/GEMMA (accessed on 15 March 2023).
R (3.5.1): https://cran.r-project.org/mirrors.html (accessed on 18 March 2023).
The UCSC database: http://genome.ucsc.edu/ (accessed on 6 April 2023).
The NCBI database: http://www.ncbi.nlm.nih.gov/ (accessed on 6 April 2023).
ENSEMBL database: http://asia.ensembl.org/index.html (accessed on 11 May 2023).
Animal QTL database: http://www.animalgenome.org/cgi-bin/QTLdb/index/ (accessed on 6 April 2023).
The DAVID database: http://david.ncifcrf.gov/list.jsp (accessed on 26 August 2023).
The KEGG database: https://www.kegg.jp/ (accessed on 26 August 2023).

3. Results

3.1. Population Stratification

Figure 1 main component analysis revealed the sample population was divided into groups (PC1: 0~−1.5, PC2: 0~−0.05; PC1: −0.05~0, PC2: 0.2~0.25; PC1: −0.1~0.1, PC2: −0.1~0.05). Therefore, when performing association analysis on the whole genome after quality control, using population relatedness as a random effect relationship matrix in the mixed linear model can avoid false positive [16,17,18] of results due to population stratification. The Q-Q diagram was drawn with R (3.5.1), as shown in Figure 2, Figure 3 and Figure 4. The ordinate is −log10 (GWAS p-value observations), the abscissa is −log10 (GWAS p-value predictions), and the red line is straight line Y = X. It can be seen the distribution of the χ2 statistic calculated by the SNP association analysis does not deviate prematurely from the hypothesis test, indicating the analytical model used complies with this study. A deviation occurs on the upper right side of the figure, indicating SNP sites [19] are significantly correlated with HCR, CCR, and DPR, respectively.

3.2. Genome-Wide Association Study

After quality control, the genotypes of 88,107 SNPs were obtained in 637 cows. Genome-wide association analysis was conducted on HCR, CCR, and DPR in Chinese Holstein cattle, and a Manhattan plot was generated as shown in Figure 5, Figure 6 and Figure 7 (different colors represent the different chromosome numbers). After Bonferroni correction, eight SNPs were significantly associated with HCR at the chromosome level (P < 1/76160), eight SNPs were significantly associated with CCR at the chromosome level (P < 1/76160), and twenty-four SNPs were significantly associated with DPR at the chromosome level (P < 1/76160).

3.3. Significant SNPs and the Candidate Genes

After Bonferroni correction, 40 SNPs were found to be supported by statistical significance: eight SNPs significantly related to HCR were detected on chromosomes 2, 6, 12, and 28 (Figure 5); eight SNPs significantly connected to CCR were located on chromosomes 1, 8, 11, 17, and 26 (Figure 6); and twenty-four SNPs loci significantly associated with DPR were located on chromosomes 2, 5, 7, 10, 15, 18, 21, and 26 (Figure 7). These 40 significant SNPs were annotated, and the 20 closest candidate loci were obtained (Table 3).

3.4. Enrichment Analysis of the Candidate Genes

GO enrichment of reproductive trait candidate genes using DAVID database found 28 biological reactions were associated in vivo, each involving zero to fifteen genes (Figure 8). The KEGG analysis revealed no pathway where the candidate genes were located.

4. Discussion

In this study, we performed a genome-wide association analysis based on a mixed linear model of reproductive-related traits for a population using the GeneSeek Genomic Profiler Bovine 50 K SNP chip in 643 Chinese Holstein cattle. The PLINK software was used to perform the quality control and principal component analysis of the individual and genotype data. The principal component analysis found the sample population could be divided into three groups, which had an indicative role in avoiding the emergence of false positives and the selection of models due to population stratification [20]. In the mixed linear model, SNP effects and population stratification effects were used as fixed effects and kinship matrix as random effects for subsequent GWAS analysis, which greatly improved the accuracy of the association analysis [21,22].

4.1. Genome-Wide Association Study

We identified candidate genes for production traits. Candidate genes were identified by the NCBI and other databases; five candidate genes were associated with HCR: von Willebrand factor C domain containing 2 like (VWC2L), signal transducer and activator of transcription 1 (STAT1), protein phosphatase 3 catalytic subunit α (PPP3CA), LIM domain binding 3 (LDB3), and catenin α 3 (CTNNA3); five candidate genes associated with CCR: progestagen associated endometrial protein (PAEP), acyl-CoA oxidase like (ACOXL), endothelial PAS domain protein 1 (EPAS1), glycine receptor β (GLRB), and MARVEL domain containing 1 (MARVELD1); and ten candidate genes were associated with DPR: phosphodiesterase 1B (PDE1B), solute carrier organic anion transporter family member 1A2 (SLCO1A2), Rho GTPase activating protein 26 (ARHGAP26), ADAM metallopeptidase domain 10 (ADAM10), amyloid β precursor protein binding family B member 1 (APBB1), MON1 homolog B, secretory trafficking associated (MON1B), coenzyme Q9 (COQ9), CDC42 binding protein kinase β (CDC42BPB), MARVEL domain containing 1 (MARVELD1), and heparanase 2 (HPSE2).

4.1.1. Candidate Genes Associated with the HCR

VWC2L is located on chromosome 2 in dairy cows. VWC2L was previously identified as a member of the cysteine desmin (CKP) family. A previous study suggests it is a novel secreted protein that regulates the function of osteoblast cells and promotes matrix mineralization by regulating the Osterix expression of TGF-βsuperfamily growth factor signaling pathway [23]. Wang Kejun et al. [24] found VWC2L gene was significantly associated with food conversion rate, involved in the growth metabolism, and indirectly associated with the growth traits of pigs. Therefore, VWC2L promotes the growth and metabolism of Holstein cattle, and it promotes the pregnancy of young cows by accelerating the formation of fetal bone.
STAT1 was detected on chromosome 2. Signal transducers and activators of transcription (STAT) proteins are essential for the regulation of many biological processes. In cattle, microarray analysis identified STAT1 as differentially expressed genes in the implanted surrounding endometrium. To gain new insights regarding STAT1 during the estrous cycle and early pregnancy, Carvalho et al. [25] investigated STAT1 transcript and protein expression and its biological activity in bovine tissue and endometrial-derived cells. This study found pregnancy increased STAT1 expression on day 16, STAT1 was located in endometrial cells on day 20 of pregnancy, and increased binding of STAT1 to interferon regulatory factor 1 (IRF1), cytokine-induced SH2-containing protein (CISH), and cytokine signaling 1 and 3 (SOCS1, SOCS3) gene promoters was consistent with the induction of its transcripts. The high expression of STAT1 in endometrial cells during pregnancy confirmed its association with pregnancy in young cows.
The PPP3CA gene, a candidate gene for goat high reproductive traits revealed by genome-wide association study, is located on chromosome 6 in cattle and is significantly associated with the pregnancy matching rate in young cows. Yangyang et al. [26] explored the genetic variation of goat PPP3CA and evaluated the genetic impact on litter size. They found only a 20bp insertion-deletion polymorphism in northern Shaanxi white cashmere goats was significantly associated with litter number (p < 0.05), and individuals with deletion/deletion (DD) genotype showed a primary phenotype compared with individuals with other genotypes. This study demonstrated deletion mutations within the PPP3CA gene in goats significantly affected litter size. By UniProt comparison, the PPP3CA protein of Holstein cattle and goats are as high as 98.40%, and the PPP3CA gene, as a candidate gene for the pregnancy rate of young cows, may have some influence on the reproductive traits of Holstein cattle population.
LDB3, located on bovine chromosome 28, is a well-characterized striated PDZ-LIM protein that regulates mechanical stress signaling [27] by association with the mechanosensing domain in filamin C. CTNNA3, also located on bovine chromosome 28, is implicated with preferential expression of the maternal allele in early gestational placental tissue [28] and differentiated villous or extravillous trophoblast. Whether these two genes can be used as candidate genes in young cows needs further confirmation.

4.1.2. Candidate Genes Associated with the CCR

PAEP is located on chromosome 11 in cattle. PAEP is a progesterone-associated endometrial protein, and studies demonstrate this gene enhances cell proliferation through the receptor-mediated membrane IgM receptor, and is the main component in regulating cell proliferation in milk [29]. As a 28-kD glycoprotein, PAEP constitutively expresses in both the human reproductive tract and the hematopoietic system [30]. Therefore, PAEP promotes cell proliferation under the action of progesterone, which has a positive impact on the pregnancy matching rate in adult cows.
ACOXL is located on chromosome 11. Ana Paula Sbardella et al. [31] used genome-wide association analysis to study reproductive traits in Nelore cattle and found ACOXL was significantly associated with multiple reproductive traits, and this gene could also serve as a candidate gene affecting reproductive traits in Chinese Holstein cattle.
MARVELD1 is located on chromosome 26 of dairy cows, which is associated with CCR and DPR. By gene mapping of the mouse cell cycle, Zeng Fanli et al. [32] found overexpression of mouse MARVELD1 resulted in significant inhibition of cell proliferation, G1 arrest, and reduced cell migration. This study showed MARVELD1 was a microtubule-associated protein that played an important role in cell cycle progression and migration and had an effect on adult cows and daughter pregnancies.
EPAS1 is located on chromosome 11 in cattle and plays multiple supporting roles in maintaining specific aspects of adipocyte function, including regulation of glucose uptake and lipid synthesis [33]. GLRB (glycine receptor β), located on chromosome 17, is a regulator of motor neuron numbers [34]. These two genes may be candidates for the pregnancy rate in adult cows and require experimental validation.

4.1.3. Candidate Genes Associated with the DPR

PDE1B is located on bovine chromosome 5. Previous studies have shown the PDE1B gene could be regarded as a positional and functional candidate gene for carcass traits in beef cattle. Zhou Di et al. [35] used high-throughput sequencing to analyze the MyoD1 gene knockout transcriptome of MDBK cells (bovine kidney cells) and found PDE1B was highly expressed in the vine leg muscle, dorsal long muscle, and shoulder, which was related to muscle formation and differentiation. Therefore, PDE1B has a certain promoting effect on fetal muscle formation and differentiation, and it has a positive effect on daughter pregnancy, and PDE1B can be further identified as a candidate gene for DPR.
SLCO1A2 is located on bovine chromosome 5; it is a member of the solute carrier organic anion transporter family. Joachim Geyer et al. [36] encode two different full-length SLCO1A2 cDNA as 666 amino acid membrane proteins containing 12 putative transmembrane-spanning domains. Bovine SLCO1A2 expression was detected in the liver, kidney, brain, and adrenal glands. This study also confirmed bovine SLCO1A2 had functional homology with human SLCO1A2, which had a certain role in the growth and development of various fetal organs, and could be further verified as a candidate gene for DPR.
ADAM10 is located on bovine chromosome 10. It is found ADAM10 initiates the hydrolysis of regulatory membrane proteins by shedding the ectodomains of many different substrates, participates in cell-cell interaction and cell migration [37], has a certain role in the functional maintenance of organs and tissues during pregnancy, and can be further identified as a candidate gene related to DPR.
APBB1 is located on bovine chromosome 15. APBB1 is described as a bridging protein, acting in concert with the β- amyloid precursor protein (APP) and Tip60 (histone acetyltransferase). Studies have shown APBB1 played a dominant role in nuclear signaling [38]. This gene could be further identified as a candidate gene for DPR.
MON1B is located on bovine chromosome 18. It has been shown MON1B participated in blastocyst formation to promote embryonic development, and gene expression [39] existed in mature oocytes and embryos.
COQ9 is located on bovine chromosome 18. To explore the effects of single nucleotide polymorphisms in COQ9 on mitochondrial, ovarian function, and fertility in Holstein cows, M Sofia Ortega et al. [39] evaluated fertility phenotype measures of up to five lactation in a population of 2273 Holstein cows and showed the coenzyme Q9 mutation affected ovarian function. The A allele was associated with increased mitochondrial DNA copy number in oocytes, with overdominance effects on oocyte COQ9 expression, follicle number, and anti-mullerian hormone concentration. COQ9 is associated with fertility in Holstein cattle and can be a candidate gene influencing DPR.
ARHGAP26, CDC42BPB, HPSE2 are located on chromosomes 7, 21, and 26, respectively. The mechanism of these three genes related to pregnancy is not clear, and its effect on the pregnancy rate needs to be further explored.

4.2. Candidate-Gene Enrichment Analysis

GO term function enrichment and KEGG pathway enrichment for the overall different genes, do not distinguish which different gene is up- or down-regulated, and cannot reflect the overall form of activation or inhibition. However, these analyses can categorize genes according to the cell components, molecular function, and biological processes following a multiple gene set analysis, so that researchers can better understand gene function. KEGG pathway integration of the information of genomics, biochemistry, and system functional omics helps researchers to study gene and expression information as a whole, to better understand the mechanism of gene regulation, and also to judge whether the candidate genes of each trait are reasonable, and provide reference for subsequent identification tests.
In this study, GO enrichment found 28 biological reactions were correlated with reproductive traits, with each reaction involving zero to fifteen genes. The specific regulatory mechanism needs to be explored.

5. Conclusions

In this study, genome-wide association analysis of reproductive traits in 643 Chinese Holstein cattle population in Tianjin, based on a mixed linear model, identified multiple SNP loci significantly associated with the target trait and identified corresponding candidate genes, but most of the genes were not confirmed and a further identification is needed. This study provides the theoretical basis and breeding direction for the improvement of the breeding performance of Chinese Holstein cattle, and also lays a foundation for further mining and identifying the important gene loci of dairy cows in Tianjin, improving the molecular breeding theory and improving the production performance of dairy cows.

Author Contributions

J.L.: analyzed the data, wrote the article, and approved the final draft. L.X.: analyzed the data, wrote the article, and approved the final draft. X.D.: conceptualized and designed experiments, analyzed data, prepared numbers and/or tables, reviewed the draft paper, and approved the final draft. Y.M.: conceptualized and designed experiments, analyzed data, prepared numbers and/or tables, reviewed the draft paper, and approved the final draft. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by Special Seed Industry Project of Tianjin Academy of Agricultural Sciences—Research on innovation and Efficient Breeding Technology of Livestock and poultry germplasm Resources (2023ZYCX011); Tianjin Science and Technology Plan Project—Integrated Demonstration of High Quality Beef Cattle Breeding (22ZYCGSN00020); Tianjin Science and Technology Plan Project—Key Technology Research and germplasm innovation of High-yield Milk Breeding (22ZXZYSN00020).

Institutional Review Board Statement

Experimental procedure was approved by the Animal Care and Use Committee of Animal Husbandry and Veterinary Research Institute of Tianjin, China. Data and model availability statement: The data were confidential and not deposited in an official repository.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Acknowledgments

I would like to grant oceans of thanks to all my respected teachers and friends who have offered me cordial assistance. Especially Ma Yi, who provided me with project support. I would also like to thank Xu Lingyang for guiding my paper writing and helping me revise the presentation and paper format. It was difficult to finish this article smoothly without their help.

Conflicts of Interest

The author declares there is no competitive advantage.

References

  1. Risch, N.; Merikangas, K. The Future of Genetic Studies of Complex Human Diseases. Science 1996, 273, 1516–1517. [Google Scholar] [CrossRef] [PubMed]
  2. Iniguez_Luy, H.A.G.B. Association mapping of seed quality traits inBrassica napusl using GWAS and candidate QTL approaches. Mol. Breed. 2015, 35, 143. [Google Scholar]
  3. Hirschhorn, J.N.; Daly, M.J. Genome-wide association studies for common diseases and complex traits. Nat. Rev. Genet. 2005, 6, 95–108. [Google Scholar] [CrossRef] [PubMed]
  4. Hermas SA Young, C.W.; Rust, J.W. Genetic Relationships and Additive Genetic Variation of Productive and Reproductive Traits in Guernsey Dairy Cattle. J. Dairy Sci. 1987, 70, 1252–1257. [Google Scholar] [CrossRef] [PubMed]
  5. Ali, I.; Muhammad Suhail, S.; Shafiq, M. Heritability estimates and genetic correlations of various production and reproductive traits of different grades of dairy cattle realed under subtropical condition. Reprod. Domest. Anim. 2019, 54, 1023–1026. [Google Scholar] [CrossRef] [PubMed]
  6. Olsen, H.G.; Hayes, B.J.; Kent, M.P.; Nome, T.; Svendsen, M.; Lien, S. A genome wide association study for QTL affecting direct and maternal effects of stillbirth and dystocia in cattle. Anim. Genet. 2010, 41, 273–280. [Google Scholar] [CrossRef] [PubMed]
  7. Sahana, G.; Guldbrandtsen, B.; Thomsen, B.; Lund, M.S. Confirmation and fine-mapping of clinical mastitis and somatic cell score QTL in Nordic Holstein cattle. Anim. Genet. 2013, 44, 620–626. [Google Scholar] [CrossRef]
  8. Hering, D.; Olenski, K.; Kaminski, S. Genome-wide association study for poor sperm motility in Holstein-Friesian bulls. Anim. Reprod. Sci. 2014, 146, 89–97. [Google Scholar] [CrossRef]
  9. Huang, W.; Kirkpatrick, B.W.; Rosa, G.J.M.; Khatib, H. A genome-wide association study using selective DNA pooling identifies candidate markers for fertility in Holstein cattle. Anim. Genet. 2010, 41, 570–578. [Google Scholar] [CrossRef]
  10. Liu, A.; Guo, G.; Wang, Y. Genetic assessment and genome-wide association analysis of Holstein cattle in China. J. Anim. Husb. Vet. Med. 2015, 46, 373–381. [Google Scholar]
  11. Chang, C.C.; Chow, C.C.; Tellier, L.C.; Vattikuti, S.; Purcell, S.M.; Lee, J.J. Second-generation PLINK: Rising to the challenge of larger and richer datasets. GigaScience 2015, 4, 7. [Google Scholar] [CrossRef] [PubMed]
  12. Yang, J.; Weedon, M.N.; Purcell, S.; Lettre, G.; Estrada, K.; Willer, C.J.; Visscher, P.M. Genomic inflation factors under polygenic inheritance. Eur. J. Hum. Genet. 2011, 19, 807–812. [Google Scholar] [CrossRef] [PubMed]
  13. Zhou, X.; Stephens, M. Genome-wide efficient mixed-model analysis for association studies. Nat. Genet. 2012, 44, 821–824. [Google Scholar] [CrossRef] [PubMed]
  14. Jiang, L.; Liu, J.; Sun, D.; Ma, P.; Ding, X.; Yu, Y.; Zhang, Q. Genome Wide Association Studies for Milk Production Traits in Chinese Holstein Population. PLoS ONE 2010, 5, e13661. [Google Scholar] [CrossRef] [PubMed]
  15. Wang, T.; Gao, X.; Song, W.; Yao, D.; Chen, L.; Chen, C.; Ma, Y. Genome-wide association analysis of limb-hoof structure and breast morphology in Chinese Holstein cattle. J. Livest. Ecol. 2021, 42, 13–18. [Google Scholar]
  16. Pritchard, J.K.; Stephens, M.; Rosenberg, N.A.; Donnelly, P. Association Mapping in Structured Populations. Am. J. Hum. Genet. 2000, 67, 170–181. [Google Scholar] [CrossRef] [PubMed]
  17. Wang, W.Y.S.; Barratt, B.J.; Clayton, D.G.; Todd, J.A. Genome-wide association studies: Theoretical and practical concerns. Nat. Rev. Genet. 2005, 6, 109–118. [Google Scholar] [CrossRef]
  18. Ioannidis, J.P.A. Why Most Published Research Findings Are False. PLoS Med. 2005, 2, e124. [Google Scholar] [CrossRef]
  19. Li, Y. Genome-Wide Association Analysis of Biochemical Components of Blood in Chinese Holstein Dairy Cows. Master’s Thesis, Fujian Agriculture and Forestry University, Fuzhou, China, 2020. [Google Scholar]
  20. Vilhjálmsson, B.J.; Nordborg, M. The nature of confounding in genome-wide association studies. Nat. Rev. Genet. 2013, 14, 1–2. [Google Scholar] [CrossRef]
  21. Kim, S.; Xing, E.P. Statistical estimation of correlated genome associations to a quantitative traits network. PLoS Genet. 2009, 5, e1000587. [Google Scholar] [CrossRef]
  22. O’Reilly, P.F.; Hoggart, C.J.; Pomyen, Y.; Calboli, F.C.; Elliott, P.; Jarvelin, M.R.; Coin, L.J. Multi-Phen: Joint model of multiple phenotypes can increase discovery in GWAS. PLoS ONE 2012, 7, e34861. [Google Scholar] [CrossRef] [PubMed]
  23. Ohyama, Y.; Katafuchi, M.; Almehmadi, A.; Venkitapathi, S.; Jaha, H.; Ehrenman, J.; Morcos, J.; Aljamaan, R.; Mochida, Y. Modulation of matrix mineralization by Vwc2-like protein and its novel splicing isoforms. Biochem. Biophys. Res. Commun. 2012, 418, 12–16. [Google Scholar] [CrossRef] [PubMed]
  24. Wang, K.; Liu, D.; Hernandez-Sanchez, J.; Chen, J.; Liu, C.; Wu, Z.; Fang, M.; Li, N. Genome Wide Association Analysis Reveals New Production Trait Genes in a Male Duroc Population. PLoS ONE 2015, 10, e0139207. [Google Scholar] [CrossRef] [PubMed]
  25. Carvalho, A.V.; Eozenou, C.; Healey, G.D.; Forde, N.; Reinaud, P.; Chebrout, M.; Gall, L.; Rodde, N.; Padilla, A.L.; Delville, C.G.; et al. Analysis of STAT1 expression and biological activity reveals interferon-tau-dependent STAT1-regulated SOCS genes in the bovine endometrium. Reprod. Fertil. Dev. 2016, 28, 459–474. [Google Scholar] [CrossRef] [PubMed]
  26. Bai, Y.; Li, J.; Zhu, H.; Liu, J.; Dong, S.; Li, L.; Qu, L.; Chen, H.; Song, X.; Lan, X. Deletion mutation within the goat PPP3CA gene identified by GWAS significantly affects litter size. Reprod. Fertil. Dev. 2021, 33, 476–483. [Google Scholar] [CrossRef]
  27. Pathak, P.; Blech-Hermoni, Y.; Subedi, K.; Mpamugo, J.; Obeng-Nyarko, C.; Ohman, R.; Molloy, I.; Kates, M.; Hale, J.; Stauffer, S.; et al. Myopathy associated LDB3 mutation causes Z-disc disassembly and protein aggregation through PKCα and TSC2-mTOR downregulation. Commun. Biol. 2021, 4, 355. [Google Scholar] [CrossRef]
  28. van Dijk, M.; Mulders, J.; Könst, A.; Janssens, B.; van Roy, F.; Blankenstein, M.; Oudejans, C. Differential downregulation of αT-catenin expression in placenta: Trophoblast cell type-dependent imprinting of the CTNNA3 gene. Gene Expr. Patterns 2004, 5, 61–65. [Google Scholar] [CrossRef]
  29. Tai, C.S.; Chen, Y.Y.; Chen, W.L. β-Lactoglobulin Influences Human Immunity and Promotes Cell Proliferation. BioMed Res. Int. 2016, 2016, 7123587. [Google Scholar] [CrossRef]
  30. Kämäräinen, M.; Riittinen, L.; Seppälä, M.; Palotie, A.; Andersson, L.C. Progesterone-associated endometrial protein--a constitutive marker of human erythroid precursors. Blood 1994, 84, 467–473. [Google Scholar] [CrossRef]
  31. Sbardella, A.P.; Watanabe, R.N.; da Costa, R.M.; Bernardes, P.A.; Braga, L.G.; Baldi Rey, F.S.; Lôbo, R.B.; Munari, D.P. Genome-Wide Association Study Provides Insights into Important Genes for Reproductive Traits in Nelore Cattle. Animals 2021, 11, 1386. [Google Scholar] [CrossRef]
  32. Zeng, F.; Tian, Y.; Shi, S.; Wu, Q.; Liu, S.; Zheng, H.; Yue, L.; Li, Y. Identification of mouse MARVELD1 as a microtubule associated protein that inhibits cell cycle progression and migration. Mol. Cells 2011, 31, 267–274. [Google Scholar] [CrossRef] [PubMed]
  33. Shimba, S.; Wada, T.; Hara, S.; Tezuka, M. EPAS1 Promotes Adipose Differentiation in 3T3-L1 Cells. J. Biol. Chem. 2004, 279, 40946–40953. [Google Scholar] [CrossRef] [PubMed]
  34. Baguma-Nibasheka, M.; Fracassi, A.; Costain, W.J.; Moreno, S.; Kablar, B. Role of skeletal muscle in motor neuron development. Histol. Histopathol. 2016, 31, 699–719. [Google Scholar] [PubMed]
  35. Zhou, D.; Wang, Y.; Yang, R.; Wang, F.; Zhao, Z.; Wang, X.; Xie, L.; Tian, X.; Wang, G.; Li, B.; et al. The MyoD1 Promoted Muscle Differentiation and Generation by Activating CCND2 in Guanling Cattle. Animals 2022, 12, 2571. [Google Scholar] [CrossRef] [PubMed]
  36. Geyer, J.; Döring, B.; Failing, K.; Petzinger, E. Molecular cloning and functional characterization of the bovine (Bos taurus) organic anion transporting polypeptide Oatp1a2 (Slco1a2). Comp. Biochem. Physiol. Part B Biochem. Mol. Biol. 2004, 137, 317–329. [Google Scholar] [CrossRef]
  37. Wild-Bode, C.; Fellerer, K.; Kugler, J.; Haass, C.; Capell, A. A Basolateral Sorting Signal Directs ADAM10 to Adherens Junctions and Is Required for Its Function in Cell Migration. J. Biol. Chem. 2006, 281, 23824–23829. [Google Scholar] [CrossRef]
  38. Yang, Z.; Cool, B.H.; Martin, G.M.; Hu, Q. A Dominant Role for FE65 (APBB1) in Nuclear Signaling. J. Biol. Chem. 2006, 281, 4207–4214. [Google Scholar] [CrossRef]
  39. Ortega, M.S.; Wohlgemuth, S.; Tribulo, P.; Siqueira, L.G.B.; Null, D.J.; Cole, J.B.; Da Silva, M.V.; Hansen, P.J. A single nucleotide polymorphism in COQ9 affects mitochondrial and ovarian function and fertility in Holstein cows. Biol. Reprod. 2017, 96, 652–663. [Google Scholar] [CrossRef]
Figure 1. Principal component analysis of 643 Chinese Holstein cattle.
Figure 1. Principal component analysis of 643 Chinese Holstein cattle.
Genes 15 00012 g001
Figure 2. HCR Q-Q Figure.
Figure 2. HCR Q-Q Figure.
Genes 15 00012 g002
Figure 3. CCR Q-Q Figure.
Figure 3. CCR Q-Q Figure.
Genes 15 00012 g003
Figure 4. DPR Q-Q Figure.
Figure 4. DPR Q-Q Figure.
Genes 15 00012 g004
Figure 5. The Manhattan plot of the HCR.
Figure 5. The Manhattan plot of the HCR.
Genes 15 00012 g005
Figure 6. The Manhattan plot of the CCR.
Figure 6. The Manhattan plot of the CCR.
Genes 15 00012 g006
Figure 7. The Manhattan plot of the DPR.
Figure 7. The Manhattan plot of the DPR.
Genes 15 00012 g007
Figure 8. GO enrichment results for candidate genes for reproductive traits.
Figure 8. GO enrichment results for candidate genes for reproductive traits.
Genes 15 00012 g008
Table 1. Descriptive statistics for reproductive traits.
Table 1. Descriptive statistics for reproductive traits.
TraitNumberAverageMaximumMinimumVarianceStandard Deviation
HCR6370.814.1−2.41.301.14
CCR6370.624.9−3.11.691.30
DPR6370.000313.4−3.21.271.13
Table 2. Information on the distribution of SNP markers on each chromosome before and after quality control.
Table 2. Information on the distribution of SNP markers on each chromosome before and after quality control.
Before Quality ControlAfter Quality Control
ChromosomeLength (Mb)SNPs CountDensity (kb/snp)SNPs CountDensity (kb/snp)
1158.3555635.1469329.6
2137.1468834.2391728.6
3121.4450837.1374930.9
4120.8404933.5339028.1
5121.2452337.3373430.8
6119.5436436.5361730.3
7112.6390334.7323228.7
8113.4380533.6328929.0
9105.7369535.0318130.1
10104.3362634.8311629.9
11107.3380135.4321630.0
1291.2304433.4256328.1
1384.2306436.4261731.1
1484.6304536.0256130.3
1585.3311936.6267431.3
1681.7282634.6233828.6
1775.2266835.5228630.4
1866.0260539.5219733.3
1964.1272642.5222634.7
2072.0273738.0225331.3
2171.6257335.9216730.3
2261.4220135.8187830.6
2352.5211040.2177433.4
2462.7225936.0193430.8
2542.9172640.2144333.6
2651.7182335.3155130.0
2745.4169937.4148832.8
2846.3173537.5150632.5
2951.5187136.3157030.5
Table 3. Results of genome-wide association analysis of reproductive traits in Chinese Holstein cattle.
Table 3. Results of genome-wide association analysis of reproductive traits in Chinese Holstein cattle.
CharacterSNPChromosomeSite (bp)p ValueThe Gene Region Distance (bp)
HCRBovineHD02000296282102,699,0789.18 × 10−3VWC2L81,620
BovineHD0200022967279,537,8463.11 × 10−5STAT121,534
BTA-37416-no-rs623,807,2528.69 × 10−4PPP3CA36,965
BovineHD280007432841,387,8947.36 × 10−5LDB32984
BovineHD28000058912822,263,7286.23 × 10−5CTNNA310,962
CCRLGB_X14710_517411103,259,1437.68 × 10−3PAEP1719
BovineHD1100000394111,230,0123.72 × 10−4ACOXL6654
BovineHD4100008487112,827,4282.86 × 10−4EPAS173,158
BovineHD17000119521742,103,9304.12 × 10−4GLRB6465
BovineHD26000048432618,941,1264.61 × 10−5MARVELD168,791
DPRARS-USMARC-666525,527,6141.51 × 10−6PDE1B11,893
BovineHD0500025349588,911,7246.17 × 10−4SLCO1A222,340
BovineHD0700016212754,281,3136.66 × 10−4ARHGAP263543
Hapmap58695-rs290198991051,673,8751.95 × 10−5ADAM105385
BovineHD15000135311546,637,6061.29 × 10−5APBB111,182
UFL-rs41859871184,419,2241.94 × 10−5MON1B3866
COQ9_rs1093015861825,446,3234.27 × 10−4COQ92139
ARS-BFGL-NGS-1150622167,739,8848.39 × 10−3CDC42BPB18,754
ARS-BFGL-NGS-1159802620,196,7471.74 × 10−5HPSE261,344
BovineHD26000048432618,941,1264.61 × 10−5MARVELD168,791
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, J.; Xu, L.; Ding, X.; Ma, Y. Genome-Wide Association Analysis of Reproductive Traits in Chinese Holstein Cattle. Genes 2024, 15, 12. https://doi.org/10.3390/genes15010012

AMA Style

Liu J, Xu L, Ding X, Ma Y. Genome-Wide Association Analysis of Reproductive Traits in Chinese Holstein Cattle. Genes. 2024; 15(1):12. https://doi.org/10.3390/genes15010012

Chicago/Turabian Style

Liu, Jiashuang, Lingyang Xu, Xiangbin Ding, and Yi Ma. 2024. "Genome-Wide Association Analysis of Reproductive Traits in Chinese Holstein Cattle" Genes 15, no. 1: 12. https://doi.org/10.3390/genes15010012

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop