Next Article in Journal
Methane Emission, Carbon Footprint and Productivity of Specialized Dairy Cows Supplemented with Bitter Cassava (Manihot esculenta Crantz)
Previous Article in Journal
Evaluation of Soybean Ingredients in Pet Foods Applications: Systematic Review
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Descriptive Statistics and Genome-Wide Copy Number Analysis of Milk Production Traits of Jiangsu Chinese Holstein Cows

1
Joint International Research Laboratory of Agriculture and Agri-Product Safety, Yangzhou University, Yangzhou 225009, China
2
College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China
3
Center for Quantitative Genetics and Genomics, Aarhus University, 8000 Aarhus C, Denmark
4
International Joint Research Laboratory, Universities of Jiangsu Province of China for Domestic Animal Germplasm Resources and Genetic Improvement, Yangzhou 225009, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Animals 2024, 14(1), 17; https://doi.org/10.3390/ani14010017
Submission received: 24 October 2023 / Revised: 5 December 2023 / Accepted: 18 December 2023 / Published: 19 December 2023
(This article belongs to the Section Animal Genetics and Genomics)

Abstract

:

Simple Summary

The aim of this study was to describe the basic status of milk production traits of Chinese Holstein cows in Jiangsu, and locate the QTLs and functional genes that affect the five milk production traits (milk yield, milk fat percentage, milk fat yield, milk protein percentage, and milk protein yield) of first-born cows. The five milk production traits showed significant phenotypic positive correlation and high genetic positive correlation among the three parities, and we identified 1731 CNVs and 236 CNVRs in the 29 autosomes of 4173 Holstein dairy cows, and 19 CNVRS were significantly associated with milk production traits. The study introduced the basic status of the milk production traits of Chinese Holstein cows in Jiangsu province, and 19 CNVRs were identified to be significantly related to milk production traits of Chinese Holstein cows in the first fetus. This study will provide a theoretical basis for molecular marker-assisted selection of dairy cows, help to analyze the genetic structure of milk production traits of dairy cows, and contribute to the genetic improvement of Chinese Holstein cattle in Jiangsu.

Abstract

Milk production traits are the most important quantitative economic traits in dairy cow production; improving the yield and quality of milk is an important way to ensure the production efficiency of the dairy industry. This study carried out a series of in-depth statistical genetics studies and molecular analyses on the Chinese Holstein cows in the Jiangsu Province, such as descriptive statistics and copy number variation analysis. A genetic correlation, phenotypic correlation, and descriptive statistical analysis of five milk production traits (milk yield, milk fat percentage, milk fat yield, milk protein percentage, and milk protein yield) of the dairy cows were analyzed using the SPSS and DMU software. Through quality control, 4173 cows and their genomes were used for genomic study. Then, SNPs were detected using DNA chips, and a copy number variation (CNV) analysis was carried out to locate the quantitative trait loci (QTL) of the milk production traits by Perl program software Penn CNV and hidden Markov model (HMM). The phenotypic means of the milk yield, milk fat percentage, milk fat mass, milk protein percentage, and milk protein mass at the first trimester were lower than those at the other trimesters by 8.821%, 1.031%, 0.930%, 0.003%, and 0.826%, respectively. The five milk production traits showed a significant phenotypic positive correlation (p < 0.01) and a high genetic positive correlation among the three parities. Based on the GGPBovine 100 K SNP data, QTL-detecting research on the fist-parity milk performance of dairy cows was carried out via the CNV. We identified 1731 CNVs and 236 CNVRs in the 29 autosomes of 984 Holstein dairy cows, and 19 CNVRs were significantly associated with the milk production traits (p < 0.05). These CNVRs were analyzed via a bioinformatics analysis; a total of 13 gene ontology (GO) terms and 20 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways were significantly enriched (p < 0.05), and these terms and pathways are mainly related to lipid metabolism, amino acid metabolism, and cellular catabolic processes. This study provided a theoretical basis for the molecular-marker-assisted selection of dairy cows by developing descriptive statistics on the milk production traits of dairy cows and by locating the QTL and functional genes that affect the milk production traits of first-born dairy cows. The results describe the basic status of the milk production traits of the Chinese Holstein cows in Jiangsu and locate the QTL and functional genes that affect the milk production traits of the first-born cows, providing a theoretical basis for the molecular-marker-assisted selection of dairy cows.

1. Introduction

Milk production traits are the most important economic traits of dairy cows, including milk yield, milk fat percentage, milk fat amount, milk protein percentage, and milk protein amount [1]. Milk production is a quantitative trait, and its performance is influenced by heredity and the environment. The genetic effect can be divided into the additive effect, epistatic effect, and dominant effect [2]. The additive effect is the accumulation of the genotype values of multiple minor genes that affect the quantitative traits, which can be truly inherited by offspring. Therefore, it is also called the breeding value of traits, and it is the main component of the phenotypic value of traits [3]. The main purpose of dairy breeding is to breed the individuals with the best genetic performance (the highest breeding value) through the selection of the economic traits, health traits, and reproductive traits of dairy cows so that their offspring, as a whole, contain more desired genes or contain a higher probability of desired genotypes than the current selection. Thus, the purpose of improving the genetic advantage of individual traits and the population production level can be achieved [4]. In early dairy breeding work, people mainly relied on the methods of Dairy Herd Improvement (DHI) and progeny determination to evaluate the production performance and genetic improvement of the dairy herd. Although effective, there are also problems, such as long generation intervals, high costs, and slow genetic progress, in the face of difficult-to-measure traits [5]. With the development of the quantitative genetics theory and practice as well as the rapid progress of computer technology and biotechnology, dairy workers have made genetic improvements to the economic traits of dairy cows based on traditional genetics combined with the theories and achievements of molecular genetics and bioinformatics [6].
Since it was recognized that single-nucleotide polymorphisms (SNPs) have the function of regulating the expression of single or multiple genes for a variety of important economic traits in dairy cows, cow genetics have made great progress [7,8]. However, recent studies have shown that genomic variations are caused by SNPs, with a significant impact on the performance of various traits of dairy cows, such as their milk yield, reproduction, health, and feed efficiency [9,10]. Structural variation includes translocation, inversion, copy number variation (CNV), etc. [11]. A CNV is the most common type of structural variation in human, plant, and animal genomes, and it refers to an unbalanced structural variation in the genome of an individual base in the form of DNA fragment loss or acquisition [12]. The total number of nucleotides covered by SNVs greatly exceeds the total number of single nucleotide polymorphisms, which greatly enriches the diversity of genomic genetic variation [13]. CNVs can lead to a phenotypic variation related to single-gene diseases, complex diseases, or quantitative traits via molecular mechanisms such as a gene structure change, gene fusion, and the dose effect [14]. For example, the genome-wide association study (GWAS), based on CNVs, identified the functional genes associated with feed conversion and growth and development [15]. Therefore, CNVs are one of the important factors that cause genetic and phenotypic variations as well as an important genetic resource for biological evolution [16]. The presence of CNVs in the protein-coding region of the genome may change the protein function, while the presence of CNVs in the regulatory region of the genome may change the expression level of the genes [17].
Previous CNV detection in most populations was based on comparative genomic hybridization (GCH) and SNP genotyping microarrays [18]. With the development and cost reduction in next-generation sequencing (NGS) technology, CNV detection and analysis can be performed using whole-genome sequencing, which has a higher CNV detection rate than genotyping because it covers the whole genome [19]. At present, there are many algorithms and pieces of software that can be used to identify genomic CNVs and CNVRs [20,21]. The PennCNV software uses the hidden Markov model (HMM) algorithm to identify CNVs; it can integrate the signal strength value (the Log R ratio (LRR)), the B allele frequency (BAF), the population frequency of the B allele (PFB) of SNPs, the distance between adjacent SNPs, the SNP allele frequency, pedigree information, etc. At the same time, the software can fit the genomic GC content of the regression model to overcome the interference of the genomic wave during CNV recognition, improving the accuracy of CNV recognition [22].
It has been found that CNVs evolve about 2.5 times faster than SNPs, indicating that CNVs can cause organisms to better adapt to different environments [23]. At present, there have been many research reports on bovine CNVs. Silva confirmed that the incidence of the CNV overlap in common cattle from Europe and the Middle East is higher than that in Zebu cattle from the Indian subcontinent and speculated that Zebu cattle have a greater diversity in CNVs [24]. Pezar believed that the variation in the CNV quantity may be related to differences in the effective population size and selection process among different populations of cattle [25]. Bae used the PennCNV software to identify a total of 368 CNVRs from 265 Korean Hanwoo cattle based on a 50 K SNP chip covering a 15.8 Mb genome without considering the lineage information or genome waves [26]. At the same time, it has been reported that CNVs are closely related to the milk production traits of Holstein cattle [27].
In this study, based on the population data of the Chinese Holstein cattle in the Jiangsu area, we explored the lactation performance and (covariance) components of the milk production traits in different parities of the cattle and used CNVs to detect the QTL of the milk production traits of the first parity of the Chinese Holstein cattle in Jiangsu.

2. Materials and Methods

2.1. Ethical Statement

The procedures of hair sample and phenotypic trait data collection were carried out according to the plan proposed by the Ministry of Agriculture of the People’s Republic of China and the Chinese Animal Protection Commission. The study was approved by the Ethics Committee for Animal Researchers of Yangzhou University (License No. SYXK(SU)IACUC 2012-0029). No animals were harmed during the study.

2.2. Phenotypic Data Collection and Quality Control

We collected population genetic improvement (DHI) records from 15,126 lactating cows in 4 medium and large farms in Jiangsu, requiring each cow to have at least 7 DHI records during the complete lactation period. Then, a total of 149,065 DHI records were collected. In the DHI records, five traits (milk yield (MY), milk fat percentage (MFR), milk fat yield (MFR), milk protein percentage (MPR), and milk protein yield (MPY)) were selected for genetic assessment in this study. Before analysis, all DHI records were quality-controlled as follows: (1) only 1–3 parity data were retained; (2) the first calving month is between 22 and 36 months; (3) only records of lactation days (DIM) from 5 to 305 days were kept; (4) the daily milk production was maintained in the range of 5–80 kg, and the somatic cell score (SCS) was maintained in the range of 0–9; (5) for each trait, values that exceeded the mean plus or minus a range of three standard deviations were removed. Before analysis, all 5 milk production traits after quality control were calibrated to 305-day performance based on the methods we have [28]. The requirement for convergence of the model used is that the norm of the update vector is less than 1.0 × 10−7 or the norm of the gradient vector (AI) is less than 1.0 × 10−6. The kinship matrices of animals need to be constructed based on pedigrees [29].

2.3. Analysis of Phenotypic and Genotypic Parameters

The paired Pearson coefficient and descriptive statistics of the phenotypic dataset were analyzed using the computer software SPSS (v16.0, Chicago, IL, USA). The variance component estimates were calculated using the DMU software (V5.6, Aarhus, Denmark). We estimated genetic parameters in the following multiple-trait animal model:
y = 1 μ + Z a + e
where y is the 305-day performance; Z is the random effects structure matrix; a is the random effect vector; e is a random residual vector [30]. Putting the resulting variance components into the following formula to calculate genetic and phenotypic correlations:
r A = C o v ( α 1 , α 2 ) σ α 1 2 σ α 2 2 r p = C o v ( P 1 , P 2 ) σ P 1 2 σ P 2 2
r A was the genetic correlation among adjusted traits; r p was a phenotypic correlation among adjusted traits; C o v ( α 1 , α 2 ) is the additive effect covariance of traits α1 and α2; σ α 1 2 is the additive variance of the trait α 1 ; σ α 2 2 is the additive variance of the trait α 2 ; C o v P 1 , P 2 is the phenotypic covariance of P 1 and P 2 ; σ P 1 2 is the phenotypic variance of the trait P 1 ; σ P 2 2 is the phenotypic variance of the trait P 2 .

2.4. Genotyping Data and Quality Control

Hair follicle samples were collected from the tail roots of selected cows with at least 50 hair follicles per cow. The sample was sent to the company and genotyped using the GGPBovine 100 K SNP Chips (Newqin Biotechnology, Shanghai, China), in which the bovine ARS-UCD1.2 assembly genome was used as the reference genome. Then, PLINK software was used for genotype quality control according to the following requirements: (1) SNPs with a detection rate >90% were retained; (2) individuals with an SNP detection rate greater than 95% were retained; (3) sites with minimum allele frequency (MAF) < 0.05 were proposed; (4) SNPs that deviated significantly from Hardy–Weinberg equilibrium (HWE) were removed (p < 1.0 × 106) [31].

2.5. Population Structure Analysis

Since the subgroups in this study came from four different pastures in Jiangsu, in order to identify the population structure of the subgroups, we used the SNPRelate package in R (v 4.0.2, Auckland, New Zealand) to conduct cluster analysis and identity by state (IBS) analysis for the populations, and to determine the effect of population stratification on phenotypic observations.

2.6. Detection of CNVR

In this study, the Perl program software Penn CNV and the hidden Markov model (HMM) were used to infer the CNV of the Chinese Holstein cattle population in Jiangsu. During the process of inference, pennCNV software integrates SNP signal strength file information, SNP location information, population pedigree information, etc. And the GC content of each SNP within the 1 Mb range of the genome can be substituted to correct the effect of genomic waves on CNVs inference caused by different genomic G/C content. The signal strength (Log R Ratio, LRR) and allele strength ratio (B allele frequency, BAF) of all SNPs were generated by Illumina BeadStudio3.5 software (Illumina, San Diego, CA, USA). Genotypic populations have complete genealogical records. The PFB (B allele of population frequency) file is generated based on the BAF of each marker in this population. The signal strength of each SNP affected by genomic waves is adjusted according to the GC content of the 500 kb genomic region on either side of it [32].
In order to improve the accuracy of CNV results detection, we used Perl programming language to conduct quality control on the obtained individual copy number variation results. The quality control criteria were as follows: (1) LRR standard deviation was less than 0.3; (2) the standard deviation of BAF was less than 0.01; (3) the absolute value of Waviness factor (GCWF) was less than 0.05; (4) confidence score was greater than 10; (5) the number of SNPs in each detected CNV were greater than or equal to 10; (6) it had to be detected simultaneously in at least two individuals [33].
All CNVs after quality control were merged according to the following criteria: if adjacent CNVs overlap and the overlap area is not less than 20% of the total length, and these CNVs had the same copy number variation state (increased or missing), they were merged into a new CNV. If at least one SNP overlapped between adjacent CNVs, the CNVs obtained by quality control were merged into the CNV Region (CNVR). Since the starting and ending position information of each CNVR is not fixed, we took the starting site of the CNV at the front of the chromosome as the starting site of the merged CNVR, and the ending site of the CNV at the back of the chromosome as the ending site of the merged CNVR. CNVRs were used for subsequent analysis of association with milk production traits.

2.7. Association Analysis Based on CNVR

The general linear model (GLM) of CNVR Ruler software was used to conduct a correlation analysis of the milk production characteristics (milk yield, milk fat percentage, milk fat amount, milk protein percentage, and milk protein amount) obtained in the CNVR region, respectively. Five milk production traits were calibrated to 305-day performance based on the models we have [28]. The GLM model used is as follows:
y = μ + Gi + ei
Among them, y was the corrected phenotypic values of milk yield, milk fat percentage, milk fat yield, milk protein percentage, and milk protein yield at 305 days of the first gestational term. μ is the population mean; Gi is the genotype effect of the ith CNVR; ei is the random residual effect of the ith CNVR, and the result is corrected using the FDR (q-value) method.

2.8. Bioinformatics Analysis of Candidate Genes

Annotation and gene function analysis were performed using the online Database for Annotation, Visualization, and Integrated Discovery (DAVID) software (https://david.ncifcrf.gov/tools.jsp), accessed on 25 June 2022.
We performed kyoko Encyclopedia of Genes and Genomes (KEGG) pathway analysis and Gene ontology (Gene Ontology, GO) analysis of CNVRs annotated in exonic regions by KOBA3.0 annotation outside show (http://kobas.cbi.pku.edu.cn/), accessed on 25 June 2022 the results only retained the path entries with corrected p-values less than 0.05 were retained.

3. Results

3.1. Descriptive Statistics of Milk Production Traits of the Chinese Holstein Cattle in Jiangsu

Through quality control, 94,595 measurement day records for 9828 cows out of 15,126 lactating cows were retained for subsequent analysis. The descriptive statistics of the milk production traits of the Chinese Holstein cattle in Jiangsu are shown in Table 1. Among them, for the milk yield trait, the number of measurement day records for first-born cows accounted for 49.02% of the total number of daily records, and the number of first-born cows accounted for 65.46% of the total. For the milk fat percentage, milk fat content, milk protein percentage, and milk protein content, the number of measurement day records for first-born cows accounted for 49.39% of the total number of daily records, and the number of first-born cows accounted for 64.02% of the total. The phenotypic mean of the five milk production traits of the first parity was lower than those of the other parities. The standard errors of the five milk production traits in the first trimester were lower than those in the other trimesters. In addition to the milk protein percentage, the skewness of the milk yield, milk fat percentage, milk fat content, and milk protein content was closest to zero in the first parity.

3.2. Phenotypic and Genetic Correlation among the Different Parities’ Milk Production Traits of the Dairy Cows in Jiangsu

Table 2 shows the phenotypic and genetic correlation of the milk production traits among the different parities of the Chinese Holstein cattle in Jiangsu. The five milk production traits studied in this paper showed a significant phenotypic positive correlation (p < 0.01) and a high genetic positive correlation among the three parities. In addition, the phenotypic correlation and genetic correlation of most of these traits between the first and third parity were higher than those between the first and second parities (the phenotypic correlation between the first and third parity was lower than that between the first and second parities). We also found that the phenotypic correlation values of the five milk production traits were smaller than the corresponding genetic correlation values between the different parities.

3.3. Group Structure Analysis

Through genotype quality control, 4173 individual cows and 87,598 SNPs were retained. A cluster analysis and state homology (IBS) were performed on 4173 dairy herds using chip data, and neither method significantly distinguished the study groups, indicating a high correlation of individuals within the study groups (Figure 1A,B).

3.4. Genome Copy Number Variation in the Chinese Holstein Cattle in Jiangsu

In this study, bovine ARS-UCD1.2 was used as the reference genome, and a total of 1731 CNVs were detected in the bovine autosomes after quality control via the PennCNV software. Among them, there were 14 CNVs with a copy number of zero (about 0.80%), and the average length was 207,115 bp. There were 68 CNVs with a copy number of one (about 3.93%), and the average length was 302,899 bp. There were 1550 CNVs with a copy number of three (89.54%), and the average length was 440,638 bp. There were 99 CNVs with a copy number of four (about 5.73%) and an average length of 422,555 bp. Copy number replication (types 3 and 4) accounts for more than 95% of the copy number variations (Table 3). Of all the 1731 CNVs detected, CNVs between 200 and 500 kb in length accounted for the majority (Figure 2A), and the average length of the CNVs detected in the duplication type was larger than that of the deletion type (Figure 2B); meanwhile, there were more copy number variations on chromosomes 18 and 21 (Figure 2C,D).

3.5. Regional Distribution of Copy Number Variations in the Chinese Holstein Cattle in Jiangsu

In this study, the PennCNV software was used to analyze the copy number variation in the 100 K chip data of 4173 Holstein cattle in Jiangsu and 1731 CNVs were obtained. After the combination, 236 copy number variation regions (CNVs) were obtained after the CNVRs expressed only in a single sample were removed. Duplicate CNVRs occupy the vast majority of all CNVRs. The proportion of copy number variations in the 29 autosomes of the dairy cows ranged from 3.9% to 20.7% among which chromosome 28 (20.7%) had the largest copy number variation region, while chromosome 8 (3.9%) had the smallest copy number variation region. There were six chromosomes with copy number variation regions covering more than 10% (Figure 3).

3.6. Association Analysis of the CNVRs and Milk Production Traits of the Chinese Holstein Cattle in Jiangsu

After FDR correction, a total of 19 CNVRs was significantly correlated with five milk production traits (milk yield, milk fat percentage, milk fat volume, milk protein percentage, and milk protein volume) in the first parity of the dairy cows in Jiangsu. SNVRs significantly associated with milk production traits were mainly concentrated on chromosomes 11, 14, and 17. The study did not find any CNVRs significantly associated with milk protein volume.
Three CNVRs, located on chromosomes 17, 17, and 11, were significantly correlated with the milk yield, and their copy number variation types were “deletion”, “deletion”, and “mixed”. Four CNVRs were significantly correlated with the milk fat percentage, and they were located on chromosomes 14, 15, 21, and 3, respectively. Their copy number variation types were “mixed”, “duplicate”, “deletion”, and “duplicate”. Six CNVRs were significantly correlated with the milk fat content, and they were located on chromosomes 17, 11, 12, 14, 17, and 18, respectively. Their copy number variation types were “duplicate”, “mixed”, “duplicate”, “mixed”, “deletion”, and “duplicate”. Six CNVRs, located on chromosomes 11, 17, 17, 11, 5, and 5, were significantly correlated with the milk protein rate, and their copy number variation types were “mixed”, “deletion”, “deletion”, “mixed”, “mixed”, and “mixed” (Table 4). The study did not find any CNVRs significantly associated with milk protein volume.

3.7. Enrichment Analysis of Candidate Genes

In order to better understand the biological function of the 19 CNVRs, we performed a KEGG pathway analysis and a GO biological function enrichment analysis for the genes within these CNVR regions and upstream and downstream these regions within 200 Kb, and a total of 216 genes were selected (Supplementary Table S1). These genes were significantly enriched into 20 KEGG pathways; among them, AKT1, VAV2, and NOTCH1 had more repeats (Table 5). After removing the GO terms concentrated in only one gene, a total of 13 GO terms were selected, which are mainly involved in biological processes such as amino acid metabolism, cellular catabolism, and lipid metabolism, and the genes involved are DGAT1, GPT, SCO2, DBH, etc. (Figure 4; Supplementary Table S2).

4. Discussion

In this study, the variance components, and the genetic and phenotypic correlations of the five milk production traits (the milk yield, milk fat percentage, milk fat yield, milk protein percentage, and milk protein yield) among the different parities of the Chinese Holstein cattle in Jiangsu were evaluated. The phenotypic and genetic correlation between the milk production traits of the Chinese Holstein cattle in Jiangsu was similar to some domestic and foreign research results [34,35]. The same milk production trait showed a significant positive phenotypic correlation and a high positive genetic correlation at one to three parities. Therefore, it is feasible to evaluate the overall performance of dairy cows according to the performance of milk production traits at one parity.
Copy number variations are one of the common forms of genomic structural variations in plants and animals, and CNV variations cover a large number of SNPs; therefore, they often cause changes in the gene sequences of large segments of biological genomes, resulting in changes in the gene structure, changes in the expression dose, and even a loss of gene function, thus affecting the phenotype [36]. Currently, there are PennCNV, cnvPartition, and QuantiSNP as well as other software used to infer CNVs and CNVRs based on SNP chip data [37,38]. Studies have shown that the PennCNV software can use more information (such as the relatives, the total signal intensity of the chip data, the B allele frequency, etc.) in the process of CNV inference, and its algorithm is more reliable, resulting in less false positives [39]. In this study, the PennCNV software was used to detect a total of 1731 CNVs and 236 CNVRs in 29 autosomes of the Chinese Holstein cattle in Jiangsu among which the duplication types of the CNVs and CNVRs were higher than the deletion types (Figure 3). This may be related to the selection and breeding of the cows [40].
CNVs can play a role in variety formation and environmental adaptation, and copy number differences between different varieties and the same variety in different environments can lead to quantitative trait variations [41,42]. According to some reports, the size and quantity of the CNVRs of the Chinese Holstein cattle in different regions also vary greatly. Jiang identified 367 CNVRs in the Chinese Holstein cattle through a PennCNV analysis of 50 K SNP chip genotyping data from 96 Chinese Holstein cattle in Beijing [43]. In this study, 236 CNVR regions of the Chinese Holstein cattle in Jiangsu were excavated using 100 K chips. Such differences in the number of CNVRs may have been caused by differences in the environment, chip density, population size, and structure [44]. Although SNP chips from cattle can be used for CNV detection, the SNP probes on the chips are neither dense enough nor evenly distributed enough to obtain an accurate high-resolution map of the copy number variations in the bovine genome. The average interval between adjacent SNPs in the bovine 100 K SNP chip was about 25 Kb, and the main application scenarios of this chip are genome-wide association analysis and genome selection; additionally, a large part of the probes may be located outside the CNVRs [45]. Therefore, only CNVRs with a certain length may be found in this study, and all CNVRs with a short length and no probe label are difficult to detect based on chip data alone. Studies on humans have shown that smaller-length CNVRs appear more frequently in the genome than larger-length CNVRs [46,47]. With the application of bovine high-density 800 K chips or next-generation sequencing methods, it is expected that more and more cross-genome CNVs and CNVRs will be identified.
It has been confirmed that the CNVs in the dairy cow genome are closely related to their milk production traits. For example, Gao found that the functional differences of the CNVs and CNVRs in the genomes of cows with a high milk fat percentage and high milk protein percentage and those of cows with a low milk fat percentage and low milk protein percentage mainly involved fat and protein metabolism and other related biological processes. Based on this, 10 candidate genes related to the milk production traits of dairy cows were identified for the subsequent molecular breeding of the dairy cows [48]. This study also identified 19 CNVRs that were significantly associated with the milk production traits of the Chinese Holstein cattle, and these CNVR regions could be used as the key genomic candidate regions for the milk production traits of the dairy cows.
Based on the existing data, this study only preliminarily explored the copy number variation areas and individual production traits. A total of 19 copy number variation regions were found to have significant effects on five milk production traits of Jiangsu Holstein dairy cows. The study did not find any CNVRs significantly associated with milk protein volume, which requires further exploration.
A KEGG pathway analysis was performed on 216 genes in the 19 screened CNVRs that were closely related to the first fetal milk production traits of the dairy cows. In order to find the core biological categories and corresponding genes that were closely related to the milk production traits, we eliminated the genes that were enriched in only one biological category. Finally, 13 GO terms with significantly enriched biological functions were retained, which mainly involved lipid synthesis, amino acid metabolism, glucose metabolism, etc., and were mostly related to the milk production performance of the dairy cows [49,50]. There were also 10 candidate genes that were highly involved in the enrichment of these GO terms. The DGAT1 gene is a basic metabolic enzyme that plays an important role in the biosynthesis of triglyceride, the metabolism of triglyceride, and the digestion and absorption of the fat in milk. Many studies have revealed the relationship between DGAT1 and the milk production performance of dairy cows. In particular, the polymorphism of K232A in DGAT1 can significantly affect the expression of the milk fat and milk protein content of dairy cows [51,52]. VPS28 may regulate milk fat synthesis through ubiquitination in bovine mammary epithelial cells [53]. MAF1 has been identified as a candidate gene associated with the milk yield and milk protein content in Canadian Holstein cattle studies [54]. By using mid-infrared mass spectrometry and GWAS techniques, GPT was listed as a candidate gene for the characteristics associated with the production of French Holstein milk cheese [55]. At the same time, we found that DAGT1 and GPT reappeared in the GO enrichment results, and they may be important candidate genes affecting the performance of the first litter of the Chinese Holstein cattle in Jiangsu.

5. Conclusions

This study introduced the basic status of the milk production traits of the Chinese Holstein cows in the Jiangsu Province, and 19 CNVRs were identified to be significantly related to the milk production traits of the Chinese Holstein cows in the first parity. SNVRs significantly associated with milk production traits were mainly concentrated on chromosomes 11, 14, and 17. CNVR_156 on chromosome 17 was found to be significantly correlated with milk yield, milk quality, and milk protein percentage. CNVR_124 on chromosome 14 was significantly associated with milk fat. CNVRs significantly correlated with milk fat percentage and milk fat content did not show overlapping regions.
We also found that AKT1, VAV2, NOTCH1, DGAT1, GPT, SCO2, and DBH may be important candidate genes affecting the performance of the first litter of the Chinese Holstein cattle in Jiangsu. This study provides a theoretical basis for the molecular-marker-assisted selection of dairy cows, helps analyze the genetic structure of the milk production traits of dairy cows, and contributes to the development of the quantitative genetics of dairy cows and the genetic improvement in the Chinese Holstein cattle in Jiangsu.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ani14010017/s1, Supplementary Table S1. genes_within_CNVRs; Supplementary Table S2. GO result.

Author Contributions

Conceptualization, H.Z. and T.X.; methodology, X.L.; validation, H.J. and X.L.; formal analysis, H.Z.; investigation, H.Z.; resources, X.L.; data curation, H.Z.; writing—original draft preparation, H.Z.; writing—review and editing, T.X.; project administration, Z.Y. and T.X.; funding acquisition, Z.Y. and T.X. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by the China Postdoctoral Science Foundation (Xubin Lu, 2023M732999); the Open Project Program of Joint International Research Laboratory of Agriculture and Agri-Product Safety, the Ministry of Education of China, Yangzhou University (Tianle Xu, JILAR-KF202204); the Open Project Program of International Joint Research Laboratory in Universities of Jiangsu Province of China for Domestic Animal Germplasm Resources and Genetic Improvement: (Tianle Xu, IJRLD-KF202214).

Institutional Review Board Statement

The study was also accepted by the Ethics Committee for Animal Researchers of Yangzhou University (License No. SYXK(SU)IACUC 2012-0029).

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are available upon request from the authors.

Acknowledgments

We appreciate the support of the sample provisions from the dairy farms located in the four correlated farms.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Dallas, D.; Nielsen, S.D. Milk Peptidomics to Identify Functional Peptides and for Quality Control of Dairy Products. Methods Mol. Biol. 2018, 1719, 223–240. [Google Scholar] [PubMed]
  2. Nayeri, S.; Sargolzaei, M.; Abo-Ismail, M.K.; May, N.; Miller, S.P.; Schenkel, F.; Moore, S.S.; Stothard, P. Genome-wide association for milk production and female fertility traits in Canadian dairy Holstein cattle. BMC Genet. 2016, 17, 75. [Google Scholar] [CrossRef] [PubMed]
  3. Atsumi, K.; Lagisz, M.; Nakagawa, S. Nonadditive genetic effects induce novel phenotypic distributions in male mating traits of F1 hybrids. Evolution 2021, 75, 1304–1315. [Google Scholar] [CrossRef] [PubMed]
  4. Boichard, D.; Brochard, M. New phenotypes for new breeding goals in dairy cattle. Animal 2012, 6, 544–550. [Google Scholar] [CrossRef] [PubMed]
  5. Brito, L.F.; Bedere, N.; Douhard, F.; Oliveira, H.R.; Arnal, M.; Peñagaricano, F.; Schinckel, A.P.; Baes, C.F.; Miglior, F. Review: Genetic selection of high-yielding dairy cattle toward sustainable farming systems in a rapidly changing world. Animal 2021, 15 (Suppl. S1), 100292. [Google Scholar] [CrossRef]
  6. Seo, D.; Lee, D.H.; Jin, S.; Won, J.I.; Lim, D.; Park, M.; Kim, T.H.; Lee, H.K.; Kim, S.; Choi, I.; et al. Long-term artificial selection of Hanwoo (Korean) cattle left genetic signatures for the breeding traits and has altered the genomic structure. Sci. Rep. 2022, 12, 6438. [Google Scholar] [CrossRef]
  7. Xiang, R.; MacLeod, I.M.; Bolormaa, S.; Goddard, M.E. Genome-wide comparative analyses of correlated and uncorrelated phenotypes identify major pleiotropic variants in dairy cattle. Sci. Rep. 2017, 7, 9248. [Google Scholar] [CrossRef]
  8. van den Berg, I.; Boichard, D.; Lund, M.S. Comparing power and precision of within-breed and multibreed genome-wide association studies of production traits using whole-genome sequence data for 5 French and Danish dairy cattle breeds. J. Dairy Sci. 2016, 99, 8932–8945. [Google Scholar] [CrossRef]
  9. de Almeida Santana, M.H.; Junior, G.A.; Cesar, A.S.; Freua, M.C.; da Costa Gomes, R.; da Luz E Silva, S.; Leme, P.R.; Fukumasu, H.; Carvalho, M.E.; Ventura, R.V.; et al. Copy number variations and genome-wide associations reveal putative genes and metabolic pathways involved with the feed conversion ratio in beef cattle. J. Appl. Genet. 2016, 57, 495–504. [Google Scholar] [CrossRef]
  10. Jiang, L.; Jiang, J.; Yang, J.; Liu, X.; Wang, J.; Wang, H.; Ding, X.; Liu, J.; Zhang, Q. Genome-wide detection of copy number variations using high-density SNP genotyping platforms in Holsteins. BMC Genom. 2013, 14, 131. [Google Scholar] [CrossRef]
  11. Alkan, C.; Coe, B.P.; Eichler, E.E. Genome structural variation discovery and genotyping. Nat. Rev. Genet. 2011, 12, 363–376. [Google Scholar] [CrossRef] [PubMed]
  12. Lye, Z.N.; Purugganan, M.D. Copy Number Variation in Domestication. Trends Plant Sci. 2019, 24, 352–365. [Google Scholar] [CrossRef] [PubMed]
  13. Macé, A.; Kutalik, Z.; Valsesia, A. Copy Number Variation. Methods Mol. Biol. 2018, 1793, 231–258. [Google Scholar] [PubMed]
  14. Pös, O.; Radvanszky, J.; Buglyó, G.; Pös, Z.; Rusnakova, D.; Nagy, B.; Szemes, T. DNA copy number variation: Main characteristics, evolutionary significance, and pathological aspects. Biomed. J. 2021, 44, 548–559. [Google Scholar] [CrossRef]
  15. Zhou, Y.; Utsunomiya, Y.T.; Xu, L.; el Hamidi Hay, A.; Bickhart, D.M.; Alexandre, P.A.; Rosen, B.D.; Schroeder, S.G.; Carvalheiro, R.; de Rezende Neves, H.H.; et al. Genome-wide CNV analysis reveals variants associated with growth traits in Bos indicus. BMC Genom. 2016, 17, 419. [Google Scholar] [CrossRef]
  16. Emerson, J.J.; Cardoso-Moreira, M.; Borevitz, J.O.; Long, M. Natural selection shapes genome-wide patterns of copy-number polymorphism in Drosophila melanogaster. Science 2008, 320, 1629–1631. [Google Scholar] [CrossRef]
  17. Liu, G.E.; Ventura, M.; Cellamare, A.; Chen, L.; Cheng, Z.; Zhu, B.; Li, C.; Song, J.; Eichler, E.E. Analysis of recent segmental duplications in the bovine genome. BMC Genom. 2009, 10, 571. [Google Scholar] [CrossRef]
  18. Zhang, Z.; Peng, M.; Wen, Y.; Chai, Y.; Liang, J.; Yang, P.; Liu, X.; Li, J.; Huang, Y.; Li, L.; et al. Copy number variation of EIF4A2 loci related to phenotypic traits in Chinese cattle. Vet. Med. Sci. 2022, 8, 2147–2156. [Google Scholar] [CrossRef]
  19. Huddleston, J.; Chaisson, M.J.P.; Steinberg, K.M.; Warren, W.; Hoekzema, K.; Gordon, D.; Graves-Lindsay, T.A.; Munson, K.M.; Kronenberg, Z.N.; Vives, L.; et al. Discovery and genotyping of structural variation from long-read haploid genome sequence data. Genome Res. 2017, 27, 677–685. [Google Scholar] [CrossRef]
  20. Zhang, Z.; Hao, K. Using SAAS-CNV to Detect and Characterize Somatic Copy Number Alterations in Cancer Genomes from Next Generation Sequencing and SNP Array Data. Methods Mol. Biol. 2018, 1833, 29–47. [Google Scholar]
  21. Sahajpal, N.S.; Barseghyan, H.; Kolhe, R.; Hastie, A.; Chaubey, A. Optical Genome Mapping as a Next-Generation Cytogenomic Tool for Detection of Structural and Copy Number Variations for Prenatal Genomic Analyses. Genes 2021, 12, 398. [Google Scholar] [CrossRef] [PubMed]
  22. Yang, L.; Gao, Y.; Oswalt, A.; Fang, L.; Boschiero, C.; Neupane, M.; Sattler, C.G.; Li, C.J.; Seroussi, E.; Xu, L.; et al. Towards the detection of copy number variation from single sperm sequencing in cattle. BMC Genom. 2022, 23, 215. [Google Scholar] [CrossRef] [PubMed]
  23. Paudel, Y.; Madsen, O.; Megens, H.J.; Frantz, L.A.; Bosse, M.; Crooijmans, R.P.; Groenen, M.A. Copy number variation in the speciation of pigs: A possible prominent role for olfactory receptors. BMC Genom. 2015, 16, 330. [Google Scholar] [CrossRef] [PubMed]
  24. Silva, V.H.; Regitano, L.C.; Geistlinger, L.; Pértille, F.; Giachetto, P.F.; Brassaloti, R.A.; Morosini, N.S.; Zimmer, R.; Coutinho, L.L. Genome-Wide Detection of CNVs and Their Association with Meat Tenderness in Nelore Cattle. PLoS ONE 2016, 11, e0157711. [Google Scholar] [CrossRef] [PubMed]
  25. Pezer, Ž.; Harr, B.; Teschke, M.; Babiker, H.; Tautz, D. Divergence patterns of genic copy number variation in natural populations of the house mouse (Mus musculus domesticus) reveal three conserved genes with major population-specific expansions. Genome Res. 2015, 25, 1114–1124. [Google Scholar] [CrossRef] [PubMed]
  26. Bae, J.S.; Cheong, H.S.; Kim, L.H.; NamGung, S.; Park, T.J.; Chun, J.Y.; Kim, J.Y.; Pasaje, C.F.; Lee, J.S.; Shin, H.D. Identification of copy number variations and common deletion polymorphisms in cattle. BMC Genom. 2010, 11, 232. [Google Scholar] [CrossRef]
  27. Xu, L.; Cole, J.B.; Bickhart, D.M.; Hou, Y.; Song, J.; VanRaden, P.M.; Sonstegard, T.S.; Van Tassell, C.P.; Liu, G.E. Genome wide CNV analysis reveals additional variants associated with milk production traits in Holsteins. BMC Genom. 2014, 15, 683. [Google Scholar] [CrossRef]
  28. Lu, X.; Arbab, A.A.I.; Abdalla, I.M.; Liu, D.; Zhang, Z.; Xu, T.; Su, G.; Yang, Z. Genetic Parameter Estimation and Genome-Wide Association Study-Based Loci Identification of Milk-Related Traits in Chinese Holstein. Front. Genet. 2022, 12, 799664. [Google Scholar] [CrossRef]
  29. Li, J.; Gao, H.; Madsen, P.; Li, R.; Liu, W.; Bao, P.; Xue, G.; Gao, Y.; Di, X.; Su, G. Impact of the Order of Legendre Polynomials in Random Regression Model on Genetic Evaluation for Milk Yield in Dairy Cattle Population. Front. Genet. 2020, 11, 586155. [Google Scholar] [CrossRef]
  30. Nazar, M.; Lu, X.; Abdalla, I.M.; Ullah, N.; Fan, Y.; Chen, Z.; Arbab, A.A.I.; Mao, Y.; Yang, Z. Genome-Wide Association Study Candidate Genes on Mammary System-Related Teat-Shape Conformation Traits in Chinese Holstein Cattle. Genes 2021, 12, 2020. [Google Scholar] [CrossRef]
  31. Marees, A.T.; de Kluiver, H.; Stringer, S.; Vorspan, F.; Curis, E.; Marie-Claire, C.; Derks, E.M. A tutorial on conducting genome-wide association studies: Quality control and statistical analysis. Int. J. Methods Psychiatr. Res. 2018, 27, e1608. [Google Scholar] [CrossRef] [PubMed]
  32. Hou, Y.; Liu, G.E.; Bickhart, D.M.; Cardone, M.F.; Wang, K.; Kim, E.S.; Matukumalli, L.K.; Ventura, M.; Song, J.; VanRaden, P.M.; et al. Genomic characteristics of cattle copy number variations. BMC Genom. 2011, 12, 127. [Google Scholar] [CrossRef] [PubMed]
  33. Kang, X.; Li, M.; Liu, M.; Liu, S.; Pan, M.G.; Wiggans, G.R.; Rosen, B.D.; Liu, G.E. Copy number variation analysis reveals variants associated with milk production traits in dairy goats. Genomics 2020, 112, 4934–4937. [Google Scholar] [CrossRef] [PubMed]
  34. Duchemin, S.I.; Nilsson, K.; Fikse, W.F.; Stålhammar, H.; Buhelt Johansen, L.; Stenholdt Hansen, M.; Lindmark-Månsson, H.; de Koning, D.J.; Paulsson, M.; Glantz, M. Genetic parameters for noncoagulating milk, milk coagulation properties, and detailed milk composition in Swedish Red Dairy Cattle. J. Dairy Sci. 2020, 103, 8330–8342. [Google Scholar] [CrossRef] [PubMed]
  35. Xue, X.; Hu, H.; Zhang, J.; Ma, Y.; Han, L.; Hao, F.; Jiang, Y.; Ma, Y. Estimation of Genetic Parameters for Conformation Traits and Milk Production Traits in Chinese Holsteins. Animals 2022, 13, 100. [Google Scholar] [CrossRef] [PubMed]
  36. Hou, Y.; Liu, G.E.; Bickhart, D.M.; Matukumalli, L.K.; Li, C.; Song, J.; Gasbarre, L.C.; Van Tassell, C.P.; Sonstegard, T.S. Genomic regions showing copy number variations associate with resistance or susceptibility to gastrointestinal nematodes in Angus cattle. Funct. Integr. Genom. 2012, 12, 81–92. [Google Scholar] [CrossRef] [PubMed]
  37. Ma, N.; Xi, H.; Chen, J.; Peng, Y.; Jia, Z.; Yang, S.; Hu, J.; Pang, J.; Zhang, Y.; Hu, R.; et al. Integrated CNV-seq, karyotyping and SNP-array analyses for effective prenatal diagnosis of chromosomal mosaicism. BMC Med Genom. 2021, 14, 56. [Google Scholar] [CrossRef]
  38. de Araújo Lima, L.; Wang, K. PennCNV in whole-genome sequencing data. BMC Bioinform. 2017, 18 (Suppl. S11), 383. [Google Scholar] [CrossRef]
  39. Marenne, G.; Rodríguez-Santiago, B.; Closas, M.G.; Pérez-Jurado, L.; Rothman, N.; Rico, D.; Pita, G.; Pisano, D.G.; Kogevinas, M.; Silverman, D.T.; et al. Assessment of copy number variation using the Illumina Infinium 1M SNP-array: A comparison of methodological approaches in the Spanish Bladder Cancer/EPICURO study. Hum. Mutat. 2011, 32, 240–248. [Google Scholar] [CrossRef]
  40. Redon, R.; Ishikawa, S.; Fitch, K.R.; Feuk, L.; Perry, G.H.; Andrews, T.D.; Fiegler, H.; Shapero, M.H.; Carson, A.R.; Chen, W.; et al. Global variation in copy number in the human genome. Nature 2006, 444, 444–454. [Google Scholar] [CrossRef]
  41. Sebat, J.; Lakshmi, B.; Troge, J.; Alexander, J.; Young, J.; Lundin, P.; Månér, S.; Massa, H.; Walker, M.; Chi, M.; et al. Large-scale copy number polymorphism in the human genome. Science 2004, 305, 525–528. [Google Scholar] [CrossRef] [PubMed]
  42. Bickhart, D.M.; Hou, Y.; Schroeder, S.G.; Alkan, C.; Cardone, M.F.; Matukumalli, L.K.; Song, J.; Schnabel, R.D.; Ventura, M.; Taylor, J.F.; et al. Copy number variation of individual cattle genomes using next-generation sequencing. Genome Res. 2012, 22, 778–790. [Google Scholar] [CrossRef] [PubMed]
  43. Jiang, L.; Jiang, J.; Wang, J.; Ding, X.; Liu, J.; Zhang, Q. Genome-wide identification of copy number variations in Chinese Holstein. PLoS ONE 2012, 7, e48732. [Google Scholar] [CrossRef] [PubMed]
  44. Fernandes, A.C.; da Silva, V.H.; Goes, C.P.; Moreira, G.C.M.; Godoy, T.F.; Ibelli, A.M.G.; Peixoto, J.O.; Cantão, M.E.; Ledur, M.C.; de Rezende, F.M.; et al. Genome-wide detection of CNVs and their association with performance traits in broilers. BMC Genom. 2021, 22, 354. [Google Scholar] [CrossRef] [PubMed]
  45. Liu, G.E.; Bickhart, D.M. Copy number variation in the cattle genome. Funct. Integr. Genom. 2012, 12, 609–624. [Google Scholar] [CrossRef] [PubMed]
  46. Conrad, D.F.; Pinto, D.; Redon, R.; Feuk, L.; Gokcumen, O.; Zhang, Y.; Aerts, J.; Andrews, T.D.; Barnes, C.; Campbell, P.; et al. Origins and functional impact of copy number variation in the human genome. Nature 2010, 464, 704–712. [Google Scholar] [CrossRef] [PubMed]
  47. Perry, G.H.; Ben-Dor, A.; Tsalenko, A.; Sampas, N.; Rodriguez-Revenga, L.; Tran, C.W.; Scheffer, A.; Steinfeld, I.; Tsang, P.; Yamada, N.A.; et al. The fine-scale and complex architecture of human copy-number variation. Am. J. Hum. Genet. 2008, 82, 685–695. [Google Scholar] [CrossRef] [PubMed]
  48. Gao, Y.; Jiang, J.; Yang, S.; Hou, Y.; Liu, G.E.; Zhang, S.; Zhang, Q.; Sun, D. CNV discovery for milk composition traits in dairy cattle using whole genome resequencing. BMC Genom. 2017, 18, 265. [Google Scholar] [CrossRef]
  49. Nafikov, R.A.; Schoonmaker, J.P.; Korn, K.T.; Noack, K.; Garrick, D.J.; Koehler, K.J.; Minick-Bormann, J.; Reecy, J.M.; Spurlock, D.E.; Beitz, D.C. Polymorphisms in lipogenic genes and milk fatty acid composition in Holstein dairy cattle. Genomics 2014, 104 Pt 6, 572–581. [Google Scholar] [CrossRef]
  50. Osorio, J.S.; Lohakare, J.; Bionaz, M. Biosynthesis of milk fat, protein, and lactose: Roles of transcriptional and posttranscriptional regulation. Physiol. Genom. 2016, 48, 231–256. [Google Scholar] [CrossRef]
  51. Bovenhuis, H.; Visker, M.H.P.W.; Poulsen, N.A.; Sehested, J.; van Valenberg, H.J.F.; van Arendonk, J.A.M.; Larsen, L.B.; Buitenhuis, A.J. Effects of the diacylglycerol o-acyltransferase 1 (DGAT1) K232A polymorphism on fatty acid, protein, and mineral composition of dairy cattle milk. J. Dairy Sci. 2016, 99, 3113–3123. [Google Scholar] [CrossRef] [PubMed]
  52. Yao, D.W.; Ma, J.; Yang, C.L.; Chen, L.L.; He, Q.Y.; Coleman, D.N.; Wang, T.Z.; Jiang, X.L.; Luo, J.; Ma, Y.; et al. Phosphatase and tensin homolog (PTEN) suppresses triacylglycerol accumulation and monounsaturated fatty acid synthesis in goat mammary epithelial cells. J. Dairy Sci. 2021, 104, 7283–7294. [Google Scholar] [CrossRef] [PubMed]
  53. Liu, L.; Zhou, J.; Chen, C.J.; Zhang, J.; Wen, W.; Tian, J.; Zhang, Z.; Gu, Y. GWAS-Based Identification of New Loci for Milk Yield, Fat, and Protein in Holstein Cattle. Animals 2020, 10, 2048. [Google Scholar] [CrossRef] [PubMed]
  54. Oliveira, H.R.; Cant, J.P.; Brito, L.F.; Feitosa, F.L.B.; Chud, T.C.S.; Fonseca, P.A.S.; Jamrozik, J.; Silva, F.F.; Lourenco, D.A.L.; Schenkel, F.S. Genome-wide association for milk production traits and somatic cell score in different lactation stages of Ayrshire, Holstein, and Jersey dairy cattle. J. Dairy Sci. 2019, 102, 8159–8174. [Google Scholar] [CrossRef]
  55. Sanchez, M.P.; Ramayo-Caldas, Y.; Wolf, V.; Laithier, C.; El Jabri, M.; Michenet, A.; Boussaha, M.; Taussat, S.; Fritz, S.; Delacroix-Buchet, A.; et al. Sequence-based GWAS, network and pathway analyses reveal genes co-associated with milk cheese-making properties and milk composition in Montbéliarde cows. Genet. Sel. Evol. 2019, 51, 34. [Google Scholar] [CrossRef]
Figure 1. Population cluster analysis and identity-country analysis. (A) Cluster analysis; (B) identity-country analysis.
Figure 1. Population cluster analysis and identity-country analysis. (A) Cluster analysis; (B) identity-country analysis.
Animals 14 00017 g001
Figure 2. Copy number variation distribution in the bovine reference genome ARS-UMD1.2. (A) Length distribution of CNVs; (B) size distribution of different types of CNVs; (C,D) distribution of different types of CNVs on autosomes of bovine.
Figure 2. Copy number variation distribution in the bovine reference genome ARS-UMD1.2. (A) Length distribution of CNVs; (B) size distribution of different types of CNVs; (C,D) distribution of different types of CNVs on autosomes of bovine.
Animals 14 00017 g002
Figure 3. Genome distribution of Holestin CNVRs in Jiangsu province. Red = duplicate; green = deletion type; black silk = mixed type.
Figure 3. Genome distribution of Holestin CNVRs in Jiangsu province. Red = duplicate; green = deletion type; black silk = mixed type.
Animals 14 00017 g003
Figure 4. Significant Gene Ontology (GO) terms enriched by the genes within CNVRS. After quality control, 13 GO terms that are enriched by 10 genes are revealed in the circus plots (p < 0.05).
Figure 4. Significant Gene Ontology (GO) terms enriched by the genes within CNVRS. After quality control, 13 GO terms that are enriched by 10 genes are revealed in the circus plots (p < 0.05).
Animals 14 00017 g004
Table 1. Descriptive statistics of milk production traits in Chinese Holstein cows in parity 1–3 in Jiangsu.
Table 1. Descriptive statistics of milk production traits in Chinese Holstein cows in parity 1–3 in Jiangsu.
TraitsParityNINRMeanSDMedianMinMaxSkewKurtosis
MY (kg)1643746,74930.397.9830.705.0075.00−0.080.25
2340224,86436.2810.7936.005.0080.000.120.17
3312423,16236.9410.9536.505.0079.000.10−0.10
1–3983495,37533.336.2333.005.0080.000.310.43
MFR (%)1626345,8143.840.963.790.339.990.481.55
2319623,7853.911.023.870.329.990.501.34
3308823,1043.971.043.920.369.570.551.23
1–3949792,7633.881.003.820.309.990.521.46
MFY (kg)1626345,8141.170.401.140.064.580.701.62
2319623,7851.410.531.350.084.900.771.41
3308823,1041.460.551.400.085.740.771.47
1–3949792,7631.290.501.230.605.740.932.00
MPR (%)1626345,8143.270.363.250.309.460.746.50
2319623,7853.290.383.260.757.270.713.43
3308823,1043.300.393.271.227.220.622.94
1–3949792,7633.280.373.250.309.460.754.95
MPY (kg)1626345,8141.000.251.000.112.43−0.010.59
2319623,7851.180.341.180.173.270.220.83
3308823,1041.210.331.20.132.760.070.26
1–3949792,7631.090.321.070.104.580.361.02
Parity = parity; NI = number of recorded individuals; NR = number of test-day records; Mean = mean; SD = standard deviation; Median = median; Min = minimum; Max = maximum; Skew = skewness; Kurtosis = Kurtosis.
Table 2. Phenotypic and genetic correlations of a trait in different parities of Chinese Holstein dairy cows in Jiangsu Province.
Table 2. Phenotypic and genetic correlations of a trait in different parities of Chinese Holstein dairy cows in Jiangsu Province.
TraitsParityParity 1Parity 2Parity 3
MY11.0000.478 **0.512 **
20.596 (0.098)1.0000.453 **
30.857 (0.088)0.867 (0.102)1.000
MFR11.0000.380 **0.421 **
20.913 (0.078)1.0000.419 **
30.963 (0.141)0.989 (0.143)1.000
MFY11.0000.362 **0.358 **
20.623 (0.158)1.0000.324 **
30.779 (0.171)0.975 (0.205)1.000
MPR11.0000.444 **0.554 **
20.902 (0.054)1.0000.546 **
30.963 (0.053)0.985 (0.078)1.000
MPY11.0000.420 **0.436 **
20.552 (0.117)1.0000.399 **
30.858 (0.122)0.815 (0.144)1.000
The data in the table is expressed as R-square. Data below the diagonal lines are genetic correlations, data above the diagonal lines are phenotypic correlations, and the diagonal lines are phenotypic correlations and genetic correlations; ** means p < 0.01.
Table 3. Statistical information of the CNVs in the bovine reference genome ARS-UMD1.2.
Table 3. Statistical information of the CNVs in the bovine reference genome ARS-UMD1.2.
CNV TypeCountsLengthMin-LengthMax-Length
014207,11560,725323,251
168302,89950,880768,584
31550440,63850,683998,290
499414,04750,735980,071
0 and 1 represent deletions; 3 and 4 represent duplications.
Table 4. Information of CNVRs that significantly associated with milk-related traits in Holstein dairy cows in Jiangsu.
Table 4. Information of CNVRs that significantly associated with milk-related traits in Holstein dairy cows in Jiangsu.
TraitsCNVR IDChrStartEndSizeDescriptionAdjusted
p-Value
MYCNVR_1551771,644,23971,885,069240,831Loss0.031
CNVR_1561772,803,79973,155,293351,495Loss0.033
CNVR_10611103,230,152104,256,7421,026,591Mixed0.040
MFRCNVR_12414146,715891,340744,626Mixed0.000
CNVR_1401569,494,37270,392,722898,351Gain0.011
CNVR_1842169,165,60769,778,711613,105Loss0.012
CNVR_32363,079,01565,566,9902,487,976Gain0.015
MFYCNVR_1481722,459,66023,169,353709,694Gain0.01
CNVR_10611103,230,152104,256,7421,026,591Mixed0.018
CNVR_1131258,857,55560,002,3951,144,841Gain0.021
CNVR_12414146,715891,340744,626Mixed0.028
CNVR_1561772,803,79973,155,293351,495Loss0.030
CNVR_1611848,610,25448,869,4651,026,591Gain0.047
MPRCNVR_10611103,230,152104,256,7421,026,591Mixed0.013
CNVR_1551771,644,23971,885,069240,831Loss0.020
CNVR_1561772,803,79973,155,293351,495Loss0.035
CNVR_10711104,418,358106,541,5212,123,164Mixed0.037
CNVR_595119,074,417119,998,002923,586Mixed0.042
CNVR_403109,548,750110,571,0301,022,281Mixed0.048
CNVR ID = CNVR number; Chr = chromosome number; Start = CNVR start position; End = CNVR end position; Size = CNVR size; Adjusted p value = adjusted p value.
Table 5. Details of the KEGG pathways significantly enriched from genes in CNVRs.
Table 5. Details of the KEGG pathways significantly enriched from genes in CNVRs.
PathwayDescriptionGene NameFDR(q)-p-Values
bta04920Adipocytokine signaling pathwayAKT1, NFKBIB, RXRA, TRAF2, CPT1B0.0003
bta01522Endocrine resistanceNOTCH1, AKT1,JAG2, MAPK12, MAPK110.001
bta04658Th1 and Th2 cell differentiationNOTCH1, JAG2, NFKBIB, MAPK12, MAPK110.0013
bta04664Fc epsilon RI signaling pathwayAKT1, VAV2, MAPK12, MAPK110.0026
bta04621NOD-like receptor signaling pathwayCARD9, SHARPIN, NFKBIB, TRAF2, MAPK12, MAPK110.0051
bta00513Various types of N-glycan biosynthesisMAN1B1, MAN1B1, ALG120.0052
bta00510N-Glycan biosynthesisMAN1B1, MAN1B1, ALG120.0088
bta00564Glycerophospholipid metabolismAGPAT2, PLD4, CHKB0.0104
bta04625C-type lectin receptor signaling pathwayCARD9, AKT1, MAPK12, MAPK110.0107
bta04622RIG-I-like receptor signaling pathwayNFKBIB, TRAF2, MAPK12, MAPK110.0111
bta04370VEGF signaling pathwayAKT1, MAPK12, MAPK110.0119
bta04659Th17 cell differentiationNFKBIB, RXRA, MAPK12, MAPK110.015
bta04071Sphingolipid signaling pathwayAKT1, TRAF2, MAPK12, MAPK110.0168
bta04722Neurotrophin signaling pathwayAKT1, NFKBIB, MAPK12, MAPK110.0178
bta00601Glycosphingolipid biosynthesis—lacto and neolacto seriesABO, FUT70.0249
bta04917Prolactin signaling pathwayAKT1, MAPK12, MAPK110.0315
bta04015Rap1 signaling pathwayAKT1, VAV2, GRIN1, MAPK12, MAPK110.0326
bta04914Progesterone-mediated oocyte maturationAKT1, MAPK12, MAPK110.0376
bta00250Alanine, aspartate and glutamate metabolismGPT, ADSS10.0387
bta00350Tyrosine metabolismCOMT, DBH0.0405
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhu, H.; Lu, X.; Jiang, H.; Yang, Z.; Xu, T. Descriptive Statistics and Genome-Wide Copy Number Analysis of Milk Production Traits of Jiangsu Chinese Holstein Cows. Animals 2024, 14, 17. https://doi.org/10.3390/ani14010017

AMA Style

Zhu H, Lu X, Jiang H, Yang Z, Xu T. Descriptive Statistics and Genome-Wide Copy Number Analysis of Milk Production Traits of Jiangsu Chinese Holstein Cows. Animals. 2024; 14(1):17. https://doi.org/10.3390/ani14010017

Chicago/Turabian Style

Zhu, Hao, Xubin Lu, Hui Jiang, Zhangping Yang, and Tianle Xu. 2024. "Descriptive Statistics and Genome-Wide Copy Number Analysis of Milk Production Traits of Jiangsu Chinese Holstein Cows" Animals 14, no. 1: 17. https://doi.org/10.3390/ani14010017

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop