Next Article in Journal
Landscape Homogeneity May Drive the Distribution of Koala Vehicle Collisions on a Major Highway in the Clarke-Connors Range in Central Queensland, Australia
Previous Article in Journal
Morphological and Metabolic Features of Brain Aging in Rodents, Ruminants, Carnivores, and Non-Human Primates
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Genetic Characteristics of FT-MIRS-Predicted Milk Fatty Acids in Chinese Holstein Cows

1
Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China
2
Frontiers Science Center for Animal Breeding and Sustainable Production, Huazhong Agricultural University, Wuhan 430070, China
3
Hebei Livestock Breeding Station, Shijiazhuang 050060, China
4
Dairy Association of China, Beijing 100192, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Animals 2024, 14(19), 2901; https://doi.org/10.3390/ani14192901
Submission received: 5 August 2024 / Revised: 3 October 2024 / Accepted: 4 October 2024 / Published: 8 October 2024
(This article belongs to the Section Cattle)

Abstract

:

Simple Summary

Fourier Transform Infrared Spectroscopy (FT-MIRS) is widely used in milk quality detection, dairy herd improvement (DHI), and other fields. It is an economical, fast, accurate, and nondestructive batch tool for determining production performance phenotypes of dairy cows. Milk is one of the most important ways to provide the human body with the fatty acids it needs. There are a huge number of dairy cows in China. Therefore, it is possible to control fatty acid production in the milk source through targeted husbandry and breeding on large pastures to improve the quality of milk production. However, this work has not yet officially begun in China. In summary, our work uses FT-MIRS for the first time to study the phenotypic properties of milk fatty acid content and the genetic mechanism of its formation and to estimate genetic parameters. At the same time, SNPs significantly related to fatty acid content were discovered and the genes or adjacent genes had critical regulatory effects on milk fat synthesis, milk protein synthesis, adipocyte differentiation, mammary gland development, milk synthesis, and growth and development in dairy cows, thus providing a new perspective for cow genetic selection in China.

Abstract

Fourier Transform Mid-Infrared Spectroscopy (FT-MIRS) can be used for quantitative detection of milk components. Here, milk samples of 458 Chinese Holstein cows from 11 provinces in China were collected and we established a total of 22 quantitative prediction models in milk fatty acids by FT-MIRS. The coefficient of determination of the validation set ranged from 0.59 (C18:0) to 0.76 (C4:0). The models were adopted to predict the milk fatty acids from 2138 cows and a new high-throughput computing software HiBLUP was employed to construct a multi-trait model to estimate and analyze genetic parameters in dairy cows. Finally, genome-wide association analysis was performed and seven novel SNPs significantly associated with fatty acid content were selected, investigated, and verified with the FarmCPU method, which stands for “Fixed and random model Circulating Probability Unification”. The findings of this study lay a foundation and offer technical support for the study of fatty acid trait breeding and the screening and grouping of characteristic dairy cows in China with rich, high-quality fatty acids. It is hoped that in the future, the method established in this study will be able to screen milk sources rich in high-quality fatty acids.

1. Introduction

Milk is a crucial source of human nutrition and essential to people’s lives and the agricultural economy [1]. Milk fat is a vital nutrient and also the primary source of essential fatty acids (linoleic acid and α-linolenic acid) for human beings [2,3,4]. Almost every kind of individual fatty acid has significant implications for human nutrition, health, and disease control [4,5,6].
Fatty acids can be divided into saturated fatty acids (about 70%) and unsaturated fatty acids (about 25% monounsaturated fatty acids and 5% polyunsaturated fatty acids) according to the chemical bonds of carbon chains [3,6,7]. They can also be classified into short-chain fatty acids, medium-chain fatty acids, and long-chain fatty acids according to the carbon chain length [8].
Supplementation and control of human consumption of fatty acids through milk boast essential biological and economic values in the future. Rapid batch examination of milk fatty acid content via genetic parameters accurately screens cows with high-yield high-quality fatty acids, presenting significant breeding value [1,3], and can also provide a reference for feed formula adjustment and healthy and efficient dairy cattle breeding [9].
Fourier Transform Mid-Infrared Spectroscopy (FT-MIRS) can analyze the frequencies and peaks absorbed by specific chemical bonds of molecules in milk substances, which can be utilized to identify the properties or content of milk substances [10,11,12]. FT-MIRS has been extensively utilized in milk composition detection and global dairy herd improvement (DHI) due to its low-cost, simple, rapid, batch-testing, accurate, and nondestructive nature [13,14].
According to the Beer–Lambert law, FT-MIRS absorbance values are linearly correlated with relevant milk components [15]. Thus, as the most widely used machine learning algorithm, partial least squares regression (PLSR) was employed to establish a quantitative prediction model based on FT-MIRS, especially in the establishment of human consumption fatty acid prediction models [9,10,16,17,18]. Before model training, spectral data should be preprocessed, and essential waves should be selected. Preprocessing is the step of performing various transformations and processing of raw spectral data to improve the accuracy and robustness of the model and to reduce noise and interference for better extraction of useful information. Common preprocessing algorithms include Savitzky–Golay (SG) [19], multivariate scatter correction (MSC) [20,21], standard normal variable transformation (SNV) [22], first derivative (D1) [21,23], second derivative (D2) [3], etc. [24]. Moreover, it was revealed that random forest regression (RFR), support vector machine regression (SVR), and ridge regression (RR) algorithms could also construct quantitative prediction models for milk composition [25,26,27].
Fatty acids are a quantitative and complex trait modulated by multiple factors [28], including genetics, physiological properties (breed, parity, stage of lactation, and calving season), and rearing environment [10,29,30,31,32,33,34,35]. In particular, the diet of dairy cows has a significant impact on the fatty acid composition of milk [30,32]. There are some phenotypic and genetic correlations among various human consumption fatty acids and between fatty acids and milk production traits [36,37]. In addition, it was suggested that FT-MIRS waves were heritable and related to the genetic characteristics of relevant milk components [11,38,39,40,41,42]. These results provided a new research idea for selecting high-yield and high-quality fatty acid cows.
Genome-wide association study (GWAS) is a prominent method for analyzing genetic variations of complex traits. There have been numerous studies concerning human/animal/plant genes based on GWAS. Regarding dairy cattle, GWAS has been applied to study milk production, reproduction, and health for years [43,44]. Multiple Single Nucleotide Polymorphism (SNP) chips have been developed to assist detection [45]. 50k, 100k, and 777k SNP chips based on UMD_3.1 have been broadly applied [22,44,46,47,48,49,50,51,52,53,54], while studies based on ARS_UCD 1.2 are fewer [55]. Moreover, the GWAS model based on multiple types of information directly affects the accuracy of SNP screening [51,56]. Currently, models extensively employed in research are the General Linear Model (GLM) and Mixed Linear Model (MLM) based on pedigree and SNP chip information [57,58]. Hence, constructing new models and algorithms for various fields or species is significant to GWAS application development.
Fatty acid traits have medium–high heritability and are controlled by multiple genes [54,59,60,61,62,63], such as the DGAT1, which regulates milk fat synthesis [51]. However, genes regulating fatty acid content in milk and their SNPs remain to be elucidated. GWAS can effectively screen significant SNPs and genes substantially affecting fatty acid content [22,43,51,64].
This study intended to build quantitative models to predict milk fatty acid content in a large sample of Chinese Holstein cows using FT-MIRS. Then, a fatty acid multi-trait model was constructed to evaluate the fatty-acid breeding value, heritability, repeatability, phenotypic and genetic correlations, etc. Eventually, phenotypic indices of fatty acid content predictions were processed for GWAS to screen SNPs significantly associated with fatty acid content. Validation was carried out in an independent herd to further identify fatty acid content and critical SNPs. This study delves into phenotypic traits of milk fatty acid content and its genetic mechanism. It provides new technologies and insights for selecting cows that produce milk rich in high-quality fatty acids and establishing a core cattle herd.

2. Materials and Methods

2.1. Milk Sample Collection, FT-MIRS, and Fatty Acid Content Reference Value Measurement

The Animal Management and Ethics Committee of Huazhong Agricultural University reviewed and approved the experimental protocol for this project (HZAUCA-2019-004). The feeding and management conditions of all cows were similar to previous studies [65]. All cows were reared in large pens with the total mixed ration (TMR) feeding mode (twice a day) with ad libitum access to water and free milking. The TMR mainly contains the following feeds: roughage such as corn silage and alfalfa hay; concentrated feed such as corn, soybean meal, and barley; and vitamins, minerals, and feed additives. The average ambient temperature was 8 °C–27 °C, and the relative humidity was 45–75%.
Milk samples were collected from 458 Chinese Holstein cows from 36 dairy farms in 11 provinces of China from December 2019 to May 2021 (Table S1). According to the NY/T 1450-2007 technical specification for the performance measurement of Chinese Holstein cattle, from each cow was harvested one milk sample via an automatic milking device. Each milk sample (40 mL) was collected and added into new cylindrical bottles (diameter 3.5 cm, height 9 cm). The bottles were numbered in order. Bronopol preservative (0.35 μL) was immediately added to each sampling bottle and slowly shaken to dissolve it thoroughly. Ice bags (2–4 °C) were placed around the milk samples to prevent spoilage during transportation back to the laboratory.
In order to ensure the results’ reliability, all experiments were carried out on fixed instruments in the same laboratory, and the instruments were standardized regularly. After arriving at the laboratory, samples were placed in the water bath at 42 °C for 15–20 min. FT + (Foss, Hillerød, Denmark) milk composition detector was utilized, and the solid fiber probe was dipped into the liquid, mixing, and scanning samples to generate FT-MIRS data. The data consisted of 1060 waves in the wavenumber interval from 925.66 cm−1–5010.15 cm−1, covering two noise-like regions (1597.21 cm−1–1712.95 cm−1, 3063.25 cm−1–3641.95 cm−1); the other areas were fingerprint regions. The reference value of human consumption fatty acids was determined using gas chromatography (GC) of Hou et al. [66]. Lipids were extracted with chloroform/methanol extraction from milk samples before the determination, and then the fatty acids were esterified with a KOH methanol solution. Finally, the supernatant was filtered with a membrane and uploaded to the device. In total, 37 individual fatty acids were determined, expressed as g/100 g total fat.
Milk samples were screened by the following criteria: milk fat percentage 1.5–9%; milk protein percentage 1–7%; somatic cell counts ≤ 1,000,000/mL; the relative error of two repeated measurements of the same fatty acid in the same sample not be greater than 10%; and FT-MIRS standard Mahalanobis distance (GH) ≤ 3, calculated by the following formula:
G H ( x , y ) = ( x y ) T S 1 ( x y )
where x and y denote two random samples, and S is the covariance matrix. A total of 359 milk samples were selected for the subsequent prediction model.

2.2. Fatty Acid Content Prediction Model Construction

According to GC results, 14 individual fatty acids and 8 fatty acid groups were selected to establish the prediction model, and the composition of fatty acid groups is shown in Table 1 [54]. For each type of fatty acid, 75% of samples were randomly chosen for model training and random 10-fold cross-validation, and 25% for model testing. For each prediction model, we separated the windows according to the spectral absorption region characteristics of different chemical functional groups, and then manually added or subtracted the critical wavelengths of each window; in the end, 459–709 characteristic waves were selected. The spectrum of water expression was not deleted when selecting characteristic waves because some studies showed that the absorbance of human consumption fatty acid-related chemical functional groups might be related to water absorbance [39,67,68,69,70]. FT-MIRS data were preprocessed twice before the model establishment, with first derivative (D1) or second derivative (D2) employed [71]. Studies have demonstrated that the D1 and D2 preprocessing methods contribute to enhancing the signal and extracting spectral points relevant to milk fatty acids. The model was trained using the PLSR algorithm, and the principal components of PCA were 8–10. The development environment was Jupyter Notebook version 6.5.4 based on Python version 3.8.
The coefficient of determination (Rcv2) and root mean square error (RMSEcv) between the cross-validation set and the predicted phenotypes, the coefficient of determination (Rt2) and root mean square error (RMSEt) between the test set and the predicted phenotypes, and the residual prediction bias (RPDcv) of the cross-validation set were used to evaluate the performance of the prediction models. The optimal model was obtained for each sort of fatty acid. All the above evaluation metrics were calculated using the SciPy package in Python version 3.8; the formulas are as follows:
R M S E = n = 1 N y n y ^ n 2 N R 2 = 1 n = 1 N y n y ^ n 2 n = 1 N y n y 2
R P D = S T D E V R M S E
where y n and y ^ n represent the reference value and predicted value. y ¯ is the mean of the y values, N represents the sample size, and STDEV represents the standard deviation of the sample.

2.3. Fatty Acid Content Prediction and Correction

Considering the samples from North China accounted for the most significant proportion in the prediction model with about 1/3 of the total samples, 2138 Chinese Holstein cows were selected from five cattle farms in North China. Corresponding pedigree data were collected and statistically analyzed. The prediction models based on 14 individual fatty acids and 8 fatty acid groups predicted 22 fatty acids in 12,236 milk samples from 2138 cows. Corrected phenotypic values of 22 fatty acids were obtained using the general linear mixed model established by the R (V4.2.1) package lmertest. The corrected formula is as follows:
Y i j k m = μ + P i + L j + S k + H T D m + e i j k m
where μ stands for the mean value of the predicted traits; Pi represents parity, which is divided into 5 levels (i = 1, 2, 3, 4 and 5–10); Lj denotes the lactation period, ranging from 5 to 365 days, divided into 12 levels with 30 days as a unit (j = A–L); Sk stands for calving season effect, which is divided into 4 levels (k = Spring, Summer, Autumn, and Winter); HTDm represents herd-test-date random effects (the date format: DD/MM/YY); eijklm is the random residual effect. As for GWAS, the corrected phenotype was the mean of all predicted values per cow minus associated fixed effects (Folder S1).

2.4. Genetic Parameter Evaluation

In this study, the multi-trait animal model in HiBLUP v1.5.0 (a new high-throughput computing software) was used to construct the relationship matrix (HRM) based on pedigree and genome data [72]. This was the case so that the influence of the “bull factor” would already be included in the model, which takes into account the combined effects of pedigree and genotype. This approach (of not including the bull factor when using HiBLUP) has been already used to capture genetic variance in high-throughput genomic studies. There are 2578 individual animals included in the pedigree. The variance components of fatty acid traits were estimated using the SSGBLUP model with the AIREML algorithm [73,74], and the heritability, repeatability, and phenotypic and genetic correlations of the traits were calculated based on the variance. Moreover, the individual breeding value was estimated. The accuracy of breeding value estimation was assessed by the correlation coefficient between the phenotype and individual breeding value after model correction. Because the phenotypic and genetic correlations between 22 fatty acids and 5 conventional milk components (milk fat, milk protein, lactose, total solids, and urea nitrogen, measured by FOSS milk component analyzer) were to be investigated in the process of genetic parameter estimation, 27 traits are in the multi-model analysis. Taking the predicted values of two traits as an example, the model is defined as follows:
y 1 y 2 = X 1 0 0 X 2 b 1 b 2 + Z 1 0 0 Z 2 u 1 u 2 + W 1 0 0 W 2 c 1 c 2 + e 1 e 2
In the model, y1 and y2 represent the vectors of predicted trait values; and b1 and b2 stand for vectors of fixed effects, including the parity effect and lactation effect. Parity is divided into 5 levels (1, 2, 3, 4, and 5–10) and lactation (5 to 365 days) is divided into 12 levels with 30 days. u1 and u2 are vectors of individual additive genetic effects, which distributes in u-N (0, σ u 2 ). c1 and c2 are individual permanent environmental effects vectors, which accord with the distribution c-N (0, σ c 2 ). e1 and e2 are residual effect vectors, consistent with the distribution e-N (0, σ e 2 ). X1, X2, Z1, Z2, W1, and W2 are the structure matrices of fixed effects, individual additive genetic effects, and permanent environmental effects, respectively. The maximum number of iterations of the AIREML algorithm is 300 and the convergence is assessed by the formula:
Σ β t β t 1 2 Σ β t 2 < 1 × 10 6
where β is the vector of all variances and covariances to be estimated, and t is the number of iterations.
In the estimation of model variance components, σ p 2 denotes the total variance of the predicted value, σ u 2 is the additive genetic variance, σ c 2 describes the permanent environmental variance, and σ e 2 represents the residual variance. Thus, σ p 2   = σ u 2   +   σ c 2   + σ e 2 . The formulas for heritability and repeatability are h2 = σ u 2 σ u 2 + σ c 2 + σ e 2 and t = σ u 2 + σ c 2 σ u 2 + σ c 2 + σ e 2 , respectively, and the formulas for genetic and phenotypic correlations are rG = C o v u 1 u 2 σ u 1 σ u 2 and rP = C o v p 1 p 2 σ p 1 σ p 2 .

2.5. GWAS and Screening of Fatty Acid-Related SNPs

In this study, Illumina Bovine SNP 50k (47873 SNPs, genome version ARS_UCD 1.2) was applied to detect the genotypes of 1873 Chinese Holstein cows. The selected cows were healthy, with consistent feeding and management conditions and no illness within half a year. The parity was 1–10, the lactation period was 5–365 d, and the calving season was evenly distributed in spring, summer, autumn, and winter. GWAS was performed to preliminarily screen significant SNPs based on the corrected fatty acid content of 1873 Chinese Holstein cows. 50k chip data quality control was performed before GWAS. A total of 30 chromosomes were selected for SNP analysis, including 29 autosomes (numbered 1–29) and 1 sex chromosome (X). A total of 47,873 SNP genotypes of 1873 dairy cows were analyzed by Plink 1.9 software [75]. The SNP loci and individual cows with a genotype detection rate greater than 0.98 and SNPs with a Hardy–Weinberg equilibrium of less than 0.0001 were screened out. Finally, the remaining 1712 cows and 42,500 SNP were obtained for fatty acid GWAS.
The analysis tool was the rMVP package in the R language [76]. The relationship matrix was constructed using pedigree data, phenotypic data, and SNP markers. In order to control the impact of population structure and kinship on GWAS and reduce the generation of false positives, three models were used in rMVP: the general linear model (GLM), mixed linear model (MLM), and FarmCPU were adopted to analyze the relationship matrix. The GLM used principal component analysis; the formula can be presented as y = PCs + S + e, where y represents the phenotype, PCs represents the principal components of the population structure, S represents the gene markers, and e represents the residual [77]. The MLM adds kinship (K) as a covariate to improve the test power of the model (y = PCs + S + K + e) [78], while FarmCPU eliminates the confounding between PCs and K, allowing fixed effect and random effect models to be executed independently [79]. The value of PCs in this study was determined to be 5. SNPs significantly associated with fatty acid content were preliminarily selected based on three models above. The threshold for acceptance in GWAS is approximately 1 × 10 6 , calculated via 0.05 ( a l p h a   v a l u e ) 42500 ( S N P   m a k e r   s i z e ) . We use the gene-based module in ANNOVAR to annotate candidate genes corresponding to SNPs [80].
Based on the preliminarily selected SNPs and their DNA sequences, the SNPs of 265 Chinese Holstein cows from an independent herd were sequenced using the sequencing technology, and the SNP genotypes of 265 Chinese Holstein cows were obtained. The association between SNP and fatty acid content was analyzed by Bonferroni ANOVA with an alpha value of 0.05. Each SNP was compared on 3 genotypes (AA, AB, and BB) and important SNPs significantly correlated with fatty acid were further verified and identified.

3. Results

3.1. Milk Fatty Acid Content Trait Analysis

Table 2 shows the fat reference values in two units (g/100 g of total fat, percentage of total fatty acid). The average content of individual fatty acid (in units of g/100 g total fat) was 0.10 (C18:3,n3cis-9,12,15)–12.74 (C16:0). The coefficient of variation (CV) was 48.60% (C16:0)–90.55% (C4:0). The mean value of fatty acid groups was 1.33 (PUFA)–35.22 (TFA), and the CV was 48.55% (SFA)–58.07% (SCFA). The mean values of individual fatty acids in the percentage of total fatty acids ranged from 0.26 (C18:3,n3cis-9,12,15) to 36.37 (C16:0), and the coefficients of variation ranged from 9.69% (C16:0) to 92.01% (C4:0). The average content of the fatty acid groups ranged from 3.71 (PUFA) to 72.30 (SFA) with coefficients of variation ranging from 6.23% (SFA) to 37.76% (SCFA). The results indicate a wide range of milk fatty acid content of dairy cows from 36 dairy farms in 11 provinces in China; therefore, this dataset is suitable for model establishment.

3.2. FT-MIRS-Based Fatty Acid Prediction Model

Figure 1 illustrates the FT-MIRS absorbances of all milk samples for prediction model development. Figure 1a delineates raw spectra, and Figure 1b represents averaged spectra. All prediction models in this study were based on the first derivative (D1) or second derivative (D2) FT-MIRS data preprocessing methods, and preprocessed spectra are shown in Figure 1c,d. Table 3 summarizes the relevant information and model evaluation indicators of the optimal prediction models for 22 fatty acids, including preprocessing methods, number of characteristic waves, Rcv2, RMSEcv, Rt2, RMSEt, and RPDcv.
The model preprocessing method presents a certain regularity: among individual fatty acids, D1 was used more frequently, while D2 was used uniformly in fatty acid groups, suggesting that D1 was more suitable for individual fatty acids with low content than D2. Rcv2 ranged from 0.59 (C18:0) to 0.76 (C4:0), and Rt2 ranged from 0.44 (C18:3,n3 cis-9, 12, 15) to 0.74 (SCFA). Among individual fatty acids, Rcv2 and Rt2 were apt to decrease with the carbon chain lengthening, and Rt2 exhibited more significant variation. The Rcv2 of single unsaturated fatty acid is higher than that of saturated fatty acid in the same carbon chain. Among fatty acid groups, except MUFA and PUFA, Rcv2 and Rt2 manifested congruous regularity, SFA > TFA > UFA, SCFA > MCFA > LCFA, which further indicated that model performance was negatively correlated with carbon chain length. In addition, it was found that the overall levels of Rcv2 and Rt2 of fatty acid groups were slightly higher than those of single fatty acids, suggesting that fatty acid content and carbon chain length could influence model performance. The RPDcv ranged from 1.57 (C18:0) to 2.04 (C4:0), indicating that the model performed acceptably and could predict fatty acid content and be used in genetic parameter estimation and GWAS for subsequent dairy herds.

3.3. Genetic Parameter Estimates of Fatty Acid Content

3.3.1. Heritability, Repeatability, and Individual Breeding Value

Table 4 describes the estimates of variance components, heritability, repeatability, and individual breeding values of 22 fatty acids. Except for C17:0, all 22 fatty acids had medium heritability and medium to high repeatability. Among individual fatty acids, C12:0 (0.34) had the highest heritability, and C17:0(0.20) had the lowest. SFA (0.31) exhibited the highest heritability among fatty acid groups, and MCFA (0.27) showed the lowest. The heritability of individual fatty acids varied substantially, while fatty acid groups had similar heritability with a high overall level, consistent with repeatability. This indicates that the differentiating features of single fatty acids were more prominent, while those of fatty acid groups might be weakened by containing multiple fatty acids.
Regarding the individual breeding value of dairy cows, the value of fatty acids varied widely and the accuracy was estimated to be high (0.54–0.82). In total, 15 of the 22 fatty acids had accuracy above 0.7, denoting that the individual breeding value estimation method in this study could be used to screen dairy cows with high or low fatty acid content and could improve the breeding efficiency of fatty acid production traits.

3.3.2. Genetic and Phenotypic Correlations

Table 5 illustrates the estimates of genetic and phenotypic correlations among 27 traits (22 fatty acids and 5 conventional milk components), and the two correlations had similar trends. For further description of genetic findings, see Table 5:
Correlations between fatty acids: Except for C4:0 and C14:1, cis-9 (−0.01), other fatty acids were positively associated (0.04–1.00). And correlations between individual fatty acids and fatty acid groups were medium to high. The heritability of almost all fatty acid groups except LCFA was above 0.6. Correlations between individual fatty acids and fatty acid groups were primarily low to moderate, and C8:0, C16:0, C17:0, and C18:0 were moderately/highly associated with fatty acid groups. Furthermore, fatty acids with similar carbon chain length or similar chemical functional groups had generally high correlations, and the correlation between single unsaturated fatty acids, UFA, MUFA, PUFA, and saturated fatty acids or other fatty acid groups was generally low. Overall, genetic correlation among fatty acids had an evident regularity, involving carbon chain length, chemical functional group type, and saturation degree, which could be used as a reference in the breeding process of fatty acid production traits.

3.4. Screening and Identification of Fatty Acid-Related SNPs and Genes

3.4.1. Comparison and Filtering of the Three GWAS Methods

The GLM, MLM, and FarmCPU methods were applied to the GWAS of 22 fatty acid content, as shown in the quantile–quantile plot (Figure 2a). The distribution of 42,500 SNPs on 30 chromosomes is shown in Figure 2b. Taking C6:0 as an example (the results of other fatty acids were similar, see the link for details: https://github.com/YikaiFan0908/FA (accessed on 3 July 2024)), Figure 2a demonstrated many false positives among selected significant SNPs via the GLM. The MLM significantly improved on false positive control but did not select significant SNPs. FarmCPU combines the advantages of the first two methods, which can control false positives and have more robust SNP screening performance. Thus, the FarmCPU method was selected for GWAS and validation.

3.4.2. GWAS, Screening, and Identification of Fatty Acid Content Associated SNPs

Based on the new version of the ARS_UCD 1.2 genome, SNP genotypes of 1873 Chinese Holstein cows were detected by 50k SNP microarray. Then, SNPs associated with fatty acid content were identified by the FarmCPU model. Table 6 exhibits SNP data and validation of preliminarily selected SNPs. In this study, 39 SNPs screened from 20 chromosomes were prominently associated with 17 fatty acids (13 single fatty acids and 4 fatty acid groups), of which 8 SNPs were selected twice. Hence, the total number of SNPs was 47. Of the 17 fatty acids, 14 were significantly associated with 2–5 SNPs each. Five SNPs were correlated with C8:0, C14:1, and cis-9, and four SNPs were associated with C6:0 and C10:0. A total of 37 of the 47 SNPs were matched with corresponding genomes from reference genomes.
The analysis of the effect of 3 SNPs on the content of short-chain fatty acids showed the following: The C6:0 content of AA genotype cows with SNP (ARS-BFGL-NGS-22276) was significantly higher than that of AB and BB genotypes, showing AA > AB > BB, indicating that allele A has an up-regulation effect on C6:0 expression compared with B (Figure 3a). The AA genotype of SNP (ARS-BFGL-NGS-33001) had significantly lower C8:0 content than the BB genotype, denoting BB > AB > AA, indicating that the allele A had a down-regulation effect on C8:0 expression compared with the allele B (Figure 3b). The AB genotype of SNP (Bovine HD 0100019865) had significantly higher C10:0 content than the BB genotype, showing AB > AA > BB, suggesting that heterozygotes may have an up-regulation effect on C10:0 expression compared with homozygotes (Figure 3c).
Individual medium-chain fatty acids: 1, 2, 5, 3, and 3 SNPs were found to be significantly associated with C12:0, C14:0, C14:1, cis-9, C15:0, and C16:0, respectively. The 14 SNPs were distributed on 8 chromosomes, 7 of which were located in introns, 3 in intergenic regions, and 4 were not associated with corresponding genes. C15:0 and C16:0 had identical SNPs, BovineHD0400022351, BovineHD2200010872, and BovineHD3000019134, which were located in an unknown gene region, the intron of SYNPR, and the intron region of TRPC5. After ANOVA with the Bonferroni test, two SNPs were identified from 14 SNPs, Bovine HD 3000029498 (C14: 0, p-value Bonferroni = 3.56 × 10 2 ) and Bovine HD 1600000152 (C14:1,cis-9, p-value Bonferroni = 1.57 × 10 2 ), which were located between CASK (dist = 28,036) and NYX (dist = 6763) and between ENSBTAG00000050274 (dist = 75,663) and ADORA1(dist = 18,476), respectively.
Analysis of the effect of the two SNPs on the content of medium-chain fatty acids showed the following: The C14:0 content of AA genotype cows with SNP (Bovine HD 3000029498) was significantly higher than that of AB and BB genotypes, showing AA > AB > BB, indicating that allele A has an up-regulation effect on C14:0 expression compared with B (Figure 3d). The AA genotype of SNP (Bovine HD 1600000152) had significantly lower C14:1,cis-9 content than the AB and BB genotypes, denoting AB > BB > AA, indicating that homozygotes may result in the down-regulation of C14:1, cis-9 expression compared with heterozygotes (Figure 3e).
Individual long-chain fatty acids: 2, 3, 3, and 3 SNPs were found to be significantly associated with long-chain fatty acids C18:0, C18:1,n9 cis-9, C18:2,n6 cis-9/12, C18:3,n3 cis-9/12/15, respectively. These 11 SNPs were distributed on 9 chromosomes. Among them, 5 were located within intergenic regions, 2 in introns, 1 in exon, and 3 were unrelated to corresponding genes. Via ANOVA with the Bonferroni test, ARS-BFGL-NGS-15402 was found to have a considerable effect on C18:3, n3 cis-9/12/15 (p-value Bonferroni = 8.34 × 10 3 ) and was located in the intron of TTC29.
The post-validation effect analysis for the SNP showed the following: The C18:3, n3 cis-9/12/15 of cows with the AA genotype was significantly lower than that of cows with the BB genotype, and BB > AB > AA, manifesting that allele A had a down-regulation effect compared with allele B (Figure 3f).
Fatty acid groups: 1, 2, 3, and 3 SNPs were found to be significantly correlated with UFA, MUFA, SCFA, and LCFA by 50k SNP microarray, respectively. The 7 SNPs were distributed on 5 chromosomes, 4 located in introns, and 3 were not associated with corresponding genes. ARS-BFGL-NGS-27018 was significantly relevant to UFA and MUFA. After ANOVA with the Bonferroni test, Bovine HD 1300022078 was confirmed to have a remarkable effect on LCFA (p-value Bonferroni = 4.61 × 10 3 ) and was located within the intron of EYA2. The post-validation effect analysis revealed that the LCFA content of AB genotype cows was significantly lower than that of BB genotype cows, and BB > AA > AB, indicating that heterozygotes may have a down-regulation effect compared with homozygotes (Figure 3g).

4. Discussion

4.1. Reference Value of Fatty Acid Measurements and Prediction Model Construction

Individual fatty acids in milk are difficult to determine due to their low concentration. Measurements from various studies vary globally [9,10,21,81,82]. The reference values in this study were collected from 36 large-scale cattle farms with more than 1000 cows. Given China’s enormous population and copious milk demand, massive numbers of dairy cows are reared by large-scale cattle farms in large pens with the TMR feeding mode. Only minor cattle farms and individual farmers use grazing systems. Thus, reference value measurements in this study differ from those in countries where dairy cows graze. Even though the reference values of each fatty acid essentially conformed to the normal distribution, the overall CV for g/100 g total fat was high, with an average of 57.31%. Yet, the average CV for the percentage of total fatty acid was reduced to 26.02% and more consistent with other studies [24]. This suggests that the samples were diverse and representative and applied to the model establishment [83]. A larger variation will improve the generalization ability of the prediction model, and g/100 g total fat can be used to calculate the absolute content of fatty acids in milk, which is more valuable for evaluating cow production performance and milk quality.
This study established 22 fatty acid prediction models based on the most widely used PLSR algorithm, and only D1 or D2 was performed for spectral preprocessing. The purpose was to reduce the differentiation between prediction models during prediction caused by distinct preprocessing or machine learning algorithms, which would affect genetic parameter estimates or GWAS. Similar to the results of Soyeurt et al., Rcv2 and Rt2 both kept high levels [9,10,16,18,84,85], and it was found that the performance of the superior model was related to carbon chain length, saturation degree, and substance content. The model developed in this study has poor predictive ability for some of the LCFA, e.g., C18:0, C7:0. The main reason for this is the low content of LCFA and the high proportion of unsaturated fatty acids in LCFA, which are relatively chemically unstable [86]. For example, Tiplady et al. also reported that Rcv2 of SCFA, MCFA, and LCFA decreased with the lengthening of carbon chains [22]. Meanwhile, Soyeurt et al. revealed that saturated fatty acids performed better than unsaturated fatty acids [3]. Compared to Zhao’s study, ours involved 36 ranches in 11 provinces, with a wider generalization capacity and application of the model [87]. Thus, the model established in this study was superior and represented the characteristics of Chinese Holstein cows, which could be used to predict the fatty acid content of Chinese Holstein cows and analyze their genetic characteristics.

4.2. Fatty Acid Genetic Parameter Estimates and Comparison Analysis

Improving milk composition and quality-related traits via FT-MIRS is a general trend [11]. In this study, we used HiBLUP (https://www.hiblup.com/ (accessed on 3 July 2024)), a high-performance computing software developed by our laboratory, to estimate the genetic parameters of dairy cows for the first time. The HRM was constructed by making full use of genome and pedigree information and converged in a finite number of iterations. This study demonstrated moderate heritability, similar to Freitas et al. study. SFA (h2 = 0.305) showed the highest heritability among fatty acid groups [52,54,83]. Our repeatability estimates were higher than in previous studies. Tiplady et al., Lopez-Villalobos et al. and other studies reported low to medium repeatability of fatty acids with universally high breeding accuracy [22,37]. This indicated that our genetic analysis method and the predictions using FT-MIRS models could preliminarily screen dairy cows with high or low fatty acid content.
Furthermore, similar to the results of Lopez-Villalobos et al., phenotypic and genetic correlation exhibited identical trends. Milk protein was negatively associated with unsaturated fatty acids, while it was positively correlated with saturated fatty acids genetically. Lactose was negatively correlated with most fatty acids [37]. Similar to the study of Eskildesn C.E et al., there were high correlations between milk fat and fatty acids, and between different fatty acids in this study, which was affected by factors such as breed and feed [85]. Unlike previous studies, the correlations in this study were almost all positive. The reason might be that several models did not have high accuracy among the 22 fatty acid prediction models, leading to deviation. Also, this was possibly due to the low fatty acid production in the selected dairy herd with g/100 g of total fat as the unit, resulting in insignificant interaction between different fatty acids, suggesting room for breeding improvement. Much published work on milk fatty acids has found that correlations with FTIR data are improved when the fatty acid data are reported on the basis of g/dl milk instead of g/100 g fat [11,83]. Given overall positive correlations among fatty acids, we recommend selecting cows with high total fatty acid production, assuring high levels of single fatty acid and fatty acid groups. Subsequently, characteristic cows rich in a certain fatty acid can be selected according to genetic correlation and phenotypic correlation to establish the characteristic core group of dairy cows.

4.3. Fatty Acid-Related Molecular Markers and Corresponding Genes

Most existing research on fatty acid GWAS and SNP screening is based on the reference values determined by GC or other chemical methods [52,83,88]. Accurate fatty acid phenotype results can contribute to efficient SNP marker mining. However, determining sample reference values is time-consuming with a high cost, unable to obtain a large number of fatty acid phenotypes in a short time or to perform GWAS, breeding value estimation, and high-yield dairy cow selection in large herds. Nevertheless, GWAS based on the predicted values of fatty acid models established with FT-MIRS can effectively solve this problem, and the accuracy of the prediction model is required. In recent years, there are also reports on GWAS of fatty acids based on model predictions [51,54]. Tiplady et al. compared fatty acid GWAS results from direct measurement and predicted by the FT-MIRS model, showing that SNPs screened by the two methods were partially the same. This revealed that FT-MIRS model predictions have the potential to be used as alternative reference values to investigate genetic variation in fatty acids [22]. Thus far, there are no reports on GWAS using FT-MIRS model prediction in Chinese Holstein cows.
Researchers have previously conducted GWAS on the fatty acid profile in milk, and most studies have found that the two genes DGAT1 and SCD are significantly associated with milk fatty acids based on UMD_3.1 [22,24,43,49,51,88]. In addition, Tiplady et al. found that CCDC57 and GPAT4 may be important fatty acid-related candidate genes [22]; Cruz et al. considered PLBD1 and MGST1 to be important additional candidate genes in Holstein cattle [51]; Buitenhuis et al. proposed that ACSS3 is a dominant candidate gene for the QTL of C10:0 and C15:0 on chromosome 5 [88]; Li et al. identified 20 novel promising fatty acid candidate genes including HTR1B, CPM, and PRKG1 [24]. In an attempt to explore whether novel SNPs significantly associated with fatty acids can be identified, this study applied ARS_UCD 1.2 for analysis. Compared with the UMD_3.1, which is used in more studies, this version has been supplemented and updated with more SNP information and is more likely to mine new SNPs that differ from previous findings; for example, Freitas et al. discovered many new SNPs related to milk fatty acids on chromosomes 5, 13, and 14 based on ARS_UCD 1.2 [54]. Therefore, close to our expectations, this study screened and identified 7 novel SNPs significantly associated with fatty acids that had not been previously reported. Bioinformatics analysis indicated that these SNPs might directly or indirectly affect gene expression regulation. TMEM120B played a vital role in adipocyte differentiation [89,90]; CASK was involved in mammary gland development, milk yield, milk fat percentage, milk protein percentage, and coagulation traits [91,92]; EVC and EYA2 play an essential regulatory role in growth and development of dairy cows [93,94]; ADORA1 was related to immune system development and regulation [95,96]. Relevant regulatory mechanisms are worth further exploration. Moreover, since cow herds for verification and identification were derived from different pastures with a small sample size, only 7 of the 47 SNPs initially screened in this study were validated to be significant in another cattle herd. Yet, other SNPs were also assumed to be crucial for fatty acid content. Specifically, 8 SNPs were preliminarily selected, showing significant associations with two fatty acids, and 5 of them denoted a strong correlation with carbon chain length or fatty acid type. Meanwhile, Bovine HD 0100019865 was proved to have a significant impact on C10:0 content and a tendency to affect C12:0 content (p-value Bonferroni = 8.34 × 10 2 ) significantly. Hence, further validation and identification are required for the 47 SNPs selected in this study.

4.4. Limitations of the Study

Since cow herds for verification and identification were derived from different pastures with a small sample size, only 7 of the 47 SNPs initially screened in this study were validated to be significant in another cattle herd. Yet, other SNPs were also assumed to be crucial for fatty acid content. Hence, further validation and identification are required for the 47 SNPs selected in this study. In addition, the regulatory mechanisms of fatty acid regulation by the 7 novel SNPs screened in this study merit further exploration. The density of the gene chip is only 50k, and the chip density is low. Only one site is related to the DGAT1 gene, which may lead to missed SNP selection.
Moreover, the accuracy of several models needs to be improved, which could introduce bias in the results. Additionally, the general applicability of the model needs to be verified by methods such as cross-farm and other effect tests. Furthermore, the fatty acid yield in the groups of cows participating in the study was low, meaning that the interactions between the different fatty acids might not be significant enough.
Furthermore, this study was conducted on farms with TMR feeding, and the findings may be more applicable to pastures of the same type rather than to grazing dairy cows.

5. Conclusions

This study constructed quantitative prediction models for 14 individual fatty acids and 8 fatty acid groups using FT-MIRS, a D1 or D2 spectral preprocessing algorithm, and a PLSR machine learning algorithm. The model performance was favorable and could be adopted for the quantitative prediction of milk fatty acids. The multi-trait model based on HiBLUP can assess multiple genetic parameters, e.g., individual breeding value, heritability, repeatability, phenotypic correlation, and genetic correlation of dairy cattle. It was shown that Chinese dairy cattle had medium fatty acid heritability and moderate to high repeatability with highly accurate breeding value estimates. Genetic correlations between different fatty acids were predominantly positive. Seven novel SNPs significantly correlated with fatty acid levels were identified using the predicted fatty acid content values from the quantitative prediction models, 50k SNP microarray, DNA sequencing, and superior GWAS algorithm (FarmCPU). The seven SNPs were located in or adjacent to 9 genes that regulate fatty acid synthesis. Also, the prediction model of fatty acid content based on FT-MIRS has high accuracy and versatility. More new SNPs and genes can be identified using higher-density chips or deep resequencing (lower-density chips selected seven significant SNPs) that would be worthy of further probation. The above model and results lay a foundation and provide technical support for detecting milk fatty acid content and investigating genetic characteristics, as well as dairy cow breeding of fatty acid traits. It is hoped that, in the future, the method established in this study will be able to screen milk sources rich in high-quality fatty acids.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ani14192901/s1, Table S1: Distribution of samples; Folder S1: Corrected phenotypes.

Author Contributions

C.L., Y.F. and S.Z. conceived the project and designed the research; C.L., Y.F., D.W. and X.S. performed the data analysis; C.L., Y.F., D.W. and S.Z. wrote the manuscript; C.C., H.W., X.L., L.N. and X.R. performed edits and revisions to the paper; S.C., Q.Y., J.N., J.L. and Y.M. provided the funding acquisition. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Inter-Governmental International Science and Technology Cooperation Project of the State Key Research and Development Program, grant number 2021YFE0115500, Fundamental Research Funds for the Central Universities, grant number 2662023DKPY001, and Dairy Industry Innovation Team Genetic Resources Development and Utilization Cooperation Project of Hebei Province.

Institutional Review Board Statement

This study was reviewed and approved by the Animal Management and Ethics Committee of Huazhong Agricultural University, with the approval number HZAUCA-2019-004.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets used and/or analyzed during the current study are available from the corresponding authors on reasonable request.

Acknowledgments

The authors greatly appreciate the efforts of the Dairy Cattle Research Center, Shandong Academy of Agricultural Sciences (Jianbin Li and Gaozhan Cai), Ningxia Hui Autonomous Region Animal Husbandry Workstation (Wan Wen and Weiqi Li), Beijing Dairy Cattle Center (Lin Liu and Yang Zou), Jiangsu Province Dairy Herd Improvement Center (Yafei Cai and Zhenhai Xia), Inner Mongolia Saikexing Livestock Seed Industry and Breeding Biotechnology Research Institute Co., Ltd. (Wei Sun, Tiezhu Zhang and Hui Chi), Henan Province Province Dairy Herd Improvement Center (Zhen Zhang and Lei Yan), Shanghai Dairy Breeding Center Co., Ltd. (Pengpeng An and Zhen Liu), Guangdong Province Center for Agriculture Technology Extension (Yuanyi Yan and Jianchao Guo), Hubei Province Province Dairy Herd Improvement Center (Guiqiang Wang and Zeping Qin), Animal Husbandry and Veterinary Workstation of the Eighth Division of Xinjiang Production and Construction Corps (Kaibin He and Ye Xu), and Hebei Bovine Germplasm Resources Technology Innovation Center for providing the milk samples.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Fleming, A.; Schenkel, F.S.; Chen, J.; Malchiodi, F.; Bonfatti, V.; Ali, R.A.; Mallard, B.; Corredig, M.; Miglior, F. Prediction of milk fatty acid content with mid-infrared spectroscopy in Canadian dairy cattle using differently distributed model development sets. J. Dairy Sci. 2017, 100, 5073–5081. [Google Scholar] [CrossRef] [PubMed]
  2. Haug, A.; Hostmark, A.T.; Harstad, O.M. Bovine milk in human nutrition—A review. Lipids Health Dis. 2007, 6, 25. [Google Scholar] [CrossRef] [PubMed]
  3. Soyeurt, H.; Dehareng, F.; Gengler, N.; McParland, S.; Wall, E.; Berry, D.P.; Coffey, M.; Dardenne, P. Mid-infrared prediction of bovine milk fatty acids across multiple breeds, production systems, and countries. J. Dairy Sci. 2011, 94, 1657–1667. [Google Scholar] [CrossRef] [PubMed]
  4. Bhavadharini, B.; Dehghan, M.; Mente, A.; Rangarajan, S.; Sheridan, P.; Mohan, V.; Iqbal, R.; Gupta, R.; Lear, S.; Wentzel-Viljoen, E.; et al. Association of dairy consumption with metabolic syndrome, hypertension and diabetes in 147 812 individuals from 21 countries. BMJ Open Diabetes Res. Care 2020, 8, e000826. [Google Scholar] [CrossRef] [PubMed]
  5. Caredda, M.; Addis, M.; Ibba, I.; Leardi, R.; Scintu, M.F.; Piredda, G.; Sanna, G. Prediction of fatty acid content in sheep milk by Mid-Infrared spectrometry with a selection of wavelengths by Genetic Algorithms. LWT 2016, 65, 503–510. [Google Scholar] [CrossRef]
  6. Liu, X.-M.; Zhang, Y.; Zhou, Y.; Li, G.-H.; Zeng, B.-Q.; Zhang, J.-W.; Feng, X.-S. Progress in Pretreatment and Analysis of Fatty Acids in Foods: An Update since 2012. Sep. Purif. Rev. 2019, 50, 203–222. [Google Scholar] [CrossRef]
  7. Shingfield, K.J.; Chilliard, Y.; Toivonen, V.; Kairenius, P.; Givens, D.I. Trans fatty acids and bioactive lipids in ruminant milk. Adv. Exp. Med. Biol. 2008, 606, 3–65. [Google Scholar] [CrossRef]
  8. Wang, F.; Chen, M.; Luo, R.; Huang, G.; Wu, X.; Zheng, N.; Zhang, Y.; Wang, J. Fatty acid profiles of milk from Holstein cows, Jersey cows, buffalos, yaks, humans, goats, camels, and donkeys based on gas chromatography–mass spectrometry. J. Dairy Sci. 2022, 105, 1687–1700. [Google Scholar] [CrossRef]
  9. Rutten, M.J.; Bovenhuis, H.; Hettinga, K.A.; van Valenberg, H.J.; van Arendonk, J.A. Predicting bovine milk fat composition using infrared spectroscopy based on milk samples collected in winter and summer. J. Dairy Sci. 2009, 92, 6202–6209. [Google Scholar] [CrossRef]
  10. Soyeurt, H.; Dardenne, P.; Dehareng, F.; Lognay, G.; Veselko, D.; Marlier, M.; Bertozzi, C.; Mayeres, P.; Gengler, N. Estimating fatty acid content in cow milk using mid-infrared spectrometry. J. Dairy Sci. 2006, 89, 3690–3695. [Google Scholar] [CrossRef]
  11. Tiplady, K.M.; Lopdell, T.J.; Littlejohn, M.D.; Garrick, D.J. The evolving role of Fourier-transform mid-infrared spectroscopy in genetic improvement of dairy cattle. J. Anim. Sci. Biotechnol. 2020, 11, 39. [Google Scholar] [CrossRef] [PubMed]
  12. Brand, W.; Wells, A.T.; Smith, S.L.; Denholm, S.J.; Wall, E.; Coffey, M.P. Predicting pregnancy status from mid-infrared spectroscopy in dairy cow milk using deep learning. J. Dairy Sci. 2021, 104, 4980–4990. [Google Scholar] [CrossRef] [PubMed]
  13. Grelet, C.; Dardenne, P.; Soyeurt, H.; Fernandez, J.A.; Vanlierde, A.; Stevens, F.; Gengler, N.; Dehareng, F. Large-scale phenotyping in dairy sector using milk MIR spectra: Key factors affecting the quality of predictions. Methods 2021, 186, 97–111. [Google Scholar] [CrossRef]
  14. Du, C.; Nan, L.; Yan, L.; Bu, Q.; Ren, X.; Zhang, Z.; Sabek, A.; Zhang, S. Genetic Analysis of Milk Production Traits and Mid-Infrared Spectra in Chinese Holstein Population. Animals 2020, 10, 139. [Google Scholar] [CrossRef]
  15. Swinehart, D.F. The Beer-Lambert Law. J. Chem. Educ. 1962, 39, 333. [Google Scholar] [CrossRef]
  16. Soyeurt, H.; Dardenne, P.; Dehareng, F.; Bastin, C.; Gengler, N. Genetic parameters of saturated and monounsaturated fatty acid content and the ratio of saturated to unsaturated fatty acids in bovine milk. J. Dairy Sci. 2008, 91, 3611–3626. [Google Scholar] [CrossRef]
  17. Soyeurt, H.; Dehareng, F.; Mayeres, P.; Bertozzi, C.; Gengler, N. Variation of Δ9-desaturase activity in dairy cattle. J. Dairy Sci. 2008, 91, 3211–3224. [Google Scholar] [CrossRef]
  18. Bresolin, T.; Dórea, J.R.R. Infrared Spectrometry as a High-Throughput Phenotyping Technology to Predict Complex Traits in Livestock Systems. Front. Genet. 2020, 11, 923. [Google Scholar] [CrossRef]
  19. Savitzky, A.; Golay, M.J.E. Smoothing and Differentiation of Data by Simplified Least Squares Procedures. Anal. Chem. 1964, 36, 1627–1639. [Google Scholar] [CrossRef]
  20. Rinnan, Å.; Nørgaard, L.; Berg, F.v.d.; Thygesen, J.; Bro, R.; Engelsen, S.B. Chapter 2—Data Pre-processing. In Infrared Spectroscopy for Food Quality Analysis and Control; Sun, D.-W., Ed.; Academic Press: San Diego, CA, USA, 2009; pp. 29–50. [Google Scholar]
  21. De Marchi, M.; Penasa, M.; Cecchinato, A.; Mele, M.; Secchiari, P.; Bittante, G. Effectiveness of mid-infrared spectroscopy to predict fatty acid composition of Brown Swiss bovine milk. Animal 2011, 5, 1653–1658. [Google Scholar] [CrossRef]
  22. Tiplady, K.M.; Lopdell, T.J.; Sherlock, R.G.; Johnson, T.J.J.; Spelman, R.J.; Harris, B.L.; Davis, S.R.; Littlejohn, M.D.; Garrick, D.J. Comparison of the genetic characteristics of directly measured and Fourier-transform mid-infrared-predicted bovine milk fatty acids and proteins. J. Dairy Sci. 2022, 105, 9763–9791. [Google Scholar] [CrossRef] [PubMed]
  23. Soyeurt, H.; Bastin, C.; Colinet, F.G.; Arnould, V.M.-R.; Berry, D.P.; Wall, E.; Dehareng, F.; Nguyen, H.N.; Dardenne, P.; Schefers, J.; et al. Mid-infrared prediction of lactoferrin content in bovine milk: Potential indicator of mastitis. Animal 2012, 6, 1830–1838. [Google Scholar] [CrossRef] [PubMed]
  24. Li, C.; Sun, D.; Zhang, S.; Wang, S.; Wu, X.; Zhang, Q.; Liu, L.; Li, Y.; Qiao, L. Genome wide association study identifies 20 novel promising genes associated with milk fatty acid traits in Chinese Holstein. PLoS ONE 2014, 9, e96186. [Google Scholar] [CrossRef] [PubMed]
  25. Ferrand-Calmels, M.; Palhiere, I.; Brochard, M.; Leray, O.; Astruc, J.M.; Aurel, M.R.; Barbey, S.; Bouvier, F.; Brunschwig, P.; Caillat, H.; et al. Prediction of fatty acid profiles in cow, ewe, and goat milk by mid-infrared spectrometry. J. Dairy Sci. 2014, 97, 17–35. [Google Scholar] [CrossRef]
  26. Contla Hernandez, B.; Lopez-Villalobos, N.; Vignes, M. Identifying Health Status in Grazing Dairy Cows from Milk Mid-Infrared Spectroscopy by Using Machine Learning Methods. Animals 2021, 11, 2154. [Google Scholar] [CrossRef]
  27. Frizzarin, M.; Gormley, I.C.; Berry, D.P.; Murphy, T.B.; Casa, A.; Lynch, A.; McParland, S. Predicting cow milk quality traits from routinely available milk spectra using statistical machine learning methods. J. Dairy Sci. 2021, 104, 7438–7447. [Google Scholar] [CrossRef]
  28. Hanuš, O.; Samková, E.; Křížová, L.; Hasoňová, L.; Kala, R. Role of Fatty Acids in Milk Fat and the Influence of Selected Factors on Their Variability—A Review. Molecules 2018, 23, 1636. [Google Scholar] [CrossRef]
  29. Palmquist, D.L.; Denise Beaulieu, A.; Barbano, D.M. Feed and Animal Factors Influencing Milk Fat Composition. J. Dairy Sci. 1993, 76, 1753–1771. [Google Scholar] [CrossRef]
  30. Auldist, M.J.; Johnston, K.A.; White, N.J.; Fitzsimons, W.P.; Boland, M.J. A comparison of the composition, coagulation characteristics and cheesemaking capacity of milk from Friesian and Jersey dairy cows. J. Dairy Res. 2004, 71, 51–57. [Google Scholar] [CrossRef]
  31. Dewhurst, R.J.; Shingfield, K.J.; Lee, M.R.F.; Scollan, N.D. Increasing the concentrations of beneficial polyunsaturated fatty acids in milk produced by dairy cows in high-forage systems. Anim. Feed. Sci. Technol. 2006, 131, 168–206. [Google Scholar] [CrossRef]
  32. Stoop, W.M.; Bovenhuis, H.; Heck, J.M.; van Arendonk, J.A. Effect of lactation stage and energy status on milk fat composition of Holstein-Friesian cows. J. Dairy Sci. 2009, 92, 1469–1478. [Google Scholar] [CrossRef] [PubMed]
  33. Lehnert, K.; Ward, H.; Berry, S.D.; Ankersmit-Udy, A.; Burrett, A.; Beattie, E.M.; Thomas, N.L.; Harris, B.; Ford, C.A.; Browning, S.R.; et al. Phenotypic population screen identifies a new mutation in bovine DGAT1 responsible for unsaturated milk fat. Sci. Rep. 2015, 5, 8484. [Google Scholar] [CrossRef] [PubMed]
  34. Niero, G.; Visentin, G.; Ton, S.; De Marchi, M.; Penasa, M.; Cassandro, M. Phenotypic characterisation of milk technological traits, protein fractions, and major mineral and fatty acid composition of Burlina cattle breed. Ital. J. Anim. Sci. 2016, 15, 576–583. [Google Scholar] [CrossRef]
  35. Gottardo, P.; Penasa, M.; Righi, F.; Lopez-Villalobos, N.; Cassandro, M.; De Marchi, M. Fatty acid composition of milk from Holstein-Friesian, Brown Swiss, Simmental and Alpine Grey cows predicted by mid-infrared spectroscopy. Ital. J. Anim. Sci. 2017, 16, 380–389. [Google Scholar] [CrossRef]
  36. Soyeurt, H.; Gillon, A.; Vanderick, S.; Mayeres, P.; Bertozzi, C.; Gengler, N. Estimation of heritability and genetic correlations for the major fatty acids in bovine milk. J. Dairy Sci. 2007, 90, 4435–4442. [Google Scholar] [CrossRef]
  37. Lopez-Villalobos, N.; Spelman, R.J.; Melis, J.; Davis, S.R.; Berry, S.D.; Lehnert, K.; Sneddon, N.W.; Holroyd, S.E.; MacGibbon, A.K.; Snell, R.G. Genetic correlations of milk fatty acid contents predicted from milk mid-infrared spectra in New Zealand dairy cattle. J. Dairy Sci. 2020, 103, 7238–7248. [Google Scholar] [CrossRef]
  38. Soyeurt, H.; Misztal, I.; Gengler, N. Genetic variability of milk components based on mid-infrared spectral data. J. Dairy Sci. 2010, 93, 1722–1728. [Google Scholar] [CrossRef]
  39. Bittante, G.; Cecchinato, A. Genetic analysis of the Fourier-transform infrared spectra of bovine milk with emphasis on individual wavelengths related to specific chemical bonds. J. Dairy Sci. 2013, 96, 5991–6006. [Google Scholar] [CrossRef]
  40. Wang, Q.; Hulzebosch, A.; Bovenhuis, H. Genetic and environmental variation in bovine milk infrared spectra. J. Dairy Sci. 2016, 99, 6793–6803. [Google Scholar] [CrossRef]
  41. Rovere, G.; de Los Campos, G.; Tempelman, R.J.; Vazquez, A.I.; Miglior, F.; Schenkel, F.; Cecchinato, A.; Bittante, G.; Toledo-Alvarado, H.; Fleming, A. A landscape of the heritability of Fourier-transform infrared spectral wavelengths of milk samples by parity and lactation stage in Holstein cows. J. Dairy Sci. 2019, 102, 1354–1363. [Google Scholar] [CrossRef]
  42. Zaalberg, R.M.; Shetty, N.; Janss, L.; Buitenhuis, A.J. Genetic analysis of Fourier transform infrared milk spectra in Danish Holstein and Danish Jersey. J. Dairy Sci. 2019, 102, 503–510. [Google Scholar] [CrossRef] [PubMed]
  43. Bouwman, A.C.; Bovenhuis, H.; Visker, M.H.; van Arendonk, J.A. Genome-wide association of milk fatty acids in Dutch dairy cattle. BMC Genet. 2011, 12, 43. [Google Scholar] [CrossRef] [PubMed]
  44. Schopen, G.C.B.; Visker, M.H.P.W.; Koks, P.D.; Mullaart, E.; van Arendonk, J.A.M.; Bovenhuis, H. Whole-genome association study for milk protein composition in dairy cattle. J. Dairy Sci. 2011, 94, 3148–3158. [Google Scholar] [CrossRef] [PubMed]
  45. Boichard, D.; Boussaha, M.; Capitan, A.; Rocha, D.; Hozé, C.; Sanchez, M.P.; Tribout, T.; Letaief, R.; Croiseau, P.; Grohs, C.; et al. Experience from large scale use of the EuroGenomics custom SNP chip in cattle. In Proceedings of the 11th World Congress on Genetics Applied to Livestock Production, Auckland, New Zealand, 11–16 February 2018. [Google Scholar]
  46. Bouwman, A.C.; Visker, M.H.; van Arendonk, J.M.; Bovenhuis, H. Fine mapping of a quantitative trait locus for bovine milk fat composition on Bos taurus autosome 19. J. Dairy Sci. 2014, 97, 1139–1149. [Google Scholar] [CrossRef]
  47. Buitenhuis, B.; Poulsen, N.A.; Gebreyesus, G.; Larsen, L.B. Estimation of genetic parameters and detection of chromosomal regions affecting the major milk proteins and their post translational modifications in Danish Holstein and Danish Jersey cattle. BMC Genet. 2016, 17, 114. [Google Scholar] [CrossRef]
  48. Olsen, H.G.; Knutsen, T.M.; Kohler, A.; Svendsen, M.; Gidskehaug, L.; Grove, H.; Nome, T.; Sodeland, M.; Sundsaasen, K.K.; Kent, M.P.; et al. Genome-wide association mapping for milk fat composition and fine mapping of a QTL for de novo synthesis of milk fatty acids on bovine chromosome 13. Genet. Sel. Evol. 2017, 49, 20. [Google Scholar] [CrossRef]
  49. Palombo, V.; Milanesi, M.; Sgorlon, S.; Capomaccio, S.; Mele, M.; Nicolazzi, E.; Ajmone-Marsan, P.; Pilla, F.; Stefanon, B.; D’Andrea, M. Genome-wide association study of milk fatty acid composition in Italian Simmental and Italian Holstein cows using single nucleotide polymorphism arrays. J. Dairy Sci. 2018, 101, 11004–11019. [Google Scholar] [CrossRef]
  50. Pegolo, S.; Mach, N.; Ramayo-Caldas, Y.; Schiavon, S.; Bittante, G.; Cecchinato, A. Integration of GWAS, pathway and network analyses reveals novel mechanistic insights into the synthesis of milk proteins in dairy cows. Sci. Rep. 2018, 8, 566. [Google Scholar] [CrossRef]
  51. Cruz, V.A.R.; Oliveira, H.R.; Brito, L.F.; Fleming, A.; Larmer, S.; Miglior, F.; Schenkel, F.S. Genome-Wide Association Study for Milk Fatty Acids in Holstein Cattle Accounting for the DGAT1 Gene Effect. Animals 2019, 9, 997. [Google Scholar] [CrossRef]
  52. Iung, L.H.S.; Petrini, J.; Ramírez-Díaz, J.; Salvian, M.; Rovadoscki, G.A.; Pilonetto, F.; Dauria, B.D.; Machado, P.F.; Coutinho, L.L.; Wiggans, G.R.; et al. Genome-wide association study for milk production traits in a Brazilian Holstein population. J. Dairy Sci. 2019, 102, 5305–5314. [Google Scholar] [CrossRef]
  53. Zhou, C.; Li, C.; Cai, W.; Liu, S.; Yin, H.; Shi, S.; Zhang, Q.; Zhang, S. Genome-Wide Association Study for Milk Protein Composition Traits in a Chinese Holstein Population Using a Single-Step Approach. Front. Genet. 2019, 10, 72. [Google Scholar] [CrossRef] [PubMed]
  54. Freitas, P.H.F.; Oliveira, H.R.; Silva, F.F.; Fleming, A.; Miglior, F.; Schenkel, F.S.; Brito, L.F. Genomic analyses for predicted milk fatty acid composition throughout lactation in North American Holstein cattle. J. Dairy Sci. 2020, 103, 6318–6331. [Google Scholar] [CrossRef] [PubMed]
  55. Rosen, B.D.; Bickhart, D.M.; Schnabel, R.D.; Koren, S.; Elsik, C.G.; Tseng, E.; Rowan, T.N.; Low, W.Y.; Zimin, A.; Couldrey, C.; et al. De novo assembly of the cattle reference genome with single-molecule sequencing. GigaScience 2020, 9, giaa021. [Google Scholar] [CrossRef]
  56. Duggal, P.; Gillanders, E.M.; Holmes, T.N.; Bailey-Wilson, J.E. Establishing an adjusted p-value threshold to control the family-wide type 1 error in genome wide association studies. BMC Genom. 2008, 9, 516. [Google Scholar] [CrossRef]
  57. Nelder, J.A.; Wedderburn, R.W.M. Generalized Linear Models. J. R. Stat. Soc. Ser. A 1972, 135, 370–384. [Google Scholar] [CrossRef]
  58. Harville, D.A. Maximum Likelihood Approaches to Variance Component Estimation and to Related Problems. J. Am. Stat. Assoc. 1977, 72, 320–338. [Google Scholar] [CrossRef]
  59. Rutten, M.J.; Bovenhuis, H.; van Arendonk, J.A. The effect of the number of observations used for Fourier transform infrared model calibration for bovine milk fat composition on the estimated genetic parameters of the predicted data. J. Dairy Sci. 2010, 93, 4872–4882. [Google Scholar] [CrossRef]
  60. Narayana, S.G.; Schenkel, F.S.; Fleming, A.; Koeck, A.; Malchiodi, F.; Jamrozik, J.; Johnston, J.; Sargolzaei, M.; Miglior, F. Genetic analysis of groups of mid-infrared predicted fatty acids in milk. J. Dairy Sci. 2017, 100, 4731–4744. [Google Scholar] [CrossRef]
  61. Fleming, A.; Schenkel, F.S.; Malchiodi, F.; Ali, R.A.; Mallard, B.; Sargolzaei, M.; Jamrozik, J.; Johnston, J.; Miglior, F. Genetic correlations of mid-infrared-predicted milk fatty acid groups with milk production traits. J. Dairy Sci. 2018, 101, 4295–4306. [Google Scholar] [CrossRef]
  62. Hein, L.; Sorensen, L.P.; Kargo, M.; Buitenhuis, A.J. Genetic analysis of predicted fatty acid profiles of milk from Danish Holstein and Danish Jersey cattle populations. J. Dairy Sci. 2018, 101, 2148–2157. [Google Scholar] [CrossRef]
  63. Bionaz, M.; Loor, J.J. Gene networks driving bovine milk fat synthesis during the lactation cycle. BMC Genom. 2008, 9, 366. [Google Scholar] [CrossRef] [PubMed]
  64. Knutsen, T.M.; Olsen, H.G.; Tafintseva, V.; Svendsen, M.; Kohler, A.; Kent, M.P.; Lien, S. Unravelling genetic variation underlying de novo-synthesis of bovine milk fatty acids. Sci. Rep. 2018, 8, 2179. [Google Scholar] [CrossRef] [PubMed]
  65. Sabek, A.; Li, C.; Du, C.; Nan, L.; Ni, J.; Elgazzar, E.; Ma, Y.; Salem, A.Z.M.; Zhang, S. Effects of parity and days in milk on milk composition in correlation with β-hydroxybutyrate in tropic dairy cows. Trop. Anim. Health Prod. 2021, 53, 270. [Google Scholar] [CrossRef] [PubMed]
  66. Hou, H.; Tang, Y.; Zhao, J.; Debrah, A.A.; Shen, Z.; Li, C.; Du, Z. Authentication of organically produced cow milk by fatty acid profile combined with chemometrics: A case study in China. J. Food Compos. Anal. 2023, 120, 105297. [Google Scholar] [CrossRef]
  67. Libnau, F.; Kvalheim, O.M.; Christy, A.A.; Toft, J. Spectra of water in the near- and mid-infrared region. Vib. Spectrosc. 1994, 7, 243–254. [Google Scholar] [CrossRef]
  68. Kaylegian, K.E.; Houghton, G.E.; Lynch, J.M.; Fleming, J.R.; Barbano, D.M. Calibration of Infrared Milk Analyzers Modified Milk. J. Dairy Sci. 2006, 89, 2817–2832. [Google Scholar] [CrossRef]
  69. Wang, Q.; Bovenhuis, H. Validation strategy can result in an overoptimistic view of the ability of milk infrared spectra to predict methane emission of dairy cattle. J. Dairy Sci. 2019, 102, 6288–6295. [Google Scholar] [CrossRef]
  70. Macedo Mota, L.F.; Bisutti, V.; Vanzin, A.; Pegolo, S.; Toscano, A.; Schiavon, S.; Tagliapietra, F.; Gallo, L.; Ajmone Marsan, P.; Cecchinato, A. Predicting milk protein fractions using infrared spectroscopy and a gradient boosting machine for breeding purposes in Holstein cattle. J. Dairy Sci. 2023, 106, 1853–1873. [Google Scholar] [CrossRef]
  71. Mishra, P.; Biancolillo, A.; Roger, J.M.; Marini, F.; Rutledge, D.N. New data preprocessing trends based on ensemble of multiple preprocessing techniques. TrAC Trends Anal. Chem. 2020, 132, 116045. [Google Scholar] [CrossRef]
  72. Yin, L.; Zhang, H.; Tang, Z.; Yin, D.; Fu, Y.; Yuan, X.; Li, X.; Liu, X.; Zhao, S. HIBLUP: An integration of statistical models on the BLUP framework for efficient genetic evaluation using big genomic data. Nucleic Acids Res. 2023, 51, 3501–3512. [Google Scholar] [CrossRef]
  73. Aguilar, I.; Misztal, I.; Johnson, D.L.; Legarra, A.; Tsuruta, S.; Lawlor, T.J. Hot topic: A unified approach to utilize phenotypic, full pedigree, and genomic information for genetic evaluation of Holstein final score. J. Dairy Sci. 2010, 93, 743–752. [Google Scholar] [CrossRef] [PubMed]
  74. Christensen, O.F.; Lund, M.S. Genomic prediction when some animals are not genotyped. Genet. Sel. Evol. 2010, 42, 2. [Google Scholar] [CrossRef] [PubMed]
  75. Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M.A.R.; Bender, D.; Maller, J.; Sklar, P.; de Bakker, P.I.W.; Daly, M.J.; et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007, 81, 559–575. [Google Scholar] [CrossRef]
  76. Yin, L.; Zhang, H.; Tang, Z.; Xu, J.; Yin, D.; Zhang, Z.; Yuan, X.; Zhu, M.; Zhao, S.; Li, X.; et al. rMVP: A Memory-efficient, Visualization-enhanced, and Parallel-accelerated Tool for Genome-wide Association Study. Genom. Proteom. Bioinform. 2021, 19, 619–628. [Google Scholar] [CrossRef] [PubMed]
  77. Price, A.L.; Patterson, N.J.; Plenge, R.M.; Weinblatt, M.E.; Shadick, N.A.; Reich, D. Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 2006, 38, 904–909. [Google Scholar] [CrossRef] [PubMed]
  78. Yu, J.; Pressoir, G.; Briggs, W.H.; Vroh Bi, I.; Yamasaki, M.; Doebley, J.F.; McMullen, M.D.; Gaut, B.S.; Nielsen, D.M.; Holland, J.B.; et al. A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat. Genet. 2006, 38, 203–208. [Google Scholar] [CrossRef]
  79. Liu, X.; Huang, M.; Fan, B.; Buckler, E.S.; Zhang, Z. Iterative Usage of Fixed and Random Effect Models for Powerful and Efficient Genome-Wide Association Studies. PLoS Genet. 2016, 12, e1005767. [Google Scholar] [CrossRef]
  80. Wang, K.; Li, M.; Hakonarson, H. ANNOVAR: Functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010, 38, e164. [Google Scholar] [CrossRef]
  81. Coppa, M.; Ferlay, A.; Leroux, C.; Jestin, M.; Chilliard, Y.; Martin, B.; Andueza, D. Prediction of milk fatty acid composition by near infrared reflectance spectroscopy. Int. Dairy J. 2010, 20, 182–189. [Google Scholar] [CrossRef]
  82. Ho, P.N.; Marett, L.C.; Wales, W.J.; Axford, M.; Oakes, E.M.; Pryce, J.E. Predicting milk fatty acids and energy balance of dairy cows in Australia using milk mid-infrared spectroscopy. Anim. Prod. Sci. 2020, 60, 164–168. [Google Scholar] [CrossRef]
  83. Wang, Z.; Zhu, B.; Niu, H.; Zhang, W.; Xu, L.; Xu, L.; Chen, Y.; Zhang, L.; Gao, X.; Gao, H.; et al. Genome wide association study identifies SNPs associated with fatty acid composition in Chinese Wagyu cattle. J. Anim. Sci. Biotechnol. 2019, 10, 27. [Google Scholar] [CrossRef] [PubMed]
  84. Afseth, N.K.; Martens, H.; Randby, A.; Gidskehaug, L.; Narum, B.; Jørgensen, K.; Lien, S.; Kohler, A. Predicting the fatty acid composition of milk: A comparison of two Fourier transform infrared sampling techniques. Appl. Spectrosc. 2010, 64, 700–707. [Google Scholar] [CrossRef]
  85. Eskildsen, C.E.; Rasmussen, M.A.; Engelsen, S.B.; Larsen, L.B.; Poulsen, N.A.; Skov, T. Quantification of individual fatty acids in bovine milk by infrared spectroscopy and chemometrics: Understanding predictions of highly collinear reference variables. J. Dairy Sci. 2014, 97, 7940–7951. [Google Scholar] [CrossRef]
  86. Gupta, M.K. Chapter 2—Basic Oil Chemistry. In Practical Guide to Vegetable Oil Processing, 2nd ed.; Gupta, M.K., Ed.; AOCS Press: Champaign, IL, USA, 2017; pp. 7–25. [Google Scholar]
  87. Zhao, X.; Song, Y.; Zhang, Y.; Cai, G.; Xue, G.; Liu, Y.; Chen, K.; Zhang, F.; Wang, K.; Zhang, M.; et al. Predictions of Milk Fatty Acid Contents by Mid-Infrared Spectroscopy in Chinese Holstein Cows. Molecules 2023, 28, 666. [Google Scholar] [CrossRef]
  88. Buitenhuis, B.; Janss, L.L.G.; Poulsen, N.A.; Larsen, L.B.; Larsen, M.K.; Sørensen, P. Genome-wide association and biological pathway analysis for milk-fat composition in Danish Holstein and Danish Jersey cattle. BMC Genom. 2014, 15, 1112. [Google Scholar] [CrossRef]
  89. Batrakou, D.G.; de Las Heras, J.I.; Czapiewski, R.; Mouras, R.; Schirmer, E.C. TMEM120A and B: Nuclear Envelope Transmembrane Proteins Important for Adipocyte Differentiation. PLoS ONE 2015, 10, e0127712. [Google Scholar] [CrossRef]
  90. Crouse, W.L.; Das, S.K.; Le, T.; Keele, G.; Holl, K.; Seshie, O.; Craddock, A.L.; Sharma, N.K.; Comeau, M.E.; Langefeld, C.D.; et al. Transcriptome-wide analyses of adipose tissue in outbred rats reveal genetic regulatory mechanisms relevant for human obesity. Physiol. Genom. 2022, 54, 206–219. [Google Scholar] [CrossRef]
  91. Bangar, Y.C.; Magotra, A.; Chauhan, A.; Yadav, A.S. Genetic polymorphisms of kappa casein gene and its association with milk and composition traits in cows: An updated meta-analysis. Meta Gene 2021, 30, 100948. [Google Scholar] [CrossRef]
  92. Malik, S.; Sidhu, N.S.; Kumar, S.; Kumar, A.; Rani, R. Kappa-casein alleles in Zebu and cross-bred (1/2 Friesian, 1/4 Jersey, 1/4 Hariana) cattle from India using polymerase chain reaction and sequence-specific oligonucleotide probes (PCR-SSOP). Genet. Anal. Biomol. Eng. 1997, 14, 61–63. [Google Scholar] [CrossRef]
  93. Hudson, N.J.; Lyons, R.E.; Reverter, A.; Greenwood, P.L.; Dalrymple, B.P. Inferring the in vivo cellular program of developing bovine skeletal muscle from expression data. Gene Expr. Patterns 2013, 13, 109–125. [Google Scholar] [CrossRef]
  94. Zhang, H.; Takeda, H.; Tsuji, T.; Kamiya, N.; Rajderkar, S.; Louie, K.; Collier, C.; Scott, G.; Ray, M.; Mochida, Y.; et al. Generation of Evc2/Limbin global and conditional KO mice and its roles during mineralized tissue formation. Genesis 2015, 53, 612–626. [Google Scholar] [CrossRef] [PubMed]
  95. Lin, X.; Wang, Z.Y.; Xue, G.; Qin, X.J.; Wu, J.F.; Zhang, G. ADORA1 is a diagnostic-related biomarker and correlated with immune infiltrates in papillary thyroid carcinoma. J. Cancer 2021, 12, 3997–4010. [Google Scholar] [CrossRef] [PubMed]
  96. Seo, J.; Osorio, J.S.; Loor, J.J. Purinergic signaling gene network expression in bovine polymorphonuclear neutrophils during the peripartal period. J. Dairy Sci. 2013, 96, 7675–7683. [Google Scholar] [CrossRef] [PubMed]
Figure 1. The spectra of cow’s milk(lines of different colors represent different samples). (a) The original spectra of all reference samples. (b) The average spectra of all reference samples. (c) The D1 preprocessing spectra of all reference samples. (d) The D2 preprocessing pretreated spectra of all reference samples.
Figure 1. The spectra of cow’s milk(lines of different colors represent different samples). (a) The original spectra of all reference samples. (b) The average spectra of all reference samples. (c) The D1 preprocessing spectra of all reference samples. (d) The D2 preprocessing pretreated spectra of all reference samples.
Animals 14 02901 g001
Figure 2. Multraits-QQplot and Circular Manhattan for C6:0. (a) The quantile–quantile plot based on the GLM, MLM, and FarmCPU methods. The red line represents x = y. The grey shaded place represents the 0.95 confidence interval (b) The cyclic Manhattan plot based on the GLM, MLM and FarmCPU methods combined with chip density.
Figure 2. Multraits-QQplot and Circular Manhattan for C6:0. (a) The quantile–quantile plot based on the GLM, MLM, and FarmCPU methods. The red line represents x = y. The grey shaded place represents the 0.95 confidence interval (b) The cyclic Manhattan plot based on the GLM, MLM and FarmCPU methods combined with chip density.
Animals 14 02901 g002
Figure 3. Seven validated SNPs significantly related to fatty acid content.‘*’ means p-Value Bonferroni < 0.05, and ‘**’ means p-Value Bonferroni < 0.01. (a) Difference and significance of C6:0 among three genotypes of ARS-BFGL-NGS-22276 gene; (b) Difference and significance of C8:0 among three genotypes of ARS-BFGL-NGS-33001 gene; (c) Difference and significance of C10:0 among three genotypes of Bovine HD 0100019865 gene; (d) Difference and significance of C14:0 among three genotypes of Bovine HD 3000029498 gene; (e) Difference and significance of C14:1,cis-9 among three genotypes of Bovine HD 1600000152 gene; (f) Difference and significance of C18:3,n3 cis-9/12/15 among three genotypes of ARS-BFGL-NGS-15402 gene; (g) Difference and significance of LCFA among three genotypes of ARS-BFGL-NGS-15402 gene.
Figure 3. Seven validated SNPs significantly related to fatty acid content.‘*’ means p-Value Bonferroni < 0.05, and ‘**’ means p-Value Bonferroni < 0.01. (a) Difference and significance of C6:0 among three genotypes of ARS-BFGL-NGS-22276 gene; (b) Difference and significance of C8:0 among three genotypes of ARS-BFGL-NGS-33001 gene; (c) Difference and significance of C10:0 among three genotypes of Bovine HD 0100019865 gene; (d) Difference and significance of C14:0 among three genotypes of Bovine HD 3000029498 gene; (e) Difference and significance of C14:1,cis-9 among three genotypes of Bovine HD 1600000152 gene; (f) Difference and significance of C18:3,n3 cis-9/12/15 among three genotypes of ARS-BFGL-NGS-15402 gene; (g) Difference and significance of LCFA among three genotypes of ARS-BFGL-NGS-15402 gene.
Animals 14 02901 g003
Table 1. Eight classified fatty acid groups, according to hydrocarbon chain saturation and carbon chain length.
Table 1. Eight classified fatty acid groups, according to hydrocarbon chain saturation and carbon chain length.
Trait* Fatty Acids Included
TFASFA, UFA/SCFA, MCFA, LCFA
SFAC4:0, C6:0, C8:0, C10:0, C11:0, C12:0, C13:0, C14:0, C15:0, C16:0, C17:0, C18:0, C20:0, C21:0, C22:0, C23:0, C24:0
UFAMUFA, PUFA
MUFAC14:1, C15:1, C16:1, C17:1, C18:1, C20:1, C22:1, C24:1
PUFAC18:2, C18:3, C20:2, C20:3, C20:4, C20:5, C22:2, C22:6
SCFAC4:0, C6:0, C8:0, C10:0
MCFAC11:0, C12:0, C13:0, C14:0, C14:1, C15:0, C15:1,C16:0, C16:1
LCFAC17:0, C17:1, C18:0, C18:1, C18:2, C18:3, C20:0, C20:1, C20:2,
C20:3, C20:4, C20:5, C21:0, C22:0, C22:1, C22:2, C22:6, C23:0,
C24:0, C24:1
* TFA = total fatty acids; SFA = saturated fatty acid; UFA = unsaturated fatty acid; MUFA = monounsaturated fatty acids; PUFA = polyunsaturated fatty acids; SCFA = short-chain fatty acids; MCFA = mid-chain fatty acids; LCFA = long-chain fatty acids.
Table 2. Descriptive statistics of fatty acid traits.
Table 2. Descriptive statistics of fatty acid traits.
Trait1 g/100 g of Total Fat1 Percentage of Total Fatty Acid, %
2 Mean2 SD2 Max2 Min2 CV, %2 Mean2 SD2 Max2 Min2 CV, %
C4:01.181.067.590.0990.553.363.0932.441.1492.01
C6:00.590.321.580.0854.221.660.312.320.6018.84
C8:00.380.211.060.0554.381.060.211.540.3419.76
C10:01.000.573.260.0957.472.800.735.020.8126.00
C12:01.130.653.720.1556.973.210.805.881.0724.80
C14:03.771.959.490.5251.5610.721.6915.224.9915.78
C14:1,cis-90.310.201.210.0362.420.890.322.170.2036.13
C15:00.380.241.690.0561.681.080.332.580.3830.36
C16:012.746.1931.061.6148.6036.373.5349.2124.899.69
C17:00.180.100.620.0255.500.510.110.970.2720.98
C18:03.922.1011.500.5253.5811.152.3522.366.1921.09
C18:1,n9 cis-96.534.2930.720.0565.7917.986.7337.940.4537.42
C18:2,n6cis-9,121.000.552.840.1255.022.790.554.371.4219.56
C18:3,n3cis-9,12,150.100.070.340.0072.920.260.130.880.0450.98
3 TFA35.2217.1480.974.5248.67——————————
3 SFA25.3912.3359.803.4648.5572.304.5090.8951.576.23
3 MUFA8.524.6534.660.4854.6524.044.4443.554.8918.46
3 PUFA1.330.723.690.1554.533.710.686.712.1418.42
3 UFA9.855.2837.040.8953.6627.774.5248.449.1116.26
3 SCFA3.141.8210.140.3558.078.883.3537.734.7237.76
3 MCFA18.949.3247.662.5249.2053.975.5369.2236.7310.25
3 LCFA13.146.9344.671.6252.7437.165.7957.0422.9415.58
1 The value of fatty acid traits was determined using gas chromatography (GC). 2 Mean = mean value; SD = standard deviation; Max = maximum; Min = minimum; CV = coefficient of variation. 3 TFA = total fatty acid; SFA = saturated fatty acid; UFA = unsaturated fatty acid; MUFA = monounsaturated fatty acid; PUFA = polyunsaturated fatty acid; SCFA = short-chain fatty acid; MCFA = mid-chain fatty acid; LCFA = long-chain fatty acid.
Table 3. Model characteristics and metrics for cross-validation and test sets.
Table 3. Model characteristics and metrics for cross-validation and test sets.
Trait1 Preprocessing MethodsNumber of Characteristic Wavelengths2 Rcv22 RMSEcv2 Rt22 RMSEt2 RPDcv
C4:0D2 + D25850.760.480.650.742.04
C6:0D1 + D16570.640.190.570.221.67
C8:0D2 + D25320.760.100.720.122.03
C10:0D1 + D16470.680.310.650.371.77
C12:0D1 + D15640.670.360.620.441.74
C14:0D1 + D16030.641.100.581.441.68
C14:1,cis-9D1 + D15760.670.110.620.141.73
C15:0D1 + D26880.710.110.600.191.85
C16:0D1 + D17090.643.710.524.251.67
C17:0D2 + D24560.640.060.490.071.67
C18:0D1 + D15100.591.300.601.451.57
C18:1,n9 cis-9D1 + D25610.672.320.583.211.74
C18:2,n6 cis-9,12D1 + D16120.650.320.500.391.69
C18:3,n3 cis-9,12,15D1 + D26930.690.040.440.061.81
3 TFAD2 + D25110.709.240.6610.391.82
3 SFAD2 + D25110.716.500.677.521.86
3 UFAD2 + D24690.653.130.653.071.69
3 MUFAD2 + D24590.652.780.652.681.68
3 PUFAD2 + D24910.680.400.620.471.77
3 SCFAD2 + D25610.720.970.740.901.90
3 MCFAD2 + D24720.665.180.675.881.73
3 LCFAD2 + D25680.624.120.624.671.62
1 D1 = first derivative; D2 = second derivative; D1+D1 = FT-MIRS data were preprocessed twice with D1 employed; D2+D2 = FT-MIRS data were preprocessed twice with D2 employed; D1+D2 = FT-MIRS data were preprocessed once with D1, then with D2. 2 Rcv2 = cross-validation coefficient of determination; Rt2 = test set coefficient of determination; RMSEcv = root mean square error in cross-validation; RMSEt = root mean square error in test set; RPDcv = residual predictive deviation of cross-validation. 3 TFA = total fatty acid; SFA = saturated fatty acid; UFA = unsaturated fatty acid; MUFA = monounsaturated fatty acid; PUFA = polyunsaturated fatty acid; SCFA = short-chain fatty acid; MCFA = mid-chain fatty acid; LCFA = long-chain fatty acid.
Table 4. Variance component estimates and estimated breeding values for FT-MIRS-predicted fatty acid traits.
Table 4. Variance component estimates and estimated breeding values for FT-MIRS-predicted fatty acid traits.
Fatty AcidVariance Component EstimatesEstimated Breeding Values
1 σu21 σc21 σe21 σT21 h21 t2 Mean2 Min2 Max2 Acc
C4:00.140.200.170.510.280.67−0.03−0.900.890.73
C6:00.020.020.030.070.310.59−0.01−0.310.590.71
C8:00.010.010.020.030.230.49−0.01−0.180.230.67
C10:00.080.060.100.240.330.59−0.02−0.601.100.72
C12:00.090.070.110.270.340.59−0.03−0.650.760.73
C14:00.540.690.661.890.290.65−0.05−1.952.250.76
C14:1,cis-90.010.010.020.040.250.480.00−0.200.180.54
C15:00.010.010.020.040.290.540.00−0.230.270.70
C16:04.016.264.8015.060.270.68−0.24−5.739.130.79
C17:00.000.000.000.010.200.330.00−0.100.080.63
C18:00.430.700.521.650.260.69−0.04−2.003.150.78
C18:1,n9 cis-92.082.872.487.420.280.67−0.08−3.333.650.69
C18:2,n6 cis-9,120.070.080.090.240.310.62−0.02−0.630.670.70
C18:3,n3 cis-9,12,150.000.000.000.000.220.500.00−0.050.060.57
3 TFA29.0235.9937.39102.390.280.64−0.26−16.0626.690.82
3 SFA18.9118.7724.2761.940.310.61−0.21−10.2023.650.82
3 UFA4.003.975.5413.510.300.59−0.11−5.607.000.78
3 MUFA3.303.274.6111.170.300.59−0.10−5.356.790.78
3 PUFA0.070.070.090.220.300.60−0.01−0.640.790.74
3 SCFA0.470.650.551.680.280.67−0.06−1.542.450.76
3 MCFA8.7813.7910.1432.710.270.69−0.20−8.189.770.74
3 LCFA5.818.786.7321.330.270.68−0.19−7.479.660.74
1 Variance component estimates: σu2 = additive genetic variance; σc2 = permanent environment variance; σe2 = residual variance; σT2 = total variance (σu2 + σc2 + σe2); h2 = heritability estimate; t = repeatability estimate. 2 Estimated breeding values: Using 0.00 as the population mean; Mean = the mean of all individual breeding values; Min = the minimum individual breeding value; Max = the maximum individual breeding value; Acc = the accuracy of breeding value estimation. 3 TFA = total fatty acid; SFA = saturated fatty acid; UFA = unsaturated fatty acid; MUFA = monounsaturated fatty acid; PUFA = polyunsaturated fatty acid; SCFA = short-chain fatty acid; MCFA = mid-chain fatty acid; LCFA = long-chain fatty acid.
Table 5. Genetic (below the diagonal) and phenotypic (above the diagonal) correlations for FT-MIRS-predicted milk traits.
Table 5. Genetic (below the diagonal) and phenotypic (above the diagonal) correlations for FT-MIRS-predicted milk traits.
FatProteinLactoseTotal SolidsUreaC4:0C6:0C8:0C10:0C12:0C14:0C14:1,
cis-9
C15:0C16:0C17:0C18:0C18:1,n9 cis-9C18:2,n6 cis-9,12C18:3,n3 cis-9,12,151 TFA1 SFA1 UFA1 MUFA1 PUFA1 SCFA1 MCFA1 LCFA
Fat 0.42−0.060.930.080.090.220.180.200.200.140.170.150.270.270.230.180.150.160.190.180.170.180.090.210.200.22
Protein0.66 −0.140.660.14−0.070.120.090.210.280.160.220.230.190.110.00−0.160.000.010.030.08−0.04−0.03−0.100.120.23−0.07
Lactose0.13−0.11 0.080.050.02−0.23−0.10−0.17−0.18−0.04−0.17−0.03−0.14−0.03−0.08−0.14−0.28−0.22−0.09−0.08−0.10−0.10−0.14−0.11−0.17−0.06
Total solids0.980.740.18 0.140.100.210.180.250.260.190.230.210.280.240.200.110.090.130.160.160.120.130.020.230.220.16
Urea0.200.230.100.26 0.20−0.150.030.020.02−0.02−0.02−0.16−0.12−0.17−0.02−0.10−0.170.01−0.14−0.09−0.14−0.16−0.090.07−0.05−0.21
C4:00.210.07−0.020.240.37 0.090.400.240.140.140.010.010.100.030.330.180.060.250.220.270.130.120.170.550.240.11
C6:00.260.12−0.100.28−0.240.07 0.380.750.760.710.590.640.670.210.500.510.750.570.280.290.170.190.210.270.410.41
C8:00.320.23−0.160.350.060.680.43 0.460.420.380.240.320.350.210.460.280.300.400.460.500.290.280.370.590.450.25
C10:00.300.24−0.090.34−0.050.290.870.60 0.870.720.630.540.720.100.500.400.590.650.330.390.170.170.310.460.540.30
C12:00.260.28−0.060.32−0.050.160.860.510.94 0.750.640.540.700.120.430.310.580.550.260.320.110.100.240.320.500.22
C14:00.280.24−0.020.34−0.100.270.880.610.930.92 0.610.700.610.110.480.360.550.540.310.350.160.160.190.320.510.32
C14:1,cis−90.250.20−0.100.28−0.06−0.010.800.350.820.840.81 0.600.650.150.310.420.550.430.270.270.200.210.160.220.390.31
C15:00.220.280.010.26−0.240.100.650.420.640.660.800.65 0.530.270.310.400.560.430.370.370.230.240.210.250.560.43
C16:00.400.21−0.040.42−0.270.240.890.610.880.840.940.790.79 0.280.470.420.600.520.370.380.250.250.290.290.450.41
C17:00.320.15−0.030.31−0.250.160.300.340.260.250.330.320.380.49 0.270.190.220.090.680.610.610.600.510.210.230.37
C18:00.380.090.010.390.000.500.740.680.740.680.770.660.510.820.53 0.410.450.610.440.440.410.410.350.440.350.35
C18:1,n9 cis-90.28−0.110.030.26−0.130.150.580.280.430.350.550.510.580.690.290.56 0.660.540.270.190.340.340.270.200.310.60
C18:2,n6 cis-9,120.17−0.04−0.110.17−0.250.040.860.350.700.680.780.790.680.830.240.680.80 0.610.250.200.210.210.300.200.370.51
C18:3,n3 cis-9,12,150.270.03−0.160.29−0.020.400.770.690.840.720.850.750.580.870.340.820.700.82 0.350.380.260.250.410.420.480.41
1 TFA0.260.11−0.090.26−0.150.410.380.680.470.390.520.330.470.630.850.660.350.300.57 0.970.850.840.800.490.480.38
1 SFA0.250.13−0.090.26−0.130.420.360.680.470.390.510.310.450.600.840.650.290.260.551.00 0.750.730.770.560.520.29
1 UFA0.18−0.05−0.060.15−0.150.280.220.490.240.160.290.200.280.440.870.560.340.210.420.940.93 1.000.750.290.330.41
1 MUFA0.17−0.07−0.060.14−0.160.260.210.470.220.140.270.200.270.430.870.560.350.210.410.930.921.00 0.710.290.320.42
1 PUFA0.11−0.05−0.140.11−0.110.310.240.540.360.270.370.210.320.470.770.490.280.190.490.920.920.910.89 0.390.410.38
1 SCFA0.330.26−0.090.370.110.810.310.850.520.400.550.190.430.550.480.630.230.200.590.750.760.550.520.62 0.470.26
1 MCFA0.350.25−0.070.38−0.150.470.580.740.680.590.780.460.770.820.560.640.580.550.760.790.780.590.560.670.82 0.44
1 LCFA0.310.01−0.010.29−0.300.190.560.420.460.370.610.440.710.760.440.530.920.730.700.550.490.470.470.450.430.80
1 TFA = total fatty acid; SFA = saturated fatty acid; UFA = unsaturated fatty acid; MUFA = monounsaturated fatty acid; PUFA = polyunsaturated fatty acid; SCFA = short-chain fatty acid; MCFA = mid-chain fatty acid; LCFA = long-chain fatty acid. Correlations between fatty acids and conventional milk components: There was a low to moderate positive correlation between 22 fatty acids and milk fat or total solids (0.11–0.42). Milk protein was positively associated with short- or medium-chain fatty acids (0.07–0.28), weakly positively or negatively correlated with long-chain fatty acids, weakly negatively correlated with fatty acid groups UFA, MUFA, and PUFA, and moderately positively related to SCFA and MCFA. Except for urea nitrogen and C4:0 (0.37), lactose or urea nitrogen showed a weak positive correlation or weak to low negative correlation (−0.30–0.11) with fatty acids. To conclude, milk fat/total solids and fatty acids had congruous variation trends, while lactose/urea nitrogen had an opposite variation trend to fatty acids. Milk protein displayed a consistent variation trend with short- or medium-chain fatty acids, showing little correlation with long-chain or unsaturated fatty acids.
Table 6. SNP loci significantly related to fatty acid content screened by the FarmCPU method.
Table 6. SNP loci significantly related to fatty acid content screened by the FarmCPU method.
NameSNPChromPositionGenetic VariationEffectRaw
p-Value
p-Value
Bonferroni
GeneFunctional
Region
C4:0BTB-018652886g.39109137A > C−0.091298053 1.08 × 10 6 0.873LCORL (dist = 1,552,031), SLIT2 (dist = 670,758)Intergenic
C4:0BovineHD220001329122g.45609641T > C0.109639913 1.33 × 10 8 0.114WNT5A (dist = 51,524), CACNA2D3 (dist = 314,894)Intergenic
C6:0BovineHD01000395191g.156602608T > G0.034016364 6.84 × 10 7 0.141KCNH8Intronic
C6:0BovineHD140002311314g.79488283A > G0.04046719 6.23 × 10 7 0.439ENSBTAG00000046821 (dist = 838,481),
\SNX16 (dist = 1,274,216)
Intergenic
C6:0ARS-BFGL-NGS-2227617g.53567893A > G0.03801704 7.02 × 10 8 8.80 × 10 3 TMEM120BUpstream
C6:0BovineHD210001455321g.50065194A > G0.056103731 8.20 × 10 9 0.607FBXO33 (dist = 755,051), LRFN5 (dist = 1,263,846)Intergenic
C8:0BovineHD01000395191g.156602608T > G0.018626629 4.76 × 10 8 0.758KCNH8Intronic
C8:0ARS-BFGL-NGS-330016g.103480813A > G−0.017966404 4.94 × 10 7 3.02 × 10 2 EVCExonic
C8:0BovineHD08000313998g.103616676T > G0.021357656 1.88 × 10 8 0.892AKNA (dist = 11,743), WHRN (dist = 4323)Intergenic
C8:0BovineHD120001749912g.63379103A > C0.018193801 9.07 × 10 8 0.104ENSBTAG00000051320 (dist = 61,739),
NONE (dist = NONE)
Intergenic
C8:0BovineHD290000213629g.7493345A > C−0.0197566 1.96 × 10 8 0.146CTSC (dist = 76,111), RAB38 (dist = 7945)Intergenic
C10:0BovineHD01000198651g.69045145T > C−0.053724787 5.00 × 10 7 9.66 × 10 3 ENSBTAG00000053238Intronic
C10:0Hapmap48014-BTA-598055g.15293572T > C0.048419799 4.36 × 10 7 0.413ALX1 (dist = 354,357), RASSF9 (dist = 174,821)Intergenic
C10:0BovineHD290000937929g.30870766A > G−0.053948553 1.96 × 10 7 0.793ENSBTAG00000052770 (dist = 293,780),
ETS1 (dist = 942,420)
Intergenic
C10:0BovineHD3000029498Xg.102101013A > G−0.05687289 4.30 × 10 7 0.971CASK (dist = 28,036), NYX (dist = 6763)Intergenic
C12:0BovineHD01000198651g.69045145T > C−0.067241259 2.27 × 10 7 8.34 × 10 2 ENSBTAG00000053238Intronic
C14:0BovineHD150001884215g.64979528T > C0.202541955 1.07 × 10 7 0.997CATIntronic
C14:0BovineHD3000029498Xg.102101013A > G−0.199760831 1.07 × 10 6 3.56 × 10 2 CASK (dist = 28,036), NYX (dist = 6763)Intergenic
C14:1,cis-9BovineHD130001444713g.49614540T > C0.023699371 1.28 × 10 7 0.351BMP2 (dist = 428,961),
ENSBTAG00000007199 (dist = 932,496)
Intergenic
C14:1,cis-9ARS-BFGL-NGS-7589014g.65908412A > G0.558449636 9.26 × 10 7 ————
C14:1,cis-9BovineHD160000015216g.896630T > C−0.019590243 1.13 × 10 6 1.57 × 10 7 ENSBTAG00000050274 (dist = 75,663),
ADORA1 (dist = 18,476)
Intergenic
C14:1,cis-9Hapmap27983-BTA-16347316g.17975275T > C0.194892692 2.79 × 10 7 ——————
C14:1,cis-9BovineHD3000036571Xg.121649547T > C0.012453536 4.56 × 10 7 0.781CNKSR2Intronic
C15:0BovineHD04000223514g.80194206A > C0.04267447 4.33 × 10 9 0.801————
C15:0BovineHD220001087222g.38049578A > G0.025775523 8.15 × 10 8 0.392SYNPRIntronic
C15:0BovineHD3000019134Xg.60227886T > C0.024999374 5.98 × 10 7 0.124TRPC5Intronic
C16:0BovineHD04000223514g.80194206A > C0.04267447 4.33 × 10 9 0.650————
C16:0BovineHD220001087222g.38049578A > G0.025775523 8.15 × 10 8 0.771SYNPRIntronic
C16:0BovineHD3000019134Xg.60227886T > C0.024999374 5.98 × 10 7 0.984TRPC5Intronic
C18:0BovineHD220000823422g.28238616A > G0.268793511 7.04 × 10 7 0.247————
C18:0BovineHD3000036571Xg.121649547T > C0.217017144 9.08 × 10 7 0.397CNKSR2intronic
C18:1,n9 cis-9ARS-BFGL-NGS-1005635g.98601044T > G0.663309395 1.24 × 10 8 0.761ENSBTAG00000001336 (dist = 4735),
ENSBTAG00000054460 (dist = 18,399)
Intergenic
C18:1,n9 cis-9ARS-USDA-AGIL-chr26-44746016-00054826g.44408705T > G0.552863575 1.13 × 10 6 0.902————
C18:1,n9 cis-9BTB-0189289029g.6982262T > C0.332464096 3.06 × 10 7 0.374TYR (dist = 546,774), GRM5 (dist = 25,615)Intergenic
C18:2,n6 cis-9,12BTB-003870339g.28842247T > C0.071584274 5.81 × 10 7 0.454HSF2 (dist = 39,233),
ENSBTAG00000053171 (dist = 266,891)
Intergenic
C18:2,n6 cis-9,12BovineHD120000697012g.23185693T > C−0.102781399 5.56 × 10 7 0.839————
C18:2,n6 cis-9,12BovineHD3000011570Xg.35028719A > G0.079096988 2.01 × 10 7 0.230ENSBTAG00000048998 (dist = 65,578),
GABRQ (dist = 17,494)
Intergenic
C18:3,n3 cis-9,12,15ARS-BFGL-NGS-349908g.11469311T > C0.006830786 9.84 × 10 7 0.754TMEM215 (dist = 133,759), NDUFB6 (dist = 35,314)Intergenic
C18:3,n3 cis-9,12,15AX-275557248g.83221868T > G−0.006388075 1.11 × 10 6 0.714CDC14BExonic
C18:3,n3 cis-9,12,15ARS-BFGL-NGS-1540217g.11368599T > C0.007664341 1.66 × 10 7 8.34 × 10 3 TTC29Intronic
1 UFAARS-BFGL-NGS-27018Xg.29070096T > C0.671700039 3.75 × 10 7 0.293————
1 MUFAARS-BFGL-NGS-27018Xg.29070096T > C0.671700039 3.75 × 10 7 0.330————
1 SCFABovineHD03000289703g.100601230A > G−0.185365212 1.72 × 10 7 0.465TESK2Intronic
SCFABovineHD07000281287g.94179851A > G−0.27438426 1.39 × 10 7 0.530MCTP1Intronic
1 LCFABovineHD130002207813g.75585136A > G0.580890257 4.50 × 10 7 4.61 × 10 3 EYA2Intronic
LCFAARS-USDA-AGIL-chr24-46907845-00053424g.46441847A > C−0.961849131 2.07 × 10 10 0.267ST8SIA5Intronic
LCFAARS-BFGL-NGS-9359524g.52964581A > G2.190506381 1.76 × 10 7 0.197————
1 UFA = unsaturated fatty acid; MUFA = monounsaturated fatty acid; SCFA = short-chain fatty acid; LCFA = long-chain fatty acid. Subsequently, 47 SNPs from the above preliminary selected SNPs were genotyped in 265 Chinese Holstein cows from another dairy herd by DNA sequencing and were analyzed and validated by Bonferroni ANOVA. Seven important SNPs related to fatty acid content were further identified from the forty-seven SNPs. The specific results are as follows: Single short-chain fatty acids: 2, 4, 5, and 4 SNPs were found to be significantly associated with C4:0, C6:0, C8:0, and C10:0, respectively. These 15 SNPs were distributed on 11 chromosomes, of which 10 were located between genes, 3 in introns, 1 in exon, and 1 in the upstream of a gene. BovineHD0100039519 was significantly correlated with C6:0 and C8:0 and located in the intron of the KCNH8. Three SNPs, ARS-BFGL-NGS-22276 (C6:0, p-value Bonferroni = 8.80 × 10 3 ), ARS-BFGL-NGS-33001 (C8:0, p-value Bonferroni = 3.02 × 10 2 ), and Bovine HD 0100019865(C10:0, p-value Bonferroni = 9.66 × 10 3 ), were identified having significant effects on fatty acid content from 15 SNPs. The three SNPs were located upstream of the TMEM120B, the exon of the EVC, and the intron of the ENSBTAG00000053238, respectively.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, C.; Fan, Y.; Wang, D.; Chu, C.; Shen, X.; Wang, H.; Luo, X.; Nan, L.; Ren, X.; Chen, S.; et al. The Genetic Characteristics of FT-MIRS-Predicted Milk Fatty Acids in Chinese Holstein Cows. Animals 2024, 14, 2901. https://doi.org/10.3390/ani14192901

AMA Style

Li C, Fan Y, Wang D, Chu C, Shen X, Wang H, Luo X, Nan L, Ren X, Chen S, et al. The Genetic Characteristics of FT-MIRS-Predicted Milk Fatty Acids in Chinese Holstein Cows. Animals. 2024; 14(19):2901. https://doi.org/10.3390/ani14192901

Chicago/Turabian Style

Li, Chunfang, Yikai Fan, Dongwei Wang, Chu Chu, Xiong Shen, Haitong Wang, Xuelu Luo, Liangkang Nan, Xiaoli Ren, Shaohu Chen, and et al. 2024. "The Genetic Characteristics of FT-MIRS-Predicted Milk Fatty Acids in Chinese Holstein Cows" Animals 14, no. 19: 2901. https://doi.org/10.3390/ani14192901

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop