Next Article in Journal
Behavior, Intake, Digestion and Milk Yield of Early Lactation Holstein Dairy Cows with Two Levels of Environmental Exposure and Feeding Strategy
Previous Article in Journal
The Effects of Enrichment on Zoo-Housed Scarlet Ibis Behavior
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genomic Selection for Weaning Weight in Alpine Merino Sheep Based on GWAS Prior Marker Information

1
Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
2
Sheep Breeding Engineering Technology Research Center of Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
*
Authors to whom correspondence should be addressed.
Animals 2024, 14(13), 1904; https://doi.org/10.3390/ani14131904
Submission received: 23 May 2024 / Revised: 19 June 2024 / Accepted: 24 June 2024 / Published: 27 June 2024

Abstract

:

Simple Summary

This study compared the accuracy of estimating the breeding value of weaning weight traits in Alpine Merino sheep using a combination of genome-wide association analysis and genome selection with the best linear unbiased prediction method. The test population was randomly divided into two groups. One group was used for discovering marker information through GWAS analysis and screening the most significant loci, and the other group used different prior marker information to estimate genetic parameters and compare the prediction accuracy of genomic breeding values. The study demonstrates that integrating the genome selection method with prior marker information from GWAS can enhance the prediction accuracy of genome breeding value. This research offers technical support for future breeding efforts of Alpine Merino sheep and the enhancement of genomic breeding value accuracy.

Abstract

This study aims to compare the accuracy of genomic estimated breeding values (GEBV) estimated using a genomic best linear unbiased prediction (GBLUP) method and GEBV estimates incorporating prior marker information from a genome-wide association study (GWAS) for the weaning weight trait in highland Merino sheep. The objective is to provide theoretical and technical support for improving the accuracy of genomic selection. The study used a population of 1007 highland Merino ewes, with the weaning weight at 3 months as the target trait. The population was randomly divided into two groups. The first group was used for GWAS analysis to identify significant markers, and the top 5%, top 10%, top 15%, and top 20% markers were selected as prior marker information. The second group was used to estimate genetic parameters and compare the accuracy of GEBV predictions using different prior marker information. The accuracy was obtained using a five-fold cross-validation. Finally, both groups were subjected to cross-validation. The study’s findings revealed that the heritability of the weaning weight trait, as calculated using the GBLUP model, ranged from 0.122 to 0.394, with corresponding prediction accuracies falling between 0.075 and 0.228. By incorporating prior marker information from GWAS, the heritability was enhanced to a range of 0.125 to 0.407. The inclusion of the top 5% to top 20% significant SNPs from GWAS results as prior information into GS showed potential for improving the accuracy of predicting genomic breeding value.

1. Introduction

Body weight traits are important indicators of economic traits in sheep and play a significant role in animal husbandry. They serve as core indicators for assessing the meat potential of sheep and are closely tied to the health status of individual animals. Additionally, body weight traits are closely associated with yield levels and greatly influence the economic benefits of farmers [1]. A deep understanding and mastery of the genetic patterns related to weight traits can lead to breakthroughs in genetic breeding efforts, thus advancing the sustainable and healthy progression of the sheep industry. However, traditional genetic breeding methods have progressed slowly in the study of weight traits, failing to meet the demands of modern animal husbandry development. The use of modern breeding technology offers a novel approach and methodology for developing high-quality sheep breeds at the molecular level. For instance, the application of modern breeding technology not only expedites the breeding process of Alpine Merino sheep but also enhances breeding accuracy and efficiency. This enables farmers to swiftly identify individuals with exceptional body weight traits and further develop high-quality sheep breeds that align better with market demands. By combining modern breeding techniques with traditional genetic breeding methods, we can gain a deeper understanding of and effectively utilize the genetic mechanisms related to weight traits, ultimately contributing to the sustainable advancement of animal husbandry.
The development and application of molecular markers, such as single-nucleotide polymorphisms (SNPs), have enabled the integration of functionally validated genetic markers with the genomic best linear unbiased prediction (GBLUP) method for molecular marker-assisted selection of breeding values. This not only enhances the accuracy of selection but also allows for early selection, effectively reducing the breeding time interval [2]. However, the majority of economic traits in poultry are quantitative traits governed by multiple minor-effect genes, making it challenging to identify loci with significant effects [3]. Genomic selection (GS), a high-density marker technique spanning the entire genome, was initially introduced by Meuwissen in 2001 [4]. Particularly beneficial for traits with low heritability and those difficult to measure, GS offers advantages such as shortened generation intervals, decreased breeding expenses [5,6,7], and enhanced precision in genomic estimated breeding value (GEBV) [8]. The GS approach encompasses direct methods like GBLUP and indirect methods like Bayes techniques. Though the latter boasts high accuracy in calculations, it is time-consuming and demands considerable computational resources, limiting its practical application [9]. In recent years, the reduced costs of high-throughput sequencing have led to the widespread adoption of GBLUP in livestock and poultry breeding, including pigs [10,11], cattle [12], sheep [13], and chickens [14], due to its efficient computation and user-friendly nature. However, the assumption in the GBLUP model that all SNPs have an equal and effective impact on traits is considered unrealistic. The genetic basis and complexity of various quantitative traits vary, leading to different genetic markers having differing effects. As a result, there remains potential for enhancement in the GBLUP methodology [15,16]. Previous studies have demonstrated that incorporating data from genome-wide association studies (GWAS) can enhance the accuracy of GS for important economic traits in livestock and poultry [17,18]. GWAS play a crucial role in identifying candidate loci associated with economic traits in poultry [19,20]. Various models, such as the General Linear Model (GLM), Mixed Linear Model (MLM), and FarmCPU model, are available for GWAS analysis. In this study, the FarmCPU model within the rMVP software package (https://github.com/xiaolei-lab/rMVP (accessed on 15 May 2023)) was employed for analysis. By combining the strengths of MLM and GLM, this model addresses the limitations between the genetic relationship and population structure relationship through iteration. This approach helps to distinguish genetic trait genes from target trait genes in selection, leading to a more effective reduction in errors.
The Alpine Merino sheep is a newly developed breed that has been successfully bred over a period of 20 years using advanced modern breeding techniques. It was created by crossing Australian Merino sheep as the male parent with Gansu Alpine Merino sheep as the female parent. The breeding process involved three main stages: hybridization improvement, cross fixation, and breeding improvement. The Alpine Merino sheep typically inhabits the highland grassland area of the Qilian Mountains, at altitudes ranging from 2400 to 4070 m throughout the year. Known for its large body size, abundant wool production, soft and delicate wool texture, good elasticity and warmth, and fresh and flavorful meat, the Alpine Merino sheep is recognized for its excellent quality. The breeding of this new breed not only enhanced the genetic resources of Chinese native sheep breeds but also played a vital role in boosting local economic growth in cold and arid mountainous regions. In sheep breeding, body weight serves as a crucial indicator of growth and development, impacting both wool quality and meat production. The early growth rate of sheep holds significant economic value. Though the use of GWAS and GS in animal breeding is well established, only a limited number of studies have utilized GWAS analysis results as prior information for GS prediction accuracy. This study aimed to compare the accuracy of breeding value estimation for Alpine Merino sheep by using weaning body weight (WW) as the target trait. The research integrated genome selection with GWAS analysis and compared it to the best linear unbiased prediction method for the genome. The findings are intended to offer technical support for future Alpine Merino sheep breeding and improve the accuracy of genomic breeding value estimation.

2. Materials and Methods

2.1. Materials

The experimental animals in this study were chosen from the Gansu Sheep Breeding Technology Promotion Station in Huangcheng Town, Sunan Yugu Autonomous County, Zhangye City, Gansu Province. The station maintains a strictly standardized breeding and management system to ensure that all animals are fed and cared for consistently. A total of 1007 Alpine Merino ewes were included in the study, with their body weight measured at three months of age. To minimize experimental error, the animals underwent a 12 h fasting period under natural conditions before being weighed on a calibrated automatic scale. Weight measurements were recorded to two decimal places in kilograms. The population was then randomly divided into two groups using the RAND function, with each group serving as the other’s control. One group was designated as the discovery population for GWAS analysis, from which the top 5%, top 10%, top 15%, and top 20% most significant SNPs were selected as a priori marker information. The other group was then merged with the prior marker information from various sets to calculate genetic parameters and evaluate the prediction accuracy of genome breeding value. The accuracy was assessed using a five-fold cross-validation method, and the two groups were cross-validated to compare the prediction accuracy of the two strategies.
At 3 months of age, each sheep had 3 mL blood samples collected via the jugular vein method and stored in a vacuum sampling container with ethylenediaminetetraacetic acid (EDTA) for genotyping. Genotyping was performed using the fine wool sheep 50 K liquid chip (MolBreeding Biotech Ltd., Shijiazhuang, China). The reference genome used was the Oar v4.0 (GCF 000298735.2) version published on NCBI. After quality control and filtering using PLINK software (version: v1.9b4) and Beagle software (version 5.0), a total of 1007 individuals and 41,956 SNPs were retained for subsequent correlation analysis. The study applied filters such as --geno, --mind, --maf, and --hwe to enhance the quality of each SNP locus. The specific quality control process involved eliminating 534 SNP loci based on locus detection rate, retaining all individuals based on genotype deletion rate, removing 285 SNP loci based on the minor allele frequency threshold, and excluding 3434 SNP loci based on Hardy–Weinberg equilibrium conditions.

2.2. GWAS Model Analysis Methods

Two experimental groups were utilized in this study: an a priori information discovery group and an a priori information verification group. GWAS analysis was conducted using the a priori information discovery population. The FarmCPU model within rMVP software (version 1.0.7) was employed for GWAS analysis of WW. This model offers distinct advantages in addressing population structure and genetic relationships in GWAS, combining fixed effects and random effects to enhance analysis. The fixed effect accounts for population structure and other confounding factors, and the random effect considers genetic correlations within the population, reducing false positives and improving true association detection [21]. Previous research has demonstrated a significant impact of flock and birth type on weaning weight. Therefore, this study incorporated these factors as fixed effects. The FarmCPU model is outlined as follows:
y i = M i 1 b 1 + M i 2 b 2 + + M i n b n + Z i j u j + e i ,
y i = V i + e i ,
In Equations (1) and (2), Equation (1) represents a fixed effect model, and Equation (2) represents a random effect model. Here, y i denotes the phenotypic observation value of the i individual, M i n signifies the classification result of n potential related loci in the model, b n is the corresponding effect value to the locus, Z i j represents the classification result of the j marker of the i individual, u j is the effect value of Z i j , V i stands for the total genetic effect of the i individual, and e i is the residual vector following a normal distribution of e ~ N (0, σ e 2 ). Formulas (1) and (2) are used interchangeably in the execution of the FarmCPU model. Each SNP is considered as a fixed factor for regression analysis, and a significance test is conducted to determine the p value of each SNP. The top 5%, top 10%, top 15%, and top 20% SNPs with smaller p values are then selected, and these SNPs are further weighted based on their heritability to create another set of prior marker information.

2.3. GS Model Analysis Method

In this study, GS analysis was conducted using the GBLUP model in ASReml software (version 4.1.0.176) [22]. The GBLUP model, a method for genome selection analysis, is based on a direct approach. It considers the genetic relatedness between individuals by estimating the genetic variance–covariance matrix (also known as the genome relationship matrix) to predict the estimated breeding value of the individual [23]. The prior label information obtained in this study is validated using a prior information verification group. The GBLUP model utilized in this study is outlined as follows:
y = X b + Z u + e ,
In Formula (3), y represents the phenotypic value vector of the individual, and b represents the fixed effect vector. The model includes birth type and herd as fixed factors for analysis. u represents the random effect, following a normal distribution with a mean of 0 and variance of G σ a 2 , denoted as u ~N(0, G σ a 2 ), where σ e 2 is the genetic variance; G is the genetic relationship matrix between individuals; X and Z are the incidence matrices of b and u ; and e represents the random residual vector, following a normal distribution e N(0, I σ e 2 ). The kinship matrix utilized in GBLUP was proposed by VanRaden [22].
G = M M 2 i = 1 m p i ( 1 p i ) ,
In Equation (4), M represents the m × n standardized genotype matrix, where m is the number of markers, and n is the number of genotyped individuals, with p i being the minimum allele frequency of the i locus.
The method for constructing the genetic relationship matrix in GS, combined with GWAS prior information, involves creating two matrices: G1, using significant prior marker information from GWAS results, and G2, using the remaining SNP sites. The genetic variance of the WW trait in G1 and G2 is then calculated, and a new matrix G3 is formed by weighting the proportion of explained genetic variance. The equation of G3 is as follows:
G 3 = γ G 1 + ( 1 γ ) G 2 ,
In Formula (5), γ = σ G 1 2 σ G 1 2 + σ G 2 2 , with σ G 1 2 and σ G 2 2 representing the genetic variances of G1 and G2 matrices, respectively.

2.4. Methods for Evaluating Predictive Accuracy of GS

The current genome prediction selection accuracy relies on calculating the Pearson correlation coefficient between GEBV and TBV, a method introduced by Meuwissen in 2001 [4]. This study utilized a five-fold cross-validation approach to assess the accuracy of genomic breeding values [24]. The GS analysis population was randomly divided into 5 groups, with 4 combined as the reference group and the remaining one used for validation. The prediction accuracy was determined by dividing the Pearson correlation coefficient between the obtained genomic breeding value and the phenotypic value by the square of heritability. The average value served as the benchmark for evaluating prediction accuracy. The calculation formula is detailed as follows:
R ( y , y p r e ) = i = 1 N ( y i y ¯ ) ( y i , p r e y p r e ¯ ) i = 1 N ( y i y ¯ ) 2 i = 1 N ( y i , p r e y p r e ¯ ) 2 ,
In Formula (6), y and y p r e represent the predicted value (GEBV) and observed value (TBV), respectively.

3. Results

3.1. Statistics and Distribution of Phenotypic Data

The WW of 1007 Alpine Merino sheep was meticulously documented. To assess the precision of the GBLUP model versus genomic prediction with GWAS information, the phenotype data of the 1007 ewes were randomly split into two groups using the RAND function. Group 1 consisted of 503 ewes, and group 2 consisted of 504 ewes. Each group served as the control group for comparative analysis. The results of descriptive statistical analysis can be found in Table 1.

3.2. Genotype Data

A total of 41,956 SNPs suitable for GWAS and GS were derived through quality control and genotype imputation of data from the 50 K Illumina chip for Alpine Merino sheep. Figure 1 displays the map of marker densities across 26 autosomes.
The population structure analysis of group 1 and group 2 was conducted separately, with the results displayed in Figure 2a,c, revealing distinct stratification within the two experimental groups. Additionally, linkage disequilibrium analysis was performed on Alpine Merino sheep, with the results presented in Figure 2b,d.

3.3. GWAS Analysis Results

Utilizing the FarmCPU model, this study examined the relationship between group 1 and group 2 based on measured WW traits. The results, including a Manhattan plot and QQ plot, can be seen in Figure 3. Additionally, a subset of the most significant SNPs, comprising the top 5%, top 10%, top 15%, and top 20%, was identified as a set of prior marker information. Specifically, the top 5% yielded 2000 SNPs, and the top 10% had 4000 SNPs. The top 15% and top 20% contained 6000 and 8000 SNPs, respectively. The remaining SNPs were distributed across the chromosomes.

3.4. GS Analysis Results

Despite the questionable hypothesis underlying the GBLUP model, it still serves as an effective method for selection prediction based on genotype data. In the absence of GWAS results, Table 2 presents the genetic parameters of group 1, with the G matrix yielding a genetic variance of 1.135, an environmental variance of 8.153, and a heritability estimate of 0.122. The prediction accuracy from five-fold cross-validation was determined to be 0.075.
With GWAS results based on group 2, the top 5% most significant SNPs from GWAS were utilized to construct the genetic relationship matrix G1, resulting in a genetic variance of 1.022, an environmental variance of 8.275, and a heritability of 0.110. The remaining 95% of SNPs were employed to create G2, which showed a genetic variance of 1.124, an environmental variance of 8.166, and a heritability of 0.121. The weighted values for G1 and G2 were calculated as 0.476 and 0.533, respectively. By combining GWAS prior marker information to form the genetic relationship matrix G3, the genetic variance was determined to be 1.161, the environmental variance to be 8.121, and the heritability to be 0.125. The prediction accuracy was 0.073, resulting in a decrease of 2.67% in accuracy.
The study focused on the top 10% of SNPs to create the genetic relationship matrix G1, which revealed a genetic variance of 1.369, an environmental variance of 7.930, and a heritability of 0.147. The remaining 90% of SNPs were used to form the genetic relationship matrix G2, showing a genetic variance of 1.083, an environmental variance of 8.207, and a heritability of 0.117. The weighted values for G1 and G2 were determined as 0.558 and 0.442, respectively. Additionally, the genetic variance and environmental variance were calculated as 1.319 and 7.962, respectively, using the genetic relationship matrix G3 constructed with prior marker information from GWAS. The heritability was found to be 0.142, with a prediction accuracy of 0.090, representing a 20.00% improvement in accuracy.
The genetic parameters of group 2 are presented in Table 3. Using the G matrix, the genetic variance was calculated to be 3.932, and the environmental variance 6.045, resulting in a heritability of 0.394. The prediction accuracy from a five-fold cross-validation was 0.228. Based on the GWAS analysis of group 1, the top 5% most significant SNPs were selected to form the genetic relationship matrix G1. The genetic and environmental variances for G1 were 2.853 and 7.063, respectively, with a heritability of 0.288. The remaining 95% of SNPs were used to create the genetic relationship matrix G2, which explained a genetic variance of 3.849 and an environmental variance of 6.131, resulting in a heritability of 0.386. The weights assigned to G1 and G2 were 0.427 and 0.573, respectively. By combining GWAS marker information, the genetic relationship matrix G3 was formed, with a genetic variance of 3.922 and an environmental variance of 6.011. The heritability for G3 was 0.395, and the prediction accuracy improved to 0.236, representing a 3.51% increase in accuracy.
The study selected the top 20% of SNPs to construct the genetic relationship matrix G1, resulting in a calculated genetic variance of 3.430 and an environmental variance of 6.506, with a heritability of 0.345. The remaining 80% of SNPs were used to create matrix G2, which explained a genetic variance of 3.849 and an environmental variance of 6.135, yielding a heritability of 0.386. The weights for G1 and G2 were determined to be 0.472 and 0.528, respectively. Additionally, using the genetic relationship matrix G3, which combined GWAS prior marker information, the study found a genetic variance of 3.899 and an environmental variance of 6.036, resulting in a heritability of 0.392 and a prediction accuracy of 0.226. However, the accuracy decreased by 0.88%. The statistical figure of prediction accuracy is shown in Figure 4.

4. Discussion

In addition to being influenced by minor polygenes, the weight trait is also impacted by major genes that can have a more significant effect [25]. Currently, the GWAS has become a crucial method for identifying candidate loci related to economic traits in livestock and poultry. Population stratification is a known factor that can influence GWAS results, as it may lead to false positives [26]. PCA is commonly utilized in population-level studies [27]. The PCA analysis of group 1 and group 2 revealed stratification between them. Therefore, the mixed linear model is employed to enhance data accuracy and reduce errors. Various software packages and models are available for GWAS, including the widely used GLM, MLM, FarmCPU model, and others. In this study, the FarmCPU model from the rMVP package was utilized for analysis. FarmCPU, an extension of the compressed MLM, integrates fixed effects and random effects in GWAS analysis [28]. The fixed effect accounts for population structure and other confounding factors, and the random effect considers genetic correlations within the population, reducing false positives and improving true association detection [21]. By combining the strengths of MLM and GLM and addressing their limitations through iteration, FarmCPU can effectively handle highly polygenic traits with numerous genetic variations influencing the phenotype [29]. Overall, utilizing the FarmCPU statistical model for GWAS analysis of WW traits can enhance the accuracy and efficiency in identifying genetic associations by integrating fixed effects and random effects.
GS studies that incorporate prior information face the challenge of effectively utilizing this information. In this study, the GBLUP model was chosen for GS analysis. Though assuming all SNPs have the same effect and distribution is unrealistic in the GBLUP model, it has the advantage of accommodating multiple G matrices and allowing for their weighting [30]. Various researchers have enhanced the GS model by integrating prior information, with advancements such as genomic features BLUP [31] and BayesRC [32]. For instance, Zhang et al. [33] introduced TABLUP in 2010, which optimizes the genome relationship matrix and constructs a trait-specific relationship matrix to improve prediction accuracy. On the other hand, the BayesB-based method is computationally intensive and complex, offering limited benefits to GBLUP accuracy. In 2015, a method based on rrBLUP was proposed, which calculates the effect of each marker, sorts them based on their magnitude, and constructs two matrices for weighted addition, resulting in a significant enhancement in prediction accuracy [34,35].
In this study, a genome-wide significance level of 1 (1/Nsnp) was utilized, and the Bonferroni correction method was applied for multiple tests to identify significant loci. The GWAS analysis of weaning weight traits demonstrated a well-fitted Q-Q diagram, suggesting the model’s validity and the reliability of the results. A total of five significantly associated SNPs were identified, with candidate genes linked to body weight traits found on chromosomes 6, 8, 12, etc. These findings align with previous studies by Yuan et al. [36] and Liu et al. [37], which highlighted the role of FAM184B, NCAPG, and LCORL genes on chromosome 6 in sheep growth, carcass traits, and meat quality. SNP loci were sorted based on the p value from GWAS analysis, ranging from the top 5% to top 20% most significant SNPs, which were then integrated into the GBLUP model as prior information. Cross-validation was employed to predict the GS of the second group using the GWAS results of the first group, and vice versa. The prediction accuracy of the GBLUP model-based GS method in both sample groups ranged from 0.075 to 0.228. This was not a significant difference compared to previous studies on pig carcass weight [38], sheep carcass traits [39], and Hanwoo beef cattle weight [40]. When constructing the genome correlation matrix (G matrix) using GWAS results as prior information, the prediction accuracy ranged from 0.073 to 0.236. Notably, utilizing GWAS results from the second group as prior information for predicting the first group led to varying SNP selections based on different GWAS result percentiles, impacting prediction accuracy differently. Overall, increasing the number of prior markers initially increased genetic variance, heritability explained by prior marker information, and GS prediction accuracy before reaching a peak and then declining. These findings align with previous studies by Li [35], Schrag [41], and Gao [42], suggesting that incorporating GWAS prior information enhances GS prediction accuracy. The limited number of prior information sites identified using 50 K liquid chip data may necessitate more accurate screening of prior information through future data resequencing or due to the small sample size. Despite other potential interference factors, the G matrix created from GWAS prior information can enhance the prediction accuracy of GS compared to traditional breeding methods, thereby positively impacting the overall accuracy of genome selection.
Although GS has some limitations, its accuracy can be enhanced by utilizing GWAS analysis results to construct the G matrix. This improvement positively impacts the precision of genome selection, which is widely utilized in practical production. Ongoing model enhancements are being made [43], considering that different populations have distinct breeding objectives and genetic backgrounds. Therefore, when applying genome selection in production, it is crucial to choose prior information and a GS model suitable for the population biology, adjusting relevant parameters as needed. Future studies should explore other methods that combine GWAS and genome selection to further optimize prediction accuracy for handling larger-scale data analysis.

5. Conclusions

This study focused on Alpine Merino ewes, utilizing the top 5% to 20% SNPs with small p values from GWAS analysis as prior information. By integrating this data with the GS model, the research analyzed genetic parameter estimation and prediction accuracy of weaning weight traits. Incorporating the top 5% to top 20% significant SNPs from GWAS results as prior information into GS can improve the accuracy of predicting genomic breeding value. This improvement in accuracy can offer valuable support for more precise prediction of the genomic breeding value of alpine Merino sheep.

Author Contributions

Conceptualization, H.W., Y.Y. and T.G.; methodology, H.W. and C.L.; software, H.W. and C.Y.; validation, H.W., C.L., X.A. and R.Z.; formal analysis, H.W. and C.L. investigation, H.W. and C.L.; resources, J.L.; data curation, C.L. and Y.Y.; writing—original draft preparation, H.W. and C.L.; writing—review and editing, H.W. and Y.Y.; visualization, C.L. and J.L.; supervision, C.Y. and J.L.; project administration, Y.Y.; funding acquisition, Y.Y. and T.G. All authors have read and agreed to the published version of the manuscript.

Funding

National Key R&D Program (2021YFD1300901); Gansu Provincial Science and Technology Major Project (22ZD6NA037); Science and Technology Innovation 2020 Project-Agricultural Biological Breeding Major Project; Major Projects of Gansu Provincial Science and Technology Department in 2021 (21ZD11NM001); Natural Science Foundation of Gansu Province (21JR7RA033); Inner Mongolia Autonomous Region Science and Technology Plan Project (2021GG320); Lanzhou Talent Innovation and Entrepreneurship Project (2020-RC-130).

Institutional Review Board Statement

All experimental protocols and procedures were approved by the Institutional Animal Care and Use Committee of Lanzhou Institute of Husbandry and Pharmaceutical Science of the Chinese Academy of Agricultural Sciences (Approval No. NKMYD201805; Approval Date: 18 October 2018). Additionally, we collected and analyzed the blood samples of the studied animals in strict accordance with the People’s Republic of China’s Guidelines for the Care and Use of Laboratory Animals.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data will be available upon request from the corresponding author.

Acknowledgments

The Gansu Breeding Technology Extension Station Sheep in Huangcheng, Sunan Yugur Autonomous County, Zhangye City, Gansu Province, which provided a batch of Alpine Merino Sheep ewes for unified feeding and management, is primarily responsible for the success of this experiment.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Gebreselassie, G.; Berihulay, H.; Jiang, L.; Ma, Y. Review on genomic regions and candidate genes associated with economically important production and reproduction traits in sheep (Ovies aries). Animals 2019, 10, 33. [Google Scholar] [CrossRef] [PubMed]
  2. Meuwissen, T.; Hayes, B.; Goddard, M. Genomic selection: A paradigm shift in animal breeding. Anim. Front. 2016, 6, 6–14. [Google Scholar] [CrossRef]
  3. Heffner, E.L.; Sorrells, M.E.; Jannink, J.L. Genomic selection for crop improvement. Crop Sci. 2009, 49, 1–12. [Google Scholar] [CrossRef]
  4. Meuwissen, T.H.; Hayes, B.J.; Goddard, M.E. Prediction of total genetic value using genome-wide dense marker maps. Genetics 2001, 157, 1819–1829. [Google Scholar] [CrossRef] [PubMed]
  5. Hayes, B.J.; Visscher, P.M.; Goddard, M.E. Increased accuracy of artificial selection by using the realized relationship matrix. Genet. Res. 2009, 91, 47–60. [Google Scholar] [CrossRef] [PubMed]
  6. Habier, D.; Tetens, J.; Seefried, F.-R.; Lichtner, P.; Thaller, G. The impact of genetic relationship information on genomic breeding values in German Holstein cattle. Genet. Sel. Evol. 2010, 42, 5. [Google Scholar] [CrossRef] [PubMed]
  7. Schaeffer, L.R. Strategy for applying genome-wide selection in dairy cattle. J. Anim. Breed. Genet. 2006, 123, 218–223. [Google Scholar] [CrossRef] [PubMed]
  8. Goddard, M.E.; Hayes, B.J. Genomic selection. J. Anim. Breed. Genet. 2007, 124, 323–330. [Google Scholar] [CrossRef] [PubMed]
  9. Yin, L.L.; Ma, Y.L.; Xiang, T.; Zhu, M.J.; Yu, M.; Li, X.Y.; Liu, X.L.; Zhao, S.H. The progress and prospect of genomic selection models. Acta Vet. Zootech. Sin. 2019, 50, 233–242. [Google Scholar]
  10. Esfandyari, H.; Sørensen, A.C.; Bijma, P. A crossbred reference population can improve the response to genomic selection for crossbred performance. Genet. Sel. Evol. 2015, 47, 76. [Google Scholar] [CrossRef]
  11. Song, H.; Ye, S.; Jiang, Y.; Zhang, Z.; Zhang, Q.; Ding, X. Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs. Genet. Sel. Evol. 2019, 51, 58. [Google Scholar] [CrossRef]
  12. Lourenco, D.A.; Tsuruta, S.; Fragomeni, B.O.; Masuda, Y.; Aguilar, I.; Legarra, A.; Bertrand, J.K.; Amen, T.S.; Wang, L.; Moser, D.W.; et al. Genetic evaluation using single-step genomic best linear unbiased predictor in American Angus. J. Anim. Sci. 2015, 93, 2653–2662. [Google Scholar] [CrossRef]
  13. Zhao, Z.D.; Zhang, L. Applications of genome selection in sheep breeding. Yi Chuan 2019, 41, 293–303. [Google Scholar]
  14. Wolc, A.; Zhao, H.H.; Arango, J.; Settar, P.; Fulton, J.E.; O’sullivan, N.P.; Preisinger, R.; Stricker, C.; Habier, D.; Fernando, R.L. Response and inbreeding from a genomic selection experiment in layer chickens. Genet. Sel. Evol. 2015, 47, 59. [Google Scholar] [CrossRef] [PubMed]
  15. Goddard, M.E.; Kemper, K.E.; MacLeod, I.M.; Chamberlain, A.J.; Hayes, B.J. Genetics of complex traits: Prediction of phenotype, identification of causal polymorphisms and genetic architecture. Proc. Biol. Sci. 2016, 283. [Google Scholar] [CrossRef] [PubMed]
  16. Yang, J.; Zeng, J.; Goddard, M.E.; Wray, N.R.; Visscher, P.M. Concepts, estimation and interpretation of SNP-based heritability. Nat. Genet. 2017, 49, 1304–1310. [Google Scholar] [CrossRef]
  17. Lopes, M.S.; Bovenhuis, H.; van Son, M.; Nordbø, Ø.; Grindflek, E.H.; Knol, E.F.; Bastiaansen, J.W. Using markers with large effect in genetic and genomic predictions. J. Anim. Sci. 2017, 95, 59–71. [Google Scholar] [CrossRef]
  18. Zhang, Z.; Ober, U.; Erbe, M.; Zhang, H.; Gao, N.; He, J.; Li, J.; Simianer, H. Improving the accuracy of whole genome prediction for complex traits using the results of genome wide association studies. PLoS ONE 2014, 9, e93017. [Google Scholar] [CrossRef]
  19. Yurchenko, A.A.; Deniskova, T.E.; Yudin, N.S.; Dotsev, A.V.; Khamiruev, T.N.; Selionova, M.I.; Egorov, S.V.; Reyer, H.; Wimmers, K.; Brem, G.; et al. High-density genotyping reveals signatures of selection related to acclimation and economically important traits in 15 local sheep breeds from Russia. BMC Genom. 2019, 20, 294. [Google Scholar] [CrossRef]
  20. Ghasemi, M.; Zamani, P.; Vatankhah, M.; Abdoli, R. Genome-wide association study of birth weight in sheep. Animal 2019, 13, 1797–1803. [Google Scholar] [CrossRef] [PubMed]
  21. Liu, X.; Huang, M.; Fan, B.; Buckler, E.S.; Zhang, Z. Iterative Usage of Fixed and Random Effect Models for Powerful and Efficient Genome-Wide Association Studies. PLoS Genet. 2016, 12, e1005767. [Google Scholar] [CrossRef] [PubMed]
  22. VanRaden, P.M. Efficient methods to compute genomic predictions. J. Dairy. Sci. 2008, 91, 4414–4423. [Google Scholar] [CrossRef]
  23. Zhang, J.; Liu, F.; Reif, J.C.; Jiang, Y. On the use of GBLUP and its extension for GWAS with additive and epistatic effects. G3 Genes Genomes Genet. 2021, 11, jkab122. [Google Scholar] [CrossRef] [PubMed]
  24. Browne, N. Cross-Validation Methods. J. Math. Psychol. 2000, 44, 108–132. [Google Scholar] [CrossRef] [PubMed]
  25. Mehrban, H.; Lee, D.H.; Moradi, M.H.; IlCho, C.; Naserkheil, M.; Ibáñez-Escriche, N. Predictive performance of genomic selection methods for carcass traits in Hanwoo beef cattle: Impacts of the genetic architecture. Genet. Sel. Evol. 2017, 49, 1. [Google Scholar] [CrossRef] [PubMed]
  26. Tian, C.; Gregersen, P.K.; Seldin, M.F. Accounting for ancestry: Population substructure and genome-wide association studies. Hum. Mol. Genet. 2008, 17, R143–R150. [Google Scholar] [CrossRef] [PubMed]
  27. Ma, S.; Shi, G. On rare variants in principal component analysis of population stratification. BMC Genet. 2020, 21, 34. [Google Scholar] [CrossRef] [PubMed]
  28. Alemu, A.; El Baouchi, A.; El Hanafi, S.; Kehel, Z.; Eddakhir, K.; Tadesse, W. Genetic analysis of grain protein content and dough quality traits in elite spring bread wheat (Triticum aestivum) lines through association study. J. Cereal Sci. 2021, 100, 103214. [Google Scholar] [CrossRef]
  29. Kusmec, A.; Schnable, P.S. Farm CPU pp: Efficient large-scale genomewide association studies. Plant Direct. 2018, 2, e00053. [Google Scholar] [CrossRef]
  30. Ren, D.; An, L.; Li, B.; Qiao, L.; Liu, W. Efficient weighting methods for genomic best linear-unbiased prediction (BLUP) adapted to the genetic architectures of quantitative traits. Heredity 2020, 126, 320–334. [Google Scholar] [CrossRef]
  31. Edwards, S.M.; Sørensen, I.F.; Sarup, P.; Mackay, T.F.; Sørensen, P. Genomic Prediction for Quantitative Traits Is Improved by Mapping Variants to Gene Ontology Categories in Drosophila melanogaster. Genetics 2016, 203, 1871–1883. [Google Scholar] [CrossRef] [PubMed]
  32. MacLeod, I.; Bowman, P.; Vander Jagt, C.; Haile-Mariam, M.; Kemper, K.; Chamberlain, A.; Schrooten, C.; Hayes, B.J.; Goddard, M. Exploiting biological priors and sequence variants enhances QTL discovery and genomic prediction of complex traits. BMC Genom. 2016, 17, 144. [Google Scholar] [CrossRef] [PubMed]
  33. Zhang, Z.; Liu, J.; Ding, X.; Bijma, P.; de Koning, D.-J.; Zhang, Q. Best linear unbiased prediction of genomic breeding values using a trait-specific marker-derived relationship matrix. PLoS ONE 2010, 5, e12648. [Google Scholar] [CrossRef] [PubMed]
  34. Zhang, Z.; Erbe, M.; He, J.; Ober, U.; Gao, N.; Zhang, H.; Simianer, H.; Li, J. Accuracy of whole-genome prediction using a genetic architecture-enhanced variance-covariance matrix. G3 Genes Genomes Genet. 2015, 5, 615–627. [Google Scholar] [CrossRef] [PubMed]
  35. Li, C.; Li, J.; Wang, H.; Zhang, R.; An, X.; Yuan, C.; Guo, T.; Yue, Y. Genomic Selection for Live Weight in the 14th Month in Alpine Merino Sheep Combining GWAS Information. Animals 2023, 13, 3516. [Google Scholar] [CrossRef] [PubMed]
  36. Yuan, Z.; Sunduimijid, B.; Xiang, R.; Behrendt, R.; Knight, M.I.; Mason, B.A.; Reich, C.M.; Prowse-Wilkins, C.; Vander Jagt, C.J.; Chamberlain, A.J.; et al. Expression quantitative trait loci in sheep liver and muscle contribute to variations in meat traits. Genet. Sel. Evol. 2021, 53, 8. [Google Scholar] [CrossRef] [PubMed]
  37. Liu, Z.; Bai, C.; Shi, L.; He, Y.; Hu, M.; Sun, H.; Peng, H.; Lai, W.; Jiao, S.; Zhao, Z.; et al. Detection of selection signatures in South African Mutton Merino sheep using whole-genome sequencing data. Anim. Genet. 2022, 53, 224–229. [Google Scholar] [CrossRef] [PubMed]
  38. Iqbal, A.; Choi, T.-J.; Kim, Y.-S.; Lee, Y.-M.; Zahangir Alam, M.; Jung, J.-H.; Choe, H.-S.; Kim, J.-J. Comparison of genomic predictions for carcass and reproduction traits in Berkshire, Duroc and Yorkshire populations in Korea. Anim. Biosci. 2019, 32, 1657–1663. [Google Scholar] [CrossRef] [PubMed]
  39. Bolormaa, S.; Gore, K.; van der Werf, J.H.J.; Hayes, B.J.; Daetwyler, H.D. Design of a low-density SNP chip for the main Australian sheep breeds and its effect on imputation and genomic prediction accuracy. Anim. Genet. 2015, 46, 544–556. [Google Scholar] [CrossRef]
  40. Naserkheil, M.; Lee, D.H.; Mehrban, H. Improving the accuracy of genomic evaluation for linear body measurement traits using single-step genomic best linear unbiased prediction in Hanwoo beef cattle. BMC Genet. 2020, 21, 144. [Google Scholar] [CrossRef]
  41. Schrag, T.A.; Westhues, M.; Schipprack, W.; Seifert, F.; Thiemann, A.; Scholten, S.; Melchinger, A.E. Beyond Genomic Prediction: Combining Different Types of omics Data Can Improve Prediction of Hybrid Performance in Maize. Genetics 2018, 208, 1373–1385. [Google Scholar] [CrossRef] [PubMed]
  42. Gao, N.; Teng, J.; Ye, S.; Yuan, X.; Huang, S.; Zhang, H.; Zhang, X.; Li, J.; Zhang, Z. Genomic Prediction of Complex Phenotypes Using Genic Similarity Based Relatedness Matrix. Front. Genet. 2018, 9, 364. [Google Scholar] [CrossRef] [PubMed]
  43. Yin, L.; Zhang, H.; Zhou, X.; Yuan, X.; Zhao, S.; Li, X.; Liu, X. KAML: Improving genomic prediction accuracy of complex traits using machine learning determined parameters. Genome Biol. 2020, 21, 146. [Google Scholar] [CrossRef]
Figure 1. Chromosome-specific SNP density in 1 Mb genomic intervals. The number of SNPs is represented in a green to red scale.
Figure 1. Chromosome-specific SNP density in 1 Mb genomic intervals. The number of SNPs is represented in a green to red scale.
Animals 14 01904 g001
Figure 2. Analysis of group 1 and group 2 population structure and attenuation of LD in the Atlas. (a) Group 1 PCA diagram; (b) group 1 LD attenuation diagram; (c) group 2 PCA diagram; (d) group 2 LD attenuation diagram.
Figure 2. Analysis of group 1 and group 2 population structure and attenuation of LD in the Atlas. (a) Group 1 PCA diagram; (b) group 1 LD attenuation diagram; (c) group 2 PCA diagram; (d) group 2 LD attenuation diagram.
Animals 14 01904 g002
Figure 3. GWAS analysis results. (a) Group 1 Manhattan plot; (b) group 1 QQ plot; (c) group 2 Manhattan plot; (d) group 2 QQ chart.
Figure 3. GWAS analysis results. (a) Group 1 Manhattan plot; (b) group 1 QQ plot; (c) group 2 Manhattan plot; (d) group 2 QQ chart.
Animals 14 01904 g003
Figure 4. (a) Prediction accuracy statistics chart of group 1; (b) prediction accuracy statistics chart of group 2.
Figure 4. (a) Prediction accuracy statistics chart of group 1; (b) prediction accuracy statistics chart of group 2.
Animals 14 01904 g004
Table 1. Descriptive statistics of WW.
Table 1. Descriptive statistics of WW.
ItemNumberMeanSdMedianTrimmedMedMinMaxSe
All100728.273.1928.0028.212.9718.0037.600.10
Group 150328.123.1828.0028.072.9718.0037.600.14
Group 250428.423.1928.2028.343.2619.0037.000.14
Sd stands for standard deviation, Med indicates the median, Min indicates the minimum, and Max indicates the maximum.
Table 2. Genetic parameters of group 1.
Table 2. Genetic parameters of group 1.
Prior Marker InformationMatrixGenetic VarianceEnvironmental
Variance
HeritabilityWeightPrediction AccuracyPromotion
-G1.1358.1530.122-0.075-
Top 5%G11.0228.2750.1100.476--
G21.1248.1660.1210.533--
G31.1618.1210.125-0.073-2.67%
Top 10%G11.3697.9300.1470.558--
G21.0838.2070.1170.442--
G31.3197.9620.142-0.090+20.00%
Top 15%G11.0858.2070.1170.492--
G21.1218.1690.1210.508--
G31.1438.1360.123-0.078+4.00%
Top 20%G11.2828.0090.1380.546--
G21.0688.2230.1150.454--
G31.2288.0500.132-0.082+9.33%
Table 3. Genetic parameters of group 2.
Table 3. Genetic parameters of group 2.
Prior Marker InformationMatrixGenetic VarianceEnvironmental
Variance
HeritabilityWeightPrediction AccuracyPromotion
-G3.9326.0450.394-0.228-
Top 5%G12.8537.0630.2880.427--
G23.8496.1310.3860.573--
G33.9226.0110.395-0.236+3.51%
Top 10%G13.3376.5900.3360.469--
G23.7896.1870.3800.531--
G34.0495.8940.407-0.235+3.07%
Top 15%G13.4066.5290.3430.472--
G23.8246.1560.3830.528--
G33.9376.0000.396-0.2280
Top 20%G13.4306.5060.3450.472--
G23.8496.1350.3860.528--
G33.8996.0360.392-0.226−0.88%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, H.; Li, C.; Li, J.; Zhang, R.; An, X.; Yuan, C.; Guo, T.; Yue, Y. Genomic Selection for Weaning Weight in Alpine Merino Sheep Based on GWAS Prior Marker Information. Animals 2024, 14, 1904. https://doi.org/10.3390/ani14131904

AMA Style

Wang H, Li C, Li J, Zhang R, An X, Yuan C, Guo T, Yue Y. Genomic Selection for Weaning Weight in Alpine Merino Sheep Based on GWAS Prior Marker Information. Animals. 2024; 14(13):1904. https://doi.org/10.3390/ani14131904

Chicago/Turabian Style

Wang, Haifeng, Chenglan Li, Jianye Li, Rui Zhang, Xuejiao An, Chao Yuan, Tingting Guo, and Yaojing Yue. 2024. "Genomic Selection for Weaning Weight in Alpine Merino Sheep Based on GWAS Prior Marker Information" Animals 14, no. 13: 1904. https://doi.org/10.3390/ani14131904

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop