Next Article in Journal
Designing the Crops for the Future; The CropBooster Program
Next Article in Special Issue
Powdery Mildew Resistance Phenotypes of Wheat Gene Bank Accessions
Previous Article in Journal
Comparison of Linear vs. Cyclic RGD Pentapeptide Interactions with Integrin αvβ3 by Molecular Dynamics Simulations
Previous Article in Special Issue
Labelling Selective Sweeps Used in Durum Wheat Breeding from a Diverse and Structured Panel of Landraces and Cultivars
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genomic Selection for End-Use Quality and Processing Traits in Soft White Winter Wheat Breeding Program with Machine and Deep Learning Models

1
Department of Crop and Soil Sciences, Washington State University, Pullman, WA 99164, USA
2
USDA-ARS Western Wheat Quality Laboratory, E-202 Food Quality Building, Washington State University, Pullman, WA 99164, USA
*
Author to whom correspondence should be addressed.
Biology 2021, 10(7), 689; https://doi.org/10.3390/biology10070689
Submission received: 30 May 2021 / Revised: 13 July 2021 / Accepted: 17 July 2021 / Published: 20 July 2021
(This article belongs to the Special Issue Genetic Improvement and Breeding of Wheat)

Abstract

:

Simple Summary

Wheat (Triticum aestivum L.) breeding programs mainly focus on improving grain yield, biotic and abiotic stress tolerance, and end-use quality traits. End-use quality and processing traits are the combinations of various predefined parameters. Genomic selection (GS) opens the potential for selecting improved end-use quality lines. This study explored the potential of the machine and deep learning-based GS models for predicting end-use quality traits in wheat. Deep models were superior to traditional statistical and Bayesian models under all the prediction scenarios. The high accuracy observed for end-use quality traits in this study support predicting them in early generations, leading to the advancement of superior genotypes to more extensive grain yield trails.

Abstract

Breeding for grain yield, biotic and abiotic stress resistance, and end-use quality are important goals of wheat breeding programs. Screening for end-use quality traits is usually secondary to grain yield due to high labor needs, cost of testing, and large seed requirements for phenotyping. Genomic selection provides an alternative to predict performance using genome-wide markers under forward and across location predictions, where a previous year’s dataset can be used to build the models. Due to large datasets in breeding programs, we explored the potential of the machine and deep learning models to predict fourteen end-use quality traits in a winter wheat breeding program. The population used consisted of 666 wheat genotypes screened for five years (2015–19) at two locations (Pullman and Lind, WA, USA). Nine different models, including two machine learning (random forest and support vector machine) and two deep learning models (convolutional neural network and multilayer perceptron) were explored for cross-validation, forward, and across locations predictions. The prediction accuracies for different traits varied from 0.45–0.81, 0.29–0.55, and 0.27–0.50 under cross-validation, forward, and across location predictions. In general, forward prediction accuracies kept increasing over time due to increments in training data size and was more evident for machine and deep learning models. Deep learning models were superior over the traditional ridge regression best linear unbiased prediction (RRBLUP) and Bayesian models under all prediction scenarios. The high accuracy observed for end-use quality traits in this study support predicting them in early generations, leading to the advancement of superior genotypes to more extensive grain yield trails. Furthermore, the superior performance of machine and deep learning models strengthens the idea to include them in large scale breeding programs for predicting complex traits.

1. Introduction

Wheat (Triticum aestivum L.) breeding programs mainly focus on improving grain yield, biotic and abiotic stress tolerance, and end-use quality traits. Hexaploid wheat is classified into hard and soft wheat classes based on kernel texture, milling quality, protein strength, and water absorption [1,2,3]. Washington State was ranked fourth in U.S. wheat production in 2020. About 80% of wheat grown in eastern Washington is soft white wheat (SWW), one of the six classes grown in the USA. SWW is the smallest wheat class and is consistently in demand from overseas markets owing to its end-use quality attributes. More than 85% of the SWW produced in the Pacific Northwest (PNW) region is exported to markets in countries like Japan, Korea, The Philippines, and Indonesia [4].
End-use quality and processing traits are the combinations of various predefined parameters [5]. Milling traits are measured to extract flour and break flour percentage as flour yield and break flour yield [6]. Thermogravimetric ovens are used for measuring the flour ash. Lower flour ash is recommended as higher ash levels are indicative of higher bran contamination, which reduces the functionality of most dough and batters [6]. The sugar snap cookie test is an essential test for evaluating SWW to meet expectations of product performance from overseas markets. Baking of cookies is performed for lines within the breeding program, and SWW lines having cookie diameter above 9.3 cm is preferred [7].
Grain characteristics commonly measured in SWW include kernel hardness, kernel size, kernel weight, test weight, and grain protein content. Kernel weight, kernel size, and kernel texture (hardness) are measured with a single kernel characterization system (SKCS). Lower values from the SKCS demonstrate softness; thus, SKCS values are negatively correlated with break flour yield. However, the two measures of kernel texture are not entirely correlated because SKCS includes only kernel resistance to crushing, whereas break flour yield includes particle size, sieving, and grain structure [8]. High gluten strength or viscoelastic strength is required for bread baking, whereas confectionary products require less gluten and water absorption. Gluten strength and water absorption capacity are measured using sodium dodecyl sulfate sedimentation and water solvent retention capacity tests. Lower water absorption in SWW contributes to better cookie spread [2,5].
Major genes influencing end-use quality traits are typically already fixed in most breeding programs, especially in different market classes. Until now, marker-assisted selection has been used for major genes controlling end-use quality, namely, low molecular weight glutenins, high molecular weight glutenins, granule bound starch synthase I (amylose composition), and puroindolines (kernel hardness) [2,9]. Usage of these molecular markers only aid in differentiating different wheat classes earlier in the breeding program; however, they do not provide the complete profile of different end-use quality traits. Previous linkage mapping and genome-wide association studies in SWW have shown that a large number of small effect QTLs control most end-use quality traits in addition to the already fixed genes [4,10,11]. Similarly, 299 small effect QTLs were identified using multi-locus genome-wide association studies for nine end-use quality traits in hard wheat [12]. Kristensen et al. [13] were unable to identify significant QTLs for Zeleny sedimentation, grain protein content, test weight, thousand kernel weight, and falling number in wheat and suggested genomic selection as the best alternative for predicting quantitative traits.
Genomic selection (GS) opens up the potential for selecting improved end-use quality lines due to the small effect of these loci, limited seed availability earlier in the breeding pipeline for conducting tests, and time constraints in winter wheat for sowing the new cycle [14]. GS uses the genotypic and phenotypic data from previous breeding lines or populations to train predictive statistical models. These trained models are subsequently used to predict the genomic estimated breeding values (GEBVs) of genotyped lines [15]. GS has shown the potential to enhance genetic gain by reducing the breeding cycle time and improving selection accuracy [16,17,18]. This is especially important for winter wheat end-use quality traits, as phenotyping requires more than three months and data from the quality lab is often not available between harvest and planting. This ultimately results in either the increase of one year in the breeding cycle or passage of undesirable lines into the next growing season. Furthermore, phenotyping requires a large amount of seed and is costly, so large-scale testing is often not conducted until later generations. Currently, the cost of genotyping 10,000 lines with high density genotyping by sequencing is equivalent to phenotyping 200 lines for end-use quality and processing traits [5]. Therefore, GS is currently the best option for breeding for end-use quality traits after considering time, cost, and seed amount.
Genomic selection has been primarily explored in several hard wheat end-use quality trait studies using the traditional genomic best linear biased prediction (GBLUP), Bayes A, Bayes B, Bayes C, and Bayes Cpi, showing mixed results, where one model performed best for one trait and not for another [19,20]. Machine and deep learning models have opened up an entirely new platform for plant breeders and exploring them in the breeding program could accelerate the pace of genetic gain. Deep learning models have shown higher prediction accuracies for different complex traits in wheat [21], rice (Oryzae sativa L.) [22], soybean (Glycine max L.) [23], and maize (Zea mays L.) [24]. Sandhu et al. [21] have shown that two deep learning models, namely, convolutional neural network (CNN) and multilayer perceptron (MLP), gave 1–5% higher prediction accuracy compared to BLUP-based models. Ma et al. [25] and Montesinos-López et al. [26,27] also obtained similar results to predict quantitative traits in wheat and suggested that deep learning models should be explored due to their better prediction accuracies. To the best of our knowledge from our literature search, this is the first study exploring the potential of the deep learning models for predicting the end-use quality traits in wheat.
This study explored the potential of GS using multi-environment data from 2015-19 for end-use quality traits in a soft white winter wheat breeding program. We explored nine different BLUP based models, Bayesian models, and machine and deep learning models to predict the fourteen different end-use quality traits. The main objectives of this include, (1) Optimization of the machine and deep learning models for predicting end-use quality traits, (2) Comparison of prediction ability of nine different GS models to predict fourteen different end-use quality traits using cross-validation approaches, and (3) Assess the potential of GS for forward prediction and across location predictions using previous years training data in the breeding program.

2. Materials and Methods

2.1. Germplasm

A total of 666 soft white winter wheat lines were evaluated for five years at two locations, namely, Pullman and Lind, WA, USA, from 2015–19. These 666 genotypes consist of F4:5 lines, double haploid lines, lines in preliminary and advanced yield trials screened as a part of the Washington State University winter wheat breeding program. F4:5 derived lines and double haploid lines were screened for the agronomic and disease resistance traits, and the superior genotypes were tested for the end-use quality. Lines in preliminary and advanced yield trials were selected for superior yield, and those lines were later advanced for end-use quality trait phenotyping. Some genotypes were replicated at a single location per year, whereas others were un-replicated, creating an unbalanced dataset [28]. As this was a breeding population, very few lines were common between all the years except check cultivars due to continuous selections.

2.2. Phenotyping

Fourteen different end-use quality and processing traits were measured, and data were obtained from the USDA-ARS Western Wheat & Pulse Quality Laboratory, Pullman, WA. All these traits were measured following the guidelines of the American Association of Cereal Chemists International (AACCI 2008). These fourteen traits were divided into four categories: milling traits, baking parameters, grain characteristics, and flour parameters. The complete summary of each trait, number of observations, mean, standard error, and heritability is provided in Table 1 and Table 2.
Grain characteristics, namely kernel size (KSIZE), kernel weight (KWT), and kernel hardness (KHRD) were determined using 200 seeds/sample with a SKCS 4100 (Perten Instruments, Springfield, IL, USA) (AACC Approved Method 55-31.01). Grain protein content (GPC) was measured using a NIR analyzer (Perten Instruement, Springfield, IL, USA) (AACC Approved Method 39-10.01). Test weight (TWT) was obtained as weight/volume following AACC Approved Method 55-10.01.
Three milling traits, namely flour yield (FYELD), break flour yield (BKYELD), and milling score (MSCOR) were obtained using a Quadrumat senior experimental mill (Brabender, South Hackensack, NJ, USA). FYELD was determined as a ratio of total flour weight (mids + break flour) to the initial sample weight using a single pass through the Quadrumat break roll unit. BKYELD was estimated as the percent of milled product passing through a 94-mesh* screen per unit grain weight. Flour ash (FASH) was obtained using the AACC Approved Method 08-01.01. MSCOR was calculated using the formula: MSCOR = (100 − (0.5(16 − 13.0 + (80 − FYELD) + 50 (FASH − 0.30))) × 1.274) − 21.602, showing that this trait is a function of FYELD and FASH content. To evaluate baking parameters, cookie diameter (CODI) was measured using AACC Approved Method 10-52.02.
Four different flour parameters, namely, flour protein (FPROT), water solvent retention capacity in water (FSRW), flour swelling volume (FSV), and flour sodium dodecyl sulfate sedimentation (FSDS) were measured from the extracted flour. FPROT was measured following the AACC Approved Method 39-11.01. FSRW measures the water retention capacity of gluten, gliadins, starch, and arabinoxylans using the AACC Approved Method 56-11.02. The FSDS test was used to measure strength of gluten by following the AACC Approved Method 56-60.01. The FSV test assesses starch composition following the AACC Approved Method 56-21.01 [29].

2.3. Statistical Analysis

Due to the unbalanced nature of the dataset, adjusted means were calculated using residuals obtained using the lme4 R package for within environment analysis. The model equation is represented as
Yij = Blocki + Checkj + eij
where Yij is the raw phenotype; Checkj is the effect of replicated check cultivar; Blocki corresponds to the fixed block effect; and eij is the residuals [30,31]. Block was considered fixed, as we want to remove that component of variation before exploring the genetic variation. Residuals from the model were used to calculate the adjusted means (line effect).
Adjusted means across the environments were calculated following the method implemented in Sandhu et al. [18,32] and is as follows
Yijk = µ + Blocki + Checkj + Envk + Blocki × Envk + Checkj × Envk + eijK
where Yijk is the raw phenotype value; Blocki, Checkj, and Envk are the fixed effect of ith block, jth check, and kth environment; and eijk is the residuals. Residuals from the model were used to calculate the adjusted means (line effect).
Best linear unbiased predictors (BLUPs) for individuals and across environments were used to obtain the variance components for estimating broad sense heritability. The equation for heritability used was
H C 2 = 1 v ¯ .. BLUP 2 σ g ˆ 2
where H C 2 is the Cullis heritability; σ g ^ 2   is genotypic variance; and v ¯ .. BLUP is the mean-variance of BLUPs [33].

2.4. Genotyping

The whole population was genotyped using GBS through the North Carolina State University (NCSU) Genomics Sciences Laboratory, Raleigh, NC, using the restriction enzymes PstI and MspI [34]. LGC Biosearch Technologies OktopureTM robotic platform with sbeadexTM magnetic microparticle reagent kits were used to extract the DNA from the leaves of ten-day-old seedlings. Thermo Fisher (Waltham, MA, USA) Quant-ItTM PicoGreenTM assays were used to quantify the DNA, and the samples were normalized to 20 ng/µL. Restriction enzymes PstI and MspI were used for sample fragmentation, and the digested samples were ligated with barcode adapters using T4 ligase. The pooled samples were amplified using PCR, following Poland et al. [34], and sequencing was performed at NCSU Genomics Sciences Laboratory. Burrows-Wheeler Aligner (BWA) 0.7.17 was used to align the sequences to the Chinese Spring (IWGSC) RefSeq v1.0 reference genome [35]. Tassel v5 was used for SNP discovery and calling [36]. Quality filtering pipeline was implemented in R software to remove markers with minor allele frequency less than 5%, markers missing more than 30% data, and heterozygosity more than 15%. Missing data in the SNP data were imputed using the expectation-maximization algorithm in the R package rrBLUP [37]. After the complete filtering pipeline, 40,518 SNPs remained and were used for population structure and genomic prediction [28].

2.5. Genomic Selection Models

We explored the performance of five parametric and four non-parametric models for all fourteen traits evaluated in this study. Parametric models used were RRBLUP, Bayes B, Bayes A, Bayes Lasso, and Bayes C. Non-parametric models included two machine and two deep learning models. The complete information for all those models and optimization process is provided as follows:

2.5.1. Ridge Regression Best Linear Unbiased Prediction (RRBLUP)

RRBLUP was included here as the benchmark for comparing its performance with other models due to frequent use in wheat breeding and ease of implementation. The model assumes that all markers contribute to the trait and has a constant effect variance. Marker effects and variance patterns are estimated using the restricted estimated maximum likelihood (REML) function based on phenotypic and marker data [37]. The RRBLUP model was implemented with the R package rrBLUP using the mixed.solve function. The model can be represented as
y = µ + Zu + e
where µ is the overall mean; y is the vector of adjusted means; u is a vector with normally distributed random marker effects with constant variance as u ~ N(0, Iσ2u); Z is an N × M matrix of markers; and e is the residual error distributed as e ~ N(0, Iσ2e). The solution for mixed equation can be written as
u = Z T   ( ZZ T   +   λ I ) 1   y
where u , Z, and y are explained above; I is an identity matrix; and λ is represented as λ = σ2e2u and is the ridge regression parameter [37].

2.5.2. Bayesian Models

We implemented four different Bayesian models, namely, Bayes Lasso, Bayes A, Bayes B, and Bayes C. All these models assume different prior distributions for estimating marker effects and variances. Bayes A applies the inverted chi-squared probability distribution for estimating marker variances. Bayes B provides a more realistic scenario for breeding, assuming that all markers do not contribute to the phenotype. It applies a mixture of prior distribution with a high probability mass at zero, and others follow the Gaussian distribution. Bayes C and Bayes Lasso follow the mixture of the prior distribution (point mass at zero with scaled-t distribution) and double exponential distribution, respectively [38]. All the Bayesian models were implemented using the BGLR R package using the model equation
y i = µ + j = 1 j = p x i j β j + ϵ i
where µ ,   y i , x i j , and ϵ i are defined above; and β j is the jth marker effect. Each Bayesian model used in this study has separate conditional prior distribution. Analysis was performed with 30,000 Monte Carlo Markov chain iterations with 10,000 burn-in iterations [38].

2.5.3. Random Forests (RF)

RF involves building a large collection of identical distributed trees and averages from the trees for final prediction. Different bootstrap samples are performed over the training set to identify the best feature subsets for splitting the tree nodes. The main criteria for splitting at the node include lowering the loss function during each bootstrapped sample [39]. Model equation is represented as
y ^ i = 1 B b = 1 B T b ( x i )
where y ^ i is the predicted value of the individual with genotype x i ; T is the total number of trees; and B is the number of bootstrap samples. The main steps involved in model functioning includes
  • Bootstrap sampling (b = (1, …, B)) to select genotypes with replacement from the training set, and an individual plant can appear once or several time during the sampling
  • Best set of features (SNPj, j = (1, …, J) were selected to minimize the mean square error (MSE) using the max feature function in the random forest regression library.
  • Splitting is performed at each node of the tree using the SNPj genotype to lower the MSE.
  • The above steps are repeated until a maximum depth is reached or a minimum node. The final predicted value of an individual of genotype x i is the average of the values from the set of trees in the forest.
The important hyperparameter model training includes the depth of the trees, the importance of each feature, the number of features sampled for each iteration, and the number of trees. Randomized grid search cross-validation was used for hyperparameter optimization. The combination of hyperparameters that were tried included max depth (40, 60, 80, 100), max features (auto, sqrt), and number of trees (200, 300, 500, 1000) [40]. The Scikit learn, and random forest regression libraries in Python 3.7 were used for analysis [41].

2.5.4. Support Vector Machine (SVM)

SVM uses the non-linear kernel for mapping the predictor space to high dimensional feature space for studying the relationship between marker genotype and phenotypes. The model equation is represented as
f ( x ) = w x + b
where f ( x ) is learning function; b is the constant, reflecting the maximum allowed bias; w is the unknown weight; and x is the marker set. The learning function is mapped by minimizing the loss function as
C i = 1 n L ( e i ) + 1 2 w 2
where C is a positive regularization parameter; w 2 represents model complexity, e i = y − f ( x ) is the associated error with the ith training data point, and L   is the loss function [42].

2.5.5. Multilayer Perceptron (MLP)

MLP is the feed-forward deep learning model that uses three layers, namely, input, hidden, and output, for mapping the relationship. These layers are connected by a dense network of neurons, where each neuron has its characteristic weight. MLP uses the combination of neurons, activation function, learning rate, hidden layers, and regularization for predicting the phenotypes. Input layer corresponds to SNP genotypes, while neurons connect multiple hidden layer with associated strength (weight). The output of the ith hidden layer is represented as
Z i = b ( I 1 ) + W i   f ( I 1 )   ( x )
where Zi is the output from the ith hidden layer; b0 is the bias for estimating neurons weight; f(I−1) represents the activation function; and Wi is the weight associated with the neurons, and this process is repeated until the output layer.
Keras function’s grid search cross-validation and internal capabilities were used for optimizing the hyperparameters. Hyperparameters giving the lowest MSE were identified and used for output prediction [43]. Regularization, dropout, and early stopping were applied to control overfitting. Furthermore, information about hyperparameter optimization and deep learning models is referred to in [21,31].

2.5.6. Convolutional Neural Network (CNN)

CNN is a special case of a deep learning model that accounts for the specific pattern present between the input features. Information about the CNN model, its implementation, and hyperparameter optimization are referred to in previous publications [21,31]. A combination of input, convolutional, pooling, dense, flatten, dropout, and output layers were applied for the prediction. Like MLP, hyperparameter was optimized using grid search cross-validation to select filters, activation function, solver, batch size, and learning rate. Regularization, dropout, and early stopping were applied to control overfitting. All the deep learning algorithms were implemented using Scikit learn and Keras libraries [44,45].

2.6. Prediction Accuracy and Cross-Validation Scheme

Prediction accuracy was evaluated using five-fold cross-validation where 20% of the data were used for testing and the remaining 80% for training within each environment. One hundred replications were performed for assessing each model’s performance. One replicate consisted of five iterations where data are split into five different groups. Prediction accuracy was reported as the Pearson correlation coefficient between the true (observed phenotype) and GEBVs. Separate analysis was performed for both locations using a cross-validation approach to assess the model’s performance.
Independent predictions or forward predictions were performed by training the model on previous year data and predicting future environments (i.e., 2015 data from Lind was used to predict 2016; 2015 and 2016 data predict 2017, and so on for both locations). In the end, we tried to predict the 2019 environment of both locations by using the whole data set from the other location (i.e., 2015–19 data from Lind was used to predict 2019 in Pullman). Forward prediction represents real prediction scenarios in breeding programs where previous data are used to predict future environments. Due to computational burden, all the GS models were analyzed over the Kamiak high-performance cluster (https://hpc.wsu.edu/, accessed on 10 April 2021).

3. Results

3.1. Phenotypic Data Summary

Table 1 provides information on the number of lines screened for end-use quality traits across years at Lind and Pullman. One thousand three hundred thirty-five lines were phenotypically screened for end-use quality traits across five years (2015–19) at two locations (Table 1). Overall, Pullman had more lines compared to Lind for each year. Summary statistics, including mean, minimum, maximum, standard error, and heritability are provided for all the fourteen end-use quality traits (Table 2). Broad sense heritability ranged from 0.56 to 0.93 for different traits. All the traits were highly heritable except GPC and FPROT (Table 2).
Significant positive and negative correlations were observed among different traits (Figure 1). Positive correlations were observed between FYELD and BKFYELD, KSIZE and KWT, GPC and FPROT, FSDS and FPROT, GPC and FSDS, and FSRW and KHRD (Figure 1). Similarly, negative correlations were seen between FASH and MSCOR, CODI and KHRD, GPC and FSV, FSDS and CODI, and CODI and FSRW (Figure 1). Most of the traits were not strongly correlated with each other, suggesting that a single quality trait cannot substitute for others; hence, measurements from all of them are required for selection decisions.

3.2. Cross-Validation Genomic Selection Accuracy and Model Comparison

Complete datasets across the years from Pullman and Lind were used to predict the fourteen end-use quality traits using nine different models (Table 3, Figure 2). Five-fold cross-validation was performed to compare the results from the models at both locations. Prediction accuracy at Pullman varied from 0.52–0.81 for all traits with nine different GS models. The highest prediction accuracy was 0.81 for KWT and KSIZE with the RF and MLP model at Pullman (Figure 2). The lowest prediction accuracies were for GPC, FASH, FPROT, and FSRW at Pullman using different GS models (Table 3). The highest prediction accuracy for each trait is bolded for comparison with other models (Table 3). For the fourteen end-use quality traits evaluated in this study at Pullman, deep learning models, namely MLP and CNN, performed best for eight of the traits, demonstrating the potential to incorporate them into breeding programs (Table 3) for prediction. RF and SVM performed best for three and four traits out of the fourteen, respectively, while RRBLUP performed superior for only one trait (Table 3 and Figure 2).
Prediction accuracies (0.45–0.70) within the Lind dataset were lower than Pullman for all traits (Table 2). Similar to Pullman, the highest cross-validation prediction accuracy (i.e., 0.70) was obtained for KWT at Lind. The lowest prediction accuracies were obtained for GPC, FPROT, and FSRW using the Bayesian models (Table 3). Machine and deep learning models performed superior for twelve out of the fourteen end-use quality traits (Figure 2). Table 3 provides the average performance for all models, and we observed that machine and deep learning models performed superior to RRBLUP and all the Bayesian models. On average, machine and deep learning models performed 10 and 5%, superior to Bayesian and RRBLUP.

3.3. Forward Predictions

GS model predictions were assessed to reflect the power of training size to predict the phenotypes in future years. Figure 3 and Figure 4 show the results for forward predictions at Pullman and Lind when combined data from the previous years were used to predict the phenotypes. The x-axis represents the year for which predictions were made while training the models on all the previous year’s phenotypic data (Figure 3 and Figure 4; Table S1). We saw a gradual increase in prediction accuracy for all the traits as we kept increasing the training data size, and the same trend was observed for both locations (Figure 3 and Figure 4). The highest improvement in prediction accuracy was observed for GPC, FPROT, FASH, and FSDS, owing to the complex nature of these traits and demonstrating the importance of training size. Similar to cross-validation prediction accuracy (Table 3), the highest forward prediction accuracy was obtained with machine and deep learning models, especially when the training data size kept increasing (Figure 3 and Figure 4). Bayesian models performed worst for all of the traits and at both locations, even when training data size was increased.
Forward predictions in 2019 were, on average, 32 and 29% greater than forward predictions in 2016 for Pullman and Lind (Figure 3 and Figure 4). The highest improvement in forward predictions from 2016 to 2019 was 0.35 to 0.55 for CODI, while the lowest was 0.26 to 0.29 for KWT (Figure 3). The highest improvement was seen for MLP and CNN, demonstrating as the size of training data increases, deep learning models result in the highest improvement in prediction accuracy. Furthermore, cross-validation prediction accuracies were, on average, 34 and 32% higher than forward prediction in 2019 for Pullman and Lind (Table 3, Figure 3 and Figure 4), suggesting that cross-validation scenarios over-inflate prediction accuracies.

3.4. Across Location Predictions

Across location predictions were performed where data from Lind was used to train the model for predicting performances in Pullman and vice versa. Owing to all the Bayesian model’s worst performance and computational burden in cross-validation and forward predictions, these models were eliminated for the across location predictions. Figure 5 and Table 4 showed the prediction accuracy for all fourteen end-use quality traits when predictions were made for 2019_Pullman by models training on the whole Lind dataset and vice versa. The across location prediction accuracies were, on average, 16 and 47% less than forward and cross-validation prediction accuracies, demonstrating the importance of inclusion of genotype by environment interaction components into the GS models for across location and environment predictions.
Deep learning models performed best for across location prediction compared to RRBLUP and machine learning models (Table 4 and Figure 5). The highest prediction accuracy was 0.50 for FYELD with a MLP model for predicting 2019_Pullman (Table 4). The lowest prediction accuracies were for MSCOR, GPC, and FSV with the RRBLUP model for predicting 2019_Pullman (Table 4). Out of the four models used, twelve end-use quality traits were best predicted by deep learning models under the 2019_Pullman scenario, while RF performed best for the remaining two traits (Table 4). In 2019_Lind predictions, the highest accuracy was 0.50 for FYELD with the MLP model, and the lowest was for GPC and MSCOR with the RRBLUP model. Similar to 2019_Pullman, deep learning models performed best for eleven out of the fourteen traits evaluated in 2019_Lind.

4. Discussion

Selection for end-use quality traits is often more difficult to conduct compared to grain yield, disease resistance, and agronomic performance, due to the cost, labor, and seed quantity requirements [46]. Phenotyping for quality traits is usually delayed until later generations, resulting in creating small population sizes with unbalanced datasets [16]. This study explored the potential of GS, especially machine and deep learning models, for predicting fourteen different end-use quality traits using five years (2015–19) of phenotyping data from a winter wheat breeding program. The prediction accuracy in this study varied from 0.27–0.81, demonstrating the potential of its implementation in the breeding program. We observed that forward and across location prediction accuracies could be increased using deep and machine learning models without accounting for genotype by environment interaction, environment covariates, and kernel matrices in traditional GS models. Furthermore, QTLs or major genes controlling quality traits are typically already fixed in the particular market class or breeding programs; hence, GS is the best substitute for marker-assisted selection by exploring different combinations of QTL to achieve the best variety [47].
The broad-sense heritability of end-use quality traits evaluated varied from 0.56 to 0.93, with the majority of them having a value above 0.80. Similar heritability values were obtained by Michel et al. [48], Jernigan et al. [49], and Kristensen et al. [50] for different baking and flour yield parameters of winter wheat. These intermediate to high heritability estimates suggested that most of the variation in these traits is attributed to genetic factors and considerably less affected by environment and genotype by environment interactions [51]. Therefore, GS is the best option for predicting these traits due to capturing most of the additive genetic variation by the models, as observed in this study, due to intermediate to high prediction accuracy for different quality traits. We observed that only a few grain and flour assessments traits were correlated. These low correlations among most end-use quality traits strengthen the fact that no single quality parameter can assist in final variety selection, but that many are needed [1]. Only three end-use quality traits, namely, GPC, FPROT, and FSV, had intermediate heritability values, which were also reported in previous studies due to their complex and polygenic inheritance nature [32,52]. Similarly, comparatively low prediction accuracies obtained from these traits validated the fact for inclusion of genotype by environment interaction or environmental covariates for their prediction [53].
Cross-validation prediction accuracies were, on average, 34% and 32% higher than forward prediction in 2019 for Pullman and Lind. The higher cross-validation prediction accuracies compared to forward and across location prediction suggests the importance of including bigger training sets, genotype by environment interactions, and environment covariates for exploiting the maximum variation to make predictions [54]. Higher accuracies obtained in cross-validation showed that most of those values are over-inflated, and attention is required before making any final decision about those large values to adopt GS in the breeding program [55]. Cross-validation approaches included training and testing sets from the same environment, thus accounted for environmental variation in prediction. Moreover, most of the lines evaluated in breeding programs are usually closely related or full sibs and confound cross-validation approaches, where full sibs might be in the same training or testing group, causing inflation in prediction accuracies [56]. The relationship between individuals in the training and testing set profoundly affects model performance, with a closer relationship resulting in higher accuracy. Forward and across location prediction are the best methods for studying the importance of GS implementation in the breeding program [57,58].
Continuous increments in forward prediction accuracy with all nine models demonstrated the importance of a large training population and more environments for training the GS model [59]. He et al. [60] and Battenfield et al. [16] observed an increase in forward prediction in spring wheat end-use quality traits. Similarly, Meuwissen et al. [61] suggested updating the GS model with a large training population every cycle to increase prediction accuracy. They observed a rise in genetic gain for fertility, longevity, milk production, and other traits in cows by following this approach. Deep learning models saw the greatest improvement in forward prediction accuracy by including more training data and new environments, supporting the importance of big data for their best performance [62]. Furthermore, across location predictions were superior by using deep learning models. This could be attributed to capturing genetic, environmental, and genotype by environment interaction components by these models without explicit fitting [63]. Across location prediction can be further improved by including genotype by environment interactions or environment covariates like weather or soil parameters into the GS models to make across location and environment selections [53,64].
We observed differences in model prediction accuracies under all scenarios evaluated in this study, where machine and deep learning models performed superior to Bayesian and RRBLUP models. This difference in model performance is attributed to the different genetic architecture of each trait, dependent upon the heritability and number of QTLs controlling that trait [65]. Similar results were obtained by various other studies showing the superiority of machine learning models over conventional Bayesian models in wheat [66,67,68]. Hu et al. [69] showed that random forest performed superior to the Bayesian and RRBLUP for predicting thousand kernel weight, grain protein content, and sedimentation volume in wheat under forward prediction scenario, further strengthening our findings that machine and deep learning models should be explored for such conditions. Furthermore, we observed that highly heritable traits in this study have higher prediction accuracy than moderately heritable traits, suggesting that in addition to genetic architecture, the heritability of a trait also plays an important role in final prediction accuracy [52,70].
Machine and deep learning models performed better than all Bayesian and RRBLUP models under cross-validation, forward, and across location predictions. The higher prediction accuracy observed due to deep and machine learning models is attributed to their flexibility in deciphering complex interactions between responses and predictors to capture different trends present in the datasets compared to only additive variation in conventional GS models [71]. Deep and machine learning models explore the whole feature space during model training using different sets of neurons, activation function, and various other hyperparameters to identify the best pattern for giving the best prediction scenario compared to Bayesian models that include a pre-selected prior distribution for final predictions. Furthermore, most of the traits were predicted best by different deep and machine learning due to their respective genetic architecture of each trait. Some studies in wheat reported that all models give the same prediction accuracy irrespective of the model used while others strengthen the superiority of different models for different traits [72,73]. Ma et al. [25] and Montesinos-López et al. [26] also obtained similar results to predict quantitative traits in wheat and suggested that deep learning models should be explored due to their better prediction accuracies.
It is believed that machine and deep learning models should be used on very large training datasets, which is often not possible for end-use quality traits that are evaluated at later stages of the breeding process. However, this and other studies have shown that even small datasets can give equivalent or superior performance to the traditional parametric GS models [21,25,74]. Moreover, Bellot et al. [75] have used a training set of 100,000 individuals and showed no advantage of deep learning models over the conventional GS models. Montesinos-Lopez et al. [76] and Liu et al. [23] showed the superiority of different deep learning algorithms over conventional GS models using population sizes of 268 wheat and 4294 soybean lines. These results provide evidence that training datasets play a minor role in prediction compared to the genetic architecture of the trait. The main issue with using a small dataset for deep learning models is overfitting, resulting in the model’s failure to learn the exact pattern from the dataset [71]. Herein, we used regularization and dropout functions to remove a certain number of neurons during model training to avoid the overfitting problem [44,77].

5. Conclusions

We assessed the potential of machine and deep learning genomic selection models for predicting fourteen different end-use quality traits at two locations in a soft white winter wheat breeding program. Different cross-validation, forward, and across location prediction scenarios were tried for comparing different models and utilization of this approach in the breeding program. Owing to limited seed availability, time constraint, and associated cost, phenotyping for quality traits is delayed to later generations. However, the higher accuracy of prediction models observed in this study suggest that selections can be performed earlier in the breeding process. Machine and deep learning models performed better than Bayesian and RRBLUP genomic selection models and can be adopted for use in plant breeding programs, regardless of dataset sizes. Furthermore, the increase in forward prediction accuracy with the addition of more lines in the training set concluded that genomic selection models should be updated every year for the best prediction accuracy. Overall, this and previous studies showed the benefit of implementing genomic selection with machine and deep learning models for different complex traits in large scale breeding programs using collected phenotypic data from previous years.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/biology10070689/s1, Table S1: Forward prediction accuracies for both locations with all the nine models.

Author Contributions

Conceptualization: K.S.S., M.A., and A.H.C.; writing original draft: K.S.S.; data analysis: K.S.S.; genotyping curation and filtration: M.A.; review and editing: K.S.S., M.A., C.F.M., and A.H.C.; resources: C.F.M. and A.H.C.; supervision and funding: A.H.C. All authors have read and agreed to the published version of the manuscript.

Funding

This project was supported by the Agriculture and Food Research Initiative Competitive Grant 2017-67007-25939 (WheatCAP), the Washington Grain Commission, the O.A. Vogel Wheat Research Fund from Washington State University, and Hatch project 1014919, and USDA ARS CRIS Project 2090-43440008-00D.

Institutional Review Board Statement

Not applicable.

Acknowledgments

The authors would like to thank Kerry Balow, Adrienne Burke, Gary Shelton, and Kyall Hagemeyer for assisting in population development, genotyping, and field plot maintenance. Phenotypic data were generated by the staff of the USDA ARS Western Wheat & Pulse Quality Lab for which we are deeply grateful.

Conflicts of Interest

Authors declare that research was conducted in the absence of any financial or commercial interests.

References

  1. Souza, E.J.; Guttieri, M.J.; Graybosch, R.A. Breeding wheat for improved milling and baking quality. J. Crop Prod. 2002, 5, 39–74. [Google Scholar] [CrossRef]
  2. Kiszonas, A.M.; Fuerst, E.P.; Morris, C.F. A comprehensive survey of soft wheat grain quality in U.S.gGermplasm. Cereal Chem. J. 2013, 90, 47–57. [Google Scholar] [CrossRef]
  3. Bhave, M.; Morris, C.F. Molecular genetics of puroindolines and related genes: Allelic diversity in wheat and other grasses. Plant Mol. Biol. 2008, 66, 205–219. [Google Scholar] [CrossRef] [PubMed]
  4. Kiszonas, A.M.; Morris, C.F. Wheat breeding for quality: A historical review. Cereal Chem. 2018, 95, 17–34. [Google Scholar] [CrossRef] [Green Version]
  5. Guzman, C.; Peña, R.J.; Singh, R.; Autrique, E.; Dreisigacker, S.; Crossa, J.; Rutkoski, J.; Poland, J.; Battenfield, S. Wheat quality improvement at CIMMYT and the use of genomic selection on it. Appl. Transl. Genom. 2016, 11, 3–8. [Google Scholar] [CrossRef] [Green Version]
  6. Morris, C.F.; Li, S.; King, G.E.; Engle, D.A.; Burns, J.W.; Ross, A.S. A comprehensive genotype and environment assessment of wheat grain ash content in oregon and washington: Analysis of variation. Cereal Chem. J. 2009, 86, 307–312. [Google Scholar] [CrossRef]
  7. Kiszonas, A.M.; Fuerst, E.P.; Morris, C.F. Modeling end-use quality in U.S. soft wheat germplasm. Cereal Chem. J. 2015, 92, 57–64. [Google Scholar] [CrossRef]
  8. Campbell, G.M.; Fang, C.; Muhamad, I.I. On predicting roller milling performance VI: Effect of kernel hardness and shape on the particle size distribution from First Break milling of wheat. Food Bioprod. Process. 2007, 85, 7–23. [Google Scholar] [CrossRef]
  9. Gale, K.R. Diagnostic DNA markers for quality traits in wheat. J. Cereal Sci. 2005, 41, 181–192. [Google Scholar] [CrossRef]
  10. Carter, A.H.; Garland-Campbell, K.; Morris, C.F.; Kidwell, K.K. Chromosomes 3B and 4D are associated with several milling and baking quality traits in a soft white spring wheat (Triticum aestivum L.) population. Theor. Appl. Genet. 2012, 124, 1079–1096. [Google Scholar] [CrossRef] [PubMed]
  11. Jernigan, K.L.; Godoy, J.V.; Huang, M.; Zhou, Y.; Morris, C.F.; Garland-Campbell, K.A.; Zhang, Z.; Carter, A.H. Genetic dissection of end-use quality traits in adapted soft white winter wheat. Front. Plant Sci. 2018, 9, 1–15. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Yang, Y.; Chai, Y.; Zhang, X.; Lu, S.; Zhao, Z.; Wei, D.; Chen, L.; Hu, Y.G. Multi-locus GWAS of quality traits in bread wheat: Mining more candidate genes and possible regulatory network. Front. Plant Sci. 2020, 11, 1091. [Google Scholar] [CrossRef] [PubMed]
  13. Kristensen, P.S.; Jahoor, A.; Andersen, J.R.; Cericola, F.; Orabi, J.; Janss, L.L.; Jensen, J. Genome-wide association studies and comparison of models and cross-validation strategies for genomic prediction of quality traits in advanced winter wheat breeding lines. Front. Plant Sci. 2018, 9, 69. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Crossa, J.; Pérez-Rodríguez, P.; Cuevas, J.; Montesinos-López, O.; Jarquín, D.; de los Campos, G.; Burgueño, J.; González-Camacho, J.M.; Pérez-Elizalde, S.; Beyene, Y.; et al. Genomic selection in plant breeding: Methods, models, and perspectives. Trends Plant Sci. 2017, 22, 961–975. [Google Scholar] [CrossRef] [PubMed]
  15. Meuwissen, T.H.E.; Hayes, B.J.; Goddard, M.E. Prediction of total genetic value using genome-wide dense marker Maps. Genetics 2001, 157, 1819–1829. [Google Scholar] [CrossRef] [PubMed]
  16. Battenfield, S.D.; Guzmán, C.; Chris Gaynor, R.; Singh, R.P.; Peña, R.J.; Dreisigacker, S.; Fritz, A.K.; Poland, J.A. Genomic selection for processing and end-use quality traits in the CIMMYT spring bread wheat breeding program. Plant Genome 2016, 9. [Google Scholar] [CrossRef] [Green Version]
  17. Juliana, P.; Poland, J.; Huerta-Espino, J.; Shrestha, S.; Crossa, J.; Crespo-Herrera, L.; Toledo, F.H.; Govindan, V.; Mondal, S.; Kumar, U.; et al. Improving grain yield, stress resilience and quality of bread wheat using large-scale genomics. Nat. Genet. 2019, 51, 1530–1539. [Google Scholar] [CrossRef]
  18. Sandhu, K.S.; Mihalyov, P.D.; Lewien, M.J.; Pumphrey, M.O.; Carter, A.H. Combining genomic and phenomic information for predicting grain protein content and grain yield in spring wheat. Front. Plant Sci. 2021, 12, 170. [Google Scholar] [CrossRef]
  19. Heffner, E.L.; Jannink, J.-L.; Sorrells, M.E. Genomic selection accuracy using multifamily prediction models in a wheat breeding program. Plant Genome 2011, 4, 65. [Google Scholar] [CrossRef] [Green Version]
  20. Heffner, E.L.; Jannink, J.L.; Iwata, H.; Souza, E.; Sorrells, M.E. Genomic selection accuracy for grain quality traits in biparental wheat populations. Crop Sci. 2011, 51, 2597–2606. [Google Scholar] [CrossRef] [Green Version]
  21. Sandhu, K.S.; Lozada, D.N.; Zhang, Z.; Pumphrey, M.O.; Carter, A.H. Deep learning for predicting complex traits in spring wheat breeding program. Front. Plant Sci. 2021, 11, 613325. [Google Scholar] [CrossRef] [PubMed]
  22. Chu, Z.; Yu, J. An end-to-end model for rice yield prediction using deep learning fusion. Comput. Electron. Agric. 2020, 174, 105471. [Google Scholar] [CrossRef]
  23. Liu, Y.; Wang, D.; He, F.; Wang, J.; Joshi, T.; Xu, D. Phenotype prediction and genome-wide association study using deep convolutional neural network of soybean. Front. Genet. 2019, 10, 1091. [Google Scholar] [CrossRef]
  24. Khaki, S.; Wang, L. Crop yield prediction using deep neural networks. Front. Plant Sci. 2019, 10, 621. [Google Scholar] [CrossRef] [Green Version]
  25. Ma, W.; Qiu, Z.; Song, J.; Li, J.; Cheng, Q.; Zhai, J.; Ma, C. A deep convolutional neural network approach for predicting phenotypes from genotypes. Planta 2018, 248, 1307–1318. [Google Scholar] [CrossRef]
  26. Montesinos-López, O.A.; Martín-Vallejo, J.; Crossa, J.; Gianola, D.; Hernández-Suárez, C.M.; Montesinos-López, A.; Juliana, P.; Singh, R. New deep learning genomic-based prediction model for multiple traits with binary, ordinal, and continuous phenotypes. G3 Genes Genomes Genet. 2019, 9, 1545–1556. [Google Scholar] [CrossRef] [Green Version]
  27. Montesinos-López, O.A.; Montesinos-López, A.; Crossa, J.; Gianola, D.; Hernández-Suárez, C.M.; Martín-Vallejo, J. Multi-trait, multi-environment deep learning modeling for genomic-enabled prediction of plant traits. G3 Genes Genomes Genet. 2018, 8, 3829–3840. [Google Scholar] [CrossRef] [Green Version]
  28. Aoun, M.; Carter, A.H.; Ward, B.P.; Morris, C.F. Genome-wide association mapping of the ‘super soft’ kernel texture in white winter wheat. Theor. Appl. Genet. 2021, 134, 2547–2559. [Google Scholar] [CrossRef]
  29. AACC. Approved Methods of Analysis. Available online: http://methods.aaccnet.org/ (accessed on 19 April 2021).
  30. Bates, D.; Mächler, M.; Bolker, B.M.; Walker, S.C. Fitting linear mixed-effects models using lme4. J. Stat. Softw. 2015, 67. [Google Scholar] [CrossRef]
  31. Sandhu, K.S.; Patil, S.S.; Pumphrey, M.O.; Carter, A.H. Multi-trait machine and deep learning models for genomic selection using spectral 1 information in a wheat breeding program. bioRxiv 2021, 2021, 439532. [Google Scholar] [CrossRef]
  32. Sandhu, K.S.; Mihalyov, P.D.; Lewien, M.J.; Pumphrey, M.O.; Carter, A.H. Genome-wide association studies and genomic selection for grain protein content stability in a nested association mapping population of spring wheat. bioRxiv 2021, 2021, 440064. [Google Scholar] [CrossRef]
  33. Cullis, B.R.; Smith, A.B.; Coombes, N.E. On the design of early generation variety trials with correlated data. J. Agric. Biol. Environ. Stat. 2006, 11, 381–393. [Google Scholar] [CrossRef]
  34. Poland, J.A.; Brown, P.J.; Sorrells, M.E.; Jannink, J.L. Development of high-density genetic maps for barley and wheat using a novel two-enzyme genotyping-by-sequencing approach. PLoS ONE 2012, 7. [Google Scholar] [CrossRef] [Green Version]
  35. Appels, R.; Eversole, K.; Feuillet, C.; Keller, B.; Rogers, J.; Stein, N.; Pozniak, C.J.; Choulet, F.; Distelfeld, A.; Poland, J.; et al. Shifting the limits in wheat research and breeding using a fully annotated reference genome. Science 2018, 361. [Google Scholar] [CrossRef] [Green Version]
  36. Bradbury, P.J.; Zhang, Z.; Kroon, D.E.; Casstevens, T.M.; Ramdoss, Y.; Buckler, E.S. TASSEL: Software for association mapping of complex traits in diverse samples. Bioinformatics 2007, 23, 2633–2635. [Google Scholar] [CrossRef] [PubMed]
  37. Endelman, J.B. Ridge regression and other kernels for genomic selection with R package rrBLUP. Plant Genome 2011, 4, 250–255. [Google Scholar] [CrossRef] [Green Version]
  38. Pérez, P.; De Los Campos, G. Genome-wide regression and prediction with the BGLR statistical package. Genetics 2014, 198, 483–495. [Google Scholar] [CrossRef]
  39. Shah, S.H.; Angel, Y.; Houborg, R.; Ali, S.; McCabe, M.F. A random forest machine learning approach for the retrieval of leaf chlorophyll content in wheat. Remote Sens. 2019, 11, 920. [Google Scholar] [CrossRef] [Green Version]
  40. Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction; Springer Science and Business Media: Berlin, Germany, 2009. [Google Scholar]
  41. Gulli, A.; Pal, S. Deep Learning with Keras; Packt Publishing Ltd.: Birmingham, UK, 2017. [Google Scholar]
  42. Smola, A.; Scholkopf, B. A tutorial on support vector regression. Stat. Comput. 2004, 14, 199–222. [Google Scholar] [CrossRef] [Green Version]
  43. Cho, M.; Hegde, C. Reducing the search space for hyperparameter optimization using group sparsity. In Proceedings of the 2019 IEEE International Conference on Acoustics, Speech and Signal Processing, Brighton, UK, 12–17 May 2019. [Google Scholar]
  44. Srivastava, N.; Hinton, G.; Krizhevsky, A.; Sutskever, I.; Salakhutdinov, R. Dropout: A simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 2014, 15, 1929–1958. [Google Scholar]
  45. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Vanderplas, J.; et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
  46. Haile, J.K.; N’Diaye, A.; Clarke, F.; Clarke, J.; Knox, R.; Rutkoski, J.; Bassi, F.M.; Pozniak, C.J. Genomic selection for grain yield and quality traits in durum wheat. Mol. Breed. 2018, 38, 1–18. [Google Scholar] [CrossRef]
  47. Lorenz, A.J. Resource allocation for maximizing prediction accuracy and genetic gain of genomic selection in plant breeding: A simulation experiment. G3 Genes Genomes Genet. 2013, 3, 481–491. [Google Scholar] [CrossRef] [Green Version]
  48. Michel, S.; Kummer, C.; Gallee, M.; Hellinger, J.; Ametz, C.; Akgöl, B.; Epure, D.; Löschenberger, F.; Buerstmayr, H. Improving the baking quality of bread wheat by genomic selection in early generations. Theor. Appl. Genet. 2018, 131, 477–493. [Google Scholar] [CrossRef]
  49. Jernigan, K.L.; Morris, C.F.; Zemetra, R.; Chen, J.; Garland-Campbell, K.; Carter, A.H. Genetic analysis of soft white wheat end-use quality traits in a club by common wheat cross. J. Cereal Sci. 2017, 76, 148–156. [Google Scholar] [CrossRef]
  50. Kristensen, P.S.; Jensen, J.; Andersen, J.R.; Guzmán, C.; Orabi, J.; Jahoor, A. Genomic prediction and genome-wide association studies of flour yield and alveograph quality traits using advanced winter wheat breeding material. Genes 2019, 10, 669. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  51. Tsai, H.Y.; Janss, L.L.; Andersen, J.R.; Orabi, J.; Jensen, J.D.; Jahoor, A.; Jensen, J. Genomic prediction and GWAS of yield, quality and disease-related traits in spring barley and winter wheat. Sci. Rep. 2020, 10, 1–15. [Google Scholar] [CrossRef]
  52. Hayes, B.J.; Panozzo, J.; Walker, C.K.; Choy, A.L.; Kant, S.; Wong, D.; Tibbits, J.; Daetwyler, H.D.; Rochfort, S.; Hayden, M.J.; et al. Accelerating wheat breeding for end-use quality with multi-trait genomic predictions incorporating near infrared and nuclear magnetic resonance-derived phenotypes. Theor. Appl. Genet. 2017, 130, 2505–2519. [Google Scholar] [CrossRef] [PubMed]
  53. Monteverde, E.; Gutierrez, L.; Blanco, P.; Pérez de Vida, F.; Rosas, J.E.; Bonnecarrère, V.; Quero, G.; McCouch, S. Integrating molecular markers and environmental covariates to interpret genotype by environment interaction in rice (Oryza sativa L.) grown in subtropical areas. G3 Genes Genomes Genet. 2019, 9, 1519–1531. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  54. Gouy, M.; Rousselle, Y.; Bastianelli, D.; Lecomte, P.; Bonnal, L.; Roques, D.; Efile, J.C.; Rocher, S.; Daugrois, J.; Toubi, L.; et al. Experimental assessment of the accuracy of genomic selection in sugarcane. Theor. Appl. Genet. 2013, 126, 2575–2586. [Google Scholar] [CrossRef] [PubMed]
  55. Crossa, J.; Pérez, P.; Hickey, J.; Burgueno, J.; Ornella, L.; Cerón-Rojas, J.; Zhang, X.; Dreisigacker, S.; Babu, R.; Li, Y.; et al. Genomic prediction in CIMMYT maize and wheat breeding programs. Heredity 2014, 112, 48–60. [Google Scholar] [CrossRef] [Green Version]
  56. Rutkoski, J.; Singh, R.P.; Huerta-Espino, J.; Bhavani, S.; Poland, J.; Jannink, J.L.; Sorrells, M.E. Efficient Use of Historical Data for Genomic Selection: A case study of stem rust resistance in wheat. Plant Genome 2015, 8. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  57. Fiedler, J.D.; Salsman, E.; Liu, Y.; De Jiménez, M.M.; Hegstad, J.B.; Chen, B.; Manthey, F.A.; Chao, S.; Xu, S.; Elias, E.M.; et al. Genome-wide association and prediction of grain and semolina quality traits in durum wheat breeding populations. Plant Genome 2017, 10. [Google Scholar] [CrossRef] [Green Version]
  58. Habier, D.; Fernando, R.L.; Garrick, D.J. Genomic BLUP decoded: A look into the black box of genomic prediction. Genetics 2013, 194, 597–607. [Google Scholar] [CrossRef] [Green Version]
  59. Yao, J.; Zhao, D.; Chen, X.; Zhang, Y.; Wang, J. Use of genomic selection and breeding simulation in cross prediction for improvement of yield and quality in wheat (Triticum aestivum L.). Crop J. 2018, 6, 353–365. [Google Scholar] [CrossRef]
  60. He, S.; Schulthess, A.W.; Mirdita, V.; Zhao, Y.; Korzun, V.; Bothe, R.; Ebmeyer, E.; Reif, J.C.; Jiang, Y. Genomic selection in a commercial winter wheat population. Theor. Appl. Genet. 2016, 129, 641–651. [Google Scholar] [CrossRef] [PubMed]
  61. Meuwissen, T.; Hayes, B.; Goddard, M. Genomic selection: A paradigm shift in animal breeding. Anim. Front. 2016, 6, 6–14. [Google Scholar] [CrossRef] [Green Version]
  62. Cuevas, J.; Montesinos-López, O.; Juliana, P.; Guzmán, C.; Pérez-Rodríguez, P.; González-Bucio, J.; Burgueño, J.; Montesinos-López, A.; Crossa, J. Deep Kernel for genomic and near infrared predictions in multi-environment breeding trials. G3 Genes Genomes Genet. 2019, 9, 2913–2924. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  63. Montesinos-López, O.A.; Montesinos-López, A.; Tuberosa, R.; Maccaferri, M.; Sciara, G.; Ammar, K.; Crossa, J. Multi-trait, multi-environment genomic prediction of durum wheat with genomic best linear unbiased predictor and deep learning methods. Front. Plant Sci. 2019, 10, 1311. [Google Scholar] [CrossRef]
  64. Jarquín, D.; Crossa, J.; Lacaze, X.; Du Cheyron, P.; Daucourt, J.; Lorgeou, J.; Piraux, F.; Guerreiro, L.; Pérez, P.; Calus, M.; et al. A reaction norm model for genomic selection using high-dimensional genomic and environmental data. Theor. Appl. Genet. 2014, 127, 595–607. [Google Scholar] [CrossRef] [Green Version]
  65. Plavšin, I.; Gunjača, J.; Šatović, Z.; Šarčević, H.; Ivić, M.; Dvojković, K.; Novoselović, D. An overview of key factors affecting genomic selection for wheat quality traits. Plants 2021, 10, 745. [Google Scholar] [CrossRef] [PubMed]
  66. Merrick, L.F.; Carter, A.H. Comparison of genomic selection models for exploring predictive ability of complex traits in breeding programs. bioRxiv 2021, 2021, 440015. [Google Scholar] [CrossRef]
  67. Gianola, D.; Fernando, R.L.; Stella, A. Genomic-assisted prediction of genetic value with semiparametric procedures. Genetics 2006, 173, 1761–1776. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  68. Montesinos-López, O.A.; Martín-Vallejo, J.; Crossa, J.; Gianola, D.; Hernández-Suárez, C.M.; Montesinos-López, A.; Juliana, P.; Singh, R. A benchmarking between deep learning, support vector machine and Bayesian threshold best linear unbiased prediction for predicting ordinal traits in plant breeding. G3 Genes Genomes Genet. 2019, 9, 601–618. [Google Scholar] [CrossRef] [Green Version]
  69. Hu, X.; Carver, B.F.; Powers, C.; Yan, L.; Zhu, L.; Chen, C. Effectiveness of genomic selection by response to selection for winter wheat variety improvement. Plant Genome 2019, 12, 180090. [Google Scholar] [CrossRef] [Green Version]
  70. Huang, M.; Cabrera, A.; Hoffstetter, A.; Griffey, C.; Van Sanford, D.; Costa, J.; McKendry, A.; Chao, S.; Sneller, C. Genomic selection for wheat traits and trait stability. Theor. Appl. Genet. 2016, 129, 1697–1710. [Google Scholar] [CrossRef]
  71. Montesinos-López, O.A.; Montesinos-López, A.; Pérez-Rodríguez, P.; Barrón-López, J.A.; Martini, J.W.R.; Fajardo-Flores, S.B.; Gaytan-Lugo, L.S.; Santana-Mancilla, P.C.; Crossa, J. A review of deep learning applications for genomic selection. BMC Genom. 2021, 22, 1–23. [Google Scholar] [CrossRef]
  72. Isidro, J.; Jannink, J.L.; Akdemir, D.; Poland, J.; Heslot, N.; Sorrells, M.E. Training set optimization under population structure in genomic selection. Theor. Appl. Genet. 2015, 128, 145–158. [Google Scholar] [CrossRef] [Green Version]
  73. Schmidt, M.; Kollers, S.; Maasberg-Prelle, A.; Großer, J.; Schinkel, B.; Tomerius, A.; Graner, A.; Korzun, V. Prediction of malting quality traits in barley based on genome-wide marker data to assess the potential of genomic selection. Theor. Appl. Genet. 2016, 129, 203–213. [Google Scholar] [CrossRef]
  74. Pook, T.; Freudenthal, J.; Korte, A.; Simianer, H. Using local convolutional neural networks for genomic prediction. Front. Genet. 2020, 11, 1366. [Google Scholar] [CrossRef]
  75. Bellot, P.; de los Campos, G.; Pérez-Enciso, M. Can deep learning improve genomic prediction of complex human traits? Genetics 2018, 210, 809–819. [Google Scholar] [CrossRef] [Green Version]
  76. Montesinos-López, A.; Montesinos-López, O.A.; Gianola, D.; Crossa, J.; Hernández-Suárez, C.M. Multi-environment genomic prediction of plant traits using deep learners with dense architecture. G3 Genes Genomes Genet. 2018, 8, 3813–3828. [Google Scholar] [CrossRef] [Green Version]
  77. Lecun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef]
Figure 1. Significant phenotypic correlation between different end-use quality traits evaluated across two locations in Washington and five years using best linear unbiased predictors. All the abbreviation are previously abbreviated in the text and Table 2.
Figure 1. Significant phenotypic correlation between different end-use quality traits evaluated across two locations in Washington and five years using best linear unbiased predictors. All the abbreviation are previously abbreviated in the text and Table 2.
Biology 10 00689 g001
Figure 2. Genomic selection cross-validation prediction accuracies for fourteen end-use quality traits evaluated with nine different models. Results are provided separately for both locations and each trait is separated with facets.
Figure 2. Genomic selection cross-validation prediction accuracies for fourteen end-use quality traits evaluated with nine different models. Results are provided separately for both locations and each trait is separated with facets.
Biology 10 00689 g002
Figure 3. Genomic selection forward prediction accuracies for Pullman, WA, when all datasets from previous years were included to predict fourteen end-use quality traits using nine different models. The x-axis represents the year for which predictions were made using previous years as training set. All abbreviations are previously abbreviated in the text and Table 2.
Figure 3. Genomic selection forward prediction accuracies for Pullman, WA, when all datasets from previous years were included to predict fourteen end-use quality traits using nine different models. The x-axis represents the year for which predictions were made using previous years as training set. All abbreviations are previously abbreviated in the text and Table 2.
Biology 10 00689 g003
Figure 4. Genomic selection forward prediction accuracies for Lind, WA, when all datasets from previous years were included to predict fourteen end-use quality traits using nine different models. The x-axis represents the year for which predictions were made using previous years as the training set. All abbreviations are previously abbreviated in the text and Table 2.
Figure 4. Genomic selection forward prediction accuracies for Lind, WA, when all datasets from previous years were included to predict fourteen end-use quality traits using nine different models. The x-axis represents the year for which predictions were made using previous years as the training set. All abbreviations are previously abbreviated in the text and Table 2.
Biology 10 00689 g004
Figure 5. Genomic selection across environment prediction accuracies for fourteen end-use quality traits evaluated with four different models. 2019_Pullan_Lind denotes the scenario where 2019_Pullman was predicted using datasets from Lind as training set and vice versa for 2019_Lind_Pullman. Results are provided separately for both locations and each trait is separated with facets.
Figure 5. Genomic selection across environment prediction accuracies for fourteen end-use quality traits evaluated with four different models. 2019_Pullan_Lind denotes the scenario where 2019_Pullman was predicted using datasets from Lind as training set and vice versa for 2019_Lind_Pullman. Results are provided separately for both locations and each trait is separated with facets.
Biology 10 00689 g005
Table 1. Total number of lines screened across each year at two locations in Washington and phenotyped for end-use quality traits.
Table 1. Total number of lines screened across each year at two locations in Washington and phenotyped for end-use quality traits.
LocationYearLines Screened for Quality
Lind2015122
2016114
2017115
201871
2019106
Pullman2015183
2016128
2017181
2018137
2019178
Total 1335
Table 2. Summary of the fourteen end-use quality traits evaluated for genomic selection analysis using nine different prediction models.
Table 2. Summary of the fourteen end-use quality traits evaluated for genomic selection analysis using nine different prediction models.
TraitAbbreviationUnitsNumber of GenotypesMeanMinMaxS.E.H2h2
Milling traits
FYELDFlour yieldpercent66669.958.075.80.090.910.75
BKYELDBreak flour yieldpercent66648.133.956.60.140.930.72
MSCORMilling scoreunitless64685.669.198.80.100.810.77
Grain characteristics
TWTTest weightKg/hL66661.854.665.90.060.920.66
GPCGrain protein contentpercent66610.737.214.80.050.560.50
KHRDKernel hardnessunitless66623.0−10.252.40.40.930.64
KWTKernel weightmg66639.326.554.60.170.860.75
KSIZEKernel sizemm6662.762.33.30.0050.830.77
Baking parameters
CODICookie diametercm6229.27.810.00.0080.890.82
Flour parameters
FPROTFlour proteinpercent6668.936.313.00.040.570.46
FASHFlour ashpercent6460.390.210.540.0010.880.73
FSVFlour swelling volumemL/g66519.0614.026.30.050.630.59
FSDSFlour SDS sedimentationg/mL66610.13.518.30.090.920.85
FSRWWater solvent retention capacitypercent66654.1843.472.60.090.850.77
S.E. is standard error, H2 is broad sense heritability, h2 is narrow sense heritability.
Table 3. Genomic selection cross-validation prediction accuracies for the fourteen end-use quality traits evaluated with nine different models at two locations in Washington. The highest accuracy for each trait is bolded under different model scenarios.
Table 3. Genomic selection cross-validation prediction accuracies for the fourteen end-use quality traits evaluated with nine different models at two locations in Washington. The highest accuracy for each trait is bolded under different model scenarios.
LocationTraitRRBLUPBayesABayes BBayes CBayes LassoRFSVMMLPCNN
PullmanFYELD0.71 b0.61 d0.64 c0.64 c0.63 c0.76 a0.76 a0.75 a0.74 a
BKYELD0.70 b0.62 d0.64 c0.64 cd0.64 cd0.75 a0.75 a0.76 a0.75 a
MSCOR0.58 c0.52 d0.52 d0.53 d0.52 d0.60 abc0.60 bc0.63 a0.61 ab
TWT0.67 c0.67 c0.66 c0.66 c0.66 c0.68 abc0.67 bc0.70 ab0.70 a
GPC0.55 b0.54 bc0.54 bc0.53 c0.53 c0.59 a0.60 a0.60 a0.60 a
KHRD0.71 a0.67 bcd0.67 cd0.68 bcd0.67 d0.70 ab0.69 abcd0.70 ab0.69 abc
KWT0.76 b0.77 b0.75 b0.75 b0.75 b0.81 a0.80 a0.80 a0.75 b
KSIZE0.77 b0.75 bc0.74 c0.75 bc0.77 b0.76 bc0.76 bc0.80 a0.81 a
CODI0.67 bc0.67 bc0.67 c0.68 bc0.67 c0.69 ab0.69 abc0.69 ab0.71 a
FPROT0.58 c0.58 c0.58 bc0.55 d0.55 d0.61 a0.58 c0.62 a0.60 ab
FASH0.55 d0.56 cd0.59 ab0.58 ab0.59 ab0.58 abc0.59 a0.59 a0.59 bc
FSV0.55 b0.54 b0.53 b0.53 b0.53 b0.59 a0.60 a0.60 a0.60 a
FSDS0.67 de0.67 bcde0.66 e0.66 e0.67 cde0.69 abcd0.69 abc0.70 ab0.70 a
FSRW0.58 b0.52 c0.52 c0.52 c0.52 c0.60 ab0.60 ab0.61 a0.62 a
LindFYELD0.64 b0.55 c0.58 c0.56 c0.58 c0.68 a0.69 a0.67 ab0.67 a
BKYELD0.63 b0.55 c0.57 c0.56 c0.57 c0.67 a0.68 a0.69 a0.69 a
MSCOR0.48 c0.49 bc0.53 a0.50 b0.52 a0.50 b0.52 a0.52 a0.50 ab
TWT0.61 ab0.61 ab0.60 b0.61 ab0.60 b0.61 ab0.61 ab0.63 ab0.64 a
GPC0.51 b0.51 b0.51 b0.47 b0.47 b0.54 a0.52 a0.55 a0.53 a
KHRD0.58 a0.56 bc0.56 bc0.57 ab0.54 c0.56 bc0.57 abc0.57 abc0.57 abc
KWT0.65 bc0.65 bc0.63 c0.63 c0.63 c0.70 a0.66 ab0.69 a0.63 bc
KSIZE0.66 bc0.64 c0.62 c0.63 c0.66 bc0.64 c0.64 c0.69 a0.68 ab
CODI0.56 b0.54 b0.54 b0.56 b0.55 b0.57 ab0.58 ab0.58 ab0.58 a
FPROT0.48 c0.48 c0.46 d0.46 d0.46 d0.51 b0.53 ab0.53 ab0.54 a
FASH0.51 c0.44 d0.44 d0.45 d0.44 d0.54 ab0.53 b0.56 a0.53 b
FSV0.48 b0.47 bc0.46 c0.45 c0.46 c0.54 a0.54 a0.53 a0.53 aa
FSDS0.59 c0.60 c0.59 c0.60 bc0.59 c0.62 ab0.63 a0.63 a0.62 ab
FSRW0.52 b0.45 c0.45 c0.45 c0.46 c0.53 ab0.53 a0.54 a0.54 a
Average 0.610.580.580.580.580.630.630.640.63
All the abbreviation are previously abbreviated in the text and Table 2. Models labelled with the same letter are not significantly different for each trait (p value = 0.05) using Tukey’s test.
Table 4. Genomic selection across environment prediction accuracies for fourteen end-use quality traits evaluated with four different models. 2019_Pullan_Lind denotes the scenario where 2019_Pullman was predicted using datasets from Lind as the training set and vice versa for 2019_Lind_Pullan. The highest accuracy for each trait is bolded under different model scenarios.
Table 4. Genomic selection across environment prediction accuracies for fourteen end-use quality traits evaluated with four different models. 2019_Pullan_Lind denotes the scenario where 2019_Pullman was predicted using datasets from Lind as the training set and vice versa for 2019_Lind_Pullan. The highest accuracy for each trait is bolded under different model scenarios.
LocationTraitRRBLUPRFMLPCNN
2019_Pullman_LindFYELD0.41 d0.48 b0.50 a0.46 c
BKYELD0.31 c0.38 b0.38 b0.40 a
MSCOR0.27 b0.30 a0.30 a0.30 a
TWT0.32 b0.37 a0.38 a0.38 a
GPC0.25 c0.30 b0.31 b0.33 a
KHRD0.32 c0.37 ab0.36 b0.38 a
KWT0.34 b0.37 a0.36 a0.36 a
KSIZE0.34 c0.38 b0.38 b0.40 a
CODI0.40 c0.45 b0.46 a0.46 a
FPROT0.35 c0.40 b0.40 b0.41 a
FASH0.40 b0.41 ab0.41 ab0.42 a
FSV0.27 c0.36 b0.39 a0.36 b
FSDS0.36 c0.44 a0.43 a0.41 b
FSRW0.36 c0.39 b0.41 a0.42 a
2019_Lind_PullmanFYELD0.43 c0.47 b0.50 a0.49 a
BKYELD0.31 b0.40 a0.41 a0.40a
MSCOR0.28 b0.29 b0.31 a0.31 a
TWT0.31 c0.36 ab0.35 b0.37 a
GPC0.27 b0.30 a0.28 b0.31 a
KHRD0.33 b0.33 b0.38 a0.37 a
KWT0.34 b0.37 a0.38 a0.37 a
KSIZE0.35 b0.39 a0.40 a0.40 a
CODI0.42 c0.44 b0.46 a0.46 a
FPROT0.34 c0.42 a0.42 a0.40 b
FASH0.41 a0.42 a0.42 a0.40 b
FSV0.30 c0.38 b0.38 b0.42 a
FSDS0.38 c0.41 a0.40 b0.40 b
FSRW0.37 c0.41 b0.41 b0.43 a
Average 0.340.380.390.39
All the abbreviation are previously abbreviated in the text and Table 2. Models labelled with the same letter are not significantly different for each trait (p value = 0.05) using Tukey’s test.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Sandhu, K.S.; Aoun, M.; Morris, C.F.; Carter, A.H. Genomic Selection for End-Use Quality and Processing Traits in Soft White Winter Wheat Breeding Program with Machine and Deep Learning Models. Biology 2021, 10, 689. https://doi.org/10.3390/biology10070689

AMA Style

Sandhu KS, Aoun M, Morris CF, Carter AH. Genomic Selection for End-Use Quality and Processing Traits in Soft White Winter Wheat Breeding Program with Machine and Deep Learning Models. Biology. 2021; 10(7):689. https://doi.org/10.3390/biology10070689

Chicago/Turabian Style

Sandhu, Karansher Singh, Meriem Aoun, Craig F. Morris, and Arron H. Carter. 2021. "Genomic Selection for End-Use Quality and Processing Traits in Soft White Winter Wheat Breeding Program with Machine and Deep Learning Models" Biology 10, no. 7: 689. https://doi.org/10.3390/biology10070689

APA Style

Sandhu, K. S., Aoun, M., Morris, C. F., & Carter, A. H. (2021). Genomic Selection for End-Use Quality and Processing Traits in Soft White Winter Wheat Breeding Program with Machine and Deep Learning Models. Biology, 10(7), 689. https://doi.org/10.3390/biology10070689

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop