Next Article in Journal
Utilization of qPCR to Determine Duration and Environmental Drivers Contributing to the Persistence of Human DNA in Soil
Previous Article in Journal
A Nonsynonymous Substitution of Lhx3 Leads to Changes in Body Size in Dogs and Mice
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Dual-Model GWAS Analysis and Genomic Selection of Maize Flowering Time-Related Traits

College of Agronomy, Xinjiang Agricultural University, Urumqi 830052, China
*
Author to whom correspondence should be addressed.
Genes 2024, 15(6), 740; https://doi.org/10.3390/genes15060740
Submission received: 23 April 2024 / Revised: 16 May 2024 / Accepted: 31 May 2024 / Published: 4 June 2024
(This article belongs to the Section Plant Genetics and Genomics)

Abstract

:
An appropriate flowering period is an important selection criterion in maize breeding. It plays a crucial role in the ecological adaptability of maize varieties. To explore the genetic basis of flowering time, GWAS and GS analyses were conducted using an associating panel consisting of 379 multi-parent DH lines. The DH population was phenotyped for days to tasseling (DTT), days to pollen-shedding (DTP), and days to silking (DTS) in different environments. The heritability was 82.75%, 86.09%, and 85.26% for DTT, DTP, and DTS, respectively. The GWAS analysis with the FarmCPU model identified 10 single-nucleotide polymorphisms (SNPs) distributed on chromosomes 3, 8, 9, and 10 that were significantly associated with flowering time-related traits. The GWAS analysis with the BLINK model identified seven SNPs distributed on chromosomes 1, 3, 8, 9, and 10 that were significantly associated with flowering time-related traits. Three SNPs 3_198946071, 9_146646966, and 9_152140631 showed a pleiotropic effect, indicating a significant genetic correlation between DTT, DTP, and DTS. A total of 24 candidate genes were detected. A relatively high prediction accuracy was achieved with 100 significantly associated SNPs detected from GWAS, and the optimal training population size was 70%. This study provides a better understanding of the genetic architecture of flowering time-related traits and provides an optimal strategy for GS.

1. Introduction

Maize (Zea mays L.) is the highest-yielding cereal crop in the world, playing a crucial role in food security [1]. The flowering time of maize is a crucial factor influencing the adaptability of maize varieties to regional environments and has been considered a critical selection criterion in crop breeding [2]. Days to tasseling (DTT), days to pollen-shedding (DTP), and days to silking (DTS) are important flowering time-related traits in maize. Flowering time-related traits in maize were typically quantitative traits regulated by many genes. Shi et al. reported that the broad-sense heritability across two environments of DTT, DTP, and DTS were 76.20%, 67.78%, and 68.03%, respectively [3]. The phenotypic variations in flowering time-related traits are mainly controlled by genetic factors [4].
A large number of QTLs have been identified for maize flowering time-related traits [5,6,7]. In the study of Liu et al. (2016), six QTLs on chromosomes 1, 5, 9, and 10 were detected for DTT, explaining 3.42–11.79% of the phenotypic variation; twenty-one QTLs on chromosomes 1, 5, 6, 7, 8, 9, and 10 were detected for DTP, explaining 0.8–12.95% of the phenotypic variation; twenty-two QTLs on chromosomes 1, 3, 4, 5, 6,7, 9, and 10 were detected for DTS, explaining 1.77–13.47% of the phenotypic variation; and seventeen QTLs on chromosomes 1, 3, 4, 5, 6, 7, 8, and 10 were detected for anthesis-silking interval (ASI), explaining 0.38–13% of the phenotypic variation [5]. In the RIL population of B73 × Abe2, eight QTLs with the phenotypic variation explained (PVE) ranging from 1.92 to 17.28%, thirteen QTLs with the PVE ranging from 2.09 to 13.08%, and fifteen QTLs with the PVE ranging from 2.28 to 14.87% were identified for days to heading (DTH), DTS, and days to anthesis (DTA), respectively (Shi et al., 2022) [3]. Wu et al. identified eleven QTLs associated with DTT explaining 11.4–32.5% of the phenotypic variation, four QTLs associated with DTP explaining 2.4–35.4% of the phenotypic variation, and six QTLs associated with DTT explaining 7.8–32.6% of the phenotypic variation [6]. Li et al. identified four and three QTLs associated with DTA and DTS explaining 35.37% and 34.22% of the total phenotypic variation, respectively [7].
GWAS is a powerful tool for exploring the genetic architecture of complex traits and candidate genes. In the GWAS panel consisting of 226 maize inbred lines, 82 SNPs and 117 candidate genes were identified for DTT, DTP, and DTS (Wu et al., 2023) [6]. Some SNPs were significantly associated with two or three-flowering time-related traits, indicating a significant genetic correlation between flowering time-related traits. Li et al. (2016) identified nearly 1000 flowering time-associated SNPs and 220 candidate genes in an extremely large multi-genetic background population [8]. All the studies are of great significance for understanding the genetic basis of maize flowering time, which is still unclear. Exploring new genetic loci and candidate genes is necessary for revealing the genetic regulation of maize flowering time.
To improve the breeding efficiency and cost-effectiveness, Meuwissen et al. introduced the concept of genomic selection (GS), where genome-wide markers were used for selection [9]. A training population with phenotype and genotype data was used to estimate the genomic estimated breeding values (GEBVs) of individuals in the prediction population with genotype data. The accuracy of the prediction is measured by the correlation between GEBVs and true breeding values. Yuan et al. (2019) conducted GS of anthesis date and ASI under well-watered (WW), drought stress (DS), heat stress (HS), and combined drought and heat stress (DTS) management conditions. The prediction accuracies were 0.64, 0.62, 0.45, and 0.51 for anthesis date, and 0.40, 0.55, 0.13, and 0.29 for ASI under WW, DS, HS, and DHS condition, respectively [10].
Previous studies have indicated that the prediction ability of GS depends on the prediction accuracy, which is influenced by various factors, including the trait heritability, prediction models, environmental and seasonal factors, training population size, the genetic relationship between training and prediction sets, marker density, and marker quality [11,12,13,14,15]. For traits affected by multiple small-effect QTL, GBLUP or RR-BLUP may achieve better prediction accuracy compared to the Bayesian models [15]. Genomic selection for flowering time-related traits using deep learning models and Bayesian models has been proposed by Mora-Poblete (2023), and deep learning models showed higher prediction consistently [16]. In the study of Beyene et al., the prediction accuracy of anthesis date (AD) was 0.37 and 0.40 under well-watered conditions and water-stressed conditions, respectively [17]. The prediction accuracy of flowering time-related traits reported previously was moderate to low. Efforts should be made to improve the prediction accuracy of flowering time-related traits to improve the breeding efficiency.
In the present study, 379 multi-parent doubled haploid (DH) lines were phenotyped for DTT, DTP, and DTS in four different environments and genotyped by the 48 K liquid-phase hybridization probe capture technique for GWAS and GS. This study aims to (1) identify SNPs and putative candidate genes conferring flowering time-related traits by different GWAS models; (2) explore the potential of GS for flowering time-related traits; and (3) estimate the effect of training population size, marker density, and significantly associated SNPs on prediction accuracy.

2. Materials and Methods

2.1. Plant Materials and Field Trials

In this study, a multi-parent DH population consisting of 379 DH lines was utilized as the association analysis population. The DH lines were developed from 21 maize hybrids (Supplementary Table S1). The experiment was conducted in four different environments, Sangong Town experimental station, Changji City, Xinjiang, China (SG, 87°12′57″ E, 43°56′54″ N) and Dafeng Town experimental station, Hutubi County, Xinjiang, China (DF, 86°34′49″ E, 44°10′47″ N) during the summer of 2022; Ledong experimental station, Hainan Province, China (LD, 108°57′14″ E, 18°27′14″ N) during the winter of 2022; and Qitai County experimental station, Xinjiang, China (QT, 89°44′19″ E, 44°5′77″ N) during the summer of 2023. The experiment was performed using a completely randomized block design with single-row plots. Each row had a length of 2.5 m, row spacing of 0.6 m, and plant spacing of 0.25 m. All field management, including fertilization, irrigation, pest and disease control, and weed management, followed the local standardized field management.

2.2. Evaluation and Data Analysis of Flowering Time-Related Traits

Days to tasseling (DTT), days to pollen-shedding (DTP), and days to silking (DTS) were evaluated as the number of dates from planting until half of the individuals in each row reached tasseling, pollen-shedding, and silking. R version 4.3.1 software [18] was employed for descriptive statistical analysis, variance analysis, correlation analysis, and visualization of the phenotypic data related to flowering time-related traits. The “lmer” function in the lme4 package of R was used to calculate variance components, best linear unbiased prediction (BLUP) values, and broad-sense heritability. BLUP values across all four environments were deployed for GWAS and GS. The broad-sense heritability was calculated on an entry-mean basis according to the method described by Hallauer et al. [19].

2.3. Genotyping

The DNA extraction and genotyping of the DH population were conducted by China Golden Marker Biotech Co., (Beijing, China). The 48 K liquid-phase hybridization probe capture technique was used. This process involved DNA concentration and quality evaluation, library construction, and sequencing. Clean reads were received after quality control using Trimmomatic-0.36 [20]. BWA 0.7.17 software [21] was used for SNP calling based on the Maize B73_RefGen_v4 reference genome. A total of 1,583,425 SNPs were received and filtered by vcftools (Danecek et al., 2011 [22]). Finally, 134,785 high-quality SNPs with the missing rate (MR) < 20%, and the minor allele frequency (MAF) > 0.05 were retained.

2.4. Genome-Wide Association Analysis

Two multi-locus models, fixed and random model circulating probability unification (FarmCPU) and bayesian information and linkage-disequilibrium iteratively nested keyway (BLINK), from the GAPIT version 3.0, were employed for GWAS of flowering time-related traits [23]. The population structure was controlled by the first three PCs. The FarmCPU model effectively controls false positives and reduces false negatives, allowing rapid analysis of large populations with numerous markers [24]. The BLINK model enhances statistical power by eliminating the assumption that causal genes are uniformly distributed in the genome [23]. A Bonferroni-corrected p-value of 3.71 × 10−7 (0.05/total number of markers) was used as the significance threshold, with a −log10 (p-value) of 6.4.

2.5. Candidate Gene Discovery

Genes located within 10 kb upstream and downstream of a significant SNP were retrieved as candidate genes based on the B73 RefGen_v4 reference genome on the maizeGDB website (https://www.maizegdb.org/ accessed on 1 September 2023), Candidate genes were annotated with databases of NCBI (https://www.ncbi.nlm.nih.gov/ accessed on 1 September 2023), GO (http://www.geneontology.org/ accessed on 1 September 2023), and the National Center for Bioinformation (https://www.cncb.ac.cn/ accessed on 1 September 2023).

2.6. Genomic Selection

Ridge regression best linear unbiased prediction (RR-BLUP) was one of the most commonly used models for GS. The rrBLUP package version 4.6.3 [25] in the R software was used for GS of flowering time-related traits. A k-fold cross-validation with k = 5 was performed 100 times, randomly selecting 20% of the population as the prediction population and the remaining 80% as the training population. The RR-BLUP model was fitted using the training data, and the response variable in the testing set was predicted. The average of prediction accuracy was calculated.
With all 134,785 SNPs, the training population size was set at 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, and 90% of the total population, and the remaining population was used as the prediction population for GS. The accuracy of GS for different population sizes was measured as the average of 100 replications.
The 5-fold cross-validation with 100 replications was employed to study the effect of marker density and significant SNPs on GS. The marker density was set to 10, 30, 50, 100, 300, 500, 1000, 3000, and 5000, which were selected randomly. The significant SNP markers identified by two GWAS models were used for GS. The SNPs were ordered according to p-value, and 1, 3, 5, 10, 30, 50, 100, 300, and 500 SNPs with the lowest p-value were selected for GS. The remaining SNPs were not used for GS. The one-way ANOVA with an LSD multiple comparison test was used to compare the differences between the prediction accuracy, estimated by all the markers and significant SNPs detected by two GWAS models, different training population sizes, and different marker densities.

3. Results and Analysis

3.1. Descriptive Statistical Analysis of Flowering Time-Related Traits

Descriptive statistical analysis was performed on the phenotypes of single and combined environments (Table 1). ALL three-flowering time-related traits varied among the environments and sufficient variation was observed in each environment. The average DTT was 59.94, 54.63, 56.41, 71.06, and 60.55 days in SG, DF, LD, QT, and combined environments, respectively. The greatest differentiation of DTT was observed in SG, where DTT ranged from 50.00 to 72.00. The smallest differentiation of DTT was observed in DF, where DTT ranged from 50.00 to 62.00.
The average DTP was 64.30, 62.80, 59.80, 72.90, and 65.00 in SG, DF, LD, QT, and combined environments, respectively. The greatest differentiation of DTP was observed in SG and QT, where DTP ranged from 54.0 to 77.0. The smallest differentiation of DTP was observed in combined environments, where DTP ranged from 58.2 to 73.0.
The average DTS was 65.87, 63.73, 61.29, 73.78, and 66.30 in SG, DF, LD, QT, and combined environments, respectively. The greatest differentiation was detected in SG, DF, and QT. The smallest differentiation was observed in combined environments, where DTS ranged from 58.8 to 75.0. The highest average values of DTT, DTP, and DTS were observed in QT. The values of skewness and kurtosis were within the range of −1 to 1, indicating that the three-flowering time-related traits followed a normal distribution.

3.2. Analysis of Variance for Flowering Time-Related Traits

A variance analysis was conducted for flowering time-related traits across multiple environments (Table 2). The results indicated that the differences in flowering time-related traits reached extremely significant levels in terms of genotype, environment, and genotype-environment interaction (p < 0.001). This suggested that, in addition to being controlled by genotype, the three-flowering time-related traits were also influenced by environmental factors and the interaction between genotype and environment. The heritability of DTT, DTP, and DTS was similar, with values of 82.75%, 86.09%, and 85.26%, respectively. This indicated that the three-flowering time-related traits were primarily under genetic control.

3.3. Correlation Analysis of Flowering Time-Related Traits

Correlation analysis was conducted based on the BLUP values of flowering time-related traits (Figure 1). The results showed that there was a highly significant positive correlation between each of the two traits. The correlation coefficient between DTP and DTS was the highest, with an r-value of 0.95, while the correlation coefficient between DTT and DTS was the lowest, with an r-value of 0.88.

3.4. Genome-Wide Association Analysis of Flowering Time-Related Traits

To accurately identify the significant loci associated with maize flowering time, BLINK, and FarmCPU, two multi-locus models were employed for the GWAS of flowering time-related traits. With the threshold at −log10p > 6.4, the BLINK model detected nine SNPs, including three for DTT, four for DTP, and two for DTS. The FarmCPU model detected thirteen SNPs, including three for DTT, six for DTP, and four for DTS (Table 3, Figure 2).
For DTT, the BLINK model identified three significantly associated SNPs on chromosomes 1, 9, and 10, accounting for 20.3% of phenotypic variation in total. The SNP 1_190275500 located on chromosome 1 had the lowest p-value of 5.52 × 10−11. The SNP 9_152140631 located on chromosome 9 was a major loci accounting for 16.92% of phenotypic variation. The FarmCPU model detected three SNPs associated with DTT on chromosomes 3, 9, and 10, accounting for 23.42 of phenotypic variation in total. The SNP 9_152140631 with the lowest p-value of 7.35 × 10−15 had a PVE of 15.53%. It was detected by both models.
For DTP, four significant SNPs located on chromosomes 3, 8, and 9 were identified by the BLINK model. The PVE of each SNP ranged from 2.64 to 12.68%, explaining 23.27% of the phenotypic variance in total. The most significant SNP 9_152140631 explained 12.68% of the phenotypic variance. Six SNP on chromosomes 3, 8, 9, and 10 were detected for DTP by the FarmCPU model. Each SNP explained 0.50 to 5.82% of the phenotypic variance, with a total PVE of 15.99%. SNP 9_152140631 was also identified in the FarmCPU model.
For DTS, two significant SNPs distributed across chromosomes 8 and 9 were identified by the BLINK model, explaining 3.39% and 24.17% of the phenotypic variation. Four SNPs located on chromosomes 3, 8, and 9 were detected by the FarmCPU model. The PVE of each SNP ranged from 2.30 to 8.57%, with a total PVE of 21.29%. SNPs 8_78666793 and 9_15214063 were co-located by both models.
Four SNPs, 3_198946071, 8_78666793, 9_146646966, and 9_152140631, were repeatedly identified by different models or traits (Figure 3). SNP 8_78666793 was co-detected for DTS by both BLINK and FarmCPU models. SNP 3_198946071 was co-detected for DTP by the BLINK model and DTT by FarmCPU model. SNP 9_146646966 was co-detected for DTP and DTS by the FarmCPU model. SNP 9_152140631 was detected for all three-flowering time-related traits with two models.

3.5. Annotation of Candidate Genes

Referring to the B73RefGen_v4 genome, nine candidate genes associated with DTT, fifteen with DTP, and seven with DTS were identified (Table 4). The function of candidate genes was annotated, and only two genes have unknown functions. Five candidate genes were found to be simultaneously related to multiple traits. The SNP 3_198946071, associated with DTT and DTP, was located within the coding region of Zm00001d043406. Zm00001d047969 and Zm00001d047968 were located in the LD interval of SNP 9_146646966, concurrently affecting DTP and DTS traits. The most crucial SNP 9_152140631, controlling all three-flowering time-related traits, was located in the LD interval of Zm00001d048190 and Zm00001d048191.

3.6. Genomic Selection for Flowering Time-Related Traits

The GS prediction accuracies estimated by a 5-fold cross-validation with all the SNPs were 0.47, 0.55, and 0.55 for DTT, DTP, and DTS, respectively.
The training population size has a noticeable impact on the prediction accuracy of GS (Figure 4A, Supplementary Table S2). As the training population size increased, the prediction accuracy of GS for flowering time-related traits gradually improved. When the training population size increased to 70%, the prediction accuracy reached a plateau, where the prediction accuracy rose from 0.27 to 0.46 for DTT, from 0.34 to 0.54 for DTP, and from 0.34 to 0.54 for DTS.
The effect of marker density on the prediction accuracy of GS is shown in Figure 4B and Supplementary Table S3. A higher marker density leads to a higher prediction accuracy. When marker density reached 3000, the prediction accuracy of all three traits reached a plateau, 0.46 for DTT, 0.52 for DTP, and 0.53 for DTS.
As the number of SNPs with the lowest p-value increased from 1 to 1000, the prediction accuracy first increased and then decreased (Figure 4C,D, Supplementary Table S4). When the number of significant SNPs increased to 100, the highest prediction accuracy was achieved. For the Blink model, the highest prediction accuracies for DTT, DTP, and DTS were 0.73, 0.79, and 0.74, respectively. For the FarmCPU model, the highest prediction accuracies for DTT, DTP, and DTS were 0.78, 0.84, and 0.82, respectively. The FarmCPU model attains higher prediction accuracy than the Blink model. Compared to GS with all the markers, the prediction accuracy using 100 significant markers can be greatly improved.

4. Discussion

4.1. Genetic Basis of Flowering Time-Related Traits in Maize

Maize has become one of the most widely planted crops in the world [26], mainly because it adapts to different geographical environments via flowering time regulation [27]. The flowering period is a crucial stage in the growth and development of maize. In this study, broad phenotypic variation in DTT, DTP, and DTS was observed in each environment. The heritability of the three-flowering time-related traits was relatively high, indicating that phenotypic selection in the early stage was effective. Two GWAS models were employed to explore the genetic basis of flowering time-related traits in maize. A total of 14 SNPs significantly associated with flowering time-related traits were detected on chromosomes 1, 3, 8, 9, and 10. Three SNPs exhibited a pleiotropic effect. The SNP 9_146646966 was significantly associated with both DTP and DTS. The SNP 3_198946071 was significantly associated with DTT and DTP. One SNP 9_152140631 was identified for all three traits by both models and the PVE ranged from 5.82% to 24.17%. It is inferred that the three-flowering time-related traits might share the same genetic control and possess a similar genetic foundation.
Numerous QTLs associated with maize flowering time-related traits have been localized, and a comprehensive analysis of these results indicates a tendency for flowering QTLs to be distributed on chromosomes 1, 3, 8, 9, and 10 [16,24,27,28,29,30]. Some SNPs detected in the present study were reported by previous studies. The SNP 9_152140631 detected for all three-flowering time-related traits by both models was close to SNP 9_149039896 <num_29 kb (referring to the B73RefGen_v2 genome), which was co-located for DA and DS in a natural association panel (Ames) with 1745 inbred lines from the USDA-ARS NCRPIS [8]. It was in the gene region of Zm00001d048190 and linked to Zm00001d048191. ZM00001D048190 encoded RNase L inhibitor protein-related. Zm00001d048191 encoded a membrane protein. SNP 8_77014638 for DTP and SNP 8_78666739 for DTS were in the same region of qDPSRS-8-1, a QTL identified for DTS by single-environment analysis [24]. SNP 8_114509981 for DTP was close to SNP S8_112412901, which was significantly associated with ASI [16]. SNP 10_146767704 and SNP 10_149653921 for DTT, and SNP 10_145158286 for DTP were located at bin 10.07 and in the same region of qDPSFS-10-1 and qDPSFJ-10-1 for DPS and qDTSFJ-10-1 for DTS reported by Liu et al. [24]. The SNP 9_146646966 co-located for DTP and DTS was close to SNP S9_144119131, which was identified for DTS in a panel of 258 tropical maize inbred lines [16]. SNP 1_190275500 detected at bin 1.06 for DTT was in the same interval of qdsilk13 and qdsilk45 mapped for DTS [27,28]. SNP 9_146646966 significantly associated with DTP and DPS was mapped in the same region of qdsilk16 and qdsilk43 [27,29]. SNP 3_198946071 for both DTT and DTP located at bin3.07 is consistent with the observations of Wang et al. [30]. The SNPs detected by different studies were stable loci controlling flowering time-related traits and some exhibited pleiotropic effects. Developing functional markers for marker-associated selection would effectively improve the breeding efficiency of flowering time-related traits.

4.2. Candidate Gene Analysis

Based on the B73 RefGen_v4 genome, candidate genes were screened within 10 Kb upstream and downstream of the 14 significantly associated SNPs for flowering-related traits. A total of 24 candidate genes were identified, of which 22 have functional annotations. Several candidate genes are known to be associated with flowering time. Zm00001d045323 encodes a double B-box zinc finger protein 11. B-box (BBX) proteins play a crucial role in plant growth regulation and development, including photomorphogenesis, photoperiodic regulation of flowering, and responses to biotic and abiotic stresses [31]. The transcription factor BBX is well-known for regulating flowering, photoperiod sensitivity, photomorphogenesis, and stress tolerance [32]. Studies have shown that AtBBX1, the first identified and characterized protein in the BBX family, plays a vital role in regulating flowering time and flower development [33]. In chrysanthemums, CmBBX24 has a dual function, delaying flowering and increasing cold or drought resistance [34]. Zm00001d009708 encodes a calcium-dependent protein kinase (CPK) family protein. A previous study has indicated that the CPK32 gene can control the growth of pollen tubes in tobacco and maize [35]. Plant growth and differentiation depend on the continuous functionality of meristematic tissues. The gene Zm00001d031444 encodes an IRK-interacting protein. It has been reported that the IRK-interacting protein is involved in maintaining and differentiating inflorescence meristematic tissues [36]. The IRK gene is expressed in proliferating and expanding tissues, such as stem meristematic tissues, floral buds, and root meristematic tissues [37]. In flowering plants, the formation of organs, like leaves and flowers, depends on the apical meristematic tissues of shoots [38].
Numerous studies have shown that various plant hormones are involved in regulating floral transition [39,40,41,42]. ETHYLENE(ETH) INSENSITIVE3/EIN3-LIKE1 modulates FLOWERING LOCUS C expression via histone demethylase interaction [43]. Zm00001d031445 encodes an ETHYLENE INSENSITIVE3-LIKE transcription factors that activate the ETH signaling pathway [44]. Zm00001d026397 encodes a protein with an RNA-binding domain, and its homolog in Arabidopsis thaliana (AT2G41060) is functionally annotated to be associated with the ethylene (ETH) biosynthesis processes [45,46]. Zm00001d039650, encoding Cytochrome P450 734A1, participates in brassinosteroid (BR) biosynthesis, influencing the flowering of Luculia gratissima and Mikania micrantha [47,48]. It is speculated that these genes affect the expression of flowering time-related traits by participating in hormone metabolism pathways.
According to the gene expression data of 23 tissues spanning vegetative and reproductive stages of maize [49], ZM00001d048190, ZM00001d043406, and ZM00001d047969 have the highest expression in female spikelet collected during the day compared with other samples of B73. These results indicate that ZM00001d048190, ZM00001d043406, and ZM00001d047969 are highly likely to be involved in regulating maize flowering time. These potential candidate genes provide a better understanding of the genetic basis of flowering time-related traits.

4.3. Factors Affecting Genomic Selection Prediction Accuracy

Genomic selection can improve breeding efficiency by capturing both minor and major QTLs of the target trait. The GS of flowering time-related traits has been reported in several studies [10,17,43]. The prediction accuracy was 0.64 for the anthesis date and 0.40 for ASI in 300 tropical and subtropical maize inbred lines [10]. These findings were consistent with our study. The prediction accuracy was 0.47 for DTT, 0.55 for DTP, and 0.55 for DTS using the 5-cross validation method.
Previous studies have shown that as the training population size increased, the prediction accuracy of GS rapidly improved and gradually stabilized [50,51]. Liu et al. (2023) showed that when the training population size increased from 60% to 90%, the increase in prediction accuracy of husk tightness could be negligible [52]. Our results showed a similar trend, where relatively high prediction accuracy was achieved with a training population size of 70%.
One of the factors limiting the application of genomic selection is the cost of genotyping. Our study showed that a good prediction accuracy of all three traits was obtained with 3000 SNPs, 0.46 for DTT, 0.52 for DTP, and 0.53 for DTS. Similar results were reported by previous studies. When the number of markers reached 5000, the prediction accuracy of common rust resistance reached a plateau in a GWAS panel [53]. These results can significantly reduce the cost of genotyping and promote the application of GS in breeding programs.
In 2014, Bernardo proposed a new approach for GS that integrates known marker-trait associations in prediction models to improve prediction accuracy [54]. In 2016, Spindel et al. improved the prediction accuracy by incorporating significantly associated loci detected by GWAS into the GS model [55]. The experimental results of Yang et al. in 2017 also demonstrated that incorporating markers with evolutionary information into GS models helps unravel the genetic basis of traits [56]. Liu et al. conducted GWAS and GS of Fusarium ear rot resistance in maize using CIMMYT germplasm. A relatively high prediction accuracy was obtained when integrating significantly associated SNPs detected from GWAS in GS [57]. In the DTMA panel, Yuan et al. [10] reported that the prediction accuracies evaluated by significant SNPs identified from GWAS were higher than those evaluated by all the SNPs for flowering time-related traits. In the present study, the prediction accuracy using 100 SNPs with the lowest p-value can be greatly improved compared to GS with all the markers. It is not that the higher the number of markers, the better the prediction effect [58]. Too many markers will not only bring great computational pressure but also reduce computational efficiency. Our study provides a theoretical basis for the application of GS in maize breeding.

5. Conclusions

In this study, the flowering time-related traits of the association population in maize exhibit rich phenotypic variation, with a relatively high genetic heritability, and strong positive correlations among these traits.
Through a dual-model whole-genome association analysis of maize flowering time-related traits, 14 significant SNP loci associated with the flowering period and 24 potential candidate genes closely related to flowering were detected. These loci play a crucial role in the genetic expression of flowering time-related traits.
Through genomic selection, it was found that the optimal training population size for three-flowering time-related traits is 70%, with a marker density of 3000 SNPs, resulting in high prediction accuracy for all three-flowering traits. When significant associated loci identified by two GWAS methods were incorporated into GS prediction, it significantly improved the accuracy of the prediction models. The significant loci identified by the FarmCPU model are more suitable for the genomic selection of flowering traits.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes15060740/s1, Table S1: The 21 maize hybrids used for DH population construction. Table S2: Genomic selection prediction accuracy estimated by different training population sizes. Table S3: Genomic selection prediction accuracy estimated by different marker densities. Table S4: Genomic selection prediction accuracy estimated by all the markers and significant SNPs.

Author Contributions

Conceptualization, Z.F., J.R. and P.W.; Data curation, Z.F., Y.Z., Y.M. and P.W.; Formal analysis, S.L., J.J., Y.Z. and Y.M.; Funding acquisition, J.R. and P.W.; Investigation, Z.F., S.L., J.J., Y.Z. and Y.M.; Methodology, Z.F., J.J., J.R. and P.W.; Project administration, J.R. and P.W.; Supervision, J.R. and P.W.; Visualization, S.L.; Writing—original draft, Z.F.; Writing—review and editing, J.R. and P.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Tianshan Innovation Team funding (grant number: 2022D14017); Xinjiang Uygur Autonomous Region Natural Science Foundation key project (grant number: 2022D01D34); Tianshan Yingcai (grant number: 2022TSYCJU0003); Xinjiang Uygur Autonomous Region Major Science and Technology Special Projects (grant number: 2022A02003-4); National Natural Foundation of China (grant numbers:32060484, U2003304, 32001561); XinJiang Agriculture Research System (grant number: XJARS-02).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available in this article and the Supplementary Materials.

Conflicts of Interest

The authors have no conflicts of interest to declare.

References

  1. Zhang, S.; Hu, R.; Peng, Z. Demand analysis and technological development directions in maize breeding. China Agric. Sci. 2000, 33, 1–8. [Google Scholar]
  2. Jung, C.; Muller, A.E. Flowering time control and applications in plant breeding. Trends Plant Sci. 2009, 14, 563–573. [Google Scholar] [CrossRef]
  3. Shi, J.; Wang, Y.; Wang, C.; Wang, L.; Zeng, W.; Han, G.; Qiu, C.; Wang, T.; Tao, Z.; Wang, K.; et al. Linkage mapping combined with GWAS revealed the genetic structural relationship and candidate genes of maize flowering time-related traits. BMC Plant Biol. 2022, 22, 328. [Google Scholar] [CrossRef]
  4. Buckler, E.S.; Holland, J.B.; Bradbury, P.J.; Acharya, C.B.; Brown, P.J.; Browne, C.; Ersoz, E.; Flint-Garcia, S.; Garcia, A.; Glaubitz, J.C.; et al. The genetic architecture of maize flowering time. Science 2009, 325, 714–718. [Google Scholar] [CrossRef]
  5. Liu, Y.H.; Yi, Q.; Hou, X.B.; Zhang, X.G.; Zhang, J.J.; Liu, H.M.; Hu, Y.F.; Huang, Y.B. Comparative quantitative trait locus mapping of maize flowering-related traits in an F2:3 and recombinant inbred line population. Genet. Mol. Res. 2016, 15. [Google Scholar] [CrossRef] [PubMed]
  6. Wu, X.; Liu, Y.; Lu, X.; Tu, L.; Gao, Y.; Wang, D.; Guo, S.; Xiao, Y.; Xiao, P.; Guo, X.; et al. Integration of GWAS, linkage analysis and transcriptome analysis to reveal the genetic basis of flowering time-related traits in maize. Front. Plant Sci. 2023, 14, 1145327. [Google Scholar] [CrossRef]
  7. Li, Z.; Li, K.; Yang, X.; Hao, H.; Jing, H.C. Combined QTL mapping and association study reveals candidate genes for leaf number and flowering time in maize. Theor. Appl. Genet. 2021, 134, 3459–3472. [Google Scholar] [CrossRef] [PubMed]
  8. Li, Y.X.; Li, C.; Bradbury, P.J.; Liu, X.; Lu, F.; Romay, C.M.; Glaubitz, J.C.; Wu, X.; Peng, B.; Shi, Y.; et al. Identification of genetic variants associated with maize flowering time using an extremely large multi-genetic background population. Plant J. 2016, 86, 391–402. [Google Scholar] [CrossRef] [PubMed]
  9. Meuwissen, T.H.E.; Hayes, B.J.; Goddard, M.E. Prediction of total genetic value using genome-wide dense marker maps. Genetics 2001, 157, 1819–1829. [Google Scholar] [CrossRef]
  10. Yuan, Y.; Cairns, J.E.; Babu, R.; Gowda, M.; Makumbi, D.; Magorokosho, C.; Zhang, A.; Liu, Y.; Wang, N.; Hao, Z.; et al. Genome-Wide Association Mapping and Genomic Prediction Analyses Reveal the Genetic Architecture of Grain Yield and Flowering Time Under Drought and Heat Stress Conditions in Maize. Front. Plant Sci. 2019, 9, 1919. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  11. Heslot, N.; Yang, H.P.; Sorrells, M.E.; Jannink, J.L. Genomic selection in plant breeding: A comparison of models. Crop Sci. 2012, 52, 146–160. [Google Scholar] [CrossRef]
  12. Millet, E.J.; Kruijer, W.; Coupel-Ledru, A.; Prado, S.A.; Cabrera-Bosquet, L.; Lacube, S.; Charcosset, A.; Welcker, C.; van Eeuwijk, F.; Tardieu, F. Genomic prediction of maize yield across European environmental conditions. Nat. Genet. 2019, 51, 952–956. [Google Scholar] [CrossRef]
  13. Zhang, A.; Wang, H.; Beyene, Y.; Semagn, K.; Liu, Y.; Cao, S.; Cui, Z.; Ruan, Y.; Burgueño, J.; San Vicente, F.; et al. Effect of trait heritability, training population size and marker density on genomic prediction accuracy estimation in 22 bi-parental tropical maize populations. Front. Plant Sci. 2017, 8, 1916. [Google Scholar] [CrossRef]
  14. Spindel, J.; Begum, H.; Akdemir, D.; Virk, P.; Collard, B.; Redona, E.; Atlin, G.; Jannink, J.L.; McCouch, S.R. Genomic selection and association mapping in rice (Oryza sativa): Effect of trait genetic architecture, training population composition, marker number and statistical model on accuracy of rice genomic selection in elite, tropical rice breeding lines. PLoS Genet. 2015, 11, e1004982. [Google Scholar]
  15. Chen, L.; Li, C.; Sargolzaei, M.; Schenkel, F.S. Impact of genotype imputation on the performance of GBLUP and Bayesian methods for genomic prediction. PLoS ONE 2014, 9, e101544. [Google Scholar] [CrossRef]
  16. Mora-Poblete, F.; Maldonado, C.; Henrique, L.; Uhdre, R.; Scapim, C.A.; Mangolim, C.A. Multi-trait and multi-environment genomic prediction for flowering traits in maize: A deep learning approach. Front. Plant Sci. 2023, 14, 1153040. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  17. Beyene, Y.; Gowda, M.; Pérez-Rodríguez, P.; Olsen, M.; Robbins, K.R.; Burgueño, J.; Prasanna, B.M.; Crossa, J. Application of Genomic Selection at the Early Stage of Breeding Pipeline in Tropical Maize. Front. Plant Sci. 2021, 12, 685488. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  18. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2021. [Google Scholar]
  19. Hallauer, A.R.; Carena, M.J.; Miranda Filho, J.B. Quantitative Genetics in Maize Breeding; Iowa State University Press: Ames, IA, USA, 2010. [Google Scholar]
  20. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef]
  21. Li, H.; Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 2010, 26, 589–595. [Google Scholar] [CrossRef]
  22. Danecek, P.; Auton, A.; Abecasis, G.; Albers, C.A.; Banks, E.; DePristo, M.A.; Handsaker, R.E.; Lunter, G.; Marth, G.T.; Sherry, S.T.; et al. The variant call format and VCFtools. Bioinformatics 2011, 27, 2156–2158. [Google Scholar] [CrossRef] [PubMed]
  23. Wang, J.; Zhang, Z. GAPIT Version 3: Boosting Power and Accuracy for Genomic Association and Prediction. Genom. Proteom. Bioinform. 2021, 19, 629–640. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  24. Liu, X.; Huang, M.; Fan, B.; Buckler, E.S.; Zhang, Z. Iterative Usage of Fixed and Random Effect Models for Powerful and Efficient Genome-Wide Association Studies. PLoS Genet. 2016, 12, e1005767. [Google Scholar] [CrossRef]
  25. Endelman, J.B. Ridge regression and other kernels for genomic selection with R package rrBLUP. Plant Genome 2011, 4, 250–255. [Google Scholar] [CrossRef]
  26. Yang, N.; Yan, J. New genomic approaches for enhancing maize genetic improvement. Curr. Opin. Plant Biol. 2021, 60, 101977. [Google Scholar] [CrossRef] [PubMed]
  27. Ribaut, J.; Hoisington, D.A.; Deutsch, J.A.; Gonzalez-de-Leon, D. Identification of quantitative trait loci under drought conditions in tropical maize. 2. Flowering parameters and the anthesis-silking interval. Theor. Appl. Genet. 1996, 92, 905–914. [Google Scholar] [CrossRef]
  28. Rebai, A.; Blanchard, P.; Perret, D.; Vincourt, P. Mapping quantitative trait loci controlling silking date in a diallel cross among four lines of maize. Theor. Appl. Genet. 1997, 95, 451–459. [Google Scholar] [CrossRef]
  29. Khairallah, M.; Bohn, M.; Jiang, C.; Deutsch, J.A.; Jewell, D.C.; Mihm, J.A.; Melchinger, A.E.; Gonzalez-De-Leon, D.; Hoisington, D.A. Molecular mapping of QTL for southwestern corn borer resistance, plant height and flowering in tropical maize. Plant Breed. 1998, 117, 309–318. [Google Scholar] [CrossRef]
  30. Wang, H.; Xu, C.; Liu, X.; Guo, Z.; Xu, X.; Wang, S.; Xie, C.; Li, W.X.; Zou, C.; Xu, Y. Development of a multiple-hybrid population for genome-wide association studies: Theoretical consideration and genetic mapping of flowering traits in maize. Sci. Rep. 2017, 7, 40239. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  31. Shalmani, A.; Jing, X.Q.; Shi, Y.; Muhammad, I.; Zhou, M.R.; Wei, X.Y.; Chen, Q.Q.; Li, W.Q.; Liu, W.T.; Chen, K.M. Characterization of B-BOX gene family and their expression profiles under hormonal, abiotic and metal stresses in Poaceae plants. BMC Genom. 2019, 20, 27. [Google Scholar] [CrossRef]
  32. Talar, U.; Kiełbowicz-Matuk, A. Beyond Arabidopsis: BBX regulators in crop plants. Int. J. Mol. Sci. 2021, 22, 2906. [Google Scholar] [CrossRef]
  33. Valverde, F. CONSTANS and the evolutionary origin of photoperiodic timing of flowering. J. Exp. Bot. 2011, 62, 2453–2463. [Google Scholar] [CrossRef]
  34. Yang, Y.; Ma, C.; Xu, Y.; Wei, Q.; Imtiaz, M.; Lan, H.; Gao, S.; Cheng, L.; Wang, M.; Fei, Z. A zinc finger protein regulates flowering time and abiotic stress tolerance in chrysanthemum by modulating gibberellin biosynthesis. Plant Cell 2014, 26, 2038–2054. [Google Scholar] [CrossRef]
  35. Zhou, L.; Lan, W.; Jiang, Y.; Fang, W.; Luan, S. A calcium-dependent protein kinase interacts with and activates a calcium channel to regulate pollen tube growth. Mol. Plant 2014, 7, 369–376. [Google Scholar] [CrossRef]
  36. Kanamoto, H.; Hattan, J.; Takemura, M.; Yokota, A.; Kohchi, T. Molecular cloning and characterization of a gene coding for a putative receptor-like protein kinase with a Leucine-rich repeat expressed in inflorescence and root apices from Arabidopsis. Plant Biotechnol. 2002, 19, 113–120. [Google Scholar] [CrossRef]
  37. Hattan, J.; Kanamoto, H.; Takemura, M.; Yokota, A.; Kohchi, T. Molecular characterization of the cytoplasmic interacting protein of the receptor kinase IRK expressed in the inforescence and root apices of Arabidopsis. Biosci. Biotechnol. Biochem. 2004, 68, 2598–2606. [Google Scholar] [CrossRef]
  38. Tsuda, K.; Kurata, N.; Ohyanagi, H.; Hake, S. Genome-wide study of KNOX regulatory network reveals brassinosteroid catabolic genes important for shoot meristem function in rice. Plant Cell 2014, 26, 3488–3500. [Google Scholar] [CrossRef]
  39. Shu, K.; Luo, X.; Meng, Y.; Yang, W. Toward a molecular understanding of abscisic acid actions in floral transition. Plant Cell Physiol. 2018, 59, 215–221. [Google Scholar] [CrossRef]
  40. Lin, T.; Walworth, A.; Zong, X.; Danial, G.H.; Tomaszewski, E.M.; Callow, P.; Han, X.; Irina Zaharia, L.; Edger, P.P.; Zhong, G.Y.; et al. VcRR2 regulates chilling-mediated flowering through expression of hormone genes in a transgenic blueberry mutant. Hortic. Res. 2019, 6, 96. [Google Scholar] [CrossRef]
  41. Zhang, Z.; Hu, Q.; Liu, Y.; Cheng, P.; Cheng, H.; Liu, W.; Xing, X.; Guan, Z.; Fang, W.; Chen, S.; et al. Strigolactone represses the synthesis of melatonin, thereby inducing floral transition in Arabidopsis thaliana in an FLC-dependent manner. J. Pineal Res. 2019, 67, e12582. [Google Scholar] [CrossRef]
  42. Bao, S.; Hua, C.; Shen, L.; Yu, H. New insights into gibberellin signaling in regulating flowering in Arabidopsis. J. Integr. Plant Biol. 2020, 62, 118–131. [Google Scholar] [CrossRef]
  43. Choquette, N.E.; Weldekidan, T.; Brewer, J.; Davis, S.B.; Wisser, R.J.; Holland, J.B. Enhancing adaptation of tropical maize to temperate environments using genomic selection. G3 Genes Genomes Genet. 2023, 13, jkad141. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  44. Yan, P.; Li, W.; Zhou, E.; Xing, Y.; Li, B.; Liu, J.; Zhang, Z.; Ding, D.; Fu, Z.; Xie, H.; et al. Integrating BSA-Seq with RNA-Seq Reveals a Novel Fasciated Ear5 Mutant in Maize. Int. J. Mol. Sci. 2023, 24, 1182. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  45. Kim, C.Y.; Bove, J.; Assmann, S.M. Overexpression of wound-responsive RNA-binding proteins induces leaf senescence and hypersensitive-like cell death. New Phytol. 2008, 180, 57–70. [Google Scholar] [CrossRef]
  46. Schmid, M.; Davison, T.S.; Henz, S.R.; Pape, U.J.; Demar, M.; Vingron, M.; Schölkopf, B.; Weigel, D.; Lohmann, J.U. A gene expression map of Arabidopsis thaliana development. Nat Genet. 2005, 37, 501–506. [Google Scholar] [CrossRef]
  47. Liang, C.; Liu, L.; Zhang, Z.; Ze, S.; Pei, L.; Feng, L.; Ji, M.; Yang, B.; Zhao, N. Transcriptome analysis of critical genes related to flowering in Mikania micrantha at different altitudes provides insights for a potential control. BMC Genom. 2023, 24, 14. [Google Scholar] [CrossRef]
  48. Liu, X.; Wan, Y.; An, J.; Zhang, X.; Cao, Y.; Li, Z.; Liu, X.; Ma, H. Morphological, Physiological, and Molecular Responses of Sweetly Fragrant Luculia gratissima During the Floral Transition Stage Induced by Short-Day Photoperiod. Front. Plant Sci. 2021, 12, 715683. [Google Scholar] [CrossRef]
  49. Walley, J.W.; Sartor, R.C.; Shen, Z.; Schmitz, R.J.; Wu, K.J.; Urich, M.A.; Nery, J.R.; Smith, L.G.; Schnable, J.C.; Ecker, J.R.; et al. Integration of omic networks in a developmental atlas of maize. Science 2016, 353, 814–818. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  50. Heffner, E.L.; Jannink, J.L.; Iwata, H.; Souza, E.; Sorrells, M.E. Genomic selection accuracy for grain quality traits in biparental wheat populations. Crop Sci. 2011, 51, 2597–2606. [Google Scholar] [CrossRef]
  51. Crossa, J.; Beyene, Y.; Kassa, S.; Pérez, P.; Hickey, J.M.; Chen, C.; De Los Campos, G.; Burgueño, J.; Windhausen, V.S.; Buckler, E.; et al. Genomic prediction in maize breeding populations with genotyping-by-sequencing. G3 Genes Genomes Genet. 2013, 3, 1903–1926. [Google Scholar] [CrossRef]
  52. Liu, Y.; Ao, M.; Lu, M.; Zheng, S.; Zhu, F.; Ruan, Y.; Guan, Y.; Zhang, A.; Cui, Z. Genomic selection to improve husk tightness based on genomic molecular markers in maize. Front. Plant Sci. 2023, 14, 1252298. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  53. Ren, J.; Li, Z.; Wu, P.; Zhang, A.; Liu, Y.; Hu, G.; Cao, S.; Qu, J.; Dhliwayo, T.; Zheng, H.; et al. Genetic Dissection of Quantitative Resistance to Common Rust (Puccinia sorghi) in Tropical Maize (Zea mays L.) by Combined Genome-Wide Association Study, Linkage Mapping, and Genomic Prediction. Front. Plant Sci. 2021, 12, 692205. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  54. Bernardo, R. Genomewide selection when major genes are known. Crop Sci. 2014, 54, 68–75. [Google Scholar] [CrossRef]
  55. Spindel, J.E.; Begum, H.; Akdemir, D.; Collard, B.; Redoña, E.; Jannink, J.-L.; McCouch, S. Genome-wide prediction models that incorporate de novo GWAS are a powerful new tool for tropical rice improvement. Heredity 2016, 116, 395. [Google Scholar] [CrossRef]
  56. Yang, J.; Mezmouk, S.; Baumgarten, A.; Buckler, E.S.; Guill, K.E.; McMullen, M.D.; Mumm, R.H.; Ross-Ibarra, J. Incomplete dominance of deleterious alleles contributes substantially to trait variation and heterosis in maize. PLOS Genet. 2017, 13, e1007019. [Google Scholar] [CrossRef]
  57. Liu, Y.B.; Hu, G.H.; Zhang, A.; Loladze, A.; Hu, Y.; Wang, H.; Qu, J.; Zhang, X.; Olsen, M.; Vicente, F.S.; et al. Genome-wide association study and genomic prediction of Fusarium ear rot resistance in tropical maize germplasm. Crop J. 2021, 9, 325–341. [Google Scholar] [CrossRef]
  58. Nakaya, A.; Isobe, S.N. Will genomic selection be a practical method for plant breeding? Ann. Bot. 2012, 110, 1303–1316. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Figure 1. Correlation analysis of flowering time-related traits. DTT: days to tasseling; DTP: days to pollen-shedding; DTS: days to silking. ***: significant at the 0.001 level. The red lines are regression lines.
Figure 1. Correlation analysis of flowering time-related traits. DTT: days to tasseling; DTP: days to pollen-shedding; DTS: days to silking. ***: significant at the 0.001 level. The red lines are regression lines.
Genes 15 00740 g001
Figure 2. Manhattan and QQ plots of flowering time-related traits by dual-model. (A): days to tasseling of Blink model; (B): days to pollen-shedding of Blink model; (C): days to silking of Blink model; (D): days to tasseling of FarmCPU model; (E): days to pollen-shedding of FarmCPU model; (F): days to silking of FarmCPU model. In the manhattan plots, different colors represent different chromosomes. In the QQ plots, red lines represent that the expected −log10(P) is equal to the observed −log10(P).
Figure 2. Manhattan and QQ plots of flowering time-related traits by dual-model. (A): days to tasseling of Blink model; (B): days to pollen-shedding of Blink model; (C): days to silking of Blink model; (D): days to tasseling of FarmCPU model; (E): days to pollen-shedding of FarmCPU model; (F): days to silking of FarmCPU model. In the manhattan plots, different colors represent different chromosomes. In the QQ plots, red lines represent that the expected −log10(P) is equal to the observed −log10(P).
Genes 15 00740 g002
Figure 3. Venn plot and the number of significant SNPs detected by each model.
Figure 3. Venn plot and the number of significant SNPs detected by each model.
Genes 15 00740 g003
Figure 4. Genomic prediction accuracy of flowering time-related traits. (A) the effect of different training population size on prediction accuracy; (B) the effect of marker density on prediction accuracy; (C) the effect of different number of significant markers on prediction accuracy of GS in BLINK models; (D) the effect of different number of significant markers on prediction accuracy of GS in FarmCPU models.
Figure 4. Genomic prediction accuracy of flowering time-related traits. (A) the effect of different training population size on prediction accuracy; (B) the effect of marker density on prediction accuracy; (C) the effect of different number of significant markers on prediction accuracy of GS in BLINK models; (D) the effect of different number of significant markers on prediction accuracy of GS in FarmCPU models.
Genes 15 00740 g004aGenes 15 00740 g004b
Table 1. Statistical analysis of flowering time-related traits in multiple environments.
Table 1. Statistical analysis of flowering time-related traits in multiple environments.
Trait 1Environment 2RangeMean ± SDSKewnessKurtosisCV (%) 3
DTTSG50.0~72.059.94 ± 4.220.23−0.687.04
DF50.0~62.054.63 ± 2.470.44−0.824.53
LD50.0~68.056.41 ± 3.790.62−0.486.71
QT63.0~83.071.06 ± 3.480.000.294.90
Combined55.1~67.060.55 ± 2.270.36−0.423.75
DTPSG54.0~77.064.30 ± 4.200.33−0.356.53
DF53.0~75.062.80 ± 4.670.29−0.777.43
LD52.0~72.059.80 ± 3.710.44−0.316.21
QT63.0~86.072.90 ± 3.410.160.704.39
Combined58.2~73.065.00 ± 2.850.36−0.464.67
DTSSG54.0~78.065.87 ± 4.510.27−0.446.85
DF54.0~78.063.73 ± 5.450.42−0.888.55
LD52.0~74.061.29 ± 3.910.31−0.086.39
QT63.0~87.073.78 ± 3.780.780.805.13
Combined58.8~75.066.30 ± 3.160.39−0.474.76
1 Trait: DTT: days to tasseling; DTP: days to pollen-shedding; DTS: days to silking. 2 SG: Sangong Town experimental station, Changji City, Xinjiang, China; DF: Dafeng Town experimental station, Hutubi County, Xinjiang, China; LD: Ledong experimental station, Hainan Province, China; QT: Qitai County experimental station, Xinjiang, China; Combined: A combination of four environments. 3 CV: coefficient of variation.
Table 2. Variance analysis of flowering time-related traits.
Table 2. Variance analysis of flowering time-related traits.
Trait 1Variance Components 2H2 (%) 3
σ ^ g 2 σ ^ g e 2 σ ^ e 2
DTT13.61 ***9.22 ***4.26 ***82.75
DTP19.84 ***10.68 ***4.29 ***86.09
DTS23.98 ***14.12 ***4.92 ***85.26
1 DTT: days to tasseling; DTP: days to pollen-shedding; DTS: days to silking. 2 σ ^ g 2 : genotype variance; σ ^ g e 2 : genotype × environment interaction variance; σ ^ e 2 : error variance; ***: significant at the 0.001 level. 3 H2: broad-sense heritability.
Table 3. Significant SNPs for flowering time-related traits.
Table 3. Significant SNPs for flowering time-related traits.
ModelTrait 1SNP 2ChromosomePositionBinp-ValueR2 (%) 3
BLINKDTT1_19027550011902755001.065.52 × 10−111.41
9_15214063191521406319.072.48 × 10−916.92
10_1467677041014676770410.072.34 × 10−81.97
DTP3_19894607131989460713.073.23 × 10−82.64
3_22966531432296653143.092.68 × 10−73.90
8_11450998181145099818.042.26 × 10−73.70
9_15214063191521406319.073.98 × 10−1112.68
DTS8_786667938786667938.032.42 × 10−123.39
9_15214063191521406319.073.52 × 10−724.17
FarmCPUDTT3_19894607131989460713.072.52 × 10−84.34
9_15214063191521406319.077.35 × 10−1515.53
10_1496539211014965392110.071.27 × 10−73.55
DTS3_105372343105372343.032.67 × 10−81.02
8_770146388770146388.032.05 × 10−70.50
9_189462159189462159.021.42 × 10−73.74
9_14664696691466469669.063.90 × 10−91.18
9_15214063191521406319.078.05 × 10−135.82
10_1451582861014515828610.077.89 × 10−143.73
DTS3_19948538631994853863.071.35 × 10−77.16
8_786667938786667938.038.63 × 10−92.30
9_14664696691466469669.063.68 × 10−73.30
9_15214063191521406319.073.40 × 10−98.53
1 DTT: days to tasseling; DTP: days to pollen-shedding; DTS: days to silking. 2 SNP name, chromosome position. 3 R2: the phenotypic variance explained by each SNP.
Table 4. Putative candidate gene of flowering time-related traits.
Table 4. Putative candidate gene of flowering time-related traits.
SNPTraitGene IDAnnotation
1_190275500DTTZm00001d031445Ethylene-insensitive3-like protein
Zm00001d031444IRK-interacting protein
Zm00001d031443DUF2361 family protein
Zm00001d031447ABC transporter C family member 10
3_10537234DTPZm00001d039650Cytochrome P450 734A1
3_198946071DTT, DTPZm00001d043406Interactor of constitutive active ROPs 1
3_199485386DTSZm00001d043427Uncharacterized LOC109945323
Zm00001d043426Pentatricopeptide repeat-containing protein At3g58590
3_229665314DTPZm00001d044495Vacuolar-processing enzyme
Zm00001d044496Pentatricopeptide repeat-containing protein chloroplastic
8_77014638DTPZm00001d009708Calcium-dependent protein kinase 1
Zm00001d009707Protein ABIL1
8_78666793DTSZm00001d009738Tubulin γ-2 chain
8_114509981DTPZm00001d010425Coiled-coil domain-containing protein 25
9_146646966DTP, DTSZm00001d047969CONSTANS interacting protein 6
Zm00001d047968Zinc finger protein 7
9_152140631DTT, DTP, DTSZm00001d048190RNase L inhibitor protein-related
Zm00001d048191Membrane protein
9_18946215DTPZm00001d045323Double B-box zinc finger protein 11
Zm00001d045324Dual specificity protein phosphatase DSP8
10_145158286DTPZm00001d026397RNA-binding protein AKIP1
Zm00001d026396Formation of crista junctions protein 1
10_146767704DTTZm00001d026482Uncharacterized LOC111590308
10_149653921DTTZm00001d026668Receptor-like kinase TMK2
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Fan, Z.; Lin, S.; Jiang, J.; Zeng, Y.; Meng, Y.; Ren, J.; Wu, P. Dual-Model GWAS Analysis and Genomic Selection of Maize Flowering Time-Related Traits. Genes 2024, 15, 740. https://doi.org/10.3390/genes15060740

AMA Style

Fan Z, Lin S, Jiang J, Zeng Y, Meng Y, Ren J, Wu P. Dual-Model GWAS Analysis and Genomic Selection of Maize Flowering Time-Related Traits. Genes. 2024; 15(6):740. https://doi.org/10.3390/genes15060740

Chicago/Turabian Style

Fan, Zehui, Shaohang Lin, Jiale Jiang, Yukang Zeng, Yao Meng, Jiaojiao Ren, and Penghao Wu. 2024. "Dual-Model GWAS Analysis and Genomic Selection of Maize Flowering Time-Related Traits" Genes 15, no. 6: 740. https://doi.org/10.3390/genes15060740

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop