Next Article in Journal
Goat miR-92a-3p Targets APOL6 Gene to Regulate the Differentiation of Intramuscular Precursor Adipocytes
Next Article in Special Issue
Estimation of Genetic Diversity and Number of Unique Genotypes of Cassava Germplasm from Burkina Faso Using Microsatellite Markers
Previous Article in Journal
A Comprehensive Evaluation of Generalizability of Deep Learning-Based Hi-C Resolution Improvement Methods
Previous Article in Special Issue
Genome-Wide Identification and Expression Analysis of the SBP-Box Gene Family in Loquat Fruit Development
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Identification and Analysis of Candidate Genes Associated with Yield Structure Traits and Maize Yield Using Next-Generation Sequencing Technology

by
Bartosz Nowak
1,
Agnieszka Tomkowiak
2,*,
Aleksandra Sobiech
2,
Jan Bocianowski
3,
Przemysław Łukasz Kowalczewski
4,
Julia Spychała
2 and
Tomasz Jamruszka
2
1
Smolice Plant Breeding Ltd., IHAR Group, Smolice 146, 63-740 Kobylin, Poland
2
Department of Genetics and Plant Breeding, Poznań University of Life Sciences, Dojazd 11, 60-632 Poznań, Poland
3
Department of Mathematical and Statistical Methods, Poznań University of Life Sciences, Wojska Polskiego 28, 60-637 Poznań, Poland
4
Department of Food Technology of Plant Origin, Poznań University of Life Sciences, Wojska Polskiego 31, 60-624 Poznań, Poland
*
Author to whom correspondence should be addressed.
Genes 2024, 15(1), 56; https://doi.org/10.3390/genes15010056
Submission received: 8 December 2023 / Accepted: 19 December 2023 / Published: 29 December 2023
(This article belongs to the Special Issue Advances in Genetics and Genomics of Plants)

Abstract

:
The main challenge of agriculture in the 21st century is the continuous increase in food production. In addition to ensuring food security, the goal of modern agriculture is the continued development and production of plant-derived biomaterials. Conventional plant breeding methods do not allow breeders to achieve satisfactory results in obtaining new varieties in a short time. Currently, advanced molecular biology tools play a significant role worldwide, markedly contributing to biological progress. The aim of this study was to identify new markers linked to candidate genes determining grain yield. Next-generation sequencing, gene association, and physical mapping were used to identify markers. An additional goal was to also optimize diagnostic procedures to identify molecular markers on reference materials. As a result of the conducted research, 19 SNP markers significantly associated with yield structure traits in maize were identified. Five of these markers (28629, 28625, 28640, 28649, and 29294) are located within genes that can be considered candidate genes associated with yield traits. For two markers (28639 and 29294), different amplification products were obtained on the electrophorograms. For marker 28629, a specific product of 189 bp was observed for genotypes 1, 4, and 10. For marker 29294, a specific product of 189 bp was observed for genotypes 1 and 10. Both markers can be used for the preliminary selection of well-yielding genotypes.

1. Introduction

Biological progress in modern plant breeding is defined as the development of new genotypes with traits relevant to agricultural practice [1,2]. These traits are associated with plant productivity and health, the suitability of produced materials for processing, as well as meeting the expectations of consumers (food) and non-consumable material users (e.g., cellulose-based resources). Maize breeding is aimed at developing high-yielding hybrid varieties [3,4,5]. This progress increasingly relies on the application of genomics and genetic engineering advancements [6].
Maize breeding worldwide is based on a wide range of research techniques in molecular genetics, primarily in two areas. The first is making selection decisions based on DNA nucleotide sequence analysis, and the second is expanding genetic variability in breeding populations through genetic modifications, primarily by developing plant organisms with foreign species genes [7,8]. This has not only created attractive prospects for achieving biological progress but also opened new possibilities for the utilization of not only maize but also other crops [5].
The introduction of molecular analysis for genetic markers has enabled the development of selection methodologies based on genetic markers—known as marker-assisted selection (MAS). The scope of applying this methodology is clearly dependent on the progress of knowledge about the genome of a given species [9,10]. The breakthrough in genomics was the completion of the first stage of sequencing the human genome and the announcement in February 2001 of two publications, published in “Nature” and “Science”, describing its organization [11,12]. This achievement has opened up a wide range of possibilities for characterizing the genomic sequences of crop plants [13,14,15,16]. The identification of genome sequences has revealed the presence of a vast number of differences in the studied genomes, with the majority being single-nucleotide polymorphisms (SNPs). They have been used to create a tool for a global genome analysis—the so-called SNP microarray—enabling the simultaneous determination of genotypes at thousands of loci [17]. SNP microarrays have become a key tool in the search for unknown polymorphisms (mutations) responsible for the occurrence of genetic diseases (e.g., monogenic) or predisposition to their development (e.g., complex diseases), as well as the phenotypic variability of production traits. The procedure based on SNP microarrays is commonly referred to as a GWAS (genome-wide association study) and is widely used primarily in human genomics and increasingly in animal and plant genomics, including maize [18].
To identify trait markers, it is necessary to have a large number of markers that densely and uniformly cover the genome. The density of this coverage depends on the linkage disequilibrium, which is species- and trait-dependent. As a result, such studies require markers obtained by next-generation sequencing methods such as GBS [17] or DArTseq and relatively large computational power. Two approaches are distinguished in association mapping: candidate gene association and genome-wide association studies (GWASs). Conducting a GWAS involves searching for trait–marker associations in the entire genome, assuming that there are markers showing linkage disequilibrium within the gene regulating the expression of a given trait [19]. Initially, association mapping performed in maize [20] did not consider population structure. This false-positive association was filtered out by the study of Pritchard, who included population structure in his maize study [21].
With the advancement of efficient marker methods and the availability of statistical software (Genstat 23), the number of analyzed species has increased, and DNA markers identified by this method are currently used in breeding practice [22,23,24]. Association mapping has proven useful for identifying the markers of traits whose quantitative loci explain a significant portion of trait variation [25]. However, this method has limited application for complex traits with weak effects of individual loci [26].
For several years, maize breeding worldwide has been supported by useful molecular markers. Many authors have stated in their publications that marker-assisted breeding accelerates yield growth not only in the USA but also in other countries, offering enormous potential to enhance maize productivity and germplasm value [27,28].
Therefore, the aim of the present study was to identify new markers linked to candidate genes determining grain yield using next-generation sequencing, gene association, and physical mapping, as well as to optimize the diagnostic procedures for the identification of 19 selected molecular markers.

2. Materials and Methods

2.1. Plant Material

The plant material included 64 inbred lines, 122 F1 hybrids, and 20 reference genotypes of maize (both high-yielding and low-yielding). The plant material was derived from Hodowla Roślin Smolice Sp. z o.o. Grupa IHAR (51°42′12″ N 17°10′10″ E) and Małopolskia Hodowla Roślin Sp. z o.o. (50°58′17″ N 16°55′50″ E). Part of the analyzed lines were flint grain lines of three different origins: F2 (a group related to the F2 line, bred at INRA in France from the Lacaune population), EP1 (a group related to the EP1 line, bred in Spain from the population derived from the Pyrenees), and German Flint. The second part of the plant material was dent-type kernels derived from various groups of origin from the United States: Iowa Stiff Stalk Synthetic (BSSS), Iowa Dent (ID), and Lancaster.

2.2. Methods

2.2.1. Open Field Experiment

This study was based on a total of 188 maize inbred and hybrid lines (G01.01, G01.02, G01.03, G01.04, G01.05, G01.06, G01.07, G01.08, G01.09, G01.10, G01.11, G01.12, G01.13, G01.14, G01.15, G01.16, G01.17, G01.18, G01.19, G01.20, G01.21, G02.01, G02.02, G02.03, G02.04, G02.05, G02.06, G02.07, G02.08, G02.09, G02.10, G02.11, G02.12, G02.13, G02.14, G02.15, G02.16, G02.17, G02.18, G02.19, G02.20, G02.21, G03.01, G03.02, G03.03, G03.04, G03.05, G03.06, G03.07, G03.08, G03.09, G03.10, G03.11, G03.12, G03.13, G03.14, G03.15, G03.16, G03.17, G03.18, G03.19, G03.20, G03.21, G04.01, G04.02, G04.03, G04.04, G04.05, G04.06, G04.07, G04.08, G04.09, G04.10, G04.11, G04.12, G04.13, G04.14, G04.15, G04.16, G04.17, G04.18, G04.19, G04.20, G04.21, G05.01, G05.02, G05.03, G05.04, G05.05, G05.06, G05.07, G05.08, G05.09, G05.10, G05.11, G05.12, G05.13, G05.14, G05.15, G05.16, G05.17, G05.18, G05.19, G05.20, G05.21, G06.01, G06.02, G06.03, G06.04, G06.05, G06.06, G06.07, G06.08, G06.09, G06.10, G06.11, G06.12, G06.13, G06.14, G06.15, G06.16, G06.17, K037, K038, K039, K040, K041, K042, K043, K044, K045, K046, K047, K048, K049, K050, K051, K052, K053, K054, K055, K056, K057, K058, K059, K060, K061, K062, K063, K064, K065, K066, K067, K068, K069, K070, K071, K072, K073, K074, K075, K076, K077, K078, K079, K080, K081, K082, K083, K084, K085, K086, K087, K088, K089, K090, K091, K092, K093, K094, K095, K096, K097, K098, K099, K100, S055, and S101). The experiment was set up in three replications in a randomized complete block design at two locations (Smolice and Kobierzyce) on plots of 10 m2. On average, there were 75 to 80 plants per plot. Morphological features were observed during the growing season and immediately after harvest. After harvest, the following yield structure traits were determined: ear length (cm), ear diameter (cm), kernel row length (cm), core length (cm), core diameter (cm), the number of rows, the number of kernels per row, and TSW (g). The grain yield (kg) from each plot was also analyzed.

2.2.2. Weather Conditions

The data used came from the weather station belonging to the Poznań University of Life Sciences. In 2022, the average rainfall in Smolice was 39.94 mm, lower than the multi-year average rainfall, which amounted to 48.27 mm. The wettest month in this year was July (102 mm of rainfall), and the greatest drought was recorded in October (3 mm of rainfall). The temperature in this year ranged from −2.1 °C in February to 21.6 °C in July. The average air temperature in this year was higher than the long-term average air temperature by 1.05 °C and amounted to 9.79 °C. In the second decade of April, no rainfall was recorded, which had an adverse effect on corn emergence, while in May, rainfall totals were higher, which resulted in soil sealing and uneven corn emergence. In 2022, the average rainfall in Kobierzyce was 40.93 mm, and similarly to Smolice, it was lower than the multi-year average rainfall by 7.37 mm. The highest rainfall was recorded in May (81 mm), and the lowest was recorded in October (12.4 mm). The average air temperature in 2022 was 10.3 °C in Kobierzyce, higher than the long-term average temperature by 1.42 °C (Figure 1). The warmest month was July (22.5 °C), and the coldest was February (−4.3 °C). In 2022, no weather anomalies were observed in Smolice and Kobierzyce. Despite the periodic drought, the weather was typical for these areas of Poland.

2.2.3. DNA Isolation

The isolation of DNA from 66 inbred lines and 122 F1 hybrids was conducted using a commercial reagent kit purchased from Promega. The samples of isolated DNA were subjected to next-generation sequencing. DNA isolation from 20 reference genotypes was carried out using a commercial reagent kit purchased from A&A Biotechnology. The concentration and purity of the isolated DNA samples were determined using a DS-11 spectrophotometer from DeNovix. The isolated DNA template was adjusted to an equal concentration of 100 ng μL−1 by diluting it with double-distilled water (ddH2O).

2.2.4. Genotyping

The methodology was taken from the work of Sobiech et al. [29]. DArTseq technology, which is based on next-generation sequencing, was applied for genotyping. The isolated DNA of the 188 maize plants tested (100 ng in 25 µL from each genotype) was sent in two 96-well Eppendorf plates for analysis to identify silicoDArT and SNPs. The analyses were performed at Diversity Arrays Technology, University of Canberra, Australia. Using methods proposed by Baird et al. [30], in the first step, the DNA template was digested with Ape KI, Pst I, and Msp I restriction enzymes to reduce genome complexity. The original GBS method used a single Ape KI enzyme (Elshire et al. [31]), and later, the method was expanded to include two additional enzymes: one infrequent cutter, Pst I, in combination with a frequent genomic DNA cutter, i.e., Msp I (Poland & Rife, [32]). Such an approach enabled the creation of a homogeneous library and the detection of most fragments associated with the infrequent cutting enzyme. A characteristic feature of the applied enzymes is their sensitivity to methylation, allowing for the filtering of non-coding regions and methylated repetitive sequences such as mobile elements. In the following step, genomic DNA fragments cleaved by restriction enzymes were ligated with adapters. Since the latter contains identifiers (so-called barcodes), the origin of each sample was strictly defined, and the identifiers met the appropriate criteria (Poland et al. [33]). The resulting PCR products were analyzed for size and constituted a genomic library, which was subsequently sequenced using a leading NGS platform (Kilian et al. [34]), Illumina, following the methodology detailed on the Diversity Arrays Technology’s website (https://www.diversityarrays.com/technology-and-resources/dartseq/) (URL accessed on 20 October 2023).

2.2.5. Association Mapping by Carrying Out a GWAS

Association mapping of 188 maize genotypes (66 lines and 122 hybrids) was performed for yield and yield structure traits by carrying out a GWAS. This mapping was conducted based on the results obtained from the genotyping and phenotyping analyses. The genotypic data were obtained from the DArTseq analysis, while the phenotypic data comprise results from field experiments concerning yield size and ear structure traits. The following yield structure traits were analyzed: ear length, ear diameter, core length, core diameter, the number of rows, the number of kernels per row, TSW, and yield per plot. Based on the GWAS, silicoDArT and SNP markers showing the highest significance level, i.e., those that were most strongly associated with yield structure traits and yield, were selected for further study.

2.2.6. Physical Mapping

Sequences of the silicoDArT and SNP markers, selected based on the GWAS, were subjected to BLAST (Basic Local Alignment Search Tool) analysis, which involved searching databases for sequences highly homologous to the selected silicoDArT and SNP markers. The following publicly available web browsers were used for this: CEPH Genotype database http://www.cephb.fr/en/cephdb/ (URL accessed on 20 October 2023), NCBI Map Viewer http://www.ncbi.nlm.nih.gov/projects/mapview/ (URL accessed on 20 October 2023), UCSC Genome Browser http://genome.ucsc.edu/ (URL accessed on 20 October 2023), Ensembl Map View http://ensembl.fugu-sg.org/common/helpview?kw=mapview;ref (URL accessed on 20 October 2023). The programs used helped identify the chromosomal locations of the retrieved sequences, similar to the analyzed sequences, and determine their physical location. The sequences of all genes located within the designated chromosomal region were further analyzed.

2.2.7. Functional Analysis of Gene Sequences

Our functional analysis was carried out using the Blast2GO program https://www.blast2go.com/ (URL accessed on 20 October 2023). The sequences of all genes located in the chromosomal regions identified from the BLAST analysis were subjected to analysis. The aim was to obtain information about the biological function of the gene sequences located in a designated chromosomal region.

2.2.8. Designing Primers for Identified SilicoDArT and SNPs Associated with Yield and Traits

The Primer 3 Plus program was used to design primers. The program can be accessed online and does not need to be downloaded or installed. Primer 3 Plus offers various options, ranging from various ways of specifying the sequence for which the primers are to be designed and general expectations regarding primer characteristics (size, the melting temperatures of both the primers and products, %GC, complementarity, etc.) to very detailed settings for primer parameters.

2.2.9. Polymerase Chain Reaction (PCR)

The identification of new markers linked to genes associated with the analyzed ear structure traits was carried out using a PCR. PCRs were conducted in a C1000 thermal cycler (Biorad, Hercules, CA, USA). The reaction mixture included the following: a 1 µ DNA template (50 ng µL−1), 4 µL polymerase buffer (5×), 1.6 µL dNTP (10 mM), 1.6 µL MgCl2 (25 mM), a 0.5 µL forward primer (10 µM), a 0.5 µL reverse primer (10 µM), 0.2 µL GoTaq polymerase (5 U µL−1), and 10.6 µL H2O. This composition was modified depending on the identified marker.
The PCR conditions were individually determined for each of the identified markers and differed in terms of primer annealing temperature, determined according to their respective melting temperatures. The following amplification temperature profile was used: initial denaturation for 5 min at 95 °C, followed by 35 cycles (denaturation for 45 s at 95 °C), primer annealing for 1 min (a different temperature was used for each pair of primers, consistent with their melting temperature), extension for 1 min at 72 °C, and final extension for 5 min at 72 °C before cooling to 4 °C.

2.2.10. Electrophoresis

The electrophoresis of the PCR products was conducted on a 2.5% agarose gel, with the addition of 1 µL of Midori Green solution, for 2 h at 100 V. The O’RangeRuler 50 bp (Fermentas, Waltham, MA, USA) was used as a reference to identify the sizes of the amplified products. The visualization of the separated DNA fragments was carried out under UV light and captured on digital images using the BIORAD gel visualization and documentation system.

3. Results

3.1. Field Experiment

The field experiment was established in two locations: Smolice (51°42′58.904″ N, 17°13′29.13″ E) and Kobierzyce (50°58′19.411″ N, 16°55′47.323″ E). This allowed us to perform and analyze biometric measurements of 188 maize genotypes. The measurement results were used for association mapping. After harvest, observations of the following yield structure traits were conducted: ear length, ear diameter, core length, core diameter, the number of rows, the number of grains per row, grain weight per ear, TSW, and yield per plot (Figure 2). Density plots were constructed to examine the distribution of all analyzed variables in both locations. The peaks in the density plots illustrate the ranges where the values of the analyzed traits are concentrated; e.g., for the majority of the analyzed genotypes in both locations (Smolice, Kobierzyce, Poland) ear length falls within the range of 17–19 cm. As demonstrated in the accompanying graphs, the distribution of the analyzed variables differed between the locations for core diameter, the number of rows, the number of grains per row, grain weight per ear, and yield per plot (Figure 2).

3.2. Phenotyping

An analysis of variance between the genotypes was performed for the recorded traits, and significant variation was observed for all traits. Our analysis of variance also showed statistically significant variation for all the studied traits between the locations where the field experiment was conducted. The line–location interaction was not significant, only being so for the number of rows (Table 1).
To determine the relationships between groups of variables in the dataset, i.e., observations of the yield structure traits and yield per plot in both locations, a multivariate technique was applied, namely canonical variate analysis. All traits were characterized by a normal distribution. The grouping of genotypes into lines and hybrids could be observed (Figure 3).
Correlations between the observed traits were analyzed in both locations, i.e., Smolice and Kobierzyce. It was demonstrated that in Smolice, the most strongly positively correlated traits were cob length and core length (97%), mass of grain from the cob and yield (92%), cob length and mass of grain from the cob (89%), and cob length and yield (87%) (Figure 4). In the case of Kobierzyce, the following traits were strongly positively correlated: cob and core length (98%), mass of grain from the cob and yield (97%), cob and core diameter (94%), cob diameter and mass of grain from the cob (93%), and cob diameter and yield (93%) (Figure 5).

3.3. DNA Isolation

DNA isolation from the 188 genotypes which were sent for next-generation sequencing was performed using a kit from Promega. The yield from individual isolations was high, ranging from 106 ng/μL for line 15 to 935.24 ng/μL for line 34. The purity of the isolated DNA was very good and averaged 1.8 for absorbance A260/A280. The exception were two samples: line 17, which had a purity of 1.54, and line 64, which had a purity of 2.39. Given the relatively high concentration of DNA obtained, the samples were adjusted to a uniform concentration of 100 ng μL−1, required for next-generation sequencing analyses.

3.4. Genotyping

A total of 92,614 molecular markers were obtained as a result of next-generation sequencing, including 60,436 SilicoDArT markers and 32,178 SNPs. MAF > 0.25 and a number of missing observations <10% were applied as criteria to determine the usefulness of the identified markers. This operation reduced the number of markers to 32,900 (26,234 DArTs and 6666 SNPs), which were subsequently used for association mapping (Table 2). The majority of SNP and Silico DArT markers were associated with yield (18,352—Kobierzyce and 18,751—Smolice), mass of grain from the cob (17,685—Kobierzyce and 18,314—Smolice), and core diameter (17,787—Kobierzyce and 16,018—Smolice). Few markers were associated with the number of rows of grain (12,757—Kobierzyce and 11,714—Smolice) and the number of grains in row (13,265—Kobierzyce and 13,981—Smolice) (Table 2). In order to narrow down the number of markers for physical mapping, 20 markers were selected from among all the significant ones that were associated with the same traits in both locations (Kobierzyce and Smolice).
Based on the identified SNP and SilicoDArT molecular markers, a dendrogram of genetic similarity was constructed for the 188 analyzed genotypes (Figure 6). The dendrogram very clearly shows two distinct similarity groups. The first group consisted of 65 inbred lines from HR in Kobierzyce, while the second group included 122 analyzed hybrids and 1 inbred line. Such an ideal clustering demonstrates the usefulness of SNP and silico DArT markers for grouping genotypes by genetic similarity.
A total of 20 of the 32,900 markers (26,234 DArTs and 6666 SNPs) significantly associated with the analyzed yield structure traits and yield were selected. These markers were significant for the same traits in both locations (Kobierzyce and Smolice) (Table 3). An attempt was also made to determine the location of selected SNP markers. Unfortunately, it was not possible to determine the position of one marker. The next step was to design primers for the identification of the 19 selected and localized markers. After determining the location of the 19 selected SNPs, an attempt was made to design primers for their identification. Primer sequences are shown in Table 4.

3.5. The Identification of New Molecular Markers Associated with Yield and Yield Structure Traits Using Polymerase Chain Reaction (PCR)

Of the 19 markers selected, 2 (28629 and 29294) produced different amplification products on the electropherograms. The first 10 genotypes based on field observations are classified as the highest yielding while genotypes numbered 11–20 are classified as the lowest yielding. For marker 28629, a specific product of 189 bp was observed for genotypes 1, 4, and 10. Non-specific products of 200 bp were obtained for the remaining genotypes (Figure 7). This marker is located on chromosome 8, 3130 bp upstream of “protein senescence-associated gene 21, mitochondrial” and 91 bp downstream of “uncharacterized protein loc100382335”. For marker 29294, a specific product of 189 bp was observed for genotypes 1 and 10. Non-specific products of 200 bp were obtained for the remaining genotypes (Figure 8). This marker is located on chromosome 5 within the hydroxyproline o-galactosyltransferase galt6 gene. Both markers will undergo further testing on a larger number of extreme genotypes to be used for the initial selection of high-yielding genotypes.

4. Discussion

The breeding of heterozygous maize varieties consistently aims to harness the potential of hybrid vigor, shorten the breeding process (e.g., by utilizing doubled haploid lines), and improve and reduce the costliness of seed production. The priority for all breeders is to obtain high-yielding and disease-resistant maize varieties [35].
In the present study, an analysis of variance was performed based on phenotypic observations (related to yield and yield structure traits). For all traits, significant variation was observed between the genotypes. Our analysis of variance also showed statistically significant variation for all studied traits between locations where the field experiment was established. The interaction of line and trial location was not significant only for the number of rows. To determine the relationships between the groups of variables in the dataset, i.e., observations of the yield structure traits and yield per plot in both locations, a multivariate technique was applied, namely canonical variate analysis. All traits were characterized by a normal distribution. The grouping of genotypes into lines and hybrids could be observed.
Phenotypic analysis, unfortunately, does not allow for the selection of parental components for heterosis crosses because traditional methods used in heterosis breeding are insufficient in the era of technological progress. In light of this challenge, modern agriculture has led to the harnessing of high-throughput techniques for analyzing the genomes of crop plants for their subsequent use in improving existing varieties, including maize [36]. Such a genomics-focused approach allows one to obtain information about coding regions that provide information on protein structure (genomic), as well as intergenic regions; both types can be successfully applied to improve crop plant varieties [37].
The introduction of next-generation sequencing (NGS) methods has enabled the elucidation of nucleotide sequences in plants other than model organisms such as Arabidopsis thaliana, which are characterized by a small genome. Crop species of interest mainly include maize, coffee, or sugarcane [38].
In recent years, many authors have attempted to identify molecular markers linked to functionally important traits in maize. Bocianowski et al. [39] used NGS technology and associative mapping to identify markers related to the heterosis effect in maize. Using the same methods, Sobiech et al. [40] identified markers linked to the resistance of maize plants to fusarium. In turn, Tomkowiak et al. [41] identified six SNP markers (1818; 14506; 2317; 3233; 11657; 12812) located inside genes, on chromosomes 8, 9, 7, 3, 5 and 1, related to the amount of yield in corn. The authors of [42] identified four genes—sucrose synthase 4 isoform ×2 gene, phosphoinositide phosphatase sac7 isoform ×1 gene, putative SET domain containing protein family isoform ×1 gene, and grx_c8–glutaredoxin subgroup iii—which can significantly regulate the level of seed vigor and germination of maize seeds.
In the present study, a total of 92,614 molecular markers were obtained utilizing next-generation sequencing, including 60,436 SilicoDArT markers and 32,178 SNPs. MAF > 0.25 and a number of missing observations <10% were applied as criteria to determine the usefulness of the identified markers. In this way, 32,900 markers (26,234 DArTs and 6666 SNPs) were obtained and applied for association mapping.
NGS technology is utilized for sequencing genomes and transcriptomes, studying protein–DNA/RNA interactions, assessing methylation levels, discovering new DNA polymorphisms, and conducting meta-genomic studies [43]. This technology allows for the analysis of various DNA fragments represented by multiple copies during a single reaction, library preparation, and the subsequent collection of gigabases of genomic data from a single sequencing run [44,45]. This not only increases the number of samples examined but also enhances the reliability of the obtained sequencing results. This is particularly valuable when the variation between specific genotypes is small [36]. The costs and time required for sequencing reactions, when calculated per unit of obtained information, are significantly lower compared to the costs of analyses conducted using traditional capillary sequencers [46].
Another sequencing strategy, primarily used to study the interactions between plants and their environment, is the use of NGS methods to characterize the plant transcriptome in different physiological states. The analysis of cDNA sequences provides information about expressed sequence tags (ESTs), which are transcribed in specific tissues and organs, and despite some limitations, these data are very useful for breeders [36,47,48]. Next-generation sequencing techniques also enable qualitative and quantitative analyses of genes expressed under different conditions, and the results of these analyses are used for association mapping [49,50,51,52].
In the present study, association mapping was carried out, and it was found that the highest number of SNPs and silico DArT markers were associated with yield per plot (18,352—Kobierzyce and 18,751—Smolice), grain weight per ear (17,685—Kobierzyce and 18,314—Smolice), and core diameter (17,787—Kobierzyce and 16,018—Smolice). The fewest markers were associated with the number of rows (12,757—Kobierzyce and 11,714—Smolice) and the number of grains per row (13,265—Kobierzyce and 13,981—Smolice). To narrow down the number of markers for physical mapping, 19 markers were selected from among all significant ones that were associated with the same traits in both locations (Kobierzyce and Smolice). These markers were tested on high- and low-yielding reference genotypes. As a result of testing, two markers (28629 and 29294) that differentiated the tested genotypes were selected. For marker 28629, a specific product of 189 bp was observed for genotypes 1, 4, and 10. For marker 29294, a specific product of 189 bp was observed for genotypes 1 and 10.
Thanks to their specific features, SNP and SilicoDArT markers find many applications, including in the creation of molecular linkage maps and the identification of quantitative trait loci (QTLs) responsible for the inheritance of quantitative traits. Additionally, they are used for origin analysis, fingerprinting of cultivated varieties, in studies on population genetic diversity and gene flow, and plant evolutionary genetics [52].
In the present study, we identified 19 SNP markers that are significantly associated with yield structure traits in maize. Five of these markers (28629, 28625, 28640, 28649, and 29294) are located within genes that can be considered candidate genes related to yield traits. Marker 28629 is located on chromosome 8 within the leucine-rich repeat receptor-like protein kinase gene. Receptor-like kinases (RLKs) are a diverse group of transmembrane proteins characterized with a ligand-binding domain to receive signal molecules, a membrane-spanning domain to anchor the protein, and a cytoplasmic protein kinase domain to transduce signals downstream [53]. According to reports in the literature, the first RLK was isolated from maize, and then numerous RLKs were identified in over 20 plant species [54]. RLKs can indirectly influence the yield of maize because they mediate many signaling messages on the cell surface and act as key regulators during developmental processes [55,56,57]. Genetic and biochemical studies conducted by other scientists have also shown that plant LRR-RLKs play an important role in various processes during growth and development [58,59]. CLV and RPK2 have also been found to be essential receptor-like kinases in the formation and maintenance of the shoot apical meristem [60,61]. Another significant marker was SNP 28625, located on chromosome 1 within the arabinosyltransferase (arad1) gene. As reported in [62], gene-encoding arabinosyltransferase ARAD1 catalyzes the polymerization of arabinose into the arabinan of arabinogalactan during secondary wall formation in loblolly pine. Research indicates a connection between arabinogalactan proteins and lignin biosynthesis for cell wall formation. It is well known that lignin occurs in the cell wall and is necessary for the transport of water and aqueous nutrients in plant stems. The polysaccharide components of plant cell walls are hydrophilic and therefore permeable to water, while lignin is hydrophobic. The cross-linking of polysaccharides by lignin prevents the absorption of water by the cell wall. Consequently, lignin enables the plant’s vascular tissue to conduct water efficiently, which is very important for the proper growth and development of the plant and its yield [63]. Therefore, there is a high probability that the ARAD1 arabinosyltransferase gene may influence the yield of maize. Research conducted in recent years has shown that cell walls can play an important role in intercellular communication, not only as a pathway for the transport of signaling molecules, or as an area of the cell in which the receptor domains of membrane proteins operate, but also as a source of signals that influence the functioning of cells [64,65,66]. The third significant SNP (28640) was located on chromosome 9 inside the sugar phosphate gene. Sucrose phosphate synthase (SPS) is a key enzyme in the sugar metabolic pathways in plants. SPS catalyzes the conversion of fructose-6-phosphate and uridine diphosphate-glucose (UDP-glucose) into sucrose-6-phosphate which is a substrate in the synthesis of sucrose [67,68]. SPS exists in many isoforms which may play various functional roles and are specific to different tissues and stages of development. Particularly, knowledge about the relationship of the localization of the individual forms of these enzymes and their role in plant responses to various stresses is highly desirable. It has been demonstrated, for instance, that several maize SPS sequences are most strongly expressed in the leaves and less intensively in pollen and kernels, and this is related to reactions to different abiotic factors [69]. SUS isoforms have been found in different areas of cell walls [70,71]). In maize, the specificity of the function of individual SUS isoforms in both cytoplasmic and membrane-associated sucrose degradation was emphasized in [72]. The fourth significant SNP (28649) was located on chromosome 4 within the gene annotated as ubiquitin carboxyl-terminal hydrolase 15 isoform x2. Ubiquitin carboxyl-terminal hydrolases (UCHs) belong to an enzymatic subclass of deubiquitinating enzymes (DUBs). The function of this gene in maize has not been described in the literature. The fifth significant SNP (29294) was located on chromosome 5 within the hydroxyproline o-galactosyltransferase galt6 gene. According to our analyses and data in the literature, this gene may be related to the yield of maize because, according to Kaur et al. (2023), hydroxyproline o-galactosyltransferase galt6 plays an important role in various stages of plant growth and development [73]. Moreira et al. (2023) have reported that arabinogalactan proteins (AGPs) are hydroxyproline-rich, sugar-rich glycoproteins widely distributed in the plant kingdom [74]. The synthesis of their complex carbohydrates is initiated by a family of hydroxyproline galactosyltransferase (Hyp-GALT) enzymes, which add the first galactose to Hyp residues in the protein backbone.

5. Conclusions

In the present study, we identified 19 SNP markers that are significantly associated with yield structure traits in maize. Five of these markers are located within genes that can be considered candidate genes related to yield traits: marker 28629, located on chromosome 8 within the leucine rich repeat receptor like protein kinase gene; marker 28625, located on chromosome 1 within the arabinosyltransferase (arad1) gene; marker 28640, located on chromosome 9 inside the sugar phosphate gene; marker 28649, located in ubiquitin carboxyl-terminal hydrolase 15 isoform x2ubiquitin carboxyl-terminal hydrolase 15 isoform x3; marker 29294, located in hydroxyproline o-galactosyltransferase galt6. These genes will be subjected to expression analysis in the next stage of our research. The identified markers were tested on high- and low-yielding reference genotypes. The testing was aimed at selecting markers that could be used to select high-yielding genotypes. In the case of two markers (28629 and 29294), different amplification products were obtained on the electrophorograms. For marker 28629, a specific product of 189 bp was observed in genotypes 1, 4, and 10. For marker 29294, a specific product of 189 bp was observed in genotypes 1 and 10. Both markers will be further tested on more extreme genotypes so that they can be applied in the preliminary selection of high-yielding genotypes.

Author Contributions

Conceptualization, A.T. and A.S.; methodology, A.T. and A.S.; software (Genstat 23), J.B.; validation, B.N., A.T., and J.B.; formal analysis, A.S.; investigation, A.T.; resources, A.T.; data curation, B.N.; writing—original draft preparation, A.T.; writing—review and editing, J.S., T.J., and P.Ł.K.; visualization, J.B.; supervision, A.T.; project administration, A.T. All authors have read and agreed to the published version of the manuscript.

Funding

The research presented in this publication was financed as part of the research project termed “Analysis of genetic determinants of heterosis effect and fusarium resistance in maize (Zea mays L.)”. PL: “Analiza genetycznych uwarunkowań związanych z efektem heterozji oraz odpornością na fuzarium u kukurydzy (Zea mays L.)”. The project was implemented under a grant from the Ministry of Agriculture and Rural Development, “Biological progress in plant production (recruitment 2020)”. Duration of the project: 2021–2026.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

Author Bartosz Nowak was employed by the company Smolice Plant Breeding Sp. Z O. O. IHAR Group. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Andrade-Sanchez, P.; Gore, M.A.; Heun, J.T.; Thorp, K.R.; Carmo-Silva, A.E.; French, A.N.; Salvucci, M.E.; White, J.W.; Andrade-Sanchez, P.; Gore, M.A.; et al. Development and Evaluation of a Field-Based High-Throughput Phenotyping Platform. Funct. Plant Biol. 2013, 41, 68–79. [Google Scholar] [CrossRef] [PubMed]
  2. Baldazzi, V.; Bertin, N.; Génard, M.; Gautier, H.; Desnoues, E.; Quilot-Turion, B. Challenges in Integrating Genetic Control in Plant and Crop Models. In Crop Systems Biology: Narrowing the Gaps between Crop Modelling and Genetics; Yin, X., Struik, P.C., Eds.; Springer International Publishing: Cham, Switzerland, 2016; pp. 1–31. ISBN 978-3-319-20562-5. [Google Scholar]
  3. Millet, E.J.; Kruijer, W.; Coupel-Ledru, A.; Alvarez Prado, S.; Cabrera-Bosquet, L.; Lacube, S.; Charcosset, A.; Welcker, C.; van Eeuwijk, F.; Tardieu, F. Genomic Prediction of Maize Yield across European Environmental Conditions. Nat. Genet. 2019, 51, 952–956. [Google Scholar] [CrossRef] [PubMed]
  4. Seifert, F.; Thiemann, A.; Schrag, T.A.; Rybka, D.; Melchinger, A.E.; Frisch, M.; Scholten, S. Small RNA-Based Prediction of Hybrid Performance in Maize. BMC Genom. 2018, 19, 371. [Google Scholar] [CrossRef] [PubMed]
  5. Cooper, M.; Technow, F.; Messina, C.; Gho, C.; Totir, L.R. Use of Crop Growth Models with Whole-Genome Prediction: Application to a Maize Multienvironment Trial. Crop. Sci. 2016, 56, 2141–2156. [Google Scholar] [CrossRef]
  6. Bevan, M.W.; Uauy, C.; Wulff, B.B.H.; Zhou, J.; Krasileva, K.; Clark, M.D. Genomic Innovation for Crop Improvement. Nature 2017, 543, 346–354. [Google Scholar] [CrossRef] [PubMed]
  7. Xu, Y.; Li, P.; Yang, Z.; Xu, C. Genetic Mapping of Quantitative Trait Loci in Crops. Crop. J. 2017, 5, 175–184. [Google Scholar] [CrossRef]
  8. Zenke-Philippi, C.; Frisch, M.; Thiemann, A.; Seifert, F.; Schrag, T.; Melchinger, A.E.; Scholten, S.; Herzog, E. Transcriptome-Based Prediction of Hybrid Performance with Unbalanced Data from a Maize Breeding Programme. Plant Breed. 2017, 136, 331–337. [Google Scholar] [CrossRef]
  9. Eathington, S.R.; Crosbie, T.M.; Edwards, M.D.; Reiter, R.S.; Bull, J.K. Molecular Markers in a Commercial Breeding Program. Crop. Sci. 2007, 47, S-154–S-163. [Google Scholar] [CrossRef]
  10. Xu, C.; Ren, Y.; Jian, Y.; Guo, Z.; Zhang, Y.; Xie, C.; Fu, J.; Wang, H.; Wang, G.; Xu, Y.; et al. Development of a Maize 55 K SNP Array with Improved Genome Coverage for Molecular Breeding. Mol. Breed. 2017, 37, 20. [Google Scholar] [CrossRef]
  11. International Human Genome Sequencing Consortium. Initial sequencing and analysis of the human genome. Nature 2001, 409, 860–921. [Google Scholar] [CrossRef]
  12. Venter, J.C.; Adams, M.D.; Myers, E.W.; Li, P.W.; Mural, R.J.; Sutton, G.G.; Smith, H.O.; Yandell, M.; Evans, C.A.; Holt, R.A.; et al. The sequence of the human genome. Science 2001, 291, 1304–1351. [Google Scholar] [CrossRef] [PubMed]
  13. Cerrudo, D.; Cao, S.; Yuan, Y.; Martinez, C.; Suarez, E.A.; Babu, R.; Zhang, X.; Trachsel, S. Genomic Selection Outperforms Marker Assisted Selection for Grain Yield and Physiological Traits in a Maize Doubled Haploid Population Across Water Treatments. Front. Plant Sci. 2018, 9, 366. [Google Scholar] [CrossRef] [PubMed]
  14. Crossa, J.; Pérez-Rodríguez, P.; Cuevas, J.; Montesinos-López, O.; Jarquín, D.; de los Campos, G.; Burgueño, J.; González-Camacho, J.M.; Pérez-Elizalde, S.; Beyene, Y.; et al. Genomic Selection in Plant Breeding: Methods, Models, and Perspectives. Trends Plant Sci. 2017, 22, 961–975. [Google Scholar] [CrossRef] [PubMed]
  15. Heslot, N.; Jannink, J.-L.; Sorrells, M.E. Perspectives for Genomic Selection Applications and Research in Plants. Crop. Sci. 2015, 55, 1–12. [Google Scholar] [CrossRef]
  16. Shikha, M.; Kanika, A.; Rao, A.R.; Mallikarjuna, M.G.; Gupta, H.S.; Nepolean, T. Genomic Selection for Drought Tolerance Using Genome-Wide SNPs in Maize. Front. Plant Sci. 2017, 8, 550. [Google Scholar] [CrossRef] [PubMed]
  17. Elshire, R.J.; Glaubitz, J.C.; Sun, Q.; Poland, J.A.; Kawamoto, K.; Buckler, E.S.; Mitchell, S.E. A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species. PLoS ONE 2011, 6, e19379. [Google Scholar] [CrossRef]
  18. Kremling, K.A.G.; Diepenbrock, C.H.; Gore, M.A.; Buckler, E.S.; Bandillo, N.B. Transcriptome-Wide Association Supplements Genome-Wide Association in Zea Mays. G3 Genes Genomes Genet. 2019, 9, 3023–3033. [Google Scholar] [CrossRef] [PubMed]
  19. Rafalski, J.A. Association Genetics in Crop Improvement. Curr. Opin. Plant Biol. 2010, 13, 174–180. [Google Scholar] [CrossRef]
  20. Bar-Hen, A.; Charcosset, A.; Bourgoin, M.; Guiard, J. Relationship between Genetic Markers and Morphological Traits in a Maize Inbred Lines Collection. Euphytica 1995, 84, 145–154. [Google Scholar] [CrossRef]
  21. Pritchard, J.K. Deconstructing Maize Population Structure. Nat. Genet. 2001, 28, 203–204. [Google Scholar] [CrossRef]
  22. Guo, Z.; Magwire, M.M.; Basten, C.J.; Xu, Z.; Wang, D. Evaluation of the Utility of Gene Expression and Metabolic Information for Genomic Prediction in Maize. Theor. Appl. Genet. 2016, 129, 2413–2427. [Google Scholar] [CrossRef] [PubMed]
  23. Thomas, D. Gene–Environment-Wide Association Studies: Emerging Approaches. Nat. Rev. Genet. 2010, 11, 259–272. [Google Scholar] [CrossRef] [PubMed]
  24. Lamsal, A.; Welch, S.M.; White, J.W.; Thorp, K.R.; Bello, N.M. Estimating Parametric Phenotypes That Determine Anthesis Date in Zea Mays: Challenges in Combining Ecophysiological Models with Genetics. PLoS ONE 2018, 13, e0195841. [Google Scholar] [CrossRef] [PubMed]
  25. Soto-Cerda, B.; Cloutier, S. Association Mapping in Plant Genomes. In Genetic Diversity in Plants; IntechOpen: London, UK, 2012; ISBN 978-953-51-0185-7. [Google Scholar]
  26. Hall, D.; Tegstrom, C.; Ingvarsson, P.K. Using Association Mapping to Dissect the Genetic Basis of Complex Traits in Plants. Brief. Funct. Genom. 2010, 9, 157–165. [Google Scholar] [CrossRef] [PubMed]
  27. Dhliwayo, T.; Pixley, K.; Menkir, A.; Warburton, M. Combining Ability, Genetic Distances, and Heterosis among Elite CIMMYT and IITA Tropical Maize Inbred Lines. Crop. Sci. 2009, 49, 1201–1210. [Google Scholar] [CrossRef]
  28. Chander, S.; Guo, Y.Q.; Yang, X.H.; Zhang, J.; Lu, X.Q.; Yan, J.B.; Song, T.M.; Rocheford, T.R.; Li, J.S. Using Molecular Markers to Identify Two Major Loci Controlling Carotenoid Contents in Maize Grain. Theor. Appl. Genet. 2008, 116, 223–233. [Google Scholar] [CrossRef] [PubMed]
  29. Sobiech, A.; Tomkowiak, A.; Nowak, B.; Bocianowski, J.; Wolko, Ł.; Spychała, J. Associative and physical mapping of markers related to Fusarium in maize resistance, obtained by next-generation sequencing (NGS). Int. J. Mol. Sci. 2022, 23, 6105. [Google Scholar] [CrossRef]
  30. Baird, N.A.; Etter, P.D.; Atwood, T.S.; Curey, M.C.; Shiver, A.L.; Lewis, Z.A.; Selker, E.U.; Cresko, W.A.; Johnson, E.A. Rapid SNP Discovery and Genetic Mapping Using Sequenced RAD Markers. PLoS ONE 2008, 3, e3376. [Google Scholar] [CrossRef]
  31. Türkoğlu, A.; Haliloğlu, K.; Mohammadi, S.A.; Öztürk, A.; Bolouri, P.; Özkan, G.; Bocianowski, J.; Pour-Aboughadareh, A.; Jamshidi, B. Genetic Diversity and Population Structure in Türkiye Bread Wheat Genotypes Revealed by Simple Sequence Repeats (SSR) Markers. Genes 2023, 14, 1182. [Google Scholar] [CrossRef]
  32. Poland, J.A.; Rife, T.W. Genotyping-by-sequencing for plant breeding and genetics. Plant Genome 2012, 5, 92–102. [Google Scholar] [CrossRef]
  33. Poland, J.; Endelman, J.; Dawson, J.; Rutkoski, J.; Wu, S.; Manes, Y.; Dreisigacker, S.; Crossa, J.; Sánchez-Villeda, H.; Sorrells, M.; et al. Genomic selection in wheat breeding using genotyping-by-sequencing. Plant Genome 2012, 5, 103. [Google Scholar] [CrossRef]
  34. Kilian, B.; Graner, A. NGS technologies for analyzing germplasm diversity in genebanks. Briefinds Funct. Genom. 2012, 11, 38–50. [Google Scholar] [CrossRef]
  35. Wan, W.; Kim, S.; Castel, B.; Charoennit, N.; Chae, E. Genetics of Autoimmunity in Plants: An Evolutionary Genetics Perspective. New Phytol. 2021, 229, 1215–1233. [Google Scholar] [CrossRef] [PubMed]
  36. Liu, H.; Niu, Y.; Gonzalez-Portilla, P.J.; Zhou, H.; Wang, L.; Zuo, T.; Qin, C.; Tai, S.; Jansen, C.; Shen, Y.; et al. An Ultra-High-Density Map as a Community Resource for Discerning the Genetic Basis of Quantitative Traits in Maize. BMC Genom. 2015, 16, 1078. [Google Scholar] [CrossRef] [PubMed]
  37. Andorf, C.M.; Cannon, E.K.; Portwood, J.L.; Gardiner, J.M.; Harper, L.C.; Schaeffer, M.L.; Braun, B.L.; Campbell, D.A.; Vinnakota, A.G.; Sribalusu, V.V.; et al. MaizeGDB Update: New Tools, Data and Interface for the Maize Model Organism Database. Nucleic Acids Res. 2016, 44, D1195–D1201. [Google Scholar] [CrossRef] [PubMed]
  38. Austin, R.S.; Vidaurre, D.; Stamatiou, G.; Breit, R.; Provart, N.J.; Bonetta, D.; Zhang, J.; Fung, P.; Gong, Y.; Wang, P.W.; et al. Next-Generation Mapping of Arabidopsis Genes. Plant J. 2011, 67, 715–725. [Google Scholar] [CrossRef] [PubMed]
  39. Sobiech, A.; Tomkowiak, A.; Bocianowski, J.; Szymańska, G.; Nowak, B.; Lenort, M. Identification and analysis of candidate genes associated with maize fusarium cob resistance using next-generation sequencing technology. Int. J. Mol. Sci. 2023, 24, 16712. [Google Scholar] [CrossRef] [PubMed]
  40. Bocianowski, J.; Tomkowiak, A.; Bocianowska, M.; Sobiech, A. The use of DArTseq technology to identify markers related to the heterosis effects in selected traits in maize. Curr. Issues Mol. Biol. 2023, 45, 2644–2660. [Google Scholar] [CrossRef]
  41. Tomkowiak, A.; Nowak, B.; Sobiech, A.; Bocianowski, J.; Wolko, Ł.; Spychała, J. The use of DArTseq technology to identify new SNP and SilicoDArT markers related to the yield-related traits components in maize. Genes 2022, 13, 848. [Google Scholar] [CrossRef]
  42. Nowak, B.; Tomkowiak, A.; Bocianowski, J.; Sobiech, A.; Bobrowska, R.; Kowalczewski, P.; Bocianowska, M. The Use of DArTseq Technology to Identify Markers Linked to Genes Responsible for Seed Germination and Seed Vigor in Maize. Int. J. Mol. Sci. 2022, 23, 14865. [Google Scholar] [CrossRef]
  43. Soto, J.; Rodriguez-Antolin, C.; Vallespín, E.; de Castro Carpeño, J.; Ibanez de Caceres, I. The Impact of Next-Generation Sequencing on the DNA Methylation–Based Translational Cancer Research. Transl. Res. 2016, 169, 1–18.e1. [Google Scholar] [CrossRef] [PubMed]
  44. Addo-Quaye, C.; Buescher, E.; Best, N.; Chaikam, V.; Baxter, I.; Dilkes, B.P. Forward Genetics by Sequencing EMS Variation-Induced Inbred Lines. G3 Genes Genomes Genet. 2017, 7, 413–425. [Google Scholar] [CrossRef] [PubMed]
  45. Anandhakumar, C.; Kizaki, S.; Bando, T.; Pandian, G.N.; Sugiyama, H. Advancing Small-Molecule-Based Chemical Biology with Next-Generation Sequencing Technologies. ChemBioChem 2015, 16, 20–38. [Google Scholar] [CrossRef] [PubMed]
  46. Michael, C.R. Alavanja Introduction: Pesticides Use and Exposure, Extensive Worldwide. Rev. Environ. Health 2009, 24, 303–310. [Google Scholar] [CrossRef]
  47. Perez-de-Castro, A.M.; Vilanova, S.; Canizares, J.; Pascual, L.; Blanca, J.M.; Diez, M.J.; Prohens, J.; Pico, B. Application of Genomic Tools in Plant Breeding. Curr. Genom. 2012, 13, 179–195. [Google Scholar] [CrossRef] [PubMed]
  48. Huang, X.; Zhao, Y.; Wei, X.; Li, C.; Wang, A.; Zhao, Q.; Li, W.; Guo, Y.; Deng, L.; Zhu, C.; et al. Genome-Wide Association Study of Flowering Time and Grain Yield Traits in a Worldwide Collection of Rice Germplasm. Nat. Genet. 2012, 44, 32–39. [Google Scholar] [CrossRef] [PubMed]
  49. Ding, J.; Ali, F.; Chen, G.; Li, H.; Mahuku, G.; Yang, N.; Narro, L.; Magorokosho, C.; Makumbi, D.; Yan, J. Genome-Wide Association Mapping Reveals Novel Sources of Resistance to Northern Corn Leaf Blight in Maize. BMC Plant Biol. 2015, 15, 206. [Google Scholar] [CrossRef]
  50. Thompson, J.F.; Milos, P.M. The Properties and Applications of Single-Molecule DNA Sequencing. Genome Biol. 2011, 12, 217. [Google Scholar] [CrossRef]
  51. Jiao, Y.; Peluso, P.; Shi, J.; Liang, T.; Stitzer, M.C.; Wang, B.; Campbell, M.S.; Stein, J.C.; Wei, X.; Chin, C.-S.; et al. Improved Maize Reference Genome with Single-Molecule Technologies. Nature 2017, 546, 524–527. Available online: https://www.nature.com/articles/nature22971 (accessed on 11 November 2023). [CrossRef]
  52. Egan, A.N.; Schlueter, J.; Spooner, D.M. Applications of Next-Generation Sequencing in Plant Biology. Am. J. Bot. 2012, 99, 175–185. [Google Scholar] [CrossRef]
  53. Walker, J.C. Structure and function of the receptor-like protein kinases of higher plants. Plant Mol. Biol. 1994, 26, 1599–1609. [Google Scholar] [CrossRef] [PubMed]
  54. Walker, J.C.; Zhang, R. Relationship of a putative receptor protein kinase from maize to the S-locus glycoproteins of Brassica. Nature 1990, 345, 743–746. [Google Scholar] [CrossRef] [PubMed]
  55. Afzal, A.J.; Wood, A.J.; Lightfoot, D.A. Plant receptor-like serine threonine kinases: Roles in signaling and plant defense. Mol. Plant Microbe Interact. 2008, 21, 507–517. [Google Scholar] [CrossRef] [PubMed]
  56. Gish, L.A.; Clark, S.E. The RLK/Pelle family of kinases. Plant J. 2011, 66, 117–127. [Google Scholar] [CrossRef] [PubMed]
  57. Johnson, K.L.; Ingram, G.C. Sending the right signals: Regulating receptor kinase activity. Curr. Opin. Plant Biol. 2005, 8, 648–656. [Google Scholar] [CrossRef] [PubMed]
  58. Dievart, A.; Clark, S.E. LRR-containing receptors regulating plant development and defense. Development 2004, 131, 251–261. [Google Scholar] [CrossRef] [PubMed]
  59. Butenko, M.A.; Aalen, R.B. Receptor ligands in development. In Receptor-like Kinases in Plants: From Development to Defense; Tax, F., Kemmerling, B., Eds.; Springer: Berlin, Germany, 2012; pp. 195–226. [Google Scholar]
  60. Matsushima, N.; Miyashita, H. Leucine-Rich Repeat (LRR) Domains Containing Intervening Motifs in Plants. Biomolecules 2012, 2, 288–311. [Google Scholar] [CrossRef] [PubMed]
  61. Dey, D.; Biswas, P.; Paul, P.; Mahmud, S.; Ema, T.I.; Khan, A.A.; Ahmed, S.Z.; Hasa, M.M.; Saikat, A.S.M.; Fatema, B.; et al. Natural flavonoids effectively block the CD81 receptor of hepatocytes and inhibit HCV infection: A computational drug development approach. Mol. Divers. 2023, 27, 1309–1322. [Google Scholar] [CrossRef]
  62. Belanger, A.E.; Besra, G.S.; Ford, M.E.; Mikusová, K.; Belisle, J.T.; Brennan, P.J.; Inamine, J.M. The embAB genes of Mycobacterium avium encode an arabinosyl transferase involved in cell wall arabinan biosynthesis that is the target for the antimycobacterial drug ethambutol. Proc. Natl. Acad. Sci. USA 1996, 93, 11919–11924. [Google Scholar] [CrossRef]
  63. Zhang, Y.; Brown, G.; Whetten, R.; Loopstra, C.A.; Neale, D.; Kieliszewski, M.J.; Sederoff, R.R. An arabinogalactan protein associated with secondary cell wall formation in differentiating xylem of loblolly pine. Plant Mol. Biol. 2003, 52, 91–102. [Google Scholar] [CrossRef]
  64. Wojtaszek, P. Genes and plant cell walls: A difficult relationship. Biol. Rev. 2000, 75, 437–475. [Google Scholar] [CrossRef] [PubMed]
  65. Lucas, W.J.; Lee, J.-Y. Plasmodesmata as a supracellular control network in plants. Nat. Rev. Mol. Celi Biol. 2004, 5, 712–726. [Google Scholar] [CrossRef] [PubMed]
  66. Fleming, A.J. Intercellular Communication in Plants; Blackwell Publishing: Oxford, UK, 2005; pp. 85–108. [Google Scholar]
  67. Winter, H.; Huber, J.L.; Huber, S.C. Membrane association of sucrose synthase: Changes during the graviresponse and possible control by protein phosphorylation. FEBS Lett. 1997, 420, 151–155. [Google Scholar] [CrossRef] [PubMed]
  68. Chen, S.; Hajirezaei, M.; Börnke, F. Differential expression of sucrose-phosphate synthase isoenzymes in tobacco reflects their functional specialization during dark-governed starch mobilization in source leaves. Plant Physiol. 2005, 139, 1163–1174. [Google Scholar] [CrossRef] [PubMed]
  69. Lutfiyya, L.L.; Xu, N.; D’Ordine, R.L.; Morrell, J.A.; Miller, P.W.; Duff, S.M.G. Phylogenetic and expression analysis of sucrose phosphate synthase isozymes in plants. J. Plant Physiol. 2007, 164, 923–933. [Google Scholar] [CrossRef] [PubMed]
  70. Salnikov, V.V.; Grimson, M.J.; Seagull, R.W.; Haigler, C.H. Localization of sucrose synthase and callose in freeze-substituted secondary-wall-stage cotton fibers. Protoplasma 2003, 221, 175–184. [Google Scholar] [CrossRef] [PubMed]
  71. Persia, D.; Cai, G.; Del Casino, C.; Faleri, C.; Willemse, M.T.M.; Cresti, M. Sucrose synthase is associated with the cell wall of tobacco pollen tubes. Plant Physiol. 2008, 147, 1603–1618. [Google Scholar] [CrossRef] [PubMed]
  72. Duncan, K.A.; Hardin, S.C.; Huber, S.C. The three maize sucrose synthase isoforms differ in distribution, localization, and phosphorylation. Plant Cell Physiol. 2006, 47, 959–971. [Google Scholar] [CrossRef]
  73. Kaur, S.; Pennington, T.; Conley, E.J.; Green, A.; Kolmer, J.; Anderson, J.; Gupta, R.; Gill, U. High-Resolution Melting-Based Marker Development for Wheat Leaf Rust Resistance Gene Lr34. Phytopathology 2023, 113, 508–515. [Google Scholar] [CrossRef]
  74. Moreira, D.; Kaur, D.; Pereira, A.M.; Held, M.A.; Showalter, A.M.; Coimbra, S. Type II Arabinogalactans Initiated by Hydroxyproline-O-Galactosyltransferases Play Important Roles in Pollen–Pistil Interactions. Plant J. 2023, 114, 371–389. [Google Scholar] [CrossRef]
Figure 1. Temperature and rainfall in Smolice and Kobierzyce in 2022.
Figure 1. Temperature and rainfall in Smolice and Kobierzyce in 2022.
Genes 15 00056 g001
Figure 2. Density charts showing the distribution of the analyzed traits: cob length (cm), ear diameter (cm), core length (cm), core diameter (cm), the number of rows, the number of kernels per row, mass of grain per cob (g), TSW (g), and grain yield (kg).
Figure 2. Density charts showing the distribution of the analyzed traits: cob length (cm), ear diameter (cm), core length (cm), core diameter (cm), the number of rows, the number of kernels per row, mass of grain per cob (g), TSW (g), and grain yield (kg).
Genes 15 00056 g002
Figure 3. Canonical variate analysis for the studied traits.
Figure 3. Canonical variate analysis for the studied traits.
Genes 15 00056 g003
Figure 4. Heat map showing correlations between analyzed yield structure traits in Smolice. * p < 0.05, ** p < 0.01, *** p < 0.001.
Figure 4. Heat map showing correlations between analyzed yield structure traits in Smolice. * p < 0.05, ** p < 0.01, *** p < 0.001.
Genes 15 00056 g004
Figure 5. Heat map showing correlations between analyzed yield structure traits in Kobierzyce. * p < 0.05, ** p < 0.01, *** p < 0.001.
Figure 5. Heat map showing correlations between analyzed yield structure traits in Kobierzyce. * p < 0.05, ** p < 0.01, *** p < 0.001.
Genes 15 00056 g005
Figure 6. Dendrogram showing the genetic similarity between the analyzed genotypes.
Figure 6. Dendrogram showing the genetic similarity between the analyzed genotypes.
Genes 15 00056 g006
Figure 7. Electropherogram showing the amplification products of 189 bp, characteristic of the SNP28629 marker. L-ladder, 1–20 genotypes.
Figure 7. Electropherogram showing the amplification products of 189 bp, characteristic of the SNP28629 marker. L-ladder, 1–20 genotypes.
Genes 15 00056 g007
Figure 8. Electropherogram showing the amplification products of 189 bp, characteristic of the SNP 29294 marker. L-ladder, 1–20 genotypes.
Figure 8. Electropherogram showing the amplification products of 189 bp, characteristic of the SNP 29294 marker. L-ladder, 1–20 genotypes.
Genes 15 00056 g008
Table 1. F-statistics from our analysis of variance for the analyzed yield structure traits.
Table 1. F-statistics from our analysis of variance for the analyzed yield structure traits.
TraitAnalysis of Variance
F-StatisticsF pr.
Cob length20.84<0.001
Cob diameter24.10<0.001
Core length21.02<0.001
Core diameter22.65<0.001
The number of rows of grain11.170.569
The number of grains in row16.07<0.001
Mass of grain from the cob15.78<0.001
Weight of one thousand grains14.31<0.001
Yield per plot50.16<0.001
Table 2. SilicoDArT and SNP molecular markers significantly associated with analyzed yield structure traits in Kobierzyce (K) and Smolice (S) (significant associations selected at p < 0.001 with Benjamini–Hochberg correction for multiple testing).
Table 2. SilicoDArT and SNP molecular markers significantly associated with analyzed yield structure traits in Kobierzyce (K) and Smolice (S) (significant associations selected at p < 0.001 with Benjamini–Hochberg correction for multiple testing).
TraitCob LengthCob DiameterCore LengthCore DiameterThe Number of Rows of GrainThe Number of Grains in a RowMass of Grain from the CobWeight of One Thousand GrainsYield
LocationKSKSKSKSKSKSKSKSKS
Number of significant markersDArT10,20914,17214,40412,934918913,54214,25410,58410,054928610,76811,34214,31514,81611,20511,6881486515,173
SNP235633433383308421563139333026262703242824972639337034982631285734873578
Total12,56517,51517,78716,01811,34516,68117,58413,21012,75711,71413,26513,98117,68518,31413,83614,54518,35218,751
Minimal effectDArT−1.913−3.579−0.693−0.482−1.49−3.00−0.447−0.237−1.82−1.991−4.378−5.718−49.63−53.66−60.49−48.04−2.67−4.23
SNP−1.885−3.148−0.5575−0.398−1.432−2.94−0.378−0.212−2.01−1.965−4.228−5.602−43.60−45.43−60.26−47.82−2.31−3.59
Total−1.913−3.579−0.693−0.482−1.49−3.00−0.447−0.237−2.01−1.991−4.378−5.718−49.63−53.66−60.49−48.04−2.67−4.23
Maximum effectDArT2.8155.1740.94460.59112.0324.4060.6130.2862.6652.8926.9268.19174.5972.6390.4766.543.9125.911
SNP2.7885.1830.94930.60642.0614.4350.6200.2952.6792.6486.9098.21774.9272.6389.8467.123.9185.911
Total2.8155.1830.94930.60642.0614.4350.6200.2952.6792.8926.9268.21774.9272.6390.4767.123.9185.911
Medium effectDArT1.7022.5760.4660.3161.3052.2690.3040.1590.6911.3444.1804.82036.80835.87552.98124.5591.8982.822
SNP1.8672.7390.4930.3351.4232.4300.3260.1690.7081.3034.4525.01339.06037.70057.15227.1422.0122.993
Total1.7332.6070.4710.3201.3282.2990.3080.1610.6951.3354.2324.85737.23736.22353.77425.0671.9202.855
Total effectDArT17,37536,5046706408911,99430,72943281685695212,47645,01454,673526,910531,519593,654287,05128,21142,819
SNP43999158166910353068762710864431915316411,11813,228131,632131,874150,36877,546701710,709
Total21,77445,6628375512415,06238,35754132128886715,64056,13167,901658,542663,394744,022364,59735,22753,528
Table 3. Characteristics and locations of markers significantly associated with the analyzed traits.
Table 3. Characteristics and locations of markers significantly associated with the analyzed traits.
MarkerTypeChromosomeCandidate Genes
28629SNPchr. 8probable leucine-rich repeat receptor-like protein kinase
28630SNPchr. 8637 bp at 5′ side: putative protein phosphatase 2c 4658414 bp at 3′ side: metallothionein-like protein type 2
28631SNPchr. 499861 bp at 5′ side: uncharacterized protein loc1002769903113 bp at 3′ side: disease resistance protein rpm1 isoform x1
28632SNPchr. 7uncharacterized atp-dependent helicase ypra
28633SNPchr. 74486 bp at 5′ side: lon protease homolog 2, peroxisomal54848 bp at 3′ side: uncharacterized protein loc100280671
28634SNPchr. 7132577 bp at 5′ side: isoamylase-type starch debranching enzyme iso3 isoform x135136 bp at 3′ side: uncharacterized protein loc100272620
31977SNPchr. 11394 bp at 5′ side: 60 s ribosomal protein l32-like68287 bp at 3′ side: uncharacterized protein loc100193765 isoform x1
29503SNPchr.4uncharacterized protein loc103655564
28625SNPchr. 1probable arabinosyltransferase arad1
28640SNPchr. 9probable sugar phosphate/phosphate translocator
28648SNPchr. 1uncharacterized protein loc100502264
28639SNPchr. 83130 bp at 5′ side: protein senescence-associated gene 21, mitochondrial91 bp at 3′ side: uncharacterized protein loc100382335
28649SNPchr. 4ubiquitin carboxyl-terminal hydrolase 15 isoform x2ubiquitin carboxyl-terminal hydrolase 15 isoform x3
28654SNPchr. 5318 bp at 5′ side: uncharacterized protein loc100383290 isoform x12155 bp at 3′ side: uncharacterized protein loc111589274
30773SNPchr.430080 bp at 5′ side: uncharacterized protein loc103653173940 bp at 3′ side: uncharacterized protein loc100193686 isoform x1
30772SNPchr. 31814 bp at 5′ side: uncharacterized protein loc100192805 isoform x240087 bp at 3′ side: uncharacterized protein loc100278501
29294SNPchr. 5hydroxyproline o-galactosyltransferase galt6
28262SNPchr. 9uncharacterized protein loc100383550uncharacterized protein loc100383550 isoform x1
28263SNPchr. 3uncharacterized protein loc103650272
Table 4. Sequences of the designed primers used to identify the newly selected markers significantly associated with the analyzed traits.
Table 4. Sequences of the designed primers used to identify the newly selected markers significantly associated with the analyzed traits.
MarkerPrimer SequencesSize of the ProductTm [°C]
ForwardReverse
28629CACCTGGAGAGGCCCTGCAGTGAAGACTGGACTCGGTCGC43761
28630AGGGACAACATACACTGCAGACGGTAAAGAGACACAATTCCCT18958
28631AAATCTGCGATCAACTGCAGGACTGGAGAGCCAGAACCTG29155
28632AGAGAAGTATTGTCTTTCTGCAGGCTGTCCTTGATGCCAAGTC43757
28633GATTATCCTTCGAGCTGCAGACATGAATGTTGCAGGCAGG42361
28634TTTATTCCTCACTGCTGCAGAGAAAGAAGGAATGTAACAACACG34256
31977CCACTTGCTCTCTGCTGCAGCTCTCTATGGCTCGTCGCTG20759
29503ACTCAGTAGCAGCACTGCAGGTAGCTGCTGCCTCTTCCAA45559
28625GCGCGCTTGTGAAGCTGCAGTTTCAGGGCGGGAAGGTTCG48856
28640TACTCTGTTAACACCTGCAGATGCGCAGTTGCCTACTTAT15862
28648AATAAGAGCTTTTGCTGCAGAGCGACGAGTAATCAATCCC45354
28639GGCTTCCAGCTTGGCTGCAGATACGTGACGCACGAGACAAAC18958
28649TGGAGCAAAAGTATCTGCAGTTTTTCACCTCTTGACGGGC47161
28654GGAATCCATCAGTTCTGCAGCTTCCCCGAGTGCATATCCT16959
30773AAACCAAACCGCCGCTGCAGCAGGCAGAGCTCAGTCCGAA41161
30772GCATTTTACGGGGTCTGCAGAGAGCTTGCTCCCTTGAACG50057
29294GTTTGTGGGACAAACTGCAGAGGGCTTAATTTATTTCCAGCCA18457
28262TAAGAATGAGAAGCCTGCAGGAATGCACTGTTGTTCTGCC45459
28263GCCGGGAATAACTTCTGCAGGTTCTTACTTCCGCCAGGCT35859
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Nowak, B.; Tomkowiak, A.; Sobiech, A.; Bocianowski, J.; Kowalczewski, P.Ł.; Spychała, J.; Jamruszka, T. Identification and Analysis of Candidate Genes Associated with Yield Structure Traits and Maize Yield Using Next-Generation Sequencing Technology. Genes 2024, 15, 56. https://doi.org/10.3390/genes15010056

AMA Style

Nowak B, Tomkowiak A, Sobiech A, Bocianowski J, Kowalczewski PŁ, Spychała J, Jamruszka T. Identification and Analysis of Candidate Genes Associated with Yield Structure Traits and Maize Yield Using Next-Generation Sequencing Technology. Genes. 2024; 15(1):56. https://doi.org/10.3390/genes15010056

Chicago/Turabian Style

Nowak, Bartosz, Agnieszka Tomkowiak, Aleksandra Sobiech, Jan Bocianowski, Przemysław Łukasz Kowalczewski, Julia Spychała, and Tomasz Jamruszka. 2024. "Identification and Analysis of Candidate Genes Associated with Yield Structure Traits and Maize Yield Using Next-Generation Sequencing Technology" Genes 15, no. 1: 56. https://doi.org/10.3390/genes15010056

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop