Next Article in Journal
Emerging Trends in Ant–Pollinator Conflict in Extrafloral Nectary-Bearing Plants
Next Article in Special Issue
Evaluating and Predicting the Performance of Sorghum Lines in an Elite by Exotic Backcross-Nested Association Mapping Population
Previous Article in Journal
Time-Course Transcriptomic Analysis Reveals Molecular Insights into the Inflorescence and Flower Development of Cardiocrinum giganteum
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Identification of Quantitative Trait Loci (QTL) for Sucrose and Protein Content in Soybean Seed

1
Department of Crop, Soil, and Environmental Sciences, University of Arkansas, Fayetteville, AR 72701, USA
2
Department of Food Science, University of Arkansas, Fayetteville, AR 72704, USA
*
Author to whom correspondence should be addressed.
Deceased author.
Plants 2024, 13(5), 650; https://doi.org/10.3390/plants13050650
Submission received: 31 December 2023 / Revised: 11 February 2024 / Accepted: 16 February 2024 / Published: 27 February 2024

Abstract

:
Protein and sugar content are important seed quality traits in soybean because they improve the value and sustainability of soy food and feed products. Thus, identifying Quantitative Trait Loci (QTL) for soybean seed protein and sugar content can benefit plant breeders and the soybean market by accelerating the breeding process via marker-assisted selection. For this study, a population of recombinant inbred lines (RILs) was developed from a cross between R08-3221 (high protein and low sucrose) and R07-2000 (high sucrose and low protein). Phenotypic data for protein content were taken from the F2:4 and F2:5 generations. The DA7250 NIR analyzer and HPLC instruments were used to analyze total seed protein and sucrose content. Genotypic data were generated using analysis via the SoySNP6k chip. A total of four QTLs were identified in this study. Two QTLs for protein content were located on chromosomes 11 and 20, and two QTLs associated with sucrose content were located on chromosomes 14 and. 11, the latter of which co-localized with detected QTLs for protein, explaining 10% of the phenotypic variation for protein and sucrose content in soybean seed within the study population. Soybean breeding programs can use the results to improve soybean seed quality.

1. Introduction

An expanding market for soy foods in the U.S. makes selecting corresponding traits increasingly important for breeding programs. Large- and small-seeded soybean varieties are ideal for producing various soy foods, and high protein and sugar content are generally desired regardless of seed size [1]. Knowledge of correlations between agronomic and seed quality traits and the location of major and minor QTLs associated with these traits will assist plant breeders in improving the food quality characteristics of their germplasm.
Due to consumer demand for alternate plant-based protein, soy protein has the potential to become a valuable source of protein in the human diet. A review by Erdman and Fordyce (1989) [2] showed that soy protein contains an excellent amino acid profile. Soybeans also contain large amounts of other components of a complete diet, such as fiber and beneficial fatty acids. In addition, undesirable constituents, such as saturated fatty acids and cholesterol, are present in low amounts or are missing from soybean seed. Despite the high quality of soy protein, there has been a significant effort to improve the overall quality of protein content in soybean seed further. One example of progress in this area is the overexpression of the GmCGS2 gene in soybean seed to improve total protein content, including Met, a limiting amino acid in soybean [3]. Such breeding efforts will further enhance soybeans’ viability as a complete source of indispensable amino acids [4].
The average protein concentration for the 2023 U.S. soybean crop was 33.9%, 0.2 points lower than the average for the 2022 crop when analyzed on a 13% moisture basis. In 2015, 30% of samples with the highest and lowest protein concentrations were analyzed for their concentrations of amino acids. This analysis showed that samples with the highest protein concentrations were slightly lower in their concentrations of the five most limiting amino acids compared with the 30% of samples with the lowest total protein concentrations. This insight is helpful because soybeans with protein concentrations containing higher fractions of the most limiting amino acids are a positive indicator of the efficiency with which animals can successfully utilize soy-based protein in their diet [5]. In 2023, the same pattern was maintained. Lower crude protein samples showed a better relative abundance of the five most limiting amino acids than the higher crude protein samples [6]. Such findings justify research and breeding efforts to identify methods for improving amino acid profiles and total protein content in soybean seeds.
Protein content, like other traits in soybean, is quantitatively inherited. Because genotype-by-environment interactions influence quantitative characteristics, it is valuable to conduct genetic studies on protein content in multiple years and environments with different populations [7]. Environmental conditions such as water stress or high temperatures can cause protein content to decrease below desired levels [8]. The protein content of U.S. commodity soybeans typically ranges from 30 to 40%. However, it can vary more broadly from 25 to 50% Co-operative Feed Dealers, 2017). This trait exhibits high heritability (roughly 83 to 94%), and population means usually demonstrate additive effects, suggesting that additional variability can be incorporated from wild germplasm sources [9].
In another study, a soybean mapping population from a cross between Benning × Danbaekkong was grown in five environments. They reported four QTLs associated with protein and amino acids in chromosomes. 14, 15, 17, and 20 [10]. Wang et al. [11] developed two separate populations from two high-protein and two low-protein lines, R05-1415 × R05-638 and V97-1346 × R05-4256, which were grown in five different environments. They identified a major QTL associated with protein positioned on 20 across the environment and four other QTLs on chromosomes. 1, 5, and 14, which explained 10–16% of the total variation in protein content. A GWAS analysis of three hundred and eight recombinant inbred lines (RIL) revealed two major QTLs, qPro15-1 and qPro20-1, associated with protein on chromosomes. 15 and 20, respectively [12]. Hyten et al. (2004) [13] identified four QTLs associated with protein content in 131 RILs, resulting in a cross between Essex and Williams. Chen et al. (2021) [14] conducted a study on a RILs population, resulting in a cross between the soybean genotypes Zhongdou27 and Hefeng25. They applied a high-density linkage map based on whole genome sequencing to detect QTLs associated with several agronomic traits and reported seven QTLs for protein content [15].
Specialty soybean food markets include tofu, natto, miso, and soymilk products. The presence and quantity of various sugars in raw soybean play a significant role in the taste and quality of these products. Miso and natto are two examples of soy food products that depend on the sugar profile of the soybean cultivar for sustainable production [16].
Of the approximately 33% of carbohydrates in soybean seed, roughly 16.6% can be classified as soluble sugars. The portion of sucrose over total sugars varies between 1.5 and 10.2% on a dry matter basis. Soluble sugars detected in soybean seed generally include sucrose, glucose, fructose, stachyose, and raffinose. However, the amounts vary significantly depending on the cultivar and maturity group under examination [16]. Stachyose and raffinose content ranged from 0.1 for stachyose to 2.1% for raffinose and 0.5 to 4.5% for stachyose dryly. Hou et al. [16] showed a distribution of sugar profiles into three categories. The first and most common category, or regular sugar profile, contained sucrose and stachyose as their main constituents. The second category identified represented a novel sugar profile with higher levels of both glucose and fructose. Such a profile would be valuable in breeding efforts to improve the digestibility of food products made from soybean. The final category identified was considered rare and valuable in breeding efforts intended for the soy food market because seed stachyose content would be limited. Because stachyose is generally undesirable, it is important to note that the 14 low stachyose genotypes discovered could be used as sources of genetic variations for modifying the sugar profile of soybean seed. Openshaw and Hadley [17] studied segregating generations from two different populations to discover heritability estimates for both populations’ parents and progeny. Results showed means of 0.38 and 0.14 for F2 plants and 0.69 for F3 progeny, which supported their observations for the heritability of sugar concentration.
Hou et al.(2009) [16] discovered multiple correlations between the five main detectable sugars of soybean seed. Strong positive correlations were observed for stachyose and total sugar (r = 0.73), raffinose and total sugar (r = 0.64), raffinose and stachyose (r = 0.68), sucrose and raffinose (r = 0.66), sucrose and total sugar (r = 0.82), as well as glucose and fructose (r = 0.98). The negative correlations included glucose or fructose as compared with sucrose (r = −0.68), raffinose (r = −0.59), total sugar (r = −0.27 to −0.24), and stachyose (r = −0.63 to −0.59). Sucrose and stachyose were negatively correlated at (r = −0.68). It is worth noting that the positive correlation reported here contradicts results published by Hymowitz et al. (1972) [18]. Relationships between individual plants sampled in a genetic study can significantly differ in the correlations discovered among traits. Therefore, just as studies on the location of QTLs for various traits should be conducted on multiple populations in different years and environments, the identification of correlations between traits should be studied for populations representing a wide range of genetic backgrounds [1].
Fitch et al. (2022) [19] conducted a GWAS on 266 soybean lines and identified seven QTL-associated chromosomes. 1, 6, 8, 9, 10 and 14. Liu et al. (2023) [20] performed high-density genetic mapping on 158 F7 lines resulting from a cross between Hefeng 25 and Zhongdou 27 and detected 16 QTLs associated with sucrose content. Another study constructed a RILs line population via interspecific hybridization between a William 82 and a wild soybean genotype, PI 483460B, and used 3343 polymorphic SNP markers to identify QTLs associated with soybean oil, protein, and sugar content. This study detected five, nine, and four QTLs for protein, oil, and sucrose content [21]. Salari et al. [22] conducted a study to identify QTLs associated with sucrose, raffinose, and stachyose in a RILs population resulting from a cross between two soybean lines—IA3023 and LD02-4485. They performed genotypic mapping using more than 3000 SNP markers and reported three QTLs—one on chromosome. 1 responsible for 10% variance and two major QTLs on chromosome 3, each explaining 22% phenotypic variation. Lee et al. [23] performed a GWAS on a collection of 220 soybean accessions using over 21 thousand SNP markers and identified seven major SNPs associated with sucrose on chromosomes. 2, 5, 8, 9, 10, 13, 14, and 15. A fine-mapping study was performed to identify sucrose content in a mapping population consisting of 190 Segment Substitution Lines (CSSLs) and identified a major QTL in chromosome 20. They screened nine candidate genes in the QTL region and reported that one gene was closely related to sucrose content in leaves.
The objectives of this study are to identify genetic structure associated with high sucrose and protein content in soybean lines grown in two locations in Arkansas. The Fayetteville Arkansas region (36.0627° N, 94.1606° W) is in northwest Arkansas in the Ozark Mountain Range between the Springfield Plateau and the Boston Mountains. The other environment in this study is Stuttgart, Arkansas (34.4941° N, 91.5581° W) located in Arkansas County; Stuttgart is in the Arkansas Delta. The second objective of this study is to identify extreme genotypes and recombinant types for protein and sucrose content.

2. Results and Discussion

As mentioned previously, due to the issue resulting from herbicide damage on the F2 generation and its effect on the later generations, an effort was made to minimize the number of plots lost in subsequent generations by using only those lines with an adequate seed number while considering the number of locations and replications needed. However, the issues were observed within plots each year. To improve statistical power in detecting QTLs, the decision was made to reduce the number of environments in the study to four (Fayetteville year 1, Fayetteville year 2, Stuttgart year 1, and Stuttgart year 2). This strategy positively impacts our research to overcome the problem. The ANOVA analysis of protein in the F2:3 population showed a normal distribution with a slight skew to the left (Figure 1A), suggesting that recombinant types for protein can inherit higher protein content than either parent, suggesting additive effects for this trait. These results are supported by similar investigations such as Qi et al. [24], Zhang et al. [25], and Teng et al. [26].
Similarly, the ANOVA analysis of sucrose on the F2:5 population showed a normal distribution with a slight skew to the left (Figure 1B), suggesting that recombinant types for sucrose can inherit higher protein content than their parents, implying an additive effect for this agronomic trait (Table 1).
In this study we have four environments (Fayetteville year 1, Fayetteville year 2, Stuttgart year 1, Stuttgart year 2). In these four environments, there are two important parameters (location and year). We decided to investigate the impact of these two parameters on sucrose and protein content. Analysis showed that there is a significant effect of location on protein content (Table 2). Despite no meaningful interaction of year on protein content, there is a significant interaction between year and location on protein which can be attributed to the location. Further statistical analysis showed there is a significant effect of year on sucrose content. While there was no meaningful interaction of location and sucrose content, there was significant interaction between year and location which can be attributed to the year effect. Single linear model analysis showed there is significant but negative correlation between protein and sucrose content with covariance = −0.81658 (Table 2). This result is supported by previous studies on protein and sugar correlations by both Maughan et al. [27] and Hymowitz et al. [18].

2.1. QTLs Associated with Protein

Two significant QTLs for protein content were located in this study. One QTL, observed across multiple locations and years, was situated on Chromosome. 20 between 2.28 × 107 and 4.63 × 107 bp, explaining 10% of the variation in protein content. This QTL was located between two SNP markers, Gm20_22826023_G_A and Gm20_46314790_C_T. In addition, this QTL co-localized with the previously reported QTL cqPRO-003 associated with protein [28]. The second QTL was located on chromosome. 11 between 3.77 × 107 and 3.91 × 107 bp and explained 9% of the variation for protein content. The QTL was flanked by Gm11_37749863_G_T and Gm11_39108822_A_G. In addition, both QTLs displayed a Linkage of Disequilibrium (LOD) value more significant than the threshold of 3 and additive effects of 0.5 and 0.4, respectively (Table 2, Supplementary Figures S1–S5). Overall, 241 QTLs through bi-parental study and 8 QTLs by GWAS study were reported in SoyBase [29]. Of these 249 reported QTLs, 25 were identified on chromosome 20, and 8 QTLs were identified on chromosome 11.

2.2. QTLs Associated with Sucrose

Two QTLs for sucrose content were located: one on chromosome 8 (3.804 × 107 and 3.91 × 107 bp) located between SNP markers Gm08_38044939_A_G and Gm08_39164008_C_T and the other one on chromosome. 14 (5.42 × 106 and 7.85 × 106 bp), positioned between Gm14_5418596_A_G and Gm14_7853431_G_T, which explained 8 and 11% of the variation for sucrose content, respectively. The LOD values of the QTLs on chromosomes 8 and 14 were 3.3 and 4.2, with additive effects of −0.3 and 1.1, respectively (Table 3, Supplementary Figures S1–S5). Overall, 37 QTLs were identified through bi-parental study and 2 by GWAS for sucrose content. Of these reported QTLs, only 1 was reported on Chromosome 14 [29].
In summary, protein and sucrose are two important soybean traits for different soy products. Along with increasing seed yield, improving protein and sucrose content are two goals in soybean breeding. Thus, many researchers have tried identifying genetic sources associated with these two agronomic traits. This research describes four QTLs in different chromosomes: two for total protein and two for percent sucrose. These QTLs offer a possible mode of selection for both traits separately and possibly simultaneously (when considering the QTLs on chromosome 11). We also report that the analysis of variance showed a significant GXE effect for protein content. There was also a strong environmental effect on protein and sucrose content. Therefore, field management, such as applying nitrogen, water availability, temperature, field location, etc., is essential for improving protein content [30,31].
To better understand how environmental factors influence protein and sucrose content, further investigation is needed to identify genetic correlations to specific soybean growing areas. These QTLs and associated markers could be potentially used for MAS on soybean seed sucrose content. As shown (Figure 1), there is some skew to the right for each graph, suggesting an additive effect and the possibility of selecting advantageous recombinant types for protein and sucrose content. Indeed, a Best Linear Unbiased Predictor (BLUP) analysis revealed UARK-2114, UARK-2116, and UARK-2141 as top performers for total seed protein and sucrose content in all environments.

3. Materials and Methods

3.1. Population Development and Field Experiment

The genetic population in this study was derived from a cross between high protein (R08-3221) and high sucrose (R07-2000) parents and, thus, represents a wide range of genetic diversity for seed protein and sucrose content. The high protein variety R08-3221 has an average of 47.9% protein, while the high sucrose parent R07-2000 averages 8.4 sucrose/stachyose. As a result, 186 true F1 hybrids were identified. The F2 population, consisting of 186 rows, was grown in Fayetteville, Arkansas. One single plant from each row was harvested to develop a mapping population. At the same time, the rest of each row was bulk harvested.
It should be noted that, unfortunately, due to herbicide damage, seed yield was low, and the plants produced thin pods with shriveled seeds. Although every line that produced a minimal amount of seed was harvested, those lines that displayed herbicide damage in the F2 generation also showed carry-over symptoms in subsequent generations. A quantity of 186 F2:3 populations from the individually harvested plants were advanced in the Costa Rica winter nursery. Then, a single plant from each F2:3 line was randomly selected, harvested, and advanced in the F3:4 generation the following summer. In addition, the remaining rows grown in Costa Rica were bulk harvested and grown as F2:5 lines in two Arkansas locations: Fayetteville (36.0627° N, 94.1606° W) and Stuttgart (34.4941° N, 91.5581° W) in a randomized complete block design with two replications in 3 m plots and 0.76 m row spacing.

3.2. Gentoyping

3.2.1. DNA Extraction

The DNA was isolated from fresh and fully developed trifoliate leaves from each F2:4 generation using the CTAB protocol described by Doyle [32] with some modifications. Leaves were ground to a powder in liquid nitrogen and placed in 2 mL tubes. A total of 750 uL of an extraction buffer (2% CTAB, 100 mM Tris-Cl, 20 mM EDTA pH 8.0, 1.4 M NaCl, and 1% of volume β-mercaptoethanol) was added to each tube. Next, a water bath was used to incubate the tubes for 1 h at 60 °C, and then 1 mL chloroform: isoamyl alcohol (24:1) was added. The tubes were centrifuged at R.T. for 15 min at 12,000 rpm to separate the layers. The top layer was pipetted to a new tube containing 1 mL of ice-cold ethanol and mixed gently. DNA pellets were separated from the solution in a centrifuge at 14,000 rpm for 10 min at R.T. The excess liquid was discarded, and the pellets were washed in 75% ethanol and dried overnight. Finally, 200 uL of distilled sterile water was used to dissolve the DNA pellets. DNA concentration and quality were verified with NanoDropTM ND-2000 (Thermo Scientific, Waltham, MA, USA).

3.2.2. Linkage Mapping and QTL Identification

For genetic mapping, DNA samples from the F2:4 generation and their parental lines were sent for genotyping using the BARC MSU Soy6k Illumina Infinium Genotyping HD Beadchip (652 K) on Illumina iScan (Illumina, San Diego, CA, USA) at the Michigan State University genotyping core facility, East Lansing, Michigan. Data from the analysis were then combined with the phenotypic data for protein content from the F2:4 and F2:5 generations. A total of 1922 polymorphic markers from the total of 6000 SNP markers from the 6K SNP chip were used for linkage mapping and then QTL analysis (Supplementary Table S1). MapDisto version 2.0 was then used to visualize the data and construct a linkage map. The Kosambi mapping function was used, and the Rf estimate was set as classical. The population type was a recombinant inbred line population to the fourth generation. The ordering criteria was SARF, and the ordering method was seriation. QGene version 4.4.0 was used to visualize those QTLs significantly affecting protein and sucrose content in each environment. Each of the environments was visualized separately and together. All of the QTL curves were overlaid to allow a straightforward interpretation of the effect of year and environment on the QTL detection (Supplementary Figures S1–S5).

3.3. Protein Analysis

Total Protein content was quantified using the DA 7250 NIR (near-infrared) analyzer instrument (Perten Instruments, Hägersten, Sweden) during the summer of 2016. Each line was measured non-destructively, as outlined in the user manual. However, many of the lines did not have enough seed to fill the cups evenly to the top, as recommended by the manufacturer. Therefore, a subsampling strategy was employed where a smaller cup held approximately 3 g of seed. Three subsamples were measured from each line and averaged to ensure that the smaller seed samples would provide a sufficiently accurate average compared with the larger sample size. No significant differences were observed between the subsampling and regular sampling methods. Instrument calibration files were built from the protein measurements of many yellow-seeded soybean varieties grown throughout the United States (Supplementary Table S2).

3.4. Sucrose Analysis

Total sucrose content was quantified via HPLC in collaboration with the National Center for Soybean Biotechnology and the Division of Plant Sciences at the University of Missouri. The system used was an Agilent 1200 series (Agilent USA) and was equipped with an evaporative light scattering detector (ELSD) to quantify the amounts of glucose, fructose, sucrose, raffinose, and stachyose contained in each of the samples. Samples were prepared for quantification by grinding and sifting the powder through 20-mesh sieves. The ground powder was then dried and lyophilized for 48 h, after which, 90.25 mg was weighed and transferred to fresh 2 mL tubes. The samples were then sent for HPLC analysis. At the University of Missouri, 900 uL of HPLC-grade water was added to each vial. The vials were agitated at 250 rpm for 30 min and then vortexed for 30 s. Once cooled, 900 uL acetonitrile was blended in, and the suspension was centrifuged for 30 min at 13.3 × 100 min−1× g. Five additional dilutions were performed on the supernatant with water: acetonitrile mixture of 65:35 (v/v). Sugar standards of 50, 100, 200, 300, 400, 500, and 1000 ug/mL were used for the HPLC analysis.
During analysis, the flow rate and gradient of two mobile phases were optimized to separate all sugars detected rapidly. Mobile phase A was pure water, while mobile phase B was acetonitrile: acetone mixture of 75:25 (v/v). The sample volume was 5 uL with high-grade nitrogen as the nebulizing gas at 3.4 bar. The column temperature was held constant at 35 °C, and the detector temperature was 55 °C [33] (Supplementary Table S2).

3.5. Data Analysis

For QTL analysis, a Best Linear Unbiased Predictor (BLUP) analysis was performed on the phenotypic data from the four population environments (Fayetteville year 1, Fayetteville year 2, Stuttgart year 1, Stuttgart year 2). The analysis was performed using the statistical analysis programming language and environment R (R: A language and Environment for Statistical Computing, 2021). This analysis revealed several possible recombinant types (UARK-2114, UARK-2116, and UARK-2141) high in total protein and sucrose content. Therefore, given the value of these traits for soy foods and feed, these lines could be used to develop soybean varieties high in total protein and sucrose content.

4. Conclusions

Protein and sugar content are important soybean agronomic characteristics used by food industries. Plant breeders aim to increase the value of their new cultivars in markets, especially for food and feeding purposes. We conducted this study in two years and two locations with a total of four environments to identify the QTLs associated with sucrose and protein. In this study, we identified four QTLs: two for protein located chromosomes 11 and 20 and two for sugar on chromosomes 11 and 14. The two detected QTLs on chromosome 11 were co-localized. This study can benefit plant breeders and the soybean market in general by accelerating the breeding process via marker-assisted selection.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/plants13050650/s1: Figure S1: QTL for total seed protein on Chr. 11 shows the QTL curve for total seed protein. As shown, the QTL’s peak with an LOD score of approximately 3.6. In addition, the additive effects are positive for protein content; Figure S2: QTL for total seed protein on Chr. 20. As shown in (A) the QTL curve for total seed protein. As shown, QTL’s peak with an LOD score of approximately 3.8; Figure S3: QTL for total sucrose on Chr. 11. A represents the QTL’s position in the QTL, and B represents the additive effect on the trait. The figure shows the QTL curve for sucrose. The QTL peaks with an LOD score of approximately 3.2; Figure S4: QTL for total sucrose on Chr. 14. The figure shows the QTL curve for sucrose. The QTL’s peak with an LOD score of approximately 4.2 is shown; Figure S5: QTL for total protein and sucrose on Chr. 11. QTL curves for total seed protein (yellow) and sucrose (blue) content on the same plot. Figure (A) represents the location of QTL in the chromosome, and figure (B) represents the interaction between sucrose and protein. As shown in (A), the LOD scores were 3.6. and 3.4 for protein and sucrose, respectively. Figure B revealed a negative correlation between protein and sucrose contents; Table S1: Genotypic data; Table S2: Phenotypic data of protein and sugar content from two locations and two years.

Author Contributions

Conceptualization, P.C. and D.R.J.; methodology, P.C., N.S.H., D.M.M. and E.S.; software, D.R.J.; validation, D.R.J. and E.S.; formal analysis, D.R.J., E.S. and P.C.; investigation, D.R.J.; resources, P.C.; data curation, D.R.J.; writing—original draft preparation, D.R.J.; writing—review and editing E.S., P.C., N.S.H. and D.R.J.; visualization, D.R.J., P.C. and E.S.; supervision, P.C. and E.S.; project administration, P.C. and E.S.; funding acquisition, P.C. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported via funds from the United Soybean Board and the Arkansas Soybean Promotion Board.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

We appreciate the support provided by Arkansas soybean producers through check-off funds administered by Arkansas Soybean Promotion Board for supporting this study. The authors would also like to thank the members of the soybean breeding program at the University of Arkansas.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Cicek, M.S.; Chen, P.; Maroof, M.A.; Buss, G.R. Interrelationships among agronomic and seed quality traits in an interspecific soybean recombinant inbred population. Crop Sci. 2006, 46, 1253–1259. [Google Scholar] [CrossRef]
  2. Erdman, J.W.; Fordyce, E.J. Soy products and the human diet. Am J Clin Nutr. 1989, 49, 725–737. [Google Scholar] [CrossRef] [PubMed]
  3. Zhang, Y.; Wang, Q.; Liu, Y.; Dong, S.; Zhang, Y.; Zhu, Y.; Tian, Y.; Li, J.; Wang, Z.; Wang, Y.; et al. Overexpressing GmCGS2 Improves Total Amino Acid and Protein Content in S oybean Seed. Int. J. Mol. Sci. 2023, 24, 14125. [Google Scholar] [CrossRef] [PubMed]
  4. Hertzler, S.R.; Lieblein-Boff, J.C.; Weiler, M.; Allgeier, C. Plant Proteins: Assessing Their Nutritional Quality and Effects on Health and Physical Function. Nutrients 2020, 12, 3704. [Google Scholar] [CrossRef] [PubMed]
  5. Miller-Garvin, J.; Orf, J.H.; Naeve, S.L. United States Soybean Quality—Annual Report. Retrieved 31 August 2015, from University of Minnesota. 2015. Available online: http://www.extension.umn.edu/Agriculture/soybean/seed/docs/2014-USSEC-soybean-quality-report.pdf (accessed on 10 January 2024).
  6. Miller, M.J.; Song, Q.; Li, Z. Genomic selection of soybean (Glycine max) for genetic improvement of yield and seed composition in a breeding context. Plant Genome 2023, 16, e20384. [Google Scholar] [CrossRef]
  7. Chen, Q.; Qi, H.; Zhang, X.; Li, W.; Hou, M.; Zhu, R.; Yin, Z.; Han, X.; Jiang, H.; Liu, C.; et al. SNP–SNP Interaction Analysis of Soybean Protein Content under Multiple Environments. Can. J. Plant Sci. 2017, 97, 1090–1099. [Google Scholar] [CrossRef]
  8. Pathan, S.M.; Vuong, T.; Clark, K.; Lee, J.D.; Shannon, J.G.; Roberts, C.A.; Sleper, D.A. Genetic mapping and Confirmation of Quantitative Trait Loci for Seed Protein and Oil Contents and Seed Weight in Soybean. Crop Sci. 2013, 53, 765–774. [Google Scholar] [CrossRef]
  9. Thorne, J.C.; Fehr, W.R. Incorporation of High-Protein, Exotic Germplasm into Soybean Populations by 2- and 3- way Crosses. Crop Sci. 1970, 10, 652–655. [Google Scholar] [CrossRef]
  10. Warrington, C.V.; Abdel-Haleem, H.; Hyten, D.L.; Cregan, P.B.; Orf, J.H.; Killam, A.S.; Bajjalieh, N.; Li, Z.; Boerma, H.R. QTL for seed protein and amino acids in the Benning× Danbaekkong soybean population. Theor. Appl. Genet. 2015, 128, 839–850. [Google Scholar] [CrossRef]
  11. Wang, J.; Chen, P.; Wang, D.; Shannon, G.; Zeng, A.; Orazaly, M.; Wu, C. Identification and mapping of stable QTL for protein content in soybean seeds. Mol. Breed. 2015, 35, 1–10. [Google Scholar] [CrossRef]
  12. Zhang, T.; Wu, T.; Wang, L.; Jiang, B.; Zhen, C.; Yuan, S.; Hou, W.; Wu, C.; Han, T.; Sun, S. A combined linkage and GWAS analysis identifies QTLs linked to soybean seed protein and oil content. Int. J. Mol. Sci. 2019, 20, 5915. [Google Scholar] [CrossRef]
  13. Hyten, D.L.; Pantalone, V.R.; Sams, C.E.; Saxton, A.M.; Landau-Ellis, D.; Stefaniak, T.R.; Schmidt, M.E. Seed quality QTL in a prominent soybean population. Theor. Appl. Genet. 2004, 109, 552–561. [Google Scholar] [CrossRef]
  14. Chen, H.; Pan, X.; Wang, F.; Liu, C.; Wang, X.; Li, Y.; Zhang, Q. Novel QTL and Meta-QTL mapping for major quality traits in soybean. Front. Plant Sci. 2021, 12, 774270. [Google Scholar] [CrossRef]
  15. Taira, H. Quality of soybean for processed foods in Japan. Jpn. Agric. Res. Q 1990, 24, 224–230. [Google Scholar]
  16. Hou, A.; Chen, P.; Alloatti, J.; Li, D.; Mozzoni, L.; Zhang, B.; Shi, A. Genetic Variability of Seed Sugar Content in Worldwide Soybean Germplasm Collections. Crop. Sci. 2009, 49, 903–912. [Google Scholar] [CrossRef]
  17. Openshaw, S.J.; Hadley, H.H. Selection to modify sugar content of soybean seed. Crop Sci. 1981, 21, 805–808. [Google Scholar] [CrossRef]
  18. Hymowitz, T.; Collins, F.I. Relationship Between the Content of Oil, Protein, and Sugar in Soybean Seed. Agron. J. 1972, 64, 613–616. [Google Scholar] [CrossRef]
  19. Ficht, A.; Bruce, R.; Torkamaneh, D.; Grainger, C.M.; Eskandari, M.; Rajcan, I. Genetic analysis of sucrose concentration in soybean seeds using a historical soybean genomic panel. Theor. Appl. Genet. 2022, 135, 1375–1383. [Google Scholar] [CrossRef]
  20. Liu, C.; Chen, H.; Yu, Q.; Gu, H.; Li, Y.; Tu, B.; Zhang, H.; Zhang, Q.; Liu, X. Identification of quantitative trait loci and candidate genes for seed sucrose and soluble sugar concentrations in soybean. Crop Science. 2023, 63, 2976–2992. [Google Scholar] [CrossRef]
  21. Patil, G.; Vuong, T.D.; Kale, S.; Valliyodan, B.; Deshmukh, R.; Zhu, C.; Wu, X.; Bai, Y.; Yungbluth, D.; Lu, F.; et al. Dissecting genomic hotspots underlying seed protein, oil, and sucrose content in an interspecific mapping population of soybean using high-density linkage mapping. Plant Biotechnol. J. 2018, 16, 1939–1953. [Google Scholar] [CrossRef]
  22. Salari, M.W.; Ongom, P.O.; Thapa, R.; Nguyen, H.T.; Vuong, T.D.; Rainey, K.M. Mapping QTL controlling soybean seed sucrose and oligosaccharides in a single family of soybean nested association mapping (SoyNAM) population. Plant Breed. 2021, 140, 110–122. [Google Scholar] [CrossRef]
  23. Lee, D.; Lara, L.; Moseley, D.; Vuong, T.D.; Shannon, G.; Xu, D.; Nguyen, H.T. Novel genetic resources associated with sucrose and stachyose content through genome-wide association study in soybean (Glycine max (L.) Merr.). Front. Plant Sci. 2023, 14, 1294659. [Google Scholar] [CrossRef] [PubMed]
  24. Qi, Z.; Hou, M.; Han, X.; Liu, C.; Jiang, H.; Xin, D.; Hu, G.; Chen, Q. Identification of quantitative trait loci (QTLs) for seed protein concentration in soybean and analysis for additive and epistatic effects of QTLs under multiple environments. Plant Breed. 2014, 133, 499–507. [Google Scholar] [CrossRef]
  25. Zhang, D.; Lü, H.; Chu, S.; Zhang, H.; Zhang, H.; Yang, Y.; Li, H.; Yu, D. The genetic architecture of water-soluble protein content and its genetic relationship to total protein content in soybean. Sci. Rep. 2017, 7, 5053. [Google Scholar] [CrossRef] [PubMed]
  26. Teng, W.; Li, W.; Zhang, Q.; Wu, D.; Zhao, X.; Li, H.; Han, Y.; Li, W. Identification of quantitative trait loci underlying seed protein content of soybean including main, epistatic, and QTL× environment effects in different regions of Northeast China. Genome 2017, 60, 649–655. [Google Scholar] [CrossRef] [PubMed]
  27. Maughan, P.J.; Maroof, M.S.; Buss, G.R. Identification of quantitative trait loci controlling sucrose contentin soybean (Glycine max). Mol. Breed. 2000, 6, 105–111. [Google Scholar] [CrossRef]
  28. Nichols, D.M.; Glover, K.D.; Carlson, S.R.; Specht, J.E.; Diers, B.W. Fine mapping of a seed protein QTL on soybean linkage group I and its correlated effects on agronomic traits. Crop Sci. 2006, 46, 834–839. [Google Scholar] [CrossRef]
  29. Brown, A.V.; Conners, S.I.; Huang, W.; Wilkey, A.P.; Grant, D.; Weeks, N.T.; Cannon, S.B.; Graham, M.A.; Nelson, R.T. A new decade and new data at SoyBase, the USDA-ARS soybean genetics and genomics database. Nucleic Acids Res. 2021, 49, D1496–D1501. [Google Scholar] [CrossRef]
  30. Bhangu, R.; Virk, H.K. Nitrogen management in soybean: A review. Agric. Rev. 2019, 40, 129–135. [Google Scholar] [CrossRef]
  31. Bosaz, L.B.; Gerde, J.A.; Borrás, L.; Cipriotti, P.A.; Ascheri, L.; Campos, M.; Gallo, S.; Rotundo, J.L. Management and environmental factors explaining soybean seed protein variability in central Argentina. Field Crops Res. 2019, 240, 34–43. [Google Scholar] [CrossRef]
  32. Doyle, J.J. Isolation of plant DNA from fresh tissue. Am. J. Clin. Nutr. 1990, 12, 725–737. [Google Scholar]
  33. Valliyodan, B.; Shi, H.; Nguyen, H.T. A simple analytical method for high-throughput screening of major sugars from soybean by normal-phase HPLC with evaporative light scattering detection. Chromatography Res. Intl. 2015, 2015, 757649. [Google Scholar] [CrossRef]
Figure 1. Histogram of percent seed protein content (A) and sucrose content (B). The population distribu-tion in (A), showing a normal distribution with a slight skew to the left, suggests that recombinant types for protein can inherit higher protein content than either parent, suggesting additive effects for this trait. The population distribution in (A), suggesting the skew of this trait to the right of the curve, could suggest additive effects for this trait, suggesting selection for advantageous recombinant types is possible. P1: Parental line R07-2000. P2: Parental line R08-3221, #: Symbol used as number.
Figure 1. Histogram of percent seed protein content (A) and sucrose content (B). The population distribu-tion in (A), showing a normal distribution with a slight skew to the left, suggests that recombinant types for protein can inherit higher protein content than either parent, suggesting additive effects for this trait. The population distribution in (A), suggesting the skew of this trait to the right of the curve, could suggest additive effects for this trait, suggesting selection for advantageous recombinant types is possible. P1: Parental line R07-2000. P2: Parental line R08-3221, #: Symbol used as number.
Plants 13 00650 g001
Table 1. ANOVA analysis of the environment in parental lines F2:5 population.
Table 1. ANOVA analysis of the environment in parental lines F2:5 population.
TraitParentsF2:5
R08-3221R07-2000µ *Range SD SE §F
Protein47.9%-43.937.1–55.92.30.175.707 **
Sucrose-8.4%6.10.5–12.21.880.142.0803 **
* μ, Average is estimated from F2:5 population resulting from cross between R08-3221 × R07-2000. , Range is the lowest and highest value of the trait under evaluation from the F2:5 population. , SD represents standard deviation estimated variation within F2:5 population. §, SE represents standard error measured from F2:5 population. **, Effects with p-values < 0.001 given between two parental lines of R08-3221 and R07-2000.
Table 2. Interaction between environmental conditions on protein and sucrose content.
Table 2. Interaction between environmental conditions on protein and sucrose content.
TraitEstimate *Std Error t Ratio Prob > |t| §
Protein
      Location0.35749790.0694925.14<0.0001 **
      Year−0.1134050.069492−1.630.1030
      Location × Year0.32375480.0694924.66<0.0001 **
Sucrose
      Location−0.0092910.054748−0.170.8653
      Year0.17241540.0547483.150.0017 **
      Location × Year0.20437960.0547483.730.0002 **
Protein × Sucrose−0.2308640.036256−6.37<0.0001 **
*, Estimate: an estimate of the model coefficients; , Std Error: an estimated standard deviation of the population for each of the parameters; , t Ratio: the ratio of the estimate to its standard error; § Prob > |t|: the effect of the p value between two parameters. **, Effects with p-values < 0.001.
Table 3. Detected QTLs associated with protein and sucrose content.
Table 3. Detected QTLs associated with protein and sucrose content.
QTLChromosome.Left MarkerRight MarkerPosition (bp)LODAdd.
Pro-UA1111Gm11_37749863_G_TGm11_39108822_A_G3.77 × 107–3.91 × 1073.50.4
Pro-UA2020Gm20_22826023_G_AGm20_46314790_C_T2.28 × 107–4.63 × 1073.650.5
Suc_UA111Gm11_38424068_C_TGm11_39108822_A_G3.84 × 107–3.91 × 1073.3−0.3
Suc2_UA214Gm14_5418596_A_GGm14_7853431_G_T3.804 × 107–3.96 × 1074.21.1
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Jamison, D.R.; Chen, P.; Hettiarachchy, N.S.; Miller, D.M.; Shakiba, E. Identification of Quantitative Trait Loci (QTL) for Sucrose and Protein Content in Soybean Seed. Plants 2024, 13, 650. https://doi.org/10.3390/plants13050650

AMA Style

Jamison DR, Chen P, Hettiarachchy NS, Miller DM, Shakiba E. Identification of Quantitative Trait Loci (QTL) for Sucrose and Protein Content in Soybean Seed. Plants. 2024; 13(5):650. https://doi.org/10.3390/plants13050650

Chicago/Turabian Style

Jamison, Daniel R., Pengyin Chen, Navam S. Hettiarachchy, David M. Miller, and Ehsan Shakiba. 2024. "Identification of Quantitative Trait Loci (QTL) for Sucrose and Protein Content in Soybean Seed" Plants 13, no. 5: 650. https://doi.org/10.3390/plants13050650

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop