1. Introduction
Astragalus membranaceus var.
mongholicus (
AMM) is a perennial herb of the genus
Astragalus in the family of Fabaceae. Its dried root has been used as medicine in China for more than 2000 years [
1], with exports extending to more than 30 countries (or regions) such as South Korea, Japan, and the United States [
2,
3,
4]. Pharmacological studies have shown that some extracts and active compounds in
AMM can enhance immunity, inhibit tumor growth, and provide anti-oxidant effects [
5,
6]. In addition to its medicinal uses,
AMM has been used traditionally as a food additive in soup and porridge for its nutritional and health benefits in China. In 2023, the National Health Commission of China officially included
AMM in the food and medicinal material catalog [
7].
Wild
AMM is primarily distributed in Heilongjiang (Hulunbuir League), Inner Mongolia, Hebei, and Shanxi. Before the 1960s, the herb was mainly harvested from the wild. However, with the growing demand in both domestic and international markets, cultivation has now become the main source of supply for the herb. At present, it is mainly cultivated in Inner Mongolia, Gansu, Shanxi, Hebei, etc. [
8,
9]. Due to the lack of systematic scientific collation and evaluation of
AMM germplasm resources, the progress in breeding superior varieties has been slow. To date, no more than six new varieties have been successfully selected and bred in China [
8]. Cultivated
AMM suffers from species confusion, unstable yields, and wide variations in medicinal material quality [
10].
The collection, collation, and evaluation of
AMM germplasm resources are the basis for the breeding of superior varieties. Plant phenotypes, which are influenced by a combination of genetic factors and the environment over a long period of evolution [
11], intuitively reflect differences between germplasms and are a primary focus of research in plant breeding. As medicinal plants, the content of medicinal components is the most important factor in ensuring the quality of herbs [
12]. More than 180 components of
Astragalus have been reported, and the active components mainly include flavonoids, saponins, and polysaccharides [
13,
14,
15]. There are about 60 published flavonoid constituents and 48 saponin constituents of
AMM [
15,
16]. However, the concentration of each of these components within the plants is generally below 1% [
17]. It is therefore essential to obtain as much information as possible on the content of multiple compounds for evaluating the quality of
AMM. Currently, no studies have evaluated the chemical composition differences among various germplasm resources of
AMM.
Research on
AMM germplasm resources and their genetic diversity is relatively scarce. Jiang et al. [
18] analyzed the genetic diversity of wild and cultivated
Astragalus germplasm resources from 30 regions in Inner Mongolia using an optimized ISSR reaction system. Their results revealed a high level of genetic diversity among the germplasm, with genetic relationships correlated with geographic location. Zhang [
19] screened four germplasm samples with superior comprehensive characteristics from 26
AMM materials based on Astragaloside IV content, combined with seed characteristics, agronomic traits, RAPD genetic diversity, and kinship analysis. Sun [
20] evaluated 12
Astragalus germplasm resources and screened two with the potential for drought tolerance and high quality. The botanical, agronomic, and quality traits of
Astragalus germplasm resources exhibit complex diversity [
8,
21]. However, combining botanical, agronomic, and chemical traits based on metabolomics characterization for a comprehensive evaluation of
AMM germplasm and exploring the relationships among these traits has not been reported in relevant studies.
Powdery mildew is a major foliar disease that significantly impacts the cultivation of
AMM. It is widespread across
AMM growing regions in China and typically emerges at the beginning of summer [
22,
23]. During the initial infection stages, white, mold-like spots appear on the leaf surfaces, gradually spreading to the entire plant. As the disease progresses, black cleistothecia form, leading to plant death. In severe cases, the incidence rate can reach 100% in commonly affected areas [
23]. Previous studies have primarily focused on pathogen identification [
24,
25], disease occurrence patterns, and screening of chemical control agents [
26,
27,
28]. However, there are no reports on
AMM powdery mildew resistant germplasm.
In this study, we collected 33 AMM germplasms from five major production areas in China and cultivated them in the same germplasm nursery. These germplasms were thoroughly evaluated based on 14 botanical traits, 4 agronomic traits, 13 quality traits (5 flavonoids and 8 saponins), and resistance to powdery mildew. The aim was to establish a comprehensive evaluation method combining phenotypic traits and active ingredients, in order to select germplasm with high yield, high active ingredient content, and resistance to powdery mildew. These selected germplasms will serve as valuable materials for breeding new varieties of AMM. Furthermore, we explored the correlations among these traits, providing important insights into genetic improvement and further utilization of key traits.
2. Materials and Methods
2.1. Materials
The seeds of 33 AMM germplasms were collected from Gansu, Inner Mongolia, Shaanxi, Shanxi, and Hebei provinces. They were sown in nutrient pots in a greenhouse next to the nursery at the Shengqitang Company Astragalus planting base in Helinger County, Hohhot City, Inner Mongolia (40°15′ N, 111°43′ E, altitude 1200 m) between late March and early April 2021. After germination, the seedlings were transplanted to the field of germplasm resource nursery on 7–8 June 2021. The experimental site is located in a mesothermal continental monsoon climate, with an annual average temperature of 6.2 °C, an annual average sunshine duration of 2942 h, a frost-free period of 118 days, and an annual average precipitation of 392.8 mm.
The field experiment was conducted from March to December 2023. A completely randomized group design was employed, with a total experimental area of 800 m
2. Each plot contained 3–8 plant materials, with a uniform planting spacing of 70 cm between the rows and 70 cm between individual plants. From each germplasm, 5–10 phenotypically consistent plants were selected for marking to ensure the homogeneity of the survey subjects. The age and phenological stage of the plants were consistent throughout the process. Botanical traits such as stems, leaves, and flowers were observed and recorded in May–June; 5–15 plants per germplasm were randomly selected in August, and powdery mildew resistance in the field was counted. When the labeled plants were mature, 3–4 plants per germplasm were randomly selected and harvested for the determination of agronomic trait indexes and complete metabolomics analysis based on UPLC-ESI-QTOF-MS
E. All the germplasms were subjected to uniform field management measures. The above plants were identified as
Astragalus mongholicus Bunge (syn.
A.membranaceus (Fisch.) Bunge var.
mongholicus (Bunge) P. K. Hsiao) by Professor Bengang Zhang from the Institute of Medicinal Plants, Chinese Academy of Medical Sciences, Beijing, China. The names of the plant materials are shown in
Table S1.
2.2. Chemicals and Reagents
The analytical-grade alcohol was used the brand of Beijing Chemical Works. Pure water (18.2 MΩ) was obtained from a Milli-Q System (Millipore, Billerica, MA, USA). Reference substances were used to compare the MS data, and retention time (RT) of the identified compounds consisting of Calycosin-7-glucoside, Ononin, 9-O-Methylnissolin 3-O-glucoside, Calycosin, Astraisoflavan-7-O-β-D-glucoside, Astragaloside I, Astragaloside II, Astragaloside III, Astragaloside IV, Isoastragaloside I, Isoastragaloside II, Cyclocephaloside II, Cycloastragenol, purchased from Chengdu Manster Biotechnology Co., Ltd. (Chengdu, China). For UPLC-MS analysis, LC/MS-grade acetonitrile, methanol, and formic acid were used the brand of Thermo Fisher (Waltham, MA, USA), and water was used the brand of Guangzhou Watsons Food and Beverage Co., Ltd. (Guangzhou, China).
2.3. Measurement of Phenotypic Traits
A total of 14 botanical and 4 agronomic traits were set (
Table S2). Plant morphology was observed visually, while leaf color, stem color, and floret color were assessed using the PANTONE CMYK Coated color chart. The number of stem branches (NSB), the number of compound leaflets (NCL), the number of florets (NF) on a single inflorescence, and the number of seeds per pod (SP) were counted. The crown width (CW) and plant height (PH) were measured by a straightedge with 0.1 cm precision. The stem diameter (SD), compound leaf length (CLL) and width (CLW), leaflet length and width, and primary root diameter (RD) were measured by a vernier caliper with 0.01 mm precision. Root fresh weight (RFW) was evaluated by an electronic balance with 0.01 g precision and the number of lateral roots (NLR) was counted. The index of leaflet size and dry matter content were calculated as follows:
;
2.4. Measurement of Quality Traits
The fresh root segments of AMM were dried in an oven at 60 °C until reaching constant weight, then ground into a fine powder and passed through a No. 4 sieve. Subsequently, 20 mL of 80% methanol was added, and the mixture was weighed precisely. It was ultrasonicated (Power 250 W, Frequency 40 HZ) for 1 h, cooled down, and the lost weight was made up. After thorough shaking, the mixture was filtered. The resulting filtrate was injected into a liquid phase vial through a 0.22 μm filter membrane.
UPLC separation was achieved on a Waters ACQUITY I-Class system (Waters Corporation, Milford, MA, USA) using a Waters ACQUITY BEH C18 column (2.1 × 100 mm, 1.7 μm, MA, USA). The mobile phase consisted of 0.1% formic acid water (A) and acetonitrile (B). The gradient conditions were as follows: 0–5 min, 10–20% B; 5–20 min, 20–65% B; 20–25 min, 65–90% B; 25–26 min, 90–10% B; 26–30 min, 10% B. The online UV spectra were recorded in the range of 200–400 nm. The column and autosampler were maintained at 30 and 10 °C, respectively. The flow rate was 0.3 mL/min, and the injection volume was 1 μL.
Mass spectrometry was operated on the Waters Xevo G2-XS Q-TOF mass spectrometer (Waters Corporation, Milford, MA, USA) equipped with an electrospray ionization (ESI) source controlled by MassLynx 4.2 software (Waters, Corporation, Milford, MA, USA). A full scan was run in positive modes, with a mass range from m/z 100–1500 Da and with a 1 s scan time. Nitrogen was used as a nebulizer and auxiliary gas. In positive ion mode, the following parameters were found: capillary voltage of 3 kV; sampling cone voltage of 40 V; source temperature of 100 °C; desolvation temperature of 250 °C; cone gas flow of 50 L/h; desolvation gas flow of 600 L/h. The instrument was performed in both low-energy and high-energy scan functions, and the collision energy was 6 and 20–50 eV, respectively. Leucine enkephalin was used as a lock mass with a reference mass value at m/z = 556.2771 [
29].
The sample data collected by UPLC-Q-TOF/MS were imported into Progenesis QI 2.3 (Waters, Milford, MA, USA) software and ion forms such as [M+H]+, [M+Na]+, [M+K]+, [M+NH4]+, [2M+H]+, [2M+Na]+, [M+H-H2O]+, and [M+H-2H2O]+ were selected to unwrap the spectral data to improve the identification rate of the data. The data were imported in a .raw format, then QI automatically selected a more standard sample for calibration based on the imported sample, followed by peak alignment, peak extraction, normalization, and compound identification based on the reference substance.
2.5. Measurement of the Disease Incidence and Disease Index of Powdery Mildew
The plants were scored on the disease grade on a scale of 0, 1, 3, 5, 7, 9, and the rating was based upon the percentage of diseased area to whole plant leaf area (
Table 1).
The disease incidence and disease index were calculated based on the Formulas (1) and (2) as follows:
Note: NDP (number of diseased plants), TNP (total number of plants), NDPDG (number of diseased plants in each disease grade), SLDG (scale level of disease grade), and MDG (maximum disease grade).
2.6. Statistical Analysis
Based on the measurements, the mean values of each phenotypic trait were used for statistical analysis. Qualitative traits were graded and assigned values on a scale of 1–3. Morphological diversity was evaluated by the frequency of trait dispersion and the Shannon–Wiener diversity index (
H′) [
30]. Quantitative parameter statistics included minimum (Min), maximum (Max), mean, standard deviation (SD), coefficient of variation (CV, %) and
H′. Based on the mean (X) and standard deviation (σ), quantitative traits were categorized into 10 levels, from level 1 [Xi < (X − 2σ)] to level 10 [Xi ≥ (X + 2σ)], with an interval of 0.5σ between each level. The relative frequency of each level was calculated to obtain
H′. The
H′ for each trait was calculated by using the following formula:
H′ = −∑Pi × ln Pi (Pi is the proportion of the individual number of this trait in total individual number).
Correlation analysis (Pearson correlation was used when the absolute values of kurtosis < 10 and skewness < 3, otherwise Spearman correlation was used [
31]), systematic clustering, and mapping were performed using Origin Pro 2021 software. Principal component analysis (PCA) was conducted using SPSS 26.0 software, with principal components extracted based on the characteristic value > 1. A comprehensive evaluation model was constructed, and the comprehensive scores for different germplasm resources were calculated using this model.
4. Discussion
The phenotypic traits of plants represent their external genetic characteristics and are often used as visible markers for the selection of superior germplasms [
32]. The
H′ value and CV are the main indices used to evaluate the diversity of germplasm resources. Generally, when the
H′ value exceeds 1, it indicates rich diversity [
33,
34]. In this study, we analyzed the phenotypic traits of 33
AMM germplasm resources. The
H′ values for the four qualitative traits were all below 1, while the
H′ values for all 14 quantitative traits exceeded 1.9 (
Table 2 and
Table 3). The
H′ values for quantitative traits were higher than that for qualitative traits, likely due to the fact that qualitative traits are more influenced by allelic variations, making them highly heritable and less affected by environmental factors [
35,
36], consistent with previous results in
Medicago ruthenica L. [
37]. When the CV is greater than 10%, it indicates considerable variation between individuals [
38,
39]. In this study, except for the NCL and DMC, the CV values of the remaining 12 quantitative traits were all greater than 10% (
Table 3 and
Table 4). We also found that the underground traits showed higher variation than the above-ground traits, which is similar to the previous results obtained by Zhang for 35
AMM cultivated germplasms grown in Beijing and Shanxi province [
19]. We think this is probably due to
AMM being drought-tolerant plants, and the high genetic diversity of root traits may contribute to adaptation to arid environments, although no relevant studies have been reported.
We developed a UPLC-Q-TOF/MS method that can simultaneously measure 13 secondary metabolites of
AMM. The top six compounds by content in the
AMM population were Calycosin-7-glucoside, Isoastragaloside II, Ononin, Astragaloside IV, 9-O-methylnissolin 3-O-glucoside, and Cycloastragenol (
Figure 3B). They exhibit pharmacological activities such as anti-tumor, anti-oxidant, and anti-diabetic effects [
40,
41,
42]. Additionally, the composition of flavonoids and saponins varied across
AMM germplasms from different regions (
Figure 3A). The comparison revealed that the three germplasms with the highest total flavonoid content were from Inner Mongolia (An-25, An-26, and An-31), while the germplasm with the lowest content was from Gansu (Ag-16). However, Chen et al. [
43] found that the total flavonoid content of
Astragalus in Inner Mongolia was lower than that in Gansu. Similarly, the top three germplasms with the highest total saponin content were from Shanxi and Inner Mongolia (As-20, An-28, and An-31), whereas the lowest content was found in a germplasm from Shaanxi (Ax-3). Zheng et al. [
44] also reported that the total saponin content in
Astragalus from Shanxi and Inner Mongolia was higher than that from Shaanxi. Since 69.70% of the germplasms in this study were sourced from Inner Mongolia, whether the levels of total flavonoids and total saponins are associated with geographical regions requires further investigation with an expanded sample size.
High content traits are a major breeding goal for
AMM, and understanding the relationship between phenotypic traits and active compound content can aid in the early screening of germplasm based on visible traits. Correlation analysis revealed a significant positive correlation between flavonoid content and the number of leaflets, but a negative correlation with crown width and stem diameter (
Figure 4). This suggests that a higher number of leaves promotes the synthesis of secondary metabolites, leading to a higher flavonoid content in the roots. In contrast, saponin content showed a negative correlation with botanical traits, with higher saponin levels observed when the above-ground parts were shorter. Similarly, saponin content was negatively correlated with agronomic traits as well. This may be because saponins in
Astragalus are primarily located in the root phloem [
45,
46,
47]. When the roots are thinner, the proportion of phloem increases, leading to higher saponin content. In summary, selecting breeding materials with taller plants, thicker stems, and abundant foliage is beneficial for achieving higher yields. Germplasm with more leaves is likely to have higher flavonoid content, while shorter plants with thinner roots may exhibit higher saponin content.
A comprehensive score (F-value) could be calculated based on the generated PCA, which fully accounts for the correlations and variations among multiple indicators, thus reflecting the overall quality of the germplasm. This method is commonly applied to the comprehensive evaluation of various crop varieties [
48,
49,
50]. In addition, all germplasm resources were classified into five categories by cluster analysis, each exhibiting distinct characteristics (
Figure 5). This classification allows for the selection of specific cluster materials based on practical breeding objectives in future improvement programs. Moreover, significant phenotypic differentiation was observed among the clusters, offering valuable references for selecting hybrid parents and optimizing breeding combinations [
51].
Severe powdery mildew infection was observed in the
AMM germplasm resource nursery. Our investigation found that there were two types of disease resistance, one with relatively lower incidence and a disease index below 30%, indicating mild symptoms, such as An-17 and An-21. The other type has a higher incidence but mild symptoms, which belong to the type of disease tolerance (
Figure 6). This suggests diverse resistance mechanisms across germplasms, and their molecular basis needs to be further studied. Overall, the proportion of
AMM disease-resistant germplasm was low (15.15%). In order to reduce the occurrence of powdery mildew and decrease pesticide use, disease-resistant breeding needs to be carried out. Additionally, we observed that the powdery mildew disease index was negatively correlated with Ononin and flavonoid content (
Figure 7). For example, the germplasm An-26, which had the lowest disease index, exhibited higher flavonoid content compared to 28 highly susceptible germplasms. This may be related to powdery mildew-induced leaf senescence, which reduces photosynthesis and regulates gene expression in the flavonoid biosynthesis pathway [
52,
53,
54]. Therefore, plants with a high flavonoid content may have strong resistance to powdery mildew, which provides us with a screening marker for breeding highly resistant varieties.