Next Article in Journal
A Missense Mutation (c.1037 G > C, p. R346P) in PAPSS2 Gene Results in Autosomal Recessive form of Brachyolmia Type 1 (Hobaek Form) in A Consanguineous Family
Next Article in Special Issue
Genetic Characterization of Some Saudi Arabia’s Accessions from Commiphora gileadensis Using Physio-Biochemical Parameters, Molecular Markers, DNA Barcoding Analysis and Relative Gene Expression
Previous Article in Journal
ABCA1 Polymorphism R1587K in Chronic Hepatitis C Is Gender-Specific and Modulates Liver Disease Severity through Its Influence on Cholesterol Metabolism and Liver Function: A Preliminary Study
Previous Article in Special Issue
Estimation of Nuclear DNA Content in Some Aegilops Species: Best Analyzed Using Flow Cytometry
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Assessment of the Genetic Distinctiveness and Uniformity of Pre-Basic Seed Stocks of Italian Ryegrass Varieties

Laboratory of Genetics and Genomics for Plant Breeding, Department of Agronomy, Food, Natural Resources, Animals and Environment, Campus of Agripolis, University of Padova, 35020 Legnaro, Italy
*
Author to whom correspondence should be addressed.
Genes 2022, 13(11), 2097; https://doi.org/10.3390/genes13112097
Submission received: 30 September 2022 / Revised: 3 November 2022 / Accepted: 9 November 2022 / Published: 11 November 2022

Abstract

:
Lolium multiflorum Lam., commonly known as Italian ryegrass, is a forage grass mostly valued for its high palatability and digestibility, along with its high productivity. However, Italian ryegrass has an outbreeding nature and therefore has high genetic heterogeneity within each variety. Consequently, the exclusive use of morphological descriptors in the existing varietal identification and registration process based on the Distinctness, Uniformity, and Stability (DUS) test results in an inadequately precise assessment. The primary objective of this work was to effectively test whether the uniformity observed at the phenological level within each population of Italian ryegrass was confirmed at the genetic level through an SSR marker analysis. In this research, using 12 polymorphic SSR loci, we analyzed 672 samples belonging to 14 different Italian ryegrass commercial varieties to determine the pairwise genetic similarity (GS), verified the distribution of genetic diversity within and among varieties, and investigated the population structure. Although the fourteen commercial varieties did not show elevated genetic differentiation, with only 13% of the total variation attributable to among-cultivar genetic variation, when analyzed as a core, each variety constitutes a genetic cluster on its own, resulting in distinct characteristics from the others, except for two varieties. In this way, by combining a genetic tool with the traditional morphological approach, we were able to limit biases linked to the environmental effect of field trials, assessing the real source of diversity among varieties and concretely answering the key requisites of the Plant Variety Protection (PVP) system.

1. Introduction

Lolium multiflorum Lam., commonly known as Italian ryegrass, is one of the most important groups of grasses largely used as forage crops due to its high productivity and high nutritional value as livestock feed, especially in terms of fiber palatability and digestibility [1,2]. Over the years, advantages have also been demonstrated from an environmental point of view: L. multiflorum can be used as a soil stabilizer during the winter to provide ground cover against soil erosion and depletion. Moreover, despite its lower persistence and stress tolerance in comparison with Lolium perenne, Italian ryegrass is more productive and can provide faster ground cover due to its timely emergence and seedling vigor.
Both annual and biennial varieties are available on the market, and to prevent any misunderstanding, it is fundamental to clarify that they both belong to this species: the truly annual forms of L. multiflorum Lam. var. westerwoldicum Wittm. (Westerwolds ryegrass) can be distinguished by their complete flowering in the year of sowing from the biennial forms of L. multiflorum Lam. ssp. italicum (A. Br.) Volkarts, which generally produce very few seed heads in the sowing year, complete the cycle the following year after cold and short-day conditions, and remain leafy during the entire season.
Although Italian ryegrass is naturally diploid, tetraploid forms have been developed by chromosome doubling to achieve higher biomass production and better nutritional characteristics. The increase in an organism’s cellular ploidy caused by genome replication without mitosis, as occurred in Italian ryegrass using colchicine (i.e., the most efficient anti-mitotic agent and mitosis inhibitor [3]), has been shown to play an important role in physiology and development via cellular, metabolic, and genetic effects [4,5]. Compared to their diploid counterparts, polyploid forage crops display enlarged leaf dimensions and plant height [6,7] but also faster regrowth after grazing and an increasing number of branches and stems [8]. In particular, tetraploid cultivars of Italian ryegrass showed not only faster leaf elongation rates with respect to diploid cultivars, resulting in longer leaves, but also larger shoot dry weights under stress-free conditions [9,10]. Moreover, positive effects have also been reported in enhanced forage quality, since the increased production of secondary metabolites in the induced polyploids together with a higher cell content/cell wall ratio leads to improved succulence and higher forage intake [11,12].
The most common method used to produce Italian ryegrass varieties is based on a recurrent selection strategy, consisting of repeated cycles of intercrossing among selected superior individuals. Almost 700 varieties of Italian ryegrass were reported in the 2022 edition of the OECD (Organization for Economic Co-operation and Development) list of varieties eligible for seed certification [13]. This highlights the economic value of registering a commercial variety and the consequent necessity of protecting breeder rights. The official regulation of Italian ryegrass cultivar identification and registration drafted by the International Union for the Protection of New Varieties of Plants (UPOV) is based on the DUS (Distinctness, Uniformity, and Stability) test, considering all morphological traits, such as length and width of leaves, the intensity of green color, height and width of plants, length of the inflorescence, and the number of spikelets. Ploidy level is the only non-morphological trait considered as a distinction criterion by UPOV. Grow-out tests are performed in both spaced plant and row plot trials, and the visual scoring of traits is frequently performed using numerical scales with different minimum–maximum ranges (e.g., 1–3, 1–5, or 1–9). However, considering the outbreeding nature of Italian ryegrass, and thus the high genetic heterogeneity within each variety, the exclusive use of morphological descriptors to discriminate and compare the increasing number of varieties is becoming difficult and costly. In addition, the morphology and physiology of a plant are the results of strong interactions with environmental factors such as rainfall, temperature, light exposure, and soil composition. Thus, the idea of a unique and fixed guideline based on the evaluation of morpho-physiological traits results in an inaccurate assessment, especially in a worldwide market.
Without a more precise and standardizable characterization, it is impossible to clearly differentiate one variety of Italian ryegrass from another and guarantee real compliance with the DUS requirements in the current Plant Variety Protection (PVP) system.
In this scenario, molecular tools would be useful in implementing the traditional DUS test, as proposed by Gilliland et al. [14], to avoid the plagiarism risk and demonstrate a clear improved value for cultivation and use (VCU) for a new registered variety. Indeed, a DNA marker system could avoid strong environmental dependence, having the potential to investigate many more samples in genotypic assays. In other words, molecular markers such as simple sequence repeats (SSRs) and single-nucleotide polymorphisms (SNPs) represent more robust and objective tools to be integrated into the PVP system for Italian ryegrass. In particular, microsatellite loci consist of short DNA sections of tandemly repeated di-, tri-, and tetranucleotide motifs, and consequent polymorphisms derive from a different number of repetitions. These differences can be detected with conserved PCR primers designed on the nonrepetitive flanking regions. Since they are genetically well-defined and codominant, SSRs have become a powerful genetic tool for marker-assisted breeding (MAS) and the assessment of diversity and uniformity. In addition, SSRs are highly polymorphic markers and are therefore able to discriminate closely related varieties, as often happens for Italian ryegrass. Finally, the SSR assay is easily performed by PCR, a method that does not require specific and expensive laboratory equipment. Despite the recent development of next-generation sequencing (NGS) techniques, particularly restriction site-associated DNA sequencing (RAD-seq) [15] and genotyping-by-sequencing (GBS) technologies, such as genotyping by random amplicon sequencing and direct (GRAS-Di) [16], we want to provide a molecular assay that is less costly and time-consuming and that could be practically applied by small laboratories of seed companies. From the literature, the use of molecular markers in the evaluation of the genetic diversity of L. multiflorum accessions has been reported by few studies using sequence-related amplified polymorphisms (SRAPs) [17], randomly amplified polymorphic DNA (RAPD) [18], and SSRs combined with bulk strategies [19].
In this study, we used 12 polymorphic SSR loci to genotype 14 Italian ryegrass commercial varieties, each composed of 48 individuals (672 samples in total). In the downstream analysis, we determined the genetic similarity (GS), verified the distribution of genetic diversity within and among varieties, and investigated the population structure. The main aim was to concretely test whether the uniformity observed at the phenological level within each population of Italian ryegrass was confirmed at the genetic level through an SSR marker analysis. Thus, in this way, we limited biases linked to the environmental effect of field trials. Thus, we were able to evaluate the real source of diversity among varieties and obtained a concrete answer to the fundamental requisites of the PVP system.

2. Materials and Methods

2.1. Plant Materials

In this study, 14 commercial varieties of Italian ryegrass, differing in ploidy level (diploid or tetraploid) and vegetative habit (annual or biennial) (Table 1), were provided and bred by the same private company. For each variety (here named from A to P), 80 seeds belonging to the latest generation of a purification process were sown in growing boards with a common potting soil and grown in a greenhouse under short-day conditions and a temperature of 20 °C at the “L. Toniolo” experimental farm of the University of Padova in Legnaro (PD). After three weeks, leaf samples were collected from 48 seedlings of each variety, for a total of 672 samples.

2.2. DNA Extraction

Genomic DNA (gDNA) was extracted from young freeze-dried leaves finely ground with a TissueLyser II mill (Qiagen, Valencia, CA, USA) using a DNeasy® 96 Plant Kit (Qiagen) following the manufacturer’s protocol. The quantity and quality of the gDNA were checked using a NanoDrop spectrophotometer (Thermo Fisher Scientific, Pittsburgh, PA, USA) in terms of concentration and 260/280 and 260/280 ratios. Extracted gDNA was stored at −20 °C until further PCR amplification.

2.3. SSR Marker Genotyping

For genotyping analysis, an initial batch of 28 primer pairs was chosen from Hirata [20] and Guan [21] considering their polymorphism information content (PIC) and their distribution within the seven linkage groups (LG). Preliminary tests on a subset of four randomly chosen samples were performed in singleplex reactions to verify primer efficiency in terms of the presence of polymorphic alleles for each marker. Thus, 12 primer pairs were selected and combined into three different multiplex groups (Table 2) based on similar annealing temperatures, diverse predicted amplicon sizes, and minimum tendencies of dimer formation checked with PerlPrimer v1.1.21. software. Multiplex PCRs were performed according to the three-primer system described by Schuelke [22], with some minor changes. Each locus was amplified by a pair of locus-specific primers, one with an oligonucleotide tail at the 5′ end (M13, PAN-1, PAN-2, or PAN-3) and a third primer complementary to the tail and labeled with a fluorescent dye (6-FAM, VIC, NED, or PET), necessary to subsequently discriminate the different loci during chromatogram screening.
The multiplex PCR mixtures had a total reaction volume of 20 µL resulting from 1 × Platinum® Multiplex PCR Master Mix, 10% GC Enhancer (Applied Biosystems, Carlsbad, CA, USA), 0.25 μM of each tailed primer, 0.75 μM of each tailed reverse primer, 0.5 μM of each fluorescent labeled primer (Applied Biosystems), 20–30 ng of gDNA, and sterile water to volume. Touchdown PCRs were completed for all three multiplexes in 96-well plates under the following conditions: initial denaturation of 5 min at 95 °C, followed by 6 cycles at 95 °C for 30 s, extension at 65 °C for 45 s, which decreased by 1 °C with each cycle, and at 72 °C for 45 s, then 34 cycles at 95 °C for 30 s, at 59 °C for 45 s, and at 72 °C for 45 s. The reaction finished with a final extension step of 30 min at 60 °C. Finally, the quality of the amplicons was verified by electrophoresis run on a 2% agarose/1 TAE gel containing 1× SYBR Safe DNA Gel Stain (Life Technologies).
The PCR products were dried at 65 °C and investigated with capillary electrophoresis using an ABI 3730 DNA analyzer, adopting the GeneScan 500 Liz as the molecular weight internal standard. Thus, chromatograms were screened to define amplicon size at each locus using Peak Scanner software 2.0 (Applied Biosystems).

2.4. Genetic Similarity Estimates and Genetic Diversity and Relationship Analyses

Two distinct datasets for marker statistics and analyses were constructed, setting different thresholds of missing data: a dataset admitting samples with missing data of two SSR loci at most, and a second dataset composed of samples with missing data of five SSR loci at most. The strictest dataset (576 samples) was uploaded to the NTSYS software package v.2.21c [23] and used to calculate the genetic similarity (GS) among individuals in all possible pairwise comparisons based on Rohlf’s simple matching (SM) coefficient. This coefficient was used because it also takes into account negative co-occurrences of alleles between different samples and populations. In addition, the average GS within and among each variety was calculated. Using the same software and Rohlf’s GS matrix, a principal coordinates analysis (PCoA) was carried out.
The genetic relationship analysis was performed according to the maximum-likelihood method (ML) implemented in IQ-Tree v1.6.12 software. This approach enabled us to associate phylogenetic and combinatorial optimization techniques into a fast and effective tree search algorithm with the implementation of the ultrafast bootstrap approximation approach [24]. The SSR marker dataset (576 samples) resulting matrix was analyzed as binary data using the GTR2 method (GTR2 + I + G4 + FO), according to the BIC value found with the ModelFinder algorithm available in the IQ-Tree. The GTR model, used to investigate the genetic relationship with SSR data, was also selected according to the scientific research of Huang et al. [25], Vieira et al. [26], and Minin et al. [27]. Statistical support for the ML dendrogram was computed by running 1000 replicates until convergence for ultrafast bootstrap (UFB) (-bb 1000) [28,29] and 1000 rounds of SH-like approximate likelihood ratio tests (SH-aLRT) (-alrt 1000) [30].
On the other hand, the second most comprehensive dataset, including 644 samples, was used both for the SSR statistics and the population structure analysis. GenoDive v3.0 software [31] was used to perform the Analysis of Molecular Variance (AMOVA) through F-statistics to analyze the molecular variance at different levels of the population structure (i.e., individual, subpopulation, and population levels), with the number of permutations = 999. GenoDive allows for a correction of the unknown dosage of alleles, permitting the statistics to estimate without or with a reduced bias. This correction uses a maximum likelihood method based on random mating within populations, as first proposed by De Silva et al. [32] and then modified by Meirmans et al. [33]. Genetic diversity was also investigated through the following statistics: number of observed alleles (Na), number of effective alleles (Ne), and observed (Ho) and expected (He) heterozygosity within subpopulations assuming Hardy–Weinberg equilibrium according to Nei. These parameters were also calculated for each Italian ryegrass variety.
F-statistics were computed, and the fixation index (FST) and gene flow (Nm) were calculated for each marker locus. FST measures the amount of genetic variance that can be explained by population structure based on Wright’s F-statistics [34], while Nm = {(1/FST) − 1}/4. An FST value of 0 indicates no differentiation between the subpopulations, while a value of 1 indicates complete differentiation [35]. Inbreeding coefficients (FIT and FIS) were computed to measure the deficiency (positive values) or excess (negative values) of heterozygotes for each assessed microsatellite. Similarly, inbreeding coefficients were calculated at the multilocus level to estimate the genetic effect of total population subdivision as a proportional reduction in overall heterozygosity due to variation in SSR allele frequencies among different subpopulations. Finally, marker allele frequencies for each locus were also determined.
Considering each variety separately, Wright’s inbreeding coefficient GIS was evaluated to investigate the deficiency and excess of heterozygosity within every Italian ryegrass variety. Moreover, a matrix with Nei’s gene diversity (Gst) values (analogous to FST) was constructed for all possible pairwise comparisons between the fourteen varieties.

2.5. Genetic Structure Analyses

In addition, the population structure of the 644 samples was assessed using the clustering algorithm of STRUCTURE v2.2 software. Since no a priori knowledge of the origin of the populations under study was available, the admixture model was chosen. We ran the Markov chain Monte Carlo (MCMC) model with 100,000 iterations and a burn-in of 20,000 samples under the assumption that the allele frequencies in the populations were correlated. Ten iterations were conducted for each value of the number of populations (K), with K ranging from 1 to 20. The obtained results were analyzed using the STRUCTURE HARVESTER web software to calculate the best value of K according to Evanno et al. [36], and then estimates of membership were plotted as a histogram using an Excel spreadsheet.

3. Results

3.1. Descriptive Statistics of SSR Marker Loci

A total of 239 alleles were detected across the 14 varieties for the 12 microsatellite loci with an average number of observed alleles (Na) of 19.9, ranging from 10 (02_01B) to 48 (02_10B). Moreover, the effective number of alleles (Ne) per locus varied from 1.7 (02_01B) to 5.2 (02_10B), as reported in Table 3. The same parameters calculated for each variety were equal to 7.3 (Na) and 3.5 (Ne), ranging from 4.4 in variety I to 9.3 in variety N and from 2.5 (variety A) to 4.6 (variety D), respectively (Table 4).
Allele frequencies were calculated per locus within each population, permitting the identification of the most common genotype.
Out of 239 alleles, 108 were considered “rare”, since their overall frequency was lower than 1% [37]. Rare alleles were found at each locus, ranging from 4 to 16. Private alleles, those detected only in a specific variety but absent in all the others, were also searched. Fifty-nine private alleles were detected. On average, each variety presented five private alleles (from 1 of A to 20 of O), with the exception of G and I, which did not show any private allele. Most of them, i.e., 52 out of 59, had a frequency lower than 5% but higher than 1% (not considerable as rare alleles).
Descriptive statistics for Nei’s genetic diversity (H-statistics) and Wright’s inbreeding coefficients (F-statistics and GIS) for single marker locus and population accessions were computed. The mean observed heterozygosity Ho equal to 0.498 ranged from 0.384 to 0.671 among the 12 loci; similarly, He ranged from 0.42 (02_01B) to 0.827 (02_10B) among loci, with a mean value of 0.649. The inbreeding coefficient (FIS) had an average value of 0.302 for microsatellite loci, derived from all positive values, except −0.024 of the 16_03D marker. Finally, FIT and FST were both positive and, on average, equal to 0.385 and 0.125, respectively, while the gene flow (Nm) was equal to 1.753. The calculated Nm values were >1 in all assayed marker loci (except for 02_02C and 12_05E), ranging from 0.871 to 3.987 over all accessions (on average, Nm = 2.183), hence supporting the little genetic differentiation between the fourteen varieties in the analysis.
Considering each variety separately, the mean Ho amounted to 0.498, ranging from 0.276 in variety D to 0.686 in variety N, whereas the mean expected heterozygosity He of 0.652 varied from 0.553 in variety I to 0.727 in variety D. An important finding is that the expected heterozygosity (He) over all the plant accessions scored a mean value significantly higher than the observed heterozygosity. As a consequence, Wright’s inbreeding coefficient GIS scored positive values, revealing a marked deficiency of heterozygotes across more than half of the varieties (especially varieties B, D, E, H, L, and O), although negative values of GIS were estimated for varieties F, I, M, and P, suggesting an excess of heterozygous individuals within these varieties.
Pairwise GST values indicate some genetic differentiation between populations (Table 5). Levels of pairwise population differentiation were variable, ranging from 0.019 to 0.236. The lowest GST value (0.019) was detected between the M and N varieties, which showed that these varieties had the lowest genetic differentiation and differences. Conversely, the highest GST value (0.236) was presented between the A and L varieties, which had the highest genetic differentiation and differences.
Pairwise comparisons among the 12 microsatellites and all 14 varieties revealed a significant deviation from the Hardy–Weinberg equilibrium with a significant multilocus heterozygosity deficiency (Wahlund effect), since at the multilocus level, the inbreeding coefficient GIS showed all positive values, ranging from 0.142 to 0.62 (Table 6).

3.2. Genetic Diversity and Clustering Analysis

Genetic variability within and between varieties was investigated primarily by calculating genetic similarity (GS) for all possible pairwise comparisons among the 576 samples (the strictest dataset) using the entire set of marker alleles scored at all genomic loci. In particular, a pairwise genetic similarity matrix was calculated using a simple matching coefficient. Rohlf’s genetic similarity ranged from 78% to 99% among all individuals analyzed. When calculated within each variety, GS varied on average from 88.5% (±2.03) within the N variety to 92.6% (±1.99) within the G variety (Table 7). In the pairwise comparisons between varieties, the N and H populations showed the lowest average value (86.48 ± 1.99%), while the I and A populations exhibited the highest value (90.45 ± 2.07%) (Figure 1).
In addition, an analysis of molecular variance (AMOVA) was conducted to better understand the distribution of genetic differentiation among and within varieties of Italian ryegrass. Findings from AMOVA revealed that 87% (resulting from the sum of 60% found within all the individuals and 27% found among the individuals nested in populations) of the total genetic variation was contributed by differences within varieties, which was notably and significantly higher than that among varieties, since only 13% of the total genetic variation was due to differences among varieties.
Clustering analyses were performed to identify PCoA centroids based on the genetic similarity estimates and to draw the maximum likelihood (ML) dendrogram. Generally, the ML dendrogram revealed a slightly structured distribution of Italian ryegrass accessions (Figure 2). In most cases, individuals from different varieties clustered separately, although two admixed groups were present. Interestingly, the A variety appeared totally distinguishable from all the others, with a statistical UFB support of 79. Also identified were two main monophyletic subclusters, which divided most of the samples of the P-M/N-F-I-L-G varieties from the E-H-D-B-C varieties. Notably, samples from the P and 8 of the D variety formed a distinct cluster that differed from other varieties. Additionally, varieties D, E, and M/N showed slight fragmentation, since they were split into three or two parts. Notably, samples belonging to the O variety did not cluster together, spreading into almost all other clusters, especially into the two admixed groups and with the C and E samples.
Principal coordinates analysis graphically represented the spatial distribution of the samples (Figure 3). To obtain clearer and more readable graphical representations, we plotted the annual varieties separate from the biennial data. In the case of the annual varieties, the first two principal components were able to explain 55.2% of the total genetic variation found within the population as a whole, although most of the accessions were closely plotted in the central area of the four main quadrants. In particular, with the first component, explaining 28.8% of the total diversity, accessions of the I variety could be discriminated from the subgroup samples of O and B and part of the samples belonging to the H variety. The second component, which explained 26.4% of the total diversity, was clearly able to distinguish the F samples from individuals of I and H varieties.
For the biennial varieties, even though most of the accessions were nearly plotted in the central part of the four main quadrants, individuals belonging to the L variety were grouped together and well separated from the other samples. The first two principal components were able to explain 60.3% of the total genetic variation found within the population of the biannual varieties. In particular, the first component, which explained 33.9% of the total diversity, discriminated samples of the L and G varieties from the C and D varieties. On the other hand, with the second component, explaining 26.4% of the total diversity, accessions of the L variety could be distinguished from the G samples and part of the individuals belonging to the M and N varieties. In addition, as shown in the dendrogram, C could not be distinguished from D and partially from E.

3.3. Population Structure

Based on the marker alleles at all SSR loci, the genetic structure of the Italian ryegrass core collection (644 samples) was analyzed using STRUCTURE v. 2.2 software [38] and Structure Harvester software [39] to determine the most likely number of ancestral genotypes represented by the core collection. Following the procedure of Evanno et al. [36], a clear maximum for the ΔK value at K = 2 was found (ΔK = 117.5), and a second lowest value was found for K = 13 (ΔK = 9.8). In particular, for K = 2 (Figure 4a), the population was split into two genetically distinguishable subgroups, representing the major marker allele clusters or the ancestral multilocus haplotypes. Similarly, the result observed for K = 13 (Figure 4b) enabled the discriminative clustering of the analyzed varieties, with one exception for varieties M and N.
Starting from the graphical representation of K = 2, each sample is plotted as a vertical histogram divided into K = 2 (and then for K = 13) with colored segments representing the estimated membership in each hypothesized ancestral genotype. The first clustering of the considered genotypes revealed that 518 of the 644 samples showed a strong ancestral association (>90%). Individuals’ membership with the first identified cluster was observed to be higher than 80% in almost all of the analyzed accessions. Particularly, populations A, B, C, D, O, and P had 94%, 91%, 98%, 89%, 92%, and 79% of samples with membership to the first cluster higher than the considered threshold, respectively. On average, all of these varieties presented a membership higher than 90% (except for P with 84%) with the main cluster. On the other hand, populations E, F, G, H, I, and L had 80%, 100%, 100%, 87%, 93%, and 98% of the samples with membership with the second cluster higher than 80%, respectively. Additionally, in this case, the average membership with the second cluster of all these varieties was higher than 90%, with the exception of the E variety with 87%. In contrast, most of the admixed genotypes (<80% membership in a single ancestral genotype) were from the M and N varieties, with 62% and 67% of the samples, respectively. Specifically, these two populations showed complementary average membership to the two hypothesized ancestral genotypes: the M variety had 63% and 37% membership to Cluster 1 and Cluster 2, while the N variety had 40% and 60%, respectively. More generally, other populations had a very low percentage of admixed samples with this clustering, ranging from 24% and 20% for the E and P varieties to the total absence of admixed genotypes in the F and G populations.
The second largest ΔK revealed an additional level of population structure and allowed the clustering of all investigated genotypes into thirteen additional subgroups. Interestingly, the ancestral population size K = 13 corresponded to the number of varieties used in this study, combining the M and N varieties into a unique admixed group. On average, all individuals of the same variety showed a membership higher than 50% to a specific cluster, which was different for each variety (exceptions were observed for varieties M and N). In addition, membership mean values higher than 80% were reached in the B (82.4%-Cluster 12), C (84.6%-Cluster 4), F (84%-Cluster 7), G (83.9%-Cluster 11), I (87.5%-Cluster 1), and L varieties (80.5%-Cluster 13) by the majority of their individuals. The only cluster shared by two different varieties (M and N) was Cluster 3, with an average membership equal to 50.8% and 68.8%, respectively. Moreover, considering those samples with a membership to the major cluster lower than 80% as admixed, the percentage of admixed samples within the same population ranged from 64% (the M variety) to 13% (the I variety). In particular, the highest proportions of admixed samples were shown within the H (58%), D (57%), N (56%), and O varieties (52%). In contrast, the I, C, and F populations had only 13%, 25%, and 25% admixed samples, respectively.

4. Discussion

Morphological characteristics have been used for descriptive purposes and are traditionally used to distinguish plant varieties. However, these methods are questionable because of the strong effect of the environment on morphological traits. In addition, this approach is inefficient because of the time and cost involved [40]. Moreover, morphological criteria alone are not sufficient to distinguish some different varieties that are morphologically similar. Therefore, the molecular fingerprinting of a plant variety is extremely important for protecting plant breeders’ rights (PBR) [41,42]. Although some issues were related to the application of molecular tools (e.g., the lack of information related to the allelic dosage, the requirement of an appropriate number of markers, and the need for specific bioinformatic knowledge), SSR markers are considered more reliable than RAPD and AFLP markers because of their ability to produce high-fidelity profiles thanks to their codominant nature and specificity [43]. As a consequence, several researchers have reported the application of SSR markers for genetic diversity assessment and variety identification in crops [44,45,46], horticultural plants [47,48,49], and forages [50,51,52].
In this study, 14 commercial varieties of Italian ryegrass were tested with 12 SSR loci, analyzing a total of 672 samples to assess their intrapopulation and interpopulation genetic variation and differentiation and the structure of the population as a whole, and in turn to evaluate them in the PVP context. Detecting genetic variability by means of marker allele composition, allele proportion, and multilocus genotypes in different accessions of L. multiflorum Lam. is a crucial step toward a more trusted variety identification and stronger protection for breeder rights.
As a general rule, the outcrossing breeding system of self-incompatible species, along with the annual life cycle, may have a central role in affecting the conservation of genetic diversity in Italian ryegrass. Indeed, for species with outcrossing mating systems, within-variety variation is usually high because pollen can be widely spread between varieties, resulting in a low degree of differentiation attributable to among-varieties variation [53]. In this study, 87% of the total variation was attributable to within-cultivar genetic variation, while variance among cultivars was only responsible for 13% of the total variation, which is consistent with results from previous studies in cross-pollinated forage plants. For example, Nie et al. [19] reported a within-variety genetic variation value equal to 82% of the total variation, testing six Chinese L. multiflorum Lam. varieties. Moreover, in perennial ryegrass cultivars, the total within-cultivar component of genetic variation was 85% [54]. In tetraploid white clover, the within-cultivar variation explained 84% of the total variation [55].
The simultaneous occurrence of a marked inbreeding rate (FIS = 0.302, on average), a reduced fixation index (FST = 0.125, on average), and a high gene flow (Nm = 2.18, on average) between overall accessions confirm that most allele diversity and genotype variation are found within populations and that genetic differentiation among varieties is low. Such a low genetic differentiation can be attributable to a high gene flow among populations because of pollen dispersal among too close multiplication fields of different varieties. In addition, along the variety selection and commercialization process other possible causes could be related to variability of heading time and consequent preferential pairing, climate/location effect during seed batch multiplication, or any possible admixture events during storage and bagging processes. Interestingly, the heterozygosity deficiency seen in our results, with a consequent robust deviation from Hardy–Weinberg equilibrium, is not only a common observation in commercial varieties due to the human-induced bottleneck and artificial selection, but also attests to the low variability in these accessions.
The STRUCTURE analysis suggested moderate structuration of populations and the existence of admixed ancestry based on two or thirteen distinct gene pools. It should be noted that the subdivision in two genetically distinguishable subgroups does not reflect either the two different ploidy levels or the two vegetative habitus. However, the resulting information from the STRUCTURE analysis partially confirmed the outcomes from the maximum likelihood dendrogram. In particular, the I, L, and G varieties could have the same hypothesized ancestral genotype according to STRUCTURE analysis and formed a monophyletic group in the ML cladogram with significant UFB support (91). Moreover, samples belonging to the M and N varieties appear to be closely related to each other. In fact, when K = 13 in the STRUCTURE analysis, M and N samples were the only ones to share the same ancestral gene pool (with the majority of the samples considered admixed, with a membership to Cluster 3 ranging from 79% to 1%), and in the ML dendrogram, they were always mixed and clustered together. Additionally, in the pairwise FST comparisons, the M and N varieties showed the lowest value of genetic differentiation (FST = 0.019), and again, the intra-variety genetic similarity value of the N population was practically the same as that among the M and N populations (88.54% and 88.59%, respectively). In other words, we can assume that the populations with the highest within variability (M and N), according also to STRUCTURE results, can be considered the most closely related. According to all these results, we can therefore say that the M and N varieties are not distinguishable from each other. In contrast, ML clustering analysis clearly suggested that the A accession has a gene pool clearly distinct from the other accessions, hence clustering apart from the core of varieties. Furthermore, in the pairwise comparisons with other varieties, the genetic similarity values of the L population were the lowest, confirming the resulting values in the pairwise FST comparisons.
In conclusion, if it is true that these fourteen commercial varieties of Italian ryegrass did not show high degrees of genetic differentiation, it is also true that, when analyzed as a core, each variety described a genetic cluster on its own, resulting in distinguishability from the others. The only evident exception was represented by the M and N varieties. Given their biennial tetraploid nature, a common recent origin from a shared ancestor could be assumed, as reported also by the population structure analysis. Interestingly, from what was reported in the varietal sheets, these two cultivars are morphologically similar, showing a similar growth habit and comparable heading time and inflorescence length, although the M variety should have longer stems and bigger leaves than the N variety.
The registration of a new variety in the Italian National Register of Varieties still does not take genetic information into account, and the features used to distinguish the candidate variety are genealogical, phenotypic, and other agronomic traits [56]. Molecular characterization by DNA markers, in combination with standard morphological descriptors, can improve variety identification, especially in forage crops. Distinguishing one variety from another is not only the foundation of the seed production market but also the necessary precondition for the successful management of seed companies. In forage crops, especially grasses where morphological differences are not always easily noticeable and objective, the implementation of efficient genetic tools for variety identification is needed. Thus, in the process of potential registration of new varieties, after DUS testing by morphological characters, performing high-throughput marker analysis on the candidate varieties would be highly useful, especially when their phenotypes are not significantly different from existing varieties. This work represents an example of how molecular genetics can be a viable tool to protect both breeders and customers by reducing commercial fraud [57,58].

Author Contributions

Conceptualization, E.P. and G.B.; formal analysis, E.P. and F.P.; data curation, E.P.; writing—original draft preparation, E.P.; writing—review and editing, E.P. and F.P.; supervision, G.B.; project administration, G.B.; funding acquisition, G.B. All authors have read and agreed to the published version of the manuscript.

Funding

This study was carried out within the research contract signed with the seed company Mediterranea Sementi S.r.l. The corresponding author declares no conflict of interest with the seed company.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data are contained within the article and supplementary materials are available on request from the corresponding author. The raw data are not publicly available due to privacy reasons.

Acknowledgments

The authors would like to thank the seed company Mediterranea Sementi S.r.l. for funding the PhD project of Elisa Pasquali.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Nashiki, M.; Narita, H.; Higashiyama, Y. Herbage mass, nutritivevalue and palatability of five grass weeds for cattle in the northern tohoku region in Japan. Weed Biol. Mang. 2005, 5, 110–117. [Google Scholar] [CrossRef]
  2. Kim, K.-Y.; Lee, S.-H.; Choi, G.J.; Park, H.S.; Hwang, T.-Y.; Lee, K.-W. A Medium Maturing Variety of Italian Ryegrass (Lolium multiflorum Lam.), ‘IR605′, with High Forage Productivity in Southern Region of Korea. Journal of The Korean Society of Grassland and Forage Science. Korean Soc. Grassl. Forage Sci. 2020, 40, 156–160. [Google Scholar] [CrossRef]
  3. Touchell, D.H.; Palmer, I.E.; Ranney, T.G. In vitro Ploidy Manipulation for Crop Improvement. Front. Plant Sci. 2020, 11, 722. [Google Scholar] [CrossRef] [PubMed]
  4. Scholes, D.R.; Paige, K.N. Plasticity in ploidy: A generalized response to stress. Trends Plant Sci. 2015, 20, 165–175. [Google Scholar] [CrossRef] [PubMed]
  5. Wu, J.; Cheng, X.; Kong, B.; Zhou, Q.; Sang, Y.; Zhang, P. In vitro octaploid induction of Populus hopeiensis with colchicine. BMC Plant Biol. 2022, 22, 176. [Google Scholar] [CrossRef] [PubMed]
  6. Sattler, M.C.; Carvalho, C.R.; Clarindo, W.R. The polyploidy and its key role in plant breeding. Planta 2016, 243, 281–296. [Google Scholar] [CrossRef]
  7. El-Naby, Z.; Mohamed, N.A.; Radwan, K.H.; El-Khishin, D.A. Colchicine induction of polyploidy in Egyptian clover genotypes. J. Am. Sci. 2012, 8, 221–227. [Google Scholar]
  8. Joshi, P.; Verma, R.C. High Frequency Production of Colchicine Induced Autotetraploids in Faba Bean (Vicia faba L.). Cytologia 2004, 69, 141–147. [Google Scholar] [CrossRef] [Green Version]
  9. Sugiyama, S. Polyploidy and Cellular Mechanisms Changing Leaf Size: Comparison of Diploid and Autotetraploid Populations in Two Species of Lolium. Ann. Bot. 2005, 96, 931–938. [Google Scholar] [CrossRef] [Green Version]
  10. Sugiyama, S. Responses of shoot growth and survival to water stress gradient in diploid and tetraploid populations of Lolium multiflorum and L. perenne. Grassl. Sci. 2006, 52, 155–160. [Google Scholar] [CrossRef]
  11. Rauf, S.; Ortiz, R.; Malinowski, D.P.; Clarindo, W.R.; Kainat, W.; Shehzad, M.; Waheed, U.; Hassan, S.W. Induced Polyploidy A. Tool for Forage Species Improvement. Agriculture 2021, 11, 210. [Google Scholar] [CrossRef]
  12. Gilliland, T.J.; Annicchiarico, P.; Julier, B.; Ghesquière, M. A proposal for enhanced EU herbage VCU and DUS testing procedures. Ann. Bot. 2020, 75, 227–241. [Google Scholar] [CrossRef]
  13. Dhawan, O.P.; Lavania, U.C. Enhancing the productivity of secondary metabolites via induced polyploidy: A review. Euphytica 1996, 87, 81–89. [Google Scholar] [CrossRef]
  14. OECD. Promoting the Use of Certified Agriculture Seed. Available online: https://www.oecd.org/agriculture/seeds/ (accessed on 16 September 2022).
  15. Yu, Q.; Ling, Y.; Xiong, Y.; Zhao, W.; Xiong, Y.; Dong, Z.; Yang, J.; Zhao, J.; Zhang, X.; Ma, X. RAD-seq as an effective strategy for heterogenous variety identification in plants—A case study in Italian Ryegrass (Lolium multiflorum). BMC Plant Biol. 2022, 22, 231. [Google Scholar] [CrossRef]
  16. Tamura, K.I.; Arakawa, A.; Kiyoshi, T.; Yonemaru, J.I. Genetic diversity and structure of diploid Italian ryegrass (Lolium multiflorum Lam.) cultivars and breeding materials in Japan based on genome-wide allele frequency. Grassl. Sci. 2022, 68, 263–276. [Google Scholar] [CrossRef]
  17. Huang, Y.; Zou, Q.; Sun, X.H.; Zhao, L.P. Computational identification of microRNAs and their targets in perennial Ryegrass (Lolium perenne). Appl. Biochem. Biotechnol. 2014, 173, 1011–1022. [Google Scholar] [CrossRef] [PubMed]
  18. Vieira, E.A.; Castro, C.M.; de Oliveira, A.C.; de Carvalho, F.I.F.; Zimmer, P.D.; Martins, L.F. Genetic structure of annual ryegrass (Lolium multiflorum) populations estimated by RAPD. Sci. Agric. 2004, 61, 407–413. [Google Scholar] [CrossRef] [Green Version]
  19. Nie, G.; Huang, T.; Ma, X.; Huang, L.; Peng, Y.; Yan, Y.; Li, Z.; Wang, X.; Zhang, X. Genetic variability evaluation and cultivar identification of tetraploid annual ryegrass using SSR markers. PeerJ 2019, 7, e7742. [Google Scholar] [CrossRef] [Green Version]
  20. Hirata, M.; Cai, H.; Inoue, M.; Yuyama, N.; Miura, Y.; Komatsu, T.; Takamizo, T.; Fujimori, M. Development of simple sequence repeat (SSR) markers and construction of an SSR-based linkage map in Italian ryegrass (Lolium multiflorum Lam.). Theor. Appl. Genet. 2006, 113, 270–279. [Google Scholar] [CrossRef]
  21. Guan, X.; Yuyama, N.; Stewart, A.; Ding, C.; Xu, N.; Kiyoshi, T.; Cai, H. Genetic Diversity and Structure of Lolium Species Surveyed on Nuclear Simple Sequence Repeat and Cytoplasmic Markers. Front. Plant Sci. 2017, 8, 584. [Google Scholar] [CrossRef] [Green Version]
  22. Schuelke, M. An economic method for the fluorescent labeling of PCR fragments. Nat. Biotechnol. 2000, 18, 233–234. [Google Scholar] [CrossRef] [PubMed]
  23. Rohlf, F.J. NTSYS: Numerical Taxonomy and Multivariate Analysis System Version 2.02; Applied Biostatistics Inc.: Setauket, NY, USA, 1998. [Google Scholar]
  24. Nguyen, L.T.; Schmidt, H.A.; von Haeseler, A.; Minh, B.Q. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 2015, 32, 268–274. [Google Scholar] [CrossRef] [PubMed]
  25. Huang, G.; Cao, J.; Chen, C.; Wang, M.; Liu, Z.; Gao, F.; Yi, M.; Chen, G.; Lu, M. Genome survey of Misgurnus anguillicaudatus to identify genomic information, simple sequence repeat (SSR) markers, and mitochondrial genome. Mol. Biol. Rep. 2022, 49, 2185–2196. [Google Scholar] [CrossRef] [PubMed]
  26. Vieira Ldo, N.; Dos Anjos, K.G.; Faoro, H.; Fraga, H.P.; Greco, T.M.; Pedrosa Fde, O.; de Souza, E.M.; Rogalski, M.; de Souza, R.F.; Guerra, M.P. Phylogenetic inference and SSR characterization of tropical woody bamboos tribe Bambuseae (Poaceae: Bambusoideae) based on complete plastid genome sequences. Curr. Genet. 2016, 62, 443–453. [Google Scholar] [CrossRef]
  27. Minin, V.; Abdo, Z.; Joyce, P.; Sullivan, J. Performance-based selection of likelihood models for phylogeny estimation. Syst. Biol. 2003, 52, 674–683. [Google Scholar] [CrossRef]
  28. Minh, B.Q.; Nguyen, M.A.; von Haeseler, A. Ultrafast approximation for phylogenetic bootstrap. Mol. Biol. Evol. 2013, 30, 1188–1195. [Google Scholar] [CrossRef] [Green Version]
  29. Hoang, D.T.; Chernomor, O.; von Haeseler, A.; Minh, B.Q.; Vinh, L.S. UFBoot2: Improving the Ultrafast Bootstrap Approximation. Mol. Biol. Evol. 2018, 35, 518–522. [Google Scholar] [CrossRef]
  30. Guindon, S.; Dufayard, J.F.; Lefort, V.; Anisimova, M.; Hordijk, W.; Gascuel, O. New algorithms and methods to estimate maximum-likelihood phylogenies: Assessing the performance of PhyML 3.0. Syst. Biol. 2010, 59, 307–321. [Google Scholar] [CrossRef] [Green Version]
  31. Meirmans, P.G. GENODIVE version 3.0: Easy-to-use software for the analysis of genetic data of diploids and polyploids. Mol. Ecol. Resour. 2020, 20, 1126–1131. [Google Scholar] [CrossRef] [Green Version]
  32. De Silva, H.N.; Hall, A.J.; Rikkerink, E.; McNeilage, M.A.; Fraser, L.G. Estimation of allele frequencies in polyploids under certain patterns of inheritance. Heredity 2005, 95, 327–334. [Google Scholar] [CrossRef]
  33. Meirmans, P.G.; Liu, S.; van Tienderen, P.H. The Analysis of Polyploid Genetic Data. J. Hered. 2018, 109, 283–296. [Google Scholar] [CrossRef] [PubMed]
  34. Wright, S. The Interpretation of Population Structure by F-Statistics with Special Regard to Systems of Mating. Evolution 1965, 19, 395–420. [Google Scholar] [CrossRef]
  35. Bird, C.E.; Karl, S.A.; Smouse, P.E.; Toonen, R.J. Detecting and measuring genetic differentiation. In Phylogeography Population Genetics in Crustacea; Routledge: Abingdon, UK, 2011; Volume 19, pp. 31–55. [Google Scholar]
  36. Evanno, G.; Regnaut, S.; Goudet, J. Detecting the number of clusters of individuals using the software STRUCTURE: A simulation study. Mol. Ecol. 2005, 14, 2611–2620. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Raychaudhuri, S. Mapping rare and common causal alleles for complex human diseases. Cell 2011, 147, 57–69. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  38. Pritchard, J.K.; Stephens, M.; Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 2000, 155, 945–959. [Google Scholar] [CrossRef]
  39. Earl, D.A.; Vonholdt, B.M. STRUCTURE HARVESTER: A website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv. Genet. Resour. 2012, 4, 359–361. [Google Scholar] [CrossRef]
  40. Singh, A.; Prasad, S.; Singh, V.; Chaturvedi, G.; Singh, B. Morphological traits for vegetative stage drought tolerance in rice (Oryza sativa). In Resilient Crops for Water Limited Environments; International Maize and Wheat Improvement Center (CIMMYT): Veracruz, Mexico, 2004; p. 188. [Google Scholar]
  41. Cooke, D.E.L.; Young, V.; Birch, P.R.J.; Toth, R.; Gourlay, F.; Day, J.P.; Carnegie, S.F.; Duncan, J.M. Phenotypic and genotypic diversity of Phytophthora infestans populations in Scotland (1995–97). Plant Pathol. 2003, 52, 181–192. [Google Scholar] [CrossRef]
  42. Laido, G.; Mangini, G.; Taranto, F.; Gadaleta, A.; Blanco, A.; Cattivelli, L.; Marone, D.; Mastrangelo, A.M.; Papa, R.; De Vita, P. Genetic Diversity and Population Structure of Tetraploid Wheats (Triticum turgidum L.) Estimated by SSR, DArT and Pedigree Data. PLoS ONE 2013, 8, e67280. [Google Scholar] [CrossRef] [Green Version]
  43. McCouch, S.R.; Chen, X.; Panaud, O.; Temnykh, S.; Xu, Y.; Cho, Y.G.; Huang, N.; Ishii, T.; Blair, M. Microsatellite marker development, mapping and applications in rice genetics and breeding. Plant Mol. Biol. 1997, 35, 89–99. [Google Scholar] [CrossRef]
  44. Huang, Q.; Borner, A.; Roder, S.; Ganal, W. Assessing genetic diversity of wheat (Triticum aestivum L.) germplasm using microsatellite markers. Appl. Genet. 2002, 105, 699–707. [Google Scholar] [CrossRef]
  45. Anand, D.; Prabhu, K.V.; Singh, A.K. Analysis of molecular diversity and fingerprinting of commercially grown Indian rice hybrids. J. Plant Biochem. Biotechnol. 2012, 21, 173–179. [Google Scholar] [CrossRef]
  46. Karaagac, E.; Yilma, S.; Cuesta-Marcos, A.; Vales, M.I. Molecular Analysis of Potatoes from the Pacific Northwest Tri-State Variety Development Program and Selection of Markers for Practical DNA Fingerprinting Applications. Am. J. Potato Res. 2014, 91, 195–203. [Google Scholar] [CrossRef]
  47. Tsai, W.S.; Shih, S.L.; Rauf, A.; Safitri, R.; Hidayati, N.; Huyen, B.T.T.; Kenyon, L. Genetic diversity of legume yellow mosaic begomoviruses in Indonesia and Vietnam. Ann. Appl. Biol. 2013, 163, 367–377. [Google Scholar] [CrossRef]
  48. Xie, R.J.; Zhou, J.; Wang, G.Y.; Zhang, S.M.; Chen, L.; Gao, Z.S. Cultivar Identification and Genetic Diversity of Chinese Bayberry (Myrica rubra) Accessions Based on Fluorescent SSR Markers. Plant Mol. Biol. Rep. 2011, 29, 554–562. [Google Scholar] [CrossRef]
  49. Miao, H.; Zhang, S.P.; Wang, X.W.; Zhang, Z.H.; Li, M.; Mu, S.Q.; Cheng, Z.C.; Zhang, R.W.; Huang, S.W.; Xie, B.Y.; et al. A linkage map of cultivated cucumber (Cucumis sativus L.) with 248 microsatellite marker loci and seven genes for horticulturally important traits. Euphytica 2011, 182, 167–176. [Google Scholar] [CrossRef]
  50. Sharifi Tehrani, M.; Mardi, M.; Sahebi, J.; Catalán, P.; Díaz-Pérez, A. Genetic diversity and structure among Iranian tall fescue populations based on genomic-SSR and EST-SSR marker analysis. Plant Syst. Evol. 2009, 282, 57–70. [Google Scholar] [CrossRef]
  51. Herrmann, D.; Flajoulot, S.; Julier, B. Sample size for diversity studies in tetraploid alfalfa (Medicago sativa) based on codominantly coded SSR markers. Euphytica 2010, 171, 441–446. [Google Scholar] [CrossRef]
  52. Jiang, L.F.; Zhang, X.Q.; Ma, X.; Huang, L.K.; Xie, W.G.; Ma, Y.M.; Zhao, Y.F. Identification of orchardgrass (Dactylis glomerata L.) cultivars by using simple sequence repeat markers. Genet. Mol. Res. 2013, 12, 5111–5123. [Google Scholar] [CrossRef]
  53. Roldán-Ruiz, I.; Dendauw, J.; Van Bockstaele, E.; Depicker, A.; De Loose, M. AFLP markers reveal high polymorphic rates in ryegrasses (Lolium spp.). Mol. Breed. 2000, 6, 125–134. [Google Scholar] [CrossRef]
  54. Kubik, C.; Sawkins, M.; Meyer, W.A.; Gaut, B.S. Genetic diversity in seven perennial Ryegrass (Lolium perenne L.) cultivars based on SSR markers. Crop Sci. 2001, 41, 1565–1572. [Google Scholar] [CrossRef]
  55. Kölliker, R.; Jones, E.S.; Jahufer, M.Z.Z.; Forster, J.W. Bulked AFLP analysis for the assessment of genetic diversity in white clover (Trifolium repens L.). Euphytica 2001, 121, 305–315. [Google Scholar] [CrossRef]
  56. MIPAAFT. Modalità Operative Inerenti la Procedura Informatica per L’iscrizione di Varietà Vegetali; Ministero delle Politiche Agricole Alimentari e Forestali. 2015; Vol. D.M. n.10803. 26 May 2015. [Google Scholar]
  57. Barcaccia, G.; Lucchin, M.; Cassandro, M. DNA Barcoding as a Molecular Tool to Track Down Mislabeling and Food Piracy. Diversity 2016, 8, 2. [Google Scholar] [CrossRef] [Green Version]
  58. Valentini, P.; Galimberti, A.; Mezzasalma, V.; De Mattia, F.; Casiraghi, M.; Labra, M.; Pompa, P.P. DNA Barcoding Meets Nanotechnology: Development of a Universal Colorimetric Test for Food Authentication. Angew. Chem. Int. Ed. Engl. 2017, 56, 8094–8098. [Google Scholar] [CrossRef]
Figure 1. Box plot of mean GS within each variety in descending order. The second and third quartiles are marked inside the square and are divided by a bar (median). The cross (×) within each box represents the mean value. Dots show outlier samples. Different letters and colors refer to the different varieties examined.
Figure 1. Box plot of mean GS within each variety in descending order. The second and third quartiles are marked inside the square and are divided by a bar (median). The cross (×) within each box represents the mean value. Dots show outlier samples. Different letters and colors refer to the different varieties examined.
Genes 13 02097 g001
Figure 2. Maximum likelihood dendrogram topology portraying the genetic relationships among the different varieties identified by different colors and letters.
Figure 2. Maximum likelihood dendrogram topology portraying the genetic relationships among the different varieties identified by different colors and letters.
Genes 13 02097 g002
Figure 3. Principal coordinates analysis (PCoA) based on the GS matrix calculated with Rohlf’s coefficient in all possible pairwise comparisons. Samples are labeled following the biotype of belonging (identified by different colors and letters), and their centroids are subgrouped to better represent overlapping areas. (a) PCoA of biennial varieties. (b) PCoA of annual varieties.
Figure 3. Principal coordinates analysis (PCoA) based on the GS matrix calculated with Rohlf’s coefficient in all possible pairwise comparisons. Samples are labeled following the biotype of belonging (identified by different colors and letters), and their centroids are subgrouped to better represent overlapping areas. (a) PCoA of biennial varieties. (b) PCoA of annual varieties.
Genes 13 02097 g003
Figure 4. Genetic structure analysis of the Italian ryegrass core collection. Identified most likely value of K = 2 (a) and K = 13 (b). Clusters and samples’ memberships (indicated by different colors) agree with the variety to which they belong, identified by different letters.
Figure 4. Genetic structure analysis of the Italian ryegrass core collection. Identified most likely value of K = 2 (a) and K = 13 (b). Clusters and samples’ memberships (indicated by different colors) agree with the variety to which they belong, identified by different letters.
Genes 13 02097 g004
Table 1. Plant material information, including variety code, relative ploidy level, and vegetative habit.
Table 1. Plant material information, including variety code, relative ploidy level, and vegetative habit.
VarietyPloidy LevelVegetative Habit
A4xannual
B2xannual
C4xbiennial
D2xbiennial
E2xbiennial
F4xannual
G2xbiennial
H4xannual
I4xannual
L2xbiennial
M4xbiennial
N4xbiennial
O2xannual
P4xannual
Table 2. Sequences (5′ to 3′) of the primer pairs used to amplify the SSR markers. For each primer pair, locus name, linkage group membership, maximum and minimum amplicon size (bp), multiplex to which the SSR marker locus belongs, fluorescent dye used, and temperature of melting.
Table 2. Sequences (5′ to 3′) of the primer pairs used to amplify the SSR markers. For each primer pair, locus name, linkage group membership, maximum and minimum amplicon size (bp), multiplex to which the SSR marker locus belongs, fluorescent dye used, and temperature of melting.
Locus NameLGExpected SizeMultiplexFluo DyeTm (°C)Forward PrimerReverse Primer
02-01B3309–34116-FAM60CTTTATTGCACTTTACTTGCCTTGCGATGTTCCACGTCAGGTG
13-07A°2124–2021VIC62CACGGAGGCATTTGATTCCCGCGACCAGTTCCTCGATCT
16-03D6153–2261NED63TTGCTGTTGGGTTTGCTCCCCTAGAGGTTAGGTCATCTCAAGCG
18-08C4317–3401NED62GCATCAGGGTCGATTCCTCTTGTCCTATGCTAAAGCTGACATCC
02-10B6276–3791PET61AGTTGGAGTTAACCCCATAGTCATCTCATCCATATATAGTCAAGCATAGTG
02-05D7243–2782VIC65ACTGCCTGCACTGGTGCTTGGCTTACTTCTGCTGACACTGTTTTACA
10-09E5204–2372NED62TCCAAGTGAACGAGTTGCGTCATCGTCACCACAGTGGC
13-12D5285–3042PET63TTGCTGCTGCACCAATAGCGGAGCCGATGATGCCACATTC
15-10H3200–23436-FAM62ACGCTACAAACCGAGGTGGATTTGCCTGGAACTGACCCC
12-05E5122–1603VIC59TTCCTGCCGAACGTCTGCTTGCCAAAAGAGGAGAGGA
14-01A7277–3183NED61CGGCATGCAGTTGAATCTGATAATCCACGCCGATCCACG
02-02C4180–2783PET61CCGAATTGTGCCGTATTTGGATACCCACCACATTCGTAAAATG
Table 3. Descriptive statistics for all SSR loci. The mean number of observed (Na) alleles per locus, mean number of effective (Ne) alleles per locus, observed heterozygosity (Ho), expected heterozygosity (He), inbreeding coefficients (FIT and FIS), fixation index (FST), and gene flow estimates (Nm) are reported for each SSR locus.
Table 3. Descriptive statistics for all SSR loci. The mean number of observed (Na) alleles per locus, mean number of effective (Ne) alleles per locus, observed heterozygosity (Ho), expected heterozygosity (He), inbreeding coefficients (FIT and FIS), fixation index (FST), and gene flow estimates (Nm) are reported for each SSR locus.
LocusNaNeHoHeFITFISFSTNm
02_01B101.70.4980.420.0740.0130.0623.917
13_07A202.10.3840.5210.5790.5170.131.616
16_03D142.70.6050.6360.038−0.0240.063.718
18_08C152.40.4340.5960.4870.40.1461.486
02_10B485.20.4180.8270.6050.5510.1211.644
02_05D184.80.6710.8030.1660.1140.0593.987
10_09E182.70.4420.6390.4360.3380.1481.363
13_12D123.60.5860.7350.2510.1840.0832.691
15_10H163.10.4620.6860.4980.420.1341.616
12_05E212.20.4620.5450.5070.3660.2230.881
14_01A163.10.4070.6920.6410.5950.1121.962
02_02C313.10.6020.6910.3330.1440.220.809
Total19.93.10.4980.6490.3850.3020.1251.658
Table 4. Descriptive statistics of genetic diversity calculated for each of the 14 varieties analyzed. The mean number of observed (Na) alleles, the mean number of effective (Ne) alleles, observed heterozygosity (Ho), expected heterozygosity (He), Wright’s inbreeding coefficient (GIS), private alleles (PA) for each variety, and private alleles with a frequency higher than 5% are reported for each variety.
Table 4. Descriptive statistics of genetic diversity calculated for each of the 14 varieties analyzed. The mean number of observed (Na) alleles, the mean number of effective (Ne) alleles, observed heterozygosity (Ho), expected heterozygosity (He), Wright’s inbreeding coefficient (GIS), private alleles (PA) for each variety, and private alleles with a frequency higher than 5% are reported for each variety.
VarietyNaNeHoHeGIS PAPA > 5%
A5.82.50.4970.5710.131
B7.43.50.30.6720.55422
C7.33.60.5640 6640.1515
D8.24.60.2760.7270.6261
E7.33.80.3480.6820.4896
F6.42.90.6480 605−0 0702
G5.62.50.5230.5620.069-
H8.54.00.4240.6770.37441
I4.42.50.6280.553−0.135-
L8.74.60.4240.710.40282
M8.64.10.680.652−0.04321
N9.34.40.6860.7140.0381
O7.23.90.3220.6830.5282
P7.92.90.6450.616−0.04720
Overall7.33.50.4980.6520.241597
Table 5. Matrix with Nei’s gene diversity (GST) values (analogous to FST) for all possible pairwise comparisons between the 14 varieties. Green and red indicate lower and higher GST values, respectively.
Table 5. Matrix with Nei’s gene diversity (GST) values (analogous to FST) for all possible pairwise comparisons between the 14 varieties. Green and red indicate lower and higher GST values, respectively.
A--
B0.135--
C0.1360.079--
D0.1290.0770.055--
E0.1720.1210.120.09--
F0.1520.1390.1350.130.093--
G0.170.2020.1840.1850.190.174--
H0.1860.1450.1150.0820.110.1590.162--
I0.1060.1670.160.1580.1430.150.150.126--
L0.2360.2050.2080.1790.1690.2020.1750.120.167--
M0.1020.0970.0990.0660.1050.0950.1280.1340.1380.174--
N0.110.0980.1020.0730.1070.0960.1390.1210.1330.130.019--
O0.0930.0990.0760.0670.1080.1330.1590.1240.1260.1840.0690.081--
P0.1160.1250.0970.0940.1450.1170.130.1410.1580.1950.0540.0690.086--
ABCDEFGHILMNOP
Table 6. Inbreeding coefficient GIS in pairwise comparisons among the 12 SSR loci and the 14 varieties under study. GIS values at the multilocus level are also reported.
Table 6. Inbreeding coefficient GIS in pairwise comparisons among the 12 SSR loci and the 14 varieties under study. GIS values at the multilocus level are also reported.
Population02_01B13_07A16_03D18_08C02_10B02_05D10_09E13_12D15_10H12_05E14_01A02_02CMultilocus
A−0.0420.165−0.2930.6060.5390.0810.5270.0380.5010.6160.5370.5610.317
B0.3010.8010.5550.3860.7300.4920.6030.1340.8100.4550.8830.5070.554
C−0.1770.693−0.2430.3940.3960.2680.5551.0000.4600.8350.2090.4310.405
D0.4160.7010.4000.8270.5310.5120.7620.8580.4700.6430.8310.4380.620
E0.0670.7140.3230.9390.7730.1890.4650.3220.6490.2990.6360.3360.489
F−0.0620.392−0.4270.4230.4460.0700.0630.0350.4430.0230.697−0.1260.144
G−0.1020.607−0.404−0.1250.5600.0830.612−0.1150.1680.1810.357−0.2410.161
H0.1400.9010.2860.5540.5840.4850.5600.1520.7990.5160.452−0.1380.450
I0.1530.112−0.2980.6240.747−0.087−0.180−0.2100.7940.3700.266−0.2880.142
L0.2870.8080.4740.2750.5470.3190.1350.5070.423−0.0080.752−0.1910.402
M−0.0710.360−0.201−0.0320.683−0.0410.236−0.0190.1800.4100.555−0.0820.163
N−0.1690.666−0.1280.1310.420−0.1080.3450.0950.3370.3680.663−0.0150.216
O0.1030.3870.5330.9010.8990.0540.6890.5630.5250.4450.8160.4270.538
P−0.1540.198−0.328−0.1570.561−0.0420.131−0.0160.0350.2640.9180.4630.172
Overall0.0670.5420.0330.4170.5960.1600.3880.2430.4640.3920.6120.1930.348
Table 7. Average genetic similarity percentages (GS%) calculated within and among each variety using Rohlf’s simple matching coefficient. Red and green indicate lower and higher GS%, respectively.
Table 7. Average genetic similarity percentages (GS%) calculated within and among each variety using Rohlf’s simple matching coefficient. Red and green indicate lower and higher GS%, respectively.
A92.31%
B89.42%90.73%
C89.28%89.28%90.67%
D89.12%89.21%89.28%90.11%
E88.83%88.70%88.66%88.85%90.41%
F89.31%88.16%88.02%88.01%89.30%91.24%
G90.29%88.19%88.50%88.17%88.58%89.35%92.55%
H88.08%87.87%88.03%88.42%88.44%87.52%88.72%89.60%
I90.45%88.38%88.26%88.07%88.83%89.16%90.35%88.99%92.28%
L87.25%87.09%86.61%87.02%87.56%86.86%88.22%87.96%88.34%90.02%
M89.31%87.89%87.99%88.08%88.15%88.69%88.97%86.86%88.56%86.70%89.50%
N88.60%87.53%87.24%87.50%87.63%88.10%87.99%86.48%88.04%86.82%88.59%88.54%
O89.64%88.99%88.97%88.95%88.64%88.27%88.69%87.87%88.46%87.15%88.33%87.66%89.98%
P89.92%88.27%88.76%88.43%87.98%88.71%89.92%87.29%88.96%86.71%89.24%88.37%88.87%90.88%
ABCDEFGHILMNOP
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Pasquali, E.; Palumbo, F.; Barcaccia, G. Assessment of the Genetic Distinctiveness and Uniformity of Pre-Basic Seed Stocks of Italian Ryegrass Varieties. Genes 2022, 13, 2097. https://doi.org/10.3390/genes13112097

AMA Style

Pasquali E, Palumbo F, Barcaccia G. Assessment of the Genetic Distinctiveness and Uniformity of Pre-Basic Seed Stocks of Italian Ryegrass Varieties. Genes. 2022; 13(11):2097. https://doi.org/10.3390/genes13112097

Chicago/Turabian Style

Pasquali, Elisa, Fabio Palumbo, and Gianni Barcaccia. 2022. "Assessment of the Genetic Distinctiveness and Uniformity of Pre-Basic Seed Stocks of Italian Ryegrass Varieties" Genes 13, no. 11: 2097. https://doi.org/10.3390/genes13112097

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop