1. Introduction
Blueberry is an economically and medicinally high-value crop that belongs to the genus
Vaccinium L., which contains about 400–500 species native to countries all over the world except Antarctica and Australia [
1,
2]. The five major groups of blueberries grown commercially include (1) lowbush (
V. angustifolium Ait.; 2n = 4x = 48), (2) highbush (
V. corymbosum L.; 2n = 4x = 48), (3) half-high (the product of hybridization between highbush and lowbush blueberries), (4) southern highbush (hybrids between
V. corymbosum and mostly
V. darrowi Camp and/or
V. ashei Reade) and (5) Rabbiteye (
V. virgatum Ait.; syn.
V. ashei; 2n = 6x = 72) [
3]. Blueberries are consumed fresh or in other commercially processed forms mainly for their high antioxidant activity, which fight off harmful radicals in the body. The high antioxidation activity is due to high concentrations of anthocyanins, flavonoids and phenolic acids. These phenolic compounds are linked to an improvement of night vision, prevention of macular degeneration, anticancerous activity and reduction in heart disease [
4,
5]. The clinical benefits of blueberry are not only limited to its fruits; blueberry leaves have also been shown to possess antidiabetic [
6] and antimicrobial activities [
7] and have been used as a traditional remedy for the treatment of diabetic symptoms [
8,
9].
Because of their health benefits, there has been a steep surge in demand for high-quality blueberries. Therefore, efforts were made at St. John’s Research and Development Centre of Agriculture and Agri-Food Canada (AAFC) in Newfoundland and Labrador, Canada, in 1999 to create cultivars that can sustain in cool climates [
10]. Earlier efforts to improve blueberry involved breeding and selection, where suitable and novel traits, such as fruit quality, tolerance to drought and cold and disease resistance, were pooled together by interspecific hybridization [
11]. The knowledge of genetic diversity in available berry germplasm can facilitate reliable and fast identification of genotypes. This knowledge can be invaluable for conservation efforts, designing specific breeding programs, selecting various combinations of parents with high diversity to create offspring with maximum genetic variation and introducing desirable genes into the available germplasm for berry improvement [
3].
It has been a regular practice to use molecular markers for diversity analyses in blueberries, including restriction fragment length polymorphism (RFLP) [
12], random amplified polymorphic DNA (RAPD) [
13,
14], amplified fragment length polymorphism (AFLP) [
13], inter simple sequence repeat (ISSR) [
15,
16], simple sequence repeat (SSR) [
10,
17,
18,
19], express sequence tag–polymerase chain reaction (EST–PCR) and cleaved amplified polymorphic sequences (CAPSs) derived from EST–PCR markers [
10,
19]. EST–PCR markers were initially used in highbush blueberries [
20] and have also been found appropriate for genetic association studies on lowbush [
10,
21,
22] and rabbiteye blueberries [
23]. While SSRs are repeats of short nucleotide sequences, mostly with six or less bases, EST–PCRs are sequence-tagged site (STS) markers that use precise primers of ~20-mer primers in expressed sequence tags (ESTs) [
23].
The objective of the present study is to examine the antioxidant properties, genetic diversity and relationship between antioxidant properties and molecular diversity in two groups of selected hybrids developed under an on-going blueberry genetic enhancement program, four groups of lowbush blueberry wild clones and a group of blueberry cultivars. Total antioxidant activity (TAA), and total phenolic (TPC) and flavonoid contents (TFC) were estimated to check biochemical diversity. Three types of DNA-based molecular markers, including EST–SSR, genomic (G)–SSR and EST–PCR, were used for a detailed molecular biodiversity study that will improve the efficiency in cultivar development, variety protection and registration, along with phenotypic selection. Comparative efficiency of molecular markers with those of biochemical diversity has not been reported in blueberries.
4. Discussion
The study presented here provides insight into genetic diversity, with respect to genetic relationship and structure, and biochemical properties of two groups of selected hybrids of lowbush and half-high blueberries, half-high wild blueberry clones, and highbush and lowbush blueberry cultivars. The antioxidant properties of blueberries are well known for their medicinal value in negating the harmful effects of free radicals [
56]. The leaves of blueberry wild clones and cultivars can have higher antioxidant activity [
57,
58], polyphenolics, and proanthocyanidins than the fruit [
59,
60].
The antioxidant activity depends on the synergistic and antagonistic interaction of various compounds and environmental factors [
61]. There is no standard agreed method for estimating antioxidant activity because of its complexity [
62]. In the present study, we used the DPPH radical scavenging method, as it is sensitive and cheaper than other known procedures [
63]. Out of all the groups, the NB wild clones had the highest TAA, followed by CV, Cross 2, and NL wild clones. TPC and TFC were highest in NL wild clones, followed by CV. The wild clones from NL and NB proved to be an important resource for improving antioxidant properties in the blueberry breeding program. Phenolics are the abundantly available secondary metabolite derived from phenylalanine via the secondary metabolic pathway, catalyzed by phenylalanine lyase L (PAL). Various biotic and abiotic factors can cause stress in source plants and trigger higher activity of PAL [
64]. Low levels of light in the NL province could have contributed to higher levels of TAA, TPC, and TFC. Leaf maturity can have a significant impact on phytochemical composition in blueberry. In their study, Riihinen et al. [
60] reported the red leaves of
V. corymbosum possessed higher levels of quercetin and kaempferol, p-coumaric, and caffeic acids than the green leaves. This could be the case because solar radiation increases these compounds as a part of the photoprotective mechanism [
60]. On top of that, the red leaves contained a small amount of anthocyanins, while green leaves did not have any anthocyanin content [
65]. Therefore, TPC and TFC may not sufficiently explain total antioxidant activity as they are the cocktail of various compounds and their activities. The DPPH value is calculated by the addition of various antioxidant compounds, which depends on the chemical used during the extraction of leaves [
66]. However, Wang and Lin [
67] reported contradicting observation that the young leaves from different varieties of blackberries, raspberries and strawberries possessed higher TPC and TAA than older leaves. There was a positive correlation between TAA with TPC and TFC, which was also reported in previous studies involving blueberries [
28,
29,
68].
The biochemical analysis in the present study provides important information about the diversity of antioxidant properties. However, biochemical characteristics by themselves are not enough for the presence of genetic diversity. The DNA marker system provides a precise and reliable method for further analysis of variability. The extent of genetic diversity between and within populations is often the outcome of a combination of factors such as gene flow, genetic drift, inbreeding, mutation and the selection effect [
43]. It is very expensive, time-consuming, and laborious to develop species-specific molecular markers. Because of these constraints, we used EST–SSR, G–SSR, and EST–PCR markers developed for highbush blueberries [
19,
20]. Our report is apparently the first to use these three types of markers to assess genetic diversity in a group of hybrids obtained by crossing lowbush with half-high blueberries. Microsatellite markers have also been used for hybrid identification in closely related wild
Petunia species [
69]. Although G–SSR markers are highly abundant in the plant genome and are attractive due to their reproducibility and polymorphic nature, most of them lack close linkage to transcribed regions and do not have a specific genic function. On the other hand, SSR markers derived from EST sequences are associated with the genome’s transcribed or expressed regions [
70]. The single-pass sequence of cDNA clones that are picked randomly is the source of EST–SSR and EST–PCR markers [
71]. All primer pairs used in this study showed an elevated polymorphism that confirmed the high degree of genetic diversity in the blueberry genome of the current material.
In the present study, the discriminatory power of EST–SSR, G–SSR, and EST–PCR primer pairs was compared by
PIC,
EMR,
MI, D, and
R. These values help in determining the effectiveness of a specific primer pair in the analysis of genetic diversity. Although
PIC values for EST–SSR (average 0.35) and G–SSR primer pairs (average 0.30) were higher than EST–PCR (average 0.28),
EMR,
MI, D, and
R values for EST-PCR markers were the highest (average 2.05, 0.61, 0.79 and 2.37, respectively) followed by G–SSR (average 1.28, 0.38, 0.65 and 1.08, respectively) and EST–SSR (average 1.13, 0.33, 0.54 and 1.02, respectively). The highest
PIC value for EST-SSR primer pair CA112 (0.96), combined with very low values for
EMR,
MI, D and
R (0.08 for
EMR,
MI,
D and 0.09 for
R;
Table 4), proved that this primer pair is not worthy for analyzing present blueberry hybrids, wild clones, and cultivars. On the other hand, the EST-PCR primer pair CA227, with its highest
MI value among all primers (1.16), was the best for overall utility to study the present material, and it was followed by CA1423 (
MI = 1.00) and CA54 (
MI = 0.99). These three EST-PCR primer pairs may be very valuable for analyzing blueberry hybrids. However, the moderate-to-high values for most of the primer pairs could be attributed to their effectiveness in studying the genetic diversity of the present material.
In the present study, the mean allele number for EST–SSR, G–SSR and EST–PCR were 6.30, 5.36 and 4.40, respectively, which is comparable or lower than previous studies involving SSR and/or EST–PCR primer pairs in blueberries (22.4; [
72]; 4.8; [
21]; 18.5; [
73]; 8.33; [
16]; 17; [
10]; 14.24; [
18]; 10; [
74]; 20.5; [
22]; 14.33; [
17]). The average Shannon’s index (
I) of 0.34 for EST–SSR, 0.29 for G–SSR and 0.26 for EST–PCR are lower than those recorded for
V. vitis-idaea (0.57; [
75]) and
V. myrtillus (0.55; [
13]),
V. uliginosum (0.65; [
76]),
V. corymbosum (0.62; [
18]) and
Vaccinium species (1.93; [
10]; 2.56; [
22]). Average
He values for EST–SSR (0.23), G–SSR (0.19) and EST-PCR (0.16) were also less than those reported in previous studies with blueberries (0.88; [
73]; 0.81; [
74]; 0.87; [
18]; 0.86; [
10]; 0.80; [
22]; 0.56; [
17]. The lower values of diversity parameter could be an indication of genetic erosion resulting from selective farming and deforestation [
3].
We used three complementary methods: STRUCTURE, NJ tree, and PCoA to study population structure and genotype relationships in wild, cultivated, and hybrid blueberries using 26 PCR-based marker pairs. Genotype identification using DNA markers is favoured due to their consistency and reliability, as they are unaffected by the environment [
77]. The combined STRUCTURE analysis divided the genotypes into three major groups, with some admixtures confirmed by PCoA and NJ analyses for most genotypes. Admixtures in the wild blueberry clones that were observed in the present material with STRUCTURE analysis might be due to the consequence of a glacial bottleneck and quick colonization of these blueberries, along with increased regional gene flow due to the migration of human beings and trade in agriculture [
78]. Although the hybrids were distributed in all three clusters, most of them formed distinct subgroups, either alone or with lowbush or half-high cultivars. This might be because they had been developed through crossing between lowbush and half-high blueberries and share the genes from both parents. However, most of the wild lowbush blueberries, except for the NB clones and the half-high cultivars, were grouped based on their phenotypes. While lowbush blueberries are less than 0.5 m tall, the suckering to crown-forming of half-high blueberry plants are 0.5 to 1.0 m tall. Highbush blueberry plants are crown-forming and 2.0 or higher in height [
79]. In the present study, most of the wild clones, although collected from four different provinces, did not group based on their collection place. Although there is a wide genetic variation among the wild clones, there is no pattern of differentiation based on their collection places. This was also observed in AMOVA analysis, where in the combined analysis, the variation among groups was 19%, and most of the variations (81%) were among the genotypes within provinces, cultivars or hybrid groups. Similar observations were also reported by Debnath [
10] and Tailor, Bykova, Igamberdiev and Debnath [
22], who worked with different sets of wild lowbush blueberry clones and observed that wild clones were grouped into different clusters. In the present study, it was evident that there was no clear difference between the wild, cultivated, and hybrid blueberries, indicating that diversity-wise, the present genotypes are all heterogeneous in nature. This might be due to a smaller variation among different groups than the variation among the genotypes within a group. STRUCTURE, NJ and PCoA analyses, along with AMOVA analysis, were complementary to each other and, thus, instead of using one method, a number of procedures are more informative for drawing valid conclusions. Similarly, using more than one type of molecular marker is always better than using a single type of molecular marker [
10,
22]. In our study, STRUCTURE, NJ, PCoA and AMOVA analyses using EST–SSR, G–SSR and ESTPCR markers have well discriminated the wild blueberry clones, cultivars and hybrids from the wild and cultivated blueberries that are part of our current germplasm repository for the cool climates of Canada.
There are no reports available on the relationship between molecular markers and biochemical properties in blueberry. Our study found no parallels between genetic and biochemical data, as observed by phylogenetic trees, PCA–PCoA graphs, and the Mantel test of correlation. The poor correlation between genetic clustering from biochemical data indicates varying genomic coverage in blueberries. Molecular markers span across the genome and most of which are not expressed at the phenotypic level. The noncoding regions of the genome that are not accessible to phenotypic expression might be the reason for the dissimilarity between molecular and chemical diversity [
77]. There are only three reports available on the comparative analysis of molecular markers with biochemical properties. In their study, Debnath and Sion [
80] reported no correlation between genetic diversity based on ISSR markers and chemical diversity based on antioxidant activity and anthocyanin content in lingonberry. Similar observations were also reported in strawberry [
81] and cranberry using ISSR, EST–SSR, and EST–PCR markers [
82]. We also studied the association of EST–SSR, G–SSR and EST–PCR markers with 70 genotypes of blueberry and found that only one marker was associated with TPC, as revealed by DPPH assay, and five markers were associated with both TPC and TFC. This can be explained by the polyploid nature of blueberry and the distribution of associated alleles across the whole genome. However, our study is the first one to use SMRA to identify markers associated with antioxidant properties. This method can provide easy and reliable identification of favourable genotypes or populations in a breeding program at an early stage and has been used to associate molecular markers with traits in numerous species such as mulberry [
83], buckthorn [
84], and Tunisian olive [
85]. This approach is a convenient and quick tool for marker–trait association, without the need to map populations. Multigenic control of TPC, TFC and antioxidant traits can have practical uses in future blueberry breeding programs.
Blueberries are of significant importance for their antioxidant phytochemicals, especially phenolic metabolites that play a significant role in human health benefits and plant defence mechanisms [
86]. Most of plant phenolics are flavonoids and nonflavonoids. Flavonoids are of two types: anthocyanins and anthoxanthins. While anthocyanins are pigment molecules (red, blue and purple), anthoxanthins are white to yellow or colourless molecules and include flavanols, flavonols, flavones and isoflavones. Nonflavonoids are comprised of phenolic acids, lignans and stilbenes. Tannin and lignin are the other nonflavonoid subclasses [
86]. The flavonol quercetin is an important nutritional bioactive compound with high bioaccessibility (~80%) [
87]. Quercetin helps in protecting against osteoporosis, cancer, pulmonary and cardiovascular diseases, and ageing [
88]. In blueberry, anthocyanins were found to possess the highest inhibition effects on in vitro colon cancer cell proliferation, followed by flavonols and tannins [
89]. Biomarker-based human clinical studies showed that regular and moderate consumption of blueberries and/or anthocyanins is associated with reduced risk of death, cardiovascular disease and type 2 diabetes [
90]. In another study with in vitro cell bioassays for anti-inflammatory and antioxidant activities, Grace et al. [
91] reported that the anthocyanin group of phenolics was mainly responsible for the bioactivity, and blueberry extract suppressed proinflammatory markers (in-terleukin-1β, cyclooxygenase-2, inducible nitric oxidesynthase, and interleukin-6 [
92]). Polyphenol-, anthocyanin- and proanthocyanidin-rich components of crude wild blueberry extract were found to suppress mRNA biomarkers of acute inflammation, and mlvidin-3-glucoside suppressed the effects of proinflammatory genes that are responsible for transcriptional regulation and cytokine-mediated inflammation [
92]. It has been observed that in-vitro antioxidant assays with blueberries resulted in a strong correlation with those of total phenolic and total anthocyanin contents [
91]. In the current study, we measured total phenolic, flavonoid and antioxidant contents to study genetic and biochemical diversity in blueberry germplasm and to identify blueberry genotypes with high bioactive components and wide diversity for use in an on-going breeding program. Identifying phenolic-rich cultivars for breeding species with high bioactive composition is an important approach to improving the nutritional quality of blueberries. Crossing between selected genotypes is expected to develop new cultivars that combine superior health-promoting bioactive components with diverse adaptability under a changing environment. However, blueberry genotypes with more specific profiles of polyphenols are of significant importance for human health, which can be explored in future research with some of the selected promising genotypes from the current material. As to the mass balance of phenolics and what is in human blood, the amount observed is low, especially for highly hydrophilic phenolics. In this, researchers have earlier ignored the metabolites that are also present; if this was done appropriately, the actual intake is much higher than was originally thought. Efforts have also been made to lipophilize phenolic compounds to enhance their absorption. Therefore, having more phenolics, especially those with different polarities, is a good idea, as a mixture of phenolics is present in each material (personal communication: F. Shahidi). Valuable single phenolic compounds can be estimated with the selected material after chromatographic separation, as the total assays do not reflect the situation in terms of polyphenols; they also can target amino acids and reducing agents.
However, when dealing with the same type of material, the trends provided are quite informative, and although absolute values may not be exact, the trends are always valid. This assumes that the amino acids/proteins present are not varied to any great extent, which is a valid assumption in almost all cases (personal communication: F. Shahidi).