Next Article in Journal
Diarrheagenic Escherichia coli in Stool Specimens Collected from Patients Attending Primary Healthcare Facilities in Ethiopia: Whole-Genome Sequencing-Based Molecular Characterization
Previous Article in Journal
Correction: Yang et al. Genome-Wide Identification and Expression Analysis of the Class III Peroxidase Gene Family under Abiotic Stresses in Litchi (Litchi chinensis Sonn.). Int. J. Mol. Sci. 2024, 25, 5804
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Identification of Genomic Regions Associated with Differences in Flowering Time and Inflorescence Architecture between Melastoma candidum and M. normale

1
School of Life Sciences, State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Stress Biology, Sun Yat-sen University, Guangzhou 510275, China
2
School of Ecology, Sun Yat-sen University, Guangzhou 510275, China
3
Guangzhou Institute of Forestry and Landscape Architecture, Guangzhou Collaborative Innovation Center on Science-Tech of Ecology and Landscape, Guangzhou 510405, China
*
Authors to whom correspondence should be addressed.
Int. J. Mol. Sci. 2024, 25(19), 10250; https://doi.org/10.3390/ijms251910250
Submission received: 25 August 2024 / Revised: 16 September 2024 / Accepted: 18 September 2024 / Published: 24 September 2024
(This article belongs to the Section Molecular Genetics and Genomics)

Abstract

:
Understanding the genetic basis of species differences in flowering time and inflorescence architecture can shed light on speciation and molecular breeding. Melastoma shows rapid speciation, with about 100 species formed in the past few million years, and, meanwhile, possesses high ornamental values. Two largely sympatric and closely related species of this genus, M. candidum and M. normale, differ markedly in flowering time and flower number per inflorescence. Here, we constructed an F2 population between M. candidum and M. normale, and used extreme bulks for flowering time and flower number per inflorescence in this population to identify genomic regions underlying the two traits. We found high differentiation on nearly the whole chromosome 7 plus a few regions on other chromosomes between the two extreme bulks for flowering time. Large chromosomal inversions on chromosome 7 between the two species, which contain flowering-related genes, can explain recombinational suppression on the chromosome. We identified 1872 genes with one or more highly differentiated SNPs between the two bulks for flowering time, including CSTF77, FY, SPA3, CDF3, AGL8, AGL15, FHY1, COL9, CIB1, FKF1 and FAR1, known to be related to flowering. We also identified 680 genes with one or more highly differentiated SNPs between the two bulks for flower number per inflorescence, including PNF, FIL and LAS, knows to play important roles in inflorescence development. These large inversions on chromosome 7 prevent us from narrowing down the genomic region(s) associated with flowering time differences between the two species. Flower number per inflorescence in Melastoma appears to be controlled by multiple genes, without any gene of major effect. Our study indicates that large chromosomal inversions can hamper the identification of the genetic basis of important traits, and the inflorescence architecture of Melastoma species may have a complex genetic basis.

1. Introduction

Flowering, which is essential for plants to reproduce, has a pivotal role in the sexual reproduction and evolution of flowering plants. Flowering time differentiation between populations is considered as one of the most important means to trigger speciation and maintain species integrity in plants [1,2]. For example, speciation in diurnally and nocturnally flowering populations can be initiated by differences in flowering time [3], and the divergence between two wild rice species, Oryza nivara and O. rufipogon, is maintained mainly by flowering time difference [4]. Six genetic pathways of flowering time, namely, vernalization, photoperiod/circadian clock, ambient temperature, gibberellin, autonomous, and age pathways, have been comprehensively characterized in the model plant Arabidopsis thaliana [5,6]. However, we know little about the genetic basis of flowering time difference in other plants, particularly non-model plants.
Meanwhile, floral traits, such as flowering time and inflorescence architecture (like the number of flowers per inflorescence), are also critical components of economical and ornamental values in crops and horticultural plants. In some crops, such as wheat (Triticum), rice (Oryza sativa), and barley (Hordeum vulgare), the number of spikelets per spike and the number of fertile florets per spikelet mainly determine grain yield, with more yield for greater grain numbers per inflorescence [7]. In horticultural plants, more flowers per inflorescence usually have higher ornamental value and longer flowering time. Therefore, exploring the genetic basis of floral traits can provide insights into not only the evolution of reproductive isolation between closely related species, but also the breeding of new cultivars with more agricultural and horticultural values.
Identifying genomic regions and molecular markers associated with traits of interest is an important goal for studying trait evolution and molecular-assisted selection. Bulk segregation analysis (BSA) was applied to two bulked DNA samples with extreme phenotypes from a segregating population generated from two parents with different traits to develop allelic variation associated with the traits [8]. Compared with traditional QTL mapping, BSA is a shortcut to detect genetic differences between two populations without characterizing a large number of individuals, which dramatically reduces the costs, labor and space [9]. Next-generation sequencing (NGS)-based genotyping techniques dramatically increase the effectiveness and accuracy of BSA for identification of markers tightly linked to the causal gene for a given phenotype and QTL mapping. Up to now, NGS-based bulk segregation analyses have been widely applied in QTL mapping, and have discovered QTLs associated with agronomic traits, such as seed weight in chickpea (Cicer arietinum) [10], flowering time and leaf shape in pigeonpea (Cajanus cajan) [11], twig trichome color variation in Melastoma normale [12], flowering time, fruit flesh thickness, pre-harvest sprouting and downy mildew resistance in cucumber [13,14,15,16], cold tolerance, salt tolerance and blast resistance in rice [9,17,18], cotyledon color and nutritional ingredient in soybean [8,19], and so on.
Melastoma L., a shrub genus mainly distributed in tropical Asia and Oceania, consists of approximately 100 species [20,21]. It has undergone rapid species radiation within the past 4 million years [22], providing an excellent opportunity to study rapid speciation and the genetic basis of adaptive traits. Species of Melastoma also possess high ornamental values, and some cultivars have been bred by artificial interspecific crosses [23]. Two main species of this genus, M. candidum and M. normale, are both light-demanding opportunists and usually occur in open fields, grasslands and roadsides [24]. They are often sympatric in South China. Although their distribution and habitats are overlapped, no hybrids between them are observed in nature due to their different flowering time. In South China, M. normale typically flowers from late March to late April, while M. candidum flowers from early June to late July [24]. The difference in flowering time leads to the reproductive isolation of the two species, which should play an important role in maintaining their genetic integrity despite their being in sympatry. They also show marked differences in inflorescence architecture, especially the number of flowers per inflorescence, with, usually, more than nine (up to fifteen) flowers per inflorescence in M. normale and at most five flowers per inflorescence in M. candidum. The number of flowers per inflorescence in Melastoma is one of its most important ornamental traits, and it may be related to the efficiency of pollination and sexual reproduction [25]. Therefore, the two species provide ideal materials to investigate the genetic basis of these key traits in Melastoma, a non-model genus of woody plants.
Many floral traits, including flowering time, have a complex genetic architecture and are controlled by multiple genes [6]. Flowering time and inflorescence architecture vary greatly among species of Melastoma; however, their genetic basis remains unknown. In this study, an F2 segregating population from interspecific crosses between M. candidum and M. normale was constructed to identify genomic regions and candidate genes associated with flowering time and inflorescence architecture (flower number per inflorescence), using bulk segregation analysis based on Illumina sequencing. Understanding species differences in the floral characteristics of Melastoma may not only shed light on the mechanisms of reproductive isolation and floral evolution, but also contribute to molecular-assisted selection of new cultivars in this genus.

2. Results

2.1. Highly Differentiated Regions and SNPs between the Two Extreme Bulks for Flowering Time

As described in the Introduction, the flowering time of M. normale is much earlier than that of M. candidum (Figure 1), and there is an interval of about one month between the end of flowering of M. normale (late April) and the beginning of flowering of M. candidum (early June). The earliest and latest flowering times in the F2 population are 4 April and 23 June, respectively. Calculated from January 1, days to flowering in this population range from 93 to 173. Days to flowering for the selected early flowering and later flowering bulks range from 93 to 111 and from 155 to 173, respectively.
After mapping Illumina reads of the two extreme bulks for flowering time to the M. candidum reference genome, a total of 7,996,211 SNPs were identified on the 12 chromosomes for the two bulks, with SNP densities ranging from 28.03 to 41.32 SNPs/kb across the 12 chromosomes. Genetic differentiation between the two bulks in the F2 population is generally low across the genome (average Fst = 0.019 based on all SNPs across the genome) except for chromosome 7 (average Fst = 0.057 based on 701,408 SNPs on this chromosome) (Figure 2, Table S1). It is surprising that nearly the whole of chromosome 7 shows high differentiation between the two extreme bulks, regardless of the window size we used in the sliding-window Fst analysis (Figure 2 and Figure 3). Across the genome, there are 4476 SNPs with an Fst value > 0.3, among which there are 49 SNPs with an Fst value > 0.5 (the maximum Fst = 0.89) and 312 SNPs with an Fst value of 0.4–0.5. Among the 4476, 312 and 49 SNPs, 4168 (93.12%), 284 (91.03%) and 24 (48.98%) were located on chromosome 7 (Figure 3), indicating that most highly differentiated SNPs concentrate on this chromosome.
The genomic location classification of the 4476 highly differentiated SNPs is shown in Figure 4. Most of these SNPs are located in intergenic spacers, and only 426 (9.52%) and 629 (14.05%) of them were found in exonic and intronic regions, respectively. The 4476 SNPs occur in 1872 genes, among which 1685 (90.01%) were located on chromosome 7. Among the 1872 genes, 1497 (79.97%) were functionally annotated by searching against the Swiss-Prot database, 1263 (67.47%) were assigned to GO terms of biological process and 412 (22.01%) were assigned to the KEGG pathways. Only one significantly enriched GO term, nucleosome assembly (GO:0006334, corrected P ≈ 0.005), was found (Table S2). No significantly enriched KEGG pathways were found (Table S3).
The 292 flowering time-related genes in Arabidopsis thaliana summarized in previous studies [26,27] correspond to 367 orthologous genes in M. candidum (Table S4), suggesting there is M. candidum-specific duplication of some genes, which is consistent with the existence of two rounds of whole genome duplications in Melastoma after it diverges from Eucalyptus [22]. Of the 367 genes, 37 are located on chromosome 7, suggesting no enrichment of flowering time-related genes on chromosome 7. Of the 367 genes, 15, namely CSTF77, FY, SPA3, CDF3, AGL8, FUS3, AGL15, FHY1, EZA1, COL9, CIB1, FKF1, FAR1, PGM and HAP2A, contain one or more highly differentiated SNPs, and are all located on chromosome 7 (Table 1). Among the genes in the six pathways of flowering time in Arabidopsis, four orthologous genes in Melastoma, AGL8 (belonging to the age pathway), CDF3, CIB1 and FKF1 (belonging to the photoperiod pathway), contain one or more highly differentiated SNPs (Table 1).
The SNP with the highest Fst value (0.89) was located on chromosome 7 and in the intergenic spacer between two M. candidum genes (Gene IDs MLD38_025448 and MLD38_025449), which encode G6PDC (glucose-6-phosphate 1-dehydrogenase, chloroplastic) and MAD57 (MADS-box transcription factor 57), respectively. MAD57 in rice controls the outgrowth of axillary buds, and overexpression of MADS57 can sustain rice tiller growth under cold stress [28,29], but it is unknown if this gene has a function in inflorescence development in Melastoma. The most highly differentiated region on chromosome 11 (see Figure 2) contains a gene (Gene ID MLD38_036042) encoding BCAL2 (branched-chain amino acid aminotransferase-like protein 2).

2.2. Large Inversions on Chromosome 7 between M. candidum and M. normale

The high differentiation on nearly the whole chromosome 7 between the two extreme bulks for flowering time is unusual in bulk segregation analysis, and this suggests that recombination on chromosome 7 between M. candidum and M. normale is highly suppressed. Because large chromosomal inversions can suppress recombination [30], we test if there is any large chromosomal inversion on chromosome 7 between the two species. Through mapping of Illumina sequencing data of the three F1 individuals on the M. candidum genome, we identified five inversions > 1 Mb in size on this chromosome, and all of them were supported by paired-end reads and split reads (Table 2, Figure S1). As one haplotype in the F1 individuals stems from M. candidum and the other from M. normale, inversions identified in the F1 individuals indicate that there exist inversions between the two species, although we do not know if these inversions are fixed in one species or not. These inversions were also identified in the two extreme bulks for flowering time from the F2 population. The longest inversion is 12.7 Mb, from the nucleotide position 6.3 Mb to 19.0 Mb on chromosome 7 (Figure 5), and the shortest inversion is about 1.8 Mb, from the nucleotide position 10.9 Mb to 12.7 Mb. These inversions span 18.3 Mb (from the nucleotide position 0.7 Mb to 19.0 Mb) of chromosome 7, covering 74.7% of the 24.5 Mb chromosome 7. All the inversion regions contain one or more genes known to be related to flowering in Arabidopsis.

2.3. Highly Differentiated Regions and SNPs between the Two Extreme Bulks for Flower Number per Inflorescence

Both M. candidum and M. normale have terminal, dichasial cymes, but they differ in flower number per inflorescence: M. candidum usually has 3–5 flowers per inflorescence, while M. normale usually has >9 flowers per inflorescence (Figure 6A,B). More flowers per inflorescence in M. normale result from more inflorescence branching on the rachis, with only first-order branches (three flowers) and first- and second-order branches (five flowers) in the inflorescence of M. candidum, and first-, second- and third-order branches in that of M. normale (Figure 6C,D). The flower number per inflorescence in the F2 population ranges from 3.3 to 11.3. The flower number per inflorescence for the selected high flower number per inflorescence and the lower flower number per inflorescence bulk range from 3.3 to 4.3 and from 8.0 to 11.3, respectively.
After mapping Illumina reads of the two extreme bulks for flower number per inflorescence to the M. candidum reference genome, a total of 7,673,521 SNPs were identified on the 12 chromosomes for the two bulks, with SNP densities ranging from 27.3 to 39.9 SNPs/kb across the 12 chromosomes. Again, sliding-window Fst analyses with different window and step sizes showed that genetic differentiation between the two bulks in the F2 population is generally low across the genome (average Fst = 0.014 based on all SNPs across the genome). Unlike the two extreme bulks for flowering time, which show high differentiation on nearly the whole chromosome 7, the two extreme bulks for flower number per inflorescence exhibit only a few highly differentiated regions on several chromosomes (Figure 7, Table S1). Across the genome, there are 759 SNPs with an Fst value > 0.2, among which there are 3 SNPs with an Fst value > 0.5 (the maximum Fst = 0.569), 6 SNPs with an Fst value of 0.4–0.5, 45 SNPs with an Fst value of 0.3–0.4 and 705 SNPs with an Fst value of 0.2–0.3.
The genomic location classification of 759 highly differentiated SNPs is shown in Figure 8. Most of these SNPs are located in intergenic spacers, and only 48 (6.32%) and 60 (7.91%) of them were found in exonic and intronic regions, respectively. The 759 SNPs were assigned to 680 genes, among which 489 (71.91%) genes were functionally annotated by searching against the Swiss-Prot database, 387 (56.91%) genes were assigned to GO terms of biological process and 115 (16,91%) genes were assigned to the KEGG pathways. Two significantly enriched GO terms, DNA integration (GO:0015074) and proteolysis (GO:0006508), were found (Table S5), but they are not related to flowering. Also, no significantly enriched KEGG pathways were found (Table S6). The 66 inflorescence architecture-related genes in A. thaliana and other species correspond to 100 orthologous genes in M. candidum (Table S7), again suggesting there is M. candidum-specific duplication of some genes. Four of the 100 genes, namely JAG, PNF, FIL and LAS, contain one or more highly differentiated SNPs (Table 3).
The SNP with the highest Fst value (0.57) was located in the intergenic spacer of two M. candidum genes (Gene IDs MLD38_038216 and MLD38_038217) on chromosome 12, which encode NF-YC11 (nuclear factor Y, subunit C11) and WTR38 (WAT1-related protein), respectively. The SNP with the second highest Fst value (0.52) was located in the intergenic spacer of two genes (Gene IDs MLD38_021340 and MLD38_021341) on chromosome 6, which encode HCAR (7-hydroxymethyl chlorophyll a reductase, chloroplastic) and HGL1 (heteroglycan glucosidase 1), respectively. The most highly differentiated region on chromosome 11 (see Figure 7) contains one gene (Gene ID MLD38_036043) encoding BCAL2 (branched-chain amino acid aminotransferase-like protein 2).

3. Discussion

3.1. Genomic Regions Related to Flowering Time in Melastoma

Our BSA-seq revealed that nearly the whole chromosome 7, plus a few regions on other chromosomes, show high differentiation between the two extreme bulks for flowering time. It is surprising that nearly the whole chromosome shows high differentiation between the two bulks, because meiotic recombination will cause allelic exchanges between homologous chromosomes except the genomic regions related to flowering time we selected for the two extreme bulks. There are two possible explanations for the chromosome-scale high differentiation. One explanation is the enrichment of most flowering time-related genes on nearly the whole chromosome 7, and these genes collectively determine the flowering time difference between M. candidum and M. normale. However, no enriched GO terms and KEGG pathways related to flowering were found in the GO and KEGG enrichment analysis for highly differentiated genes (most on chromosome 7). Furthermore, of 367 M. candidum orthologs of flowering-related genes in Arabidopsis, only 37 (10.1%) were located on chromosome 7. So, this explanation does not hold. The other explanation is that large inversion(s) exists on chromosome 7 between M. candidum and M. normale, and, meanwhile, the inversion(s) contains one or more genes contributing mainly to flowering time differences between the two species, because large inversion(s) can highly suppress recombination [31,32,33,34]. Our results are consistent with this explanation. On one hand, we identified five large inversions on chromosome 7, including the largest one of 12.7 Mb, which collectively span nearly the whole chromosome. On the other hand, the highly differentiated regions on chromosome 7 indeed contain a few genes related to flowering time (Table 2).
Among 15 candidate flowering time-related genes, CDF3, CIB1 and FKF1 genes are included in the photoperiod pathway that controls flowering in Arabidopsis. CDF3 (Cyclic DOF factor 3) is one of the finger (DOF)-type transcription factors that are implicated in responses to light and phytohormones, and repress flowering by downregulating CONSTANS (CO), a gene that accelerates flowering in response to long days [6,35]. CIB1 (cryptochrome-interacting basic helix-loop-helix) interacts with CRY2 (a cryptochrome that mediates light response) and CIB1-related proteins to promote CRY2-dependent floral initiation [36]. FKF1 (FLAVIN-BINDING, KELCH REPEAT, F-BOX 1) controls the daytime expression pattern of CO [37,38]. AGL8, also known as FRUITFULL (FUL), belongs to the age pathway in Arabidopsis, and regulates FLOWERING LOCUS C (FLC) [39], the key gene in both the vernalization pathway and the autonomous flowering pathway [40]. FY-encoding protein interacts with FLC and FCA, with the latter being a post-transcriptional regulator that strongly promotes the transition to flowering in Arabidopsis [41]. The interactions of FY, FCA and FLC regulate flowering time through the autonomous pathway in Arabidopsis [42]. CSTF77 (cleavage stimulation factor subunit 77) is a conserved RNA 3′-end-processing factor, which is required in FLC silencing and, thus, regulates the onset of flowering [43]. SPA3 (SPA1-related 3), a member of the Suppressor of phyA-105 (SPA1) gene family, regulates photomorphogenesis in the light in Arabidopsis seedlings [44]. COL9 is a member of the CONSTANS-like (COL) gene family, and overexpression of COL9 results in delayed flowering in transgenic Arabidopsis [45]. AGL15 acts on the upstream of FLOWERING LOCUS T (FT), and increased expression of FT can delay flowering time [46,47]. FHY1 (FAR-RED ELONGATED HYPOCOTYL 1), a member of the photoperiod pathway and primarily involved in far-red light sensing, mediates nuclear accumulation of the Phytochrome A (PHYA) in Arabidopsis [48,49]. NFYA1 (nuclear transcription factor Y subunit A-1) is a subunit of the HAP complex, and the expression of HAP complex subunits affects flowering time by reducing activation of FT in Arabidopsis. In summary, the M. candidum orthologs of these flowering-related genes contain highly differentiated SNP(s), suggesting that they might contribute to flowering time differences between M. candidum and M. normale.
Large chromosomal inversions, which can severely inhibit recombination at the inversion regions, play important roles in reproductive isolation and local adaptation [31,50]. For example, 21 polymorphic inversions likely facilitate local adaptation, and cause near-complete suppression of recombination when heterozygous in deer mice [51]. An inversion of 115 Mb was discovered in the genome of the common quail, and individuals with this inversion are larger and have darker throat coloration and rounder wings, which cause poorer flight efficiency [52]. In the stick insect, a chromosomal inversion suppresses effective recombination and is associated with crypsis, which contributes largely to adaptation to different microhabitats [53]. A major inversion between subspecies of Boechera stricta contains genes controlling ecologically important traits, including flowering time difference, which is expected to enhance reproductive isolation between subspecies [54]. Locally adapted alleles can be captured by inversions and promote incipient speciation [34]. It is likely that a few flowering-related genes were captured by large inversions on chromosome 7 in Melastoma, which causes high differentiation on nearly the whole chromosome between the two extreme bulks for flowering time.
However, we could not determine which gene(s) confer flowering time differences between M. candidum and M. normale. Due to the presence of large inversions, it is possible that even variation in one causal gene related to flowering time within an inversion can cause high differentiation in the whole region spanning the inversion between the two extreme bulks for flowering time. So, we could not narrow down the region associated with flowering time within the inversion.

3.2. Genomic Regions Related to Flower Number per Inflorescence in Melastoma

All species of Melastoma have terminal, dichasial cymes, and most species, including M. candidum, usually have 3–5 flowers per inflorescence, while several species, including M. normale, M. imbricatum, M. joffrei and M. malabathricum, usually have >9 flowers per inflorescence [24]. As shown in Results, more flowers per inflorescence result from more inflorescence branching on the rachis.
Among candidate genes related to inflorescence architecture identified in other species, PNF is expressed in the shoot apical meristem in Arabidopsis and is required for the maintenance of boundaries between the initiating floral primordia and inflorescence meristem [55]; FIL (FILAMENTOUS FLOWER) regulates the inflorescence meristem and floral meristem development, and the fil mutant forms two types of inflorescences and flowers [56]. LAS is implicated in the process of axillary meristem initiation in Arabidopsis [57]. NF-YC11 is expressed in vascular tissues and is a potential target for flowering time control in Arabidopsis [58], and its role in inflorescence development remains unclear. The BCAL2 gene is an uncharacterized aminotransferase-like sequence [59]. As this gene also shows high differentiation between the two bulks for flowering time, we infer that this gene may have dual roles. A previous study also showed that some genes related to flowering also have functions in inflorescence development [6]. These genes with highly differentiated SNPs between the two bulks exist in different chromosomes, suggesting that flower number per inflorescence in Melastoma might be controlled by multiple genes, without any gene of major effect. The divergence of these genes between M. candidum and M. normale may result in differences in flower number per inflorescence between them. This can be tested in other species of Melastoma that also show differences in this trait.
In dogwoods (Cornus), expressional alternations of two inflorescence architecture genes (TFL1 and AP1) among different species correlate with major evolutionary shifts in inflorescence architecture [60], and we speculate that difference in flower number per inflorescence among Melastoma species may also be associated with expressional changes in some inflorescence architecture genes identified in this study, which needs further testing. If so, molecular-assisted selection can be used for new cultivar breeding by examining the expressional patterns of these genes.

4. Materials and Methods

4.1. Construction of an F2 Segregating Population and Bulk Sampling

One individual each of M. normale and M. candidum was collected from Zhuhai, Guangdong and Shanglin, Guangxi, respectively, and both of them were planted on the campus of Sun Yat-sen University, Guangzhou. The pollens of M. normale were collected in March and stored immediately at −80 °C until the flowering of M. candidum in early June 2018. Stamens of M. candidum were removed one day before flowering, and pollens of M. normale were placed on the stigma of M. candidum on the first day of flowering. Pollinated flowers were bagged for two days until the styles fell. When the fruits ripened, F1 seeds were obtained and sowed. F2 hybrids were obtained through F1 selfing when F1 individuals started to flower in 2020. The F2 individuals were grown in 10 cm × 10 cm plastic pots in the greenhouse of Sun Yat-sen University under natural light conditions. Finally, an F2 segregating population comprising 188 individuals was obtained and used in the subsequent analysis. For each F2 individual, the flowering date of the first flower and flower number per inflorescence (we used the average number from three randomly chosen inflorescences) were recorded in 2022.
Extreme bulks for flowering time and flower number per inflorescence from the F2 population were sampled separately. Fresh leaves of 30 individuals each, with the earliest flowering, the latest flowering, the highest flower number per inflorescence and the lowest flower number per inflorescence, were collected, respectively. The leaves were dried in plastic bags with silica gels. For each of the four extreme bulks mentioned above, we pooled the dried leaf tissues with equal weights of all 30 individuals for DNA isolation, as conducted in [61]. In addition, we also collected fresh leaves of three F1 individuals and dried them using the same method.

4.2. Library Preparation and Illumina Sequencing

We extracted DNA from the four pooled samples and three F1 individuals using a Magen HiPure Plant DNA Mini Kit (Magen Biotech, Guangzhou, China). The quality and quantity of genomic DNA were determined by 1% agarose gel electrophoresis and a Qubit 3.0 Fluorometer (Invitrogen, Carlsbad, CA, USA). For each DNA sample, a sequencing library with an insert size of 400 bp was constructed using an Illumina TruSeq Nano DNA Library Prep Kit (Illumina, San Diego, CA, USA). The resulting four DNA libraries were sequenced on an Illumina HiSeq X Ten platform, generating about 30 Gb (~117 × of the M. candidum genome at 256.2 Mb) of paired-end raw reads of 150 bp (PE-150) for each sample. For the three F1 individuals, about 6.1, 6.7 and 7.1 Gb of PE-150 were generated, respectively.

4.3. Bulk Segregation Analysis

The raw reads were filtered using Trimmomatic (version 0.39) [62] with the parameters “ILLUMINACLIP:TruSeq3-PE:2:30:10 LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:100”. Clean reads of each sample were mapped to the reference genome of M. candidum (GenBank accession: GCA_023653495.1) using BWA-MEM [63] with default parameters. The aligned reads were sorted using the sort module and the PCR duplications were removed using the rmdup module in SAMtools version 1.10 [64]. Final alignment files for each extreme bulk pair were combined to generate multiple pileup files using the mpileup module in SAMtools.
The multiple pileup files were supplied to Popoolation2 [65] for analyzing the genetic differentiation between each of the two extreme bulk pairs. The pileup file for each bulk pair was synchronized using the mpileup2sync.jar module in Popoolation2. To reduce the influence of insertions/deletions (indels) on SNP identification, the two perl scripts identify-indel-regions.pl and filter-sync-by-gtf.pl in Popoolation2 were used to filter out regions around indels with default parameters. The Fst for each SNP was calculated using the script fst-sliding.pl in Popoolation2 with parameters of suppress-noninformative, min-count 2, min-coverage 20, max-coverage 2%, min-covered-fraction 1 and pool-size 30 and five different sets of window size and step size (window-size 50 kb, step-size 10 kb; window-size 10 kb, step-size 2 kb; window-size 5 kb, step-size 1 kb; window-size 1 kb, step-size 0.5 kb; and window-size 1 bp, step-size 1 bp).

4.4. Annotation of Highly Differentiated SNPs between Each Bulk Pair

The gene annotation file of the M. candidum genome was downloaded from the NCBI database, and converted to a GenePred file using the gtfToGenePred tool (https://hgdownload.cse.ucsc.edu/admin/exe/linux.x86_64/gff3ToGenePred; accessed on 13 March 2018). Then, the mRNA sequences were retrieved using the perl script retrieve_seq_from_fasta.pl in ANNOVAR version 2020-06-07 [66], and SNPs showing high genetic differentiation between each bulk pair were annotated using the script annotate_variation.pl with the parameter --neargene 2000 to identify genes containing these SNPs.

4.5. GO and KEGG Pathway Enrichment Analyses of Candidate Genes Associated with Flowering Time and Flower Number per Inflorescence

The identified genes were functionally annotated using BLAST 2.2.24+ (ftp://ftp.ncbi.nlm.nih.gov/blast/executables/blast+/LATEST/; accessed on 9 September 2011) against the Swiss-Prot database (https://ftp.ncbi.nlm.nih.gov/blast/db/FASTA/swissprot.gz; accessed on 6 February 2024). The Gene Ontology (GO) and KEGG pathway enrichment analyses of these candidate genes were carried out with the functional enrichment analysis tool DAVID [67], with all the M. candidum genes as the background.

4.6. Detection of Inversions between the Genomes of M. candidun and M. normale

Delly v.1.2.6 [68] was applied to detect inversions between the genomes of M. candidun and M. normale by analyzing Illumina sequencing data of three F1 individuals and two extreme bulks for flowering time, respectively. Briefly, Illumina sequencing data of the three F1 individuals were mapped to the reference genome of M. candidum using the same method described before. Then, the alignment file of each individual was used as input to call structural variants using Delly with default parameters. For an inversion between the genomes of M. candidun and M. normale, it is expected that one of the two alleles in the F1 individuals should carry the inversion. Therefore, we kept only inversions supported by about half of the paired-end reads and split reads on the mapped position. We then examined whether these inversions also exist in the two extreme bulks for flowering time. These inversions were further visualized and verified in Integrative Genomics Viewer (IGV) v. 2.16.2 [69].

4.7. Identification of Orthologous Proteins between Arabidopsis thaliana and M. candidum

The protein sequences of A. thaliana were downloaded from the TAIR website (https://www.arabidopsis.org/download/file?path=Proteins/Araport11_protein_lists/Araport11_pep_20220914_representative_gene_model.gz; accessed on 14 September 2022). Orthogroups of proteins between A. thaliana and M. candidum were identified using OrthoFinder v2.4.0 [70]. Orthogroups containing 292 flowering time genes in A. thaliana summarized in previous studies [26,27] were extracted (Table S4), and the corresponding orthologs in M. candidum were identified. Genes involved in the regulation of inflorescence architecture in A. thaliana and other species summarized from previous studies [71,72,73,74] were extracted (Table S7).

5. Conclusions

Using bulk segregation analyses, we identified genomic regions and genes associated with differences in flowering time and flower number per inflorescence between two Melastoma species. Surprisingly, nearly the whole of chromosome 7 shows high differentiation between early and late flowering time bulks in the F2 population. A gene or multiple genes related to flowering time difference within large chromosomal inversions on this chromosome between the two species should cause this chromosome-scale differentiation. These large inversions also prevent us from narrowing down the genomic region(s) associated with flowering time differences between the two species. Further studies can select other Melastoma species differing in flowering time but lacking in large inversions between them to identify genomic regions and genes underlying this trait. Flower number per inflorescence in Melastoma appears to be controlled by multiple genes without any gene of major effect. It remains unknown whether the expressional alternations of these identified genes correlate with differences in the number of flowers per inflorescence between species. Further RNA-seq experiments are needed to test this hypothesis.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/ijms251910250/s1.

Author Contributions

J.C. and R.Z. designed the study. J.C., Y.Z. and P.Z. collected the samples. J.C., Y.Z. and R.Z. performed the analyses. J.N., Y.L. and S.D. contributed technical assistance. J.C. and R.Z. drafted the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported financially by the Guangzhou Collaborative Innovation Center on S&T of Ecology and Landscape (202206010058) and the Natural Science Foundation of China (32170217).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All raw sequencing data used in this research have been deposited in a genome sequence archive (GSA, Beijing, China, https://ngdc.cncb.ac.cn/gsa/). The accession number of this project is PRJCA008154, and the accession numbers of raw Illumina data are CRA005999, CRX327799, CRX327798, CRX327797 and CRX327796.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

References

  1. Gaudinier, A.; Blackman, B.K. Evolutionary processes from the perspective of flowering time diversity. New Phytol. 2020, 225, 1883–1898. [Google Scholar] [CrossRef] [PubMed]
  2. Rieseberg, L.H.; Willis, J.H. Plant speciation. Science 2007, 317, 910–914. [Google Scholar] [CrossRef] [PubMed]
  3. Matsumoto, T.; Yasumoto, A.A.; Nitta, K.; Hirota, S.K.; Yahara, T.; Tachida, H. Difference in flowering time can initiate speciation of nocturnally flowering species. J. Theor. Biol. 2015, 370, 61–71. [Google Scholar] [CrossRef] [PubMed]
  4. Xu, X.; Meng, Q.; Geng, M.; Ren, N.; Zhou, L.; Du, Y.; Cai, Z.; Wang, M.; Wang, X.; Wang, X.; et al. Divergence in flowering time is a major component contributing to reproductive isolation between two wild rice species (Oryza rufipogon and O. nivara). Sci. China Life Sci. 2020, 63, 1714–1724. [Google Scholar] [CrossRef]
  5. Cheng, J.; Zhou, Y.; Lv, T.; Xie, C.; Tian, C. Research progress on the autonomous flowering time pathway in Arabidopsis. Physiol. Mol. Biol. Plants 2017, 23, 477–485. [Google Scholar] [CrossRef]
  6. Fornara, F.; de Montaigu, A.; Coupland, G. SnapShot: Control of flowering in Arabidopsis. Cell 2010, 141, 550.e1–550.e2. [Google Scholar] [CrossRef]
  7. Sakuma, S.; Schnurbusch, T. Of floral fortune: Tinkering with the grain yield potential of cereal crops. New Phytol. 2020, 225, 1873–1882. [Google Scholar] [CrossRef]
  8. Song, J.; Li, Z.; Liu, Z.; Guo, Y.; Qiu, L.J. Next-generation sequencing from bulked-segregant analysis accelerates the simultaneous identification of two qualitative genes in soybean. Front. Plant Sci. 2017, 8, 919. [Google Scholar] [CrossRef]
  9. Tiwari, S.; Sl, K.; Kumar, V.; Singh, B.; Rao, A.R.; Mithra Sv, A.; Rai, V.; Singh, A.K.; Singh, N.K. Mapping QTLs for salt tolerance in rice (Oryza sativa L.) by bulked segregant analysis of recombinant inbred lines using 50K SNP chip. PLoS ONE 2016, 11, e0153610. [Google Scholar] [CrossRef]
  10. Singh, V.K.; Khan, A.W.; Jaganathan, D.; Thudi, M.; Roorkiwal, M.; Takagi, H.; Garg, V.; Kumar, V.; Chitikineni, A.; Gaur, P.M.; et al. QTL-seq for rapid identification of candidate genes for 100-seed weight and root/total plant dry weight ratio under rainfed conditions in chickpea. Plant Biotechnol. J. 2016, 14, 2110–2119. [Google Scholar] [CrossRef]
  11. Singh, V.; Sinha, P.; Obala, J.; Khan, A.W.; Chitikineni, A.; Saxena, R.K.; Varshney, R.K. QTL-seq for the identification of candidate genes for days to flowering and leaf shape in pigeonpea. Heredity 2022, 128, 411–419. [Google Scholar] [CrossRef] [PubMed]
  12. Huang, G.; Wu, W.; Chen, Y.; Zhi, X.; Zou, P.; Ning, Z.; Fan, Q.; Liu, Y.; Deng, S.; Zeng, K.; et al. Balancing selection on an MYB transcription factor maintains the twig trichome color variation in Melastoma normale. BMC Biol. 2023, 21, 122. [Google Scholar] [CrossRef] [PubMed]
  13. Lu, H.; Lin, T.; Klein, J.; Wang, S.; Qi, J.; Zhou, Q.; Sun, J.; Zhang, Z.; Weng, Y.; Huang, S. QTL-seq identifies an early flowering QTL located near Flowering Locus T in cucumber. Theor. Appl. Genet. 2014, 127, 1491–1499. [Google Scholar] [CrossRef] [PubMed]
  14. Xu, X.; Lu, L.; Zhu, B.; Xu, Q.; Qi, X.; Chen, X. QTL mapping of cucumber fruit flesh thickness by SLAF-seq. Sci. Rep. 2015, 5, 15829. [Google Scholar] [CrossRef] [PubMed]
  15. Win, K.T.; Vegas, J.; Zhang, C.; Song, K.; Lee, S. QTL mapping for downy mildew resistance in cucumber via bulked segregant analysis using next-generation sequencing and conventional methods. Theor. Appl. Genet. 2017, 130, 199–211. [Google Scholar] [CrossRef]
  16. Cao, M.; Li, S.; Deng, Q.; Wang, H.; Yang, R. Identification of a major-effect QTL associated with pre-harvest sprouting in cucumber (Cucumis sativus L.) using the QTL-seq method. BMC Genom. 2021, 22, 249. [Google Scholar] [CrossRef]
  17. Yang, Z.; Huang, D.; Tang, W.; Zheng, Y.; Liang, K.; Cutler, A.J.; Wu, W. Mapping of quantitative trait loci underlying cold tolerance in rice seedlings via high-throughput sequencing of pooled extremes. PLoS ONE 2013, 8, e68433. [Google Scholar] [CrossRef]
  18. Zheng, W.; Wang, Y.; Wang, L.; Ma, Z.; Zhao, J.; Wang, P.; Zhang, L.; Liu, Z.; Lu, X. Genetic mapping and molecular marker development for Pi65(t), a novel broad-spectrum resistance gene to rice blast using next-generation sequencing. Theor. Appl. Genet. 2016, 129, 1035–1044. [Google Scholar] [CrossRef]
  19. Dobbels, A.A.; Michno, J.M.; Campbell, B.W.; Virdi, K.S.; Stec, A.O.; Muehlbauer, G.J.; Naeve, S.L.; Stupar, R.M. An induced chromosomal translocation in soybean disrupts a KASI ortholog and is associated with a high-sucrose and low-oil seed phenotype. G3 Genes Genomes Genet. 2017, 7, 1215–1223. [Google Scholar] [CrossRef]
  20. Zhang, X.; Dai, J.H.; Liu, X.; Li, Z.; Lee, S.Y.; Zhou, R.; Tan, G. Lectotypification of the name Melastoma candidum f. albiflorum and its taxonomic status. PhytoKeys 2020, 146, 47–52. [Google Scholar] [CrossRef]
  21. Zheng, W.; Ren, Y.; Wu, M.; Yang, Y.; Fan, Y.; Piao, X.; Ge, Y.; Wang, S. A review of the traditional uses, phytochemistry and biological activities of the Melastoma genus. J. Ethnopharmacol. 2021, 264, 113322. [Google Scholar] [CrossRef] [PubMed]
  22. Zhong, Y.; Wu, W.; Sun, C.; Zou, P.; Liu, Y.; Dai, S.; Zhou, R. Chromosomal-level genome assembly of Melastoma candidum provides insights into trichome evolution. Front. Plant Sci. 2023, 14, 1126319. [Google Scholar] [CrossRef] [PubMed]
  23. Dai, S.; Zhang, J.; He, M.; Xiong, Y.; Wang, W.; Liu, W.; Ni, J.; Li, X.; Zhou, R. Two new Melastoma cultivars ‘Tianjiao’ and ‘Xinyuan’. Acta Hortic. Sin. 2016, 43, 1847–1848. [Google Scholar] [CrossRef]
  24. Chen, J. Melastomataceae. In Flora Reipublicae Popularis Sinicae; Science Press: Beijing, China, 1984; Volume 53, pp. 152–162. [Google Scholar]
  25. Vasconcelos, T.N.; Proença, C.E. Floral cost vs. floral display: Insights from the megadiverse Myrtales suggest that energetically expensive floral parts are less phylogenetically constrained. Am. J. Bot. 2015, 102, 900–909. [Google Scholar] [CrossRef]
  26. Atwell, S.; Huang, Y.S.; Vilhjálmsson, B.J.; Willems, G.; Horton, M.; Li, Y.; Meng, D.; Platt, A.; Tarone, A.M.; Hu, T.T. Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines. Nature 2010, 465, 627–631. [Google Scholar] [CrossRef]
  27. Srikanth, A.; Schmid, M. Regulation of flowering time: All roads lead to Rome. Cell. Mol. Life Sci. 2011, 68, 2013–2037. [Google Scholar] [CrossRef]
  28. Guo, S.; Xu, Y.; Liu, H.; Mao, Z.; Zhang, C.; Ma, Y.; Zhang, Q.; Meng, Z.; Chong, K. The interaction between OsMADS57 and OsTB1 modulates rice tillering via DWARF14. Nat. Commun. 2013, 4, 1566. [Google Scholar] [CrossRef]
  29. Chen, L.; Zhao, Y.; Xu, S.; Zhang, Z.; Xu, Y.; Zhang, J.; Chong, K. OsMADS57 together with OsTB1 coordinates transcription of its target OsWRKY94 and D14 to switch its organogenesis to defense for cold adaptation in rice. New Phytol. 2018, 218, 219–231. [Google Scholar] [CrossRef]
  30. Ortiz-Barrientos, D.; Engelstädter, J.; Rieseberg, L.H. Recombination rate evolution and the origin of species. Trends Ecol. Evol. 2016, 31, 226–236. [Google Scholar] [CrossRef]
  31. Wellenreuther, M.; Bernatchez, L. Eco-evolutionary genomics of chromosomal inversions. Trends Ecol. Evol. 2018, 33, 427–440. [Google Scholar] [CrossRef]
  32. Noor, M.A.F.; Gratos, K.L.; Bertucci, L.A.; Reiland, J. Chromosomal inversions and the reproductive isolation of species. Proc. Natl. Acad. Sci. USA 2001, 98, 12084–12088. [Google Scholar] [CrossRef] [PubMed]
  33. Rieseberg, L.H. Chromosomal rearrangements and speciation. Trends Ecol. Evol. 2001, 16, 351–358. [Google Scholar] [CrossRef] [PubMed]
  34. Kirkpatrick, M.; Barton, N. Chromosome inversions, local adaptation and speciation. Genetics 2006, 173, 419–434. [Google Scholar] [CrossRef] [PubMed]
  35. Corrales, A.R.; Carrillo, L.; Lasierra, P.; Nebauer, S.G.; Dominguez-Figueroa, J.; Renau-Morata, B.; Pollmann, S.; Granell, A.; Molina, R.V.; Vicente-Carbajosa, J.; et al. Multifaceted role of cycling DOF factor 3 (CDF3) in the regulation of flowering time and abiotic stress responses in Arabidopsis. Plant Cell Environ. 2017, 40, 748–764. [Google Scholar] [CrossRef] [PubMed]
  36. Liu, H.; Yu, X.; Li, K.; Klejnot, J.; Yang, H.; Lisiero, D.; Lin, C. Photoexcited CRY2 interacts with CIB1 to regulate transcription and floral initiation in Arabidopsis. Science 2008, 322, 1535–1539. [Google Scholar] [CrossRef]
  37. Imaizumi, T.; Schultz, T.F.; Harmon, F.G.; Ho, L.A.; Kay, S.A. FKF1 F-Box protein mediates cyclic degradation of a repressor of CONSTANS in Arabidopsis. Science 2005, 309, 293–297. [Google Scholar] [CrossRef]
  38. Imaizumi, T.; Tran, H.G.; Swartz, T.E.; Briggs, W.R.; Kay, S.A. FKF1 is essential for photoperiodic-specific light signalling in Arabidopsis. Nature 2003, 426, 302–306. [Google Scholar] [CrossRef]
  39. Balanzà, V.; Martínez-Fernández, I.; Ferrándiz, C. Sequential action of FRUITFULL as a modulator of the activity of the floral regulators SVP and SOC1. J. Exp. Bot. 2014, 65, 1193–1203. [Google Scholar] [CrossRef]
  40. Sheldon, C.C.; Rouse, D.T.; Finnegan, E.J.; Peacock, W.J.; Dennis, E.S. The molecular basis of vernalization: The central role of FLOWERING LOCUS C (FLC). Proc. Natl. Acad. Sci. USA 2000, 97, 3753–3758. [Google Scholar] [CrossRef]
  41. Macknight, R.; Bancroft, I.; Page, T.; Lister, C.; Schmidt, R.; Love, K.; Westphal, L.; Murphy, G.; Sherson, S.; Cobbett, C.; et al. FCA, a gene controlling flowering time in Arabidopsis, encodes a protein containing RNA-binding domains. Cell 1997, 89, 737–745. [Google Scholar] [CrossRef]
  42. Simpson, G.G.; Dijkwel, P.P.; Quesada, V.; Henderson, I.; Dean, C. FY is an RNA 3′ end-processing factor that interacts with FCA to control the Arabidopsis floral transition. Cell 2003, 113, 777–787. [Google Scholar] [CrossRef] [PubMed]
  43. Liu, F.; Marquardt, S.; Lister, C.; Swiezewski, S.; Dean, C. Targeted 3′ processing of antisense transcripts triggers Arabidopsis FLC chromatin silencing. Science 2010, 327, 94–97. [Google Scholar] [CrossRef] [PubMed]
  44. Laubinger, S.; Hoecker, U. The SPA1-like proteins SPA3 and SPA4 repress photomorphogenesis in the light. Plant J. 2003, 35, 373–385. [Google Scholar] [CrossRef] [PubMed]
  45. Cheng, X.; Wang, Z. Overexpression of COL9, a CONSTANS-LIKE gene, delays flowering by reducing expression of CO and FT in Arabidopsis thaliana. Plant J. 2005, 43, 758–768. [Google Scholar] [CrossRef] [PubMed]
  46. Adamczyk, B.J.; Lehti-Shiu, M.D.; Fernandez, D.E. The MADS domain factors AGL15 and AGL18 act redundantly as repressors of the floral transition in Arabidopsis. Plant J. 2007, 50, 1007–1019. [Google Scholar] [CrossRef]
  47. Turck, F.; Fornara, F.; Coupland, G. Regulation and identity of florigen: FLOWERING LOCUS T moves center stage. Annu. Rev. Plant Biol. 2008, 59, 573–594. [Google Scholar] [CrossRef]
  48. Hiltbrunner, A.; Tscheuschler, A.; Viczián, A.; Kunkel, T.; Kircher, S.; Schäfer, E. FHY1 and FHL act together to mediate nuclear accumulation of the Phytochrome A photoreceptor. Plant Cell Physiol. 2006, 47, 1023–1034. [Google Scholar] [CrossRef]
  49. Genoud, T.; Schweizer, F.; Tscheuschler, A.; Debrieux, D.; Casal, J.J.; Schäfer, E.; Hiltbrunner, A.; Fankhauser, C. FHY1 mediates nuclear import of the light-activated Phytochrome A photoreceptor. PLoS Genet. 2008, 4, e1000143. [Google Scholar] [CrossRef]
  50. Hoffmann, A.A.; Rieseberg, L.H. Revisiting the impact of inversions in evolution: From population genetic markers to drivers of adaptive shifts and speciation? Annu. Rev. Ecol. Evol. Syst. 2008, 39, 21–42. [Google Scholar] [CrossRef]
  51. Harringmeyer, O.S.; Hoekstra, H.E. Chromosomal inversion polymorphisms shape the genomic landscape of deer mice. Nat. Ecol. Evol. 2022, 6, 1965–1979. [Google Scholar] [CrossRef]
  52. Sanchez-Donoso, I.; Ravagni, S.; Rodríguez-Teijeiro, J.D.; Christmas, M.J.; Huang, Y.; Maldonado-Linares, A.; Puigcerver, M.; Jiménez-Blasco, I.; Andrade, P.; Gonçalves, D.; et al. Massive genome inversion drives coexistence of divergent morphs in common quails. Curr. Biol. 2022, 32, 462–469.e6. [Google Scholar] [CrossRef] [PubMed]
  53. Lindtke, D.; Lucek, K.; Soria-Carrasco, V.; Villoutreix, R.; Farkas, T.E.; Riesch, R.; Dennis, S.R.; Gompert, Z.; Nosil, P. Long-term balancing selection on chromosomal variants associated with crypsis in a stick insect. Mol. Ecol. 2017, 26, 6189–6205. [Google Scholar] [CrossRef] [PubMed]
  54. Lee, C.R.; Wang, B.; Mojica, J.P.; Mandáková, T.; Prasad, K.V.S.K.; Goicoechea, J.L.; Perera, N.; Hellsten, U.; Hundley, H.N.; Johnson, J.; et al. Young inversion with multiple linked QTLs under selection in a hybrid zone. Nat. Ecol. Evol. 2017, 1, 0119. [Google Scholar] [CrossRef] [PubMed]
  55. Kanrar, S.; Onguka, O.; Smith, H.M.S. Arabidopsis inflorescence architecture requires the activities of KNOX-BELL homeodomain heterodimers. Planta 2006, 224, 1163–1173. [Google Scholar] [CrossRef] [PubMed]
  56. Sawa, S.; Ito, T.; Shimura, Y.; Okada, K. FILAMENTOUS FLOWER controls the formation and development of Arabidopsis inflorescences and floral meristems. Plant Cell 1999, 11, 69–86. [Google Scholar] [CrossRef]
  57. Greb, T.; Clarenz, O.; Schafer, E.; Muller, D.; Herrero, R.; Schmitz, G.; Theres, K. Molecular analysis of the LATERAL SUPPRESSOR gene in Arabidopsis reveals a conserved control mechanism for axillary meristem formation. Genes Dev. 2003, 17, 1175–1187. [Google Scholar] [CrossRef]
  58. Siefers, N.; Dang, K.K.; Kumimoto, R.W.; Bynum Iv, W.E.; Tayrose, G.; Holt Iii, B.F. Tissue-specific expression patterns of Arabidopsis NF-Y transcription factors suggest potential for extensive combinatorial complexity. Plant Physiol. 2009, 149, 625–641. [Google Scholar] [CrossRef]
  59. Liepman, A.H.; Olsen, L.J. Genomic analysis of aminotransferases in Arabidopsis thaliana. Crit. Rev. Plant Sci. 2004, 23, 73–89. [Google Scholar] [CrossRef]
  60. Feng, C.; Xiang, Q.; Franks, R.G. Phylogeny-based developmental analyses illuminate evolution of inflorescence architectures in dogwoods (Cornus s. l., Cornaceae). New Phytol. 2011, 191, 850–869. [Google Scholar] [CrossRef]
  61. Zhou, R.; Ling, S.; Zhao, W.; Osada, N.; Chen, S.; Zhang, M.; He, Z.; Bao, H.; Zhong, C.; Zhang, B.; et al. Population genetics in nonmodel organisms: II. Natural selection in marginal habitats revealed by deep sequencing on dual platforms. Mol. Biol. Evol. 2011, 28, 2833–2842. [Google Scholar] [CrossRef]
  62. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef] [PubMed]
  63. Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv 2013, arXiv:1303.3997. [Google Scholar]
  64. Danecek, P.; Bonfield, J.K.; Liddle, J.; Marshall, J.; Ohan, V.; Pollard, M.O.; Whitwham, A.; Keane, T.; McCarthy, S.A.; Davies, R.M.; et al. Twelve years of SAMtools and BCFtools. Gigascience 2021, 10, giab008. [Google Scholar] [CrossRef] [PubMed]
  65. Kofler, R.; Pandey, R.V.; Schlötterer, C. PoPoolation2: Identifying differentiation between populations using sequencing of pooled DNA samples (Pool-Seq). Bioinformatics 2011, 27, 3435–3436. [Google Scholar] [CrossRef] [PubMed]
  66. Wang, K.; Li, M.; Hakonarson, H. ANNOVAR: Functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010, 38, e164. [Google Scholar] [CrossRef] [PubMed]
  67. Sherman, B.; Hao, M.; Leidos, L.; Jiao, X.; Baseler, M.; Lane, H.; Imamichi, T.; Chang, W. DAVID: A web server for functional enrichment analysis and functional annotation of gene lists (2021 update). Nucleic Acids Res. 2022, 50, W216–W221. [Google Scholar] [CrossRef]
  68. Rausch, T.; Zichner, T.; Schlattl, A.; Stütz, A.M.; Benes, V.; Korbel, J.O. DELLY: Structural variant discovery by integrated paired-end and split-read analysis. Bioinformatics 2012, 28, i333–i339. [Google Scholar] [CrossRef]
  69. Thorvaldsdóttir, H.; Robinson, J.T.; Mesirov, J.P. Integrative Genomics Viewer (IGV): High-performance genomics data visualization and exploration. Brief. Bioinform. 2013, 14, 178–192. [Google Scholar] [CrossRef]
  70. Emms, D.M.; Kelly, S. OrthoFinder: Phylogenetic orthology inference for comparative genomics. Genome Biol. 2019, 20, 238. [Google Scholar] [CrossRef]
  71. Ma, Q.; Liu, X.; Franks, R.G.; Xiang, Q. Alterations of CorTFL1 and CorAP1 expression correlate with major evolutionary shifts of inflorescence architecture in Cornus (Cornaceae)—A proposed model for variation of closed inflorescence forms. New Phytol. 2017, 216, 519–535. [Google Scholar] [CrossRef]
  72. Chen, D.; Yan, W.; Fu, L.; Kaufmann, K. Architecture of gene regulatory networks controlling flower development in Arabidopsis thaliana. Nat. Commun. 2018, 9, 4534. [Google Scholar] [CrossRef] [PubMed]
  73. Wils, C.R.; Kaufmann, K. Gene-regulatory networks controlling inflorescence and flower development in Arabidopsis thaliana. Biochim. Biophys. Acta (BBA)-Gene Regul. Mech. 2017, 1860, 95–105. [Google Scholar] [CrossRef] [PubMed]
  74. Li, J.; Yao, X.; Lai, H.; Zhang, X.; Zhong, J. The diversification of the shoot branching system: A quantitative and comparative perspective in meristem determinacy. Curr. Opin. Plant Biol. 2024, 81, 102574. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Flowering time difference between Melastoma candidum (left) and M. normale (right). The two species are largely sympatrical in South China. Melastoma normale typically flowers from late March to late April, much earlier than M. candidum, which flowers from early June to late July. This photo was taken on the Zhuhai campus of Sun Yat-sen University, Zhuhai, Guangdong, China, on 11 May 2012, when M. normale had already set fruits, while M. candidum had not begun to flower.
Figure 1. Flowering time difference between Melastoma candidum (left) and M. normale (right). The two species are largely sympatrical in South China. Melastoma normale typically flowers from late March to late April, much earlier than M. candidum, which flowers from early June to late July. This photo was taken on the Zhuhai campus of Sun Yat-sen University, Zhuhai, Guangdong, China, on 11 May 2012, when M. normale had already set fruits, while M. candidum had not begun to flower.
Ijms 25 10250 g001
Figure 2. Genomic differentiation between two extreme bulks for flowering time in an F2 population between Melastoma candidum and M. normale. Sliding window Fst analyses were conducted with five sets of different window size and step size, as shown in the upper left of each panel.
Figure 2. Genomic differentiation between two extreme bulks for flowering time in an F2 population between Melastoma candidum and M. normale. Sliding window Fst analyses were conducted with five sets of different window size and step size, as shown in the upper left of each panel.
Ijms 25 10250 g002
Figure 3. Genetic differentiation on chromosome 7 between two extreme bulks for flowering time in an F2 population between Melastoma candidum and M. normale. Sliding window Fst analyses were conducted with five sets of different window sizes and step sizes, as shown in the upper left of each panel.
Figure 3. Genetic differentiation on chromosome 7 between two extreme bulks for flowering time in an F2 population between Melastoma candidum and M. normale. Sliding window Fst analyses were conducted with five sets of different window sizes and step sizes, as shown in the upper left of each panel.
Ijms 25 10250 g003
Figure 4. Genomic location classification of highly differentiated (Fst > 0.3) SNPs between two extreme bulks for flowering time in an F2 population between Melastoma candidum and M. normale. The numbers of SNPs and their percentages are shown. Exonic: SNPs occurring in a coding region; intronic: SNPs occurring in an intron; splicing: SNPs occurring within 2 bp near the exon/intron boundary in an intron; upstream: SNPs occurring in 2 kb upstream of transcription start site; downstream: SNPs occurring in 2 kb downstream of transcription end site; upstream;downstream: SNPs occurring in 2 kb upstream of transcription start site of a gene and, meanwhile, 2 kb downstream of transcription end site of another gene; intergenic: SNPs occurring in intergenic region; UTR5: SNPs occurring in a 5′ untranslated region; UTR3: SNPs occurring in a 3′ untranslated region; UTR5; UTR3: SNPs occurring in a 5′ untranslated region of one gene and, meanwhile, a 3′ untranslated region of another gene.
Figure 4. Genomic location classification of highly differentiated (Fst > 0.3) SNPs between two extreme bulks for flowering time in an F2 population between Melastoma candidum and M. normale. The numbers of SNPs and their percentages are shown. Exonic: SNPs occurring in a coding region; intronic: SNPs occurring in an intron; splicing: SNPs occurring within 2 bp near the exon/intron boundary in an intron; upstream: SNPs occurring in 2 kb upstream of transcription start site; downstream: SNPs occurring in 2 kb downstream of transcription end site; upstream;downstream: SNPs occurring in 2 kb upstream of transcription start site of a gene and, meanwhile, 2 kb downstream of transcription end site of another gene; intergenic: SNPs occurring in intergenic region; UTR5: SNPs occurring in a 5′ untranslated region; UTR3: SNPs occurring in a 3′ untranslated region; UTR5; UTR3: SNPs occurring in a 5′ untranslated region of one gene and, meanwhile, a 3′ untranslated region of another gene.
Ijms 25 10250 g004
Figure 5. Visualization of the largest inversion on chromosome 7 detected from Illumina sequencing data mapping files of two extreme bulks for flowering time in an F2 population between Melastoma candidum and M. normale using Integrative Genomics Viewer. To visualize, we used the options of “set color alignments by pair orientation” and “view mate region in split screen”. Paired-end reads with blue and cyan support inversion. The region ranges containing the inversion breakpoints on chromosome 7 are indicated on the top.
Figure 5. Visualization of the largest inversion on chromosome 7 detected from Illumina sequencing data mapping files of two extreme bulks for flowering time in an F2 population between Melastoma candidum and M. normale using Integrative Genomics Viewer. To visualize, we used the options of “set color alignments by pair orientation” and “view mate region in split screen”. Paired-end reads with blue and cyan support inversion. The region ranges containing the inversion breakpoints on chromosome 7 are indicated on the top.
Ijms 25 10250 g005
Figure 6. Differences in inflorescence architecture (flower number per inflorescence) between Melastoma candidum and M. normale. (A) One fruit and four unopen flowers in an inflorescence of M. candidum; (B) the fruits in an infructescence of M. normale (we show the fruits rather than the flowers because we could not provide photos to show clear inflorescence architecture when flowering). (C) A schematic diagram of the inflorescence architecture of M. candidum; (D) a schematic diagram of the inflorescence architecture of M. normale. (C,D) Purple filled circles, green crescents and green ovals represent flowers, bracts and leaves, respectively, and Br1, Br2 and Br3 represent the first-, second- and third-order branches in the inflorescence.
Figure 6. Differences in inflorescence architecture (flower number per inflorescence) between Melastoma candidum and M. normale. (A) One fruit and four unopen flowers in an inflorescence of M. candidum; (B) the fruits in an infructescence of M. normale (we show the fruits rather than the flowers because we could not provide photos to show clear inflorescence architecture when flowering). (C) A schematic diagram of the inflorescence architecture of M. candidum; (D) a schematic diagram of the inflorescence architecture of M. normale. (C,D) Purple filled circles, green crescents and green ovals represent flowers, bracts and leaves, respectively, and Br1, Br2 and Br3 represent the first-, second- and third-order branches in the inflorescence.
Ijms 25 10250 g006
Figure 7. Genomic differentiation between two extreme bulks for flower number per inflorescence in an F2 population between Melastoma candidum and M. normale. Sliding window Fst analyses were conducted with five sets of different window sizes and step sizes, as shown in the upper left of each panel.
Figure 7. Genomic differentiation between two extreme bulks for flower number per inflorescence in an F2 population between Melastoma candidum and M. normale. Sliding window Fst analyses were conducted with five sets of different window sizes and step sizes, as shown in the upper left of each panel.
Ijms 25 10250 g007
Figure 8. Genomic location classification of highly differentiated (Fst > 0.2) SNPs between two extreme bulks for flower number per inflorescence in an F2 population between Melastoma candidum and M. normale. See Figure 4 for the legend of each category.
Figure 8. Genomic location classification of highly differentiated (Fst > 0.2) SNPs between two extreme bulks for flower number per inflorescence in an F2 population between Melastoma candidum and M. normale. See Figure 4 for the legend of each category.
Ijms 25 10250 g008
Table 1. Fifteen candidate genes related to flowering time differences identified in Melastoma. These genes contain one or more highly differentiated SNPs between two extreme bulks for flowering time in an F2 population between Melastoma candidum and M. normale and are orthologous to flowering time-related genes in Arabidopsis thaliana.
Table 1. Fifteen candidate genes related to flowering time differences identified in Melastoma. These genes contain one or more highly differentiated SNPs between two extreme bulks for flowering time in an F2 population between Melastoma candidum and M. normale and are orthologous to flowering time-related genes in Arabidopsis thaliana.
Gene_IDChromosomeGene NameNumber of Highly Differentiated SNPsFst of Highly Differentiated SNP(s)Arabidopsis OrthologsPathways of Flowering Time in Arabidopsis to Which the Gene Belongs
MLD38_0263377CSTF7750.30–0.36AT1G17760-
MLD38_0240737FY20.33, 0.35AT5G13480-
MLD38_0270997SPA310.31AT1G53090, AT3G15354-
MLD38_0261747CDF320.38, 0.43AT3G47500, AT5G62430Photoperiod pathway
MLD38_0261947AGL810.47AT5G60910Age pathway
MLD38_0244337FUS390.31–0.40AT3G26790-
MLD38_0240587AGL1510.35AT5G13790-
MLD38_0265167FHY120.31, 0.32AT2G37678, AT5G02200-
MLD38_0251187EZA110.32AT1G02580, AT4G02020-
MLD38_0264757COL960.33–0.41AT3G07650, AT5G48250
MLD38_0270287CIB120.30, 0.38AT4G34530Photoperiod pathway
MLD38_0255347FKF160.32–0.45AT1G68050Photoperiod pathway
MLD38_0239617FAR150.32–0.40AT4G15090, AT4G19990-
MLD38_0256457PGM120.30–0.40AT5G51820-
MLD38_0253637HAP2A40.32–0.34AT5G12840-
Table 2. Large inversions on chromosome 7 between Melastoma candidum and M. normale.
Table 2. Large inversions on chromosome 7 between Melastoma candidum and M. normale.
Start PositionEnd PositionInversion Size (bp)Flowering-Related Genes Contained within the Inversion
757,4253,760,5093,003,084FY, AGL15, FAR1
6,342,11119,027,72012,685,609CSTF77, CDF3, AGL8, EZA1, COL9, PGMP, HAP2A, FKF1
10,533,03817,653,6897,120,651AGL8, PGM, FKF1
10,896,85712,709,2751,812,418FKF1
11,415,94914,960,3733,544,424FKF1
Table 3. Four candidate genes related to flower number per inflorescence difference identified in Melastoma. These genes contain one or more highly differentiated SNPs between two extreme bulks for flower number per inflorescence in an F2 population between Melastoma candidum and M. normale and are orthologous to inflorescence architecture-related genes in Arabidopsis thaliana.
Table 3. Four candidate genes related to flower number per inflorescence difference identified in Melastoma. These genes contain one or more highly differentiated SNPs between two extreme bulks for flower number per inflorescence in an F2 population between Melastoma candidum and M. normale and are orthologous to inflorescence architecture-related genes in Arabidopsis thaliana.
Gene_IDChromosomeGene NameNumber of Highly Differentiated SNPsFst of Highly Differentiated SNP(s)Arabidopsis Orthologs
MLD38_04042312JAG50.22–0.30AT1G13400, AT1G68480
MLD38_0040292PNF10.22AT2G27990
MLD38_0238197FIL10.26AT2G45190
MLD38_03510710LAS20.24, 0.24AT1G55580
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chen, J.; Zhong, Y.; Zou, P.; Ni, J.; Liu, Y.; Dai, S.; Zhou, R. Identification of Genomic Regions Associated with Differences in Flowering Time and Inflorescence Architecture between Melastoma candidum and M. normale. Int. J. Mol. Sci. 2024, 25, 10250. https://doi.org/10.3390/ijms251910250

AMA Style

Chen J, Zhong Y, Zou P, Ni J, Liu Y, Dai S, Zhou R. Identification of Genomic Regions Associated with Differences in Flowering Time and Inflorescence Architecture between Melastoma candidum and M. normale. International Journal of Molecular Sciences. 2024; 25(19):10250. https://doi.org/10.3390/ijms251910250

Chicago/Turabian Style

Chen, Jingfang, Yan Zhong, Peishan Zou, Jianzhong Ni, Ying Liu, Seping Dai, and Renchao Zhou. 2024. "Identification of Genomic Regions Associated with Differences in Flowering Time and Inflorescence Architecture between Melastoma candidum and M. normale" International Journal of Molecular Sciences 25, no. 19: 10250. https://doi.org/10.3390/ijms251910250

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop