Next Article in Journal
Future Climate Effects on Yield and Mortality of Conventional versus Modified Oil Palm in SE Asia
Next Article in Special Issue
Phylogeography of 912 Cherry Accessions Insight into Independent Origins of Fruiting Cherries and Domestication Footprints of Cultivated Chinese Cherry (Prunus pseudocerasus Lindl.)
Previous Article in Journal
Growth and Primary Metabolism of Lettuce Seedlings (Lactuca sativa L.) Are Promoted by an Innovative Iron-Based Fenton-Composted Amendment
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Transcriptome-Based Study on the Phylogeny and Hybridization of Marattialean Ferns (Marattiaceae)

1
School of Ecology and Environmental Science, Yunnan University, Kunming 650091, China
2
School of Life Sciences, Yunnan University, East Outer Ring Road, Chenggong District, Kunming 650500, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Plants 2023, 12(12), 2237; https://doi.org/10.3390/plants12122237
Submission received: 31 March 2023 / Revised: 7 May 2023 / Accepted: 31 May 2023 / Published: 7 June 2023
(This article belongs to the Special Issue Plant Taxonomy, Systematics, and Phylogeography)

Abstract

:
Marattiaceae is a phylogenetically isolated family of tropical eusporangiate ferns including six genera with more than one-hundred species. In Marattiaceae, monophyly of genera has been well-supported phylogenetically. However, the phylogenetic relationships among them were elusive and controversial. Here, a dataset of 26 transcriptomes (including 11 newly generated) were used to assess single-copy nuclear genes and to obtain the organelle gene sequences. Through phylotranscriptomic analysis, the phylogeny and hybridization events of Marattiaceae were explored and a robust phylogenomic framework for the evolution of Marattiaceae was provided. Using both concatenation- and coalescent-based phylogenies, the gene-tree discordance, incomplete lineage sorting (ILS) simulations, and network inference were examined. Except the low support with mitochondrial genes of Marattiaceae, nuclear genes and chloroplast genes strongly supported a sister relationship between Marattiaceae and leptosporangiate ferns. At the genus level, all phylogenetic analysis based on nuclear genes datasets recovered five genera in Marattiaceae as monophyletic with strong support. Danaea and Ptisana were the first two diverged clades in turn. Christensenia was a sister clade to the clade Marattia + Angiopteris s.l. In Angiopteris s.l., three clades (Angiopteris s.s., the Archangiopteris group, and An. sparsisora) were well identified with maximum support. The Archangiopteris group was derived from Angiopteris s.s. at ca. 18 Ma. The putative hybrid species An. sparsisora between Angiopteris s.s. and the Archangiopteris group was verified by the species network analyses and the maternal plastid genes. This study will improve our understanding for using the phylotranscriptomic method to explore phylogeny and investigate hybridization events for difficult taxa in ferns.

1. Introduction

Marattiales Link is one of the most ancient lineages of ferns [1]. Only one family, Marattiaceae Kaulf., was recognized in extant Marattiales, including six genera with ca. 110 species [2]. Marattiales have one of the richest fossil records of all fern lineages (e.g., [3,4,5,6,7,8,9]). Although extant Marattiales are essentially restricted to the tropical regions of the world [1,10], their fossil records have been found in all continents [9,11,12,13,14,15,16,17,18], making it an ideal group for divergence time estimation and learning about the process driving evolution and diversification [19,20]. The eusporangiate Marattiales were often resolved as the sister group of the leptosporangiate ferns (=Polypodiidae Cronquist, Takht. & W. Zimm.) [19,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37]. However, some studies instead resolved Marattiaceae as sister to the clade Psilotales Prant + Ophioglossales Link [38,39,40] or to Equisetales DC. ex Bercht. & J. Presl [41].
In Marattiaceae, Murdock [1] recognized six genera based on a phylogenetic relationship and morphological characters [1,10]. He divided Marattia s.l. into three genera (Marattia s.s. Sw., Eupodium J.Sm., and Ptisana Murdock) and combined another three genera (Angiopteris Hoffm. s.s., Archangiopteris Christ & Giesenh, and Macroglossum Copel.) into Angiopteris s.l. Murdock’s [10] classification was widely accepted and adopted by subsequent scholars [2,42,43,44]. In all previous phylogenetic studies for Marattiaceae, Danaea was recovered as sister to the rest of Marattiaceae and the clade Eupodium + Ptisana was always well-supported (Figure 1). However, depending on different datasets and analysis methods, phylogenetic relationships among Angiopteris s.l., Christensenia, Marattia, and the clade Eupodium + Ptisana varied (Figure 1; also see [10,19,20,21,34,45,46,47,48,49]). Recently, by incorporating good sampling and abundant molecular and morphological data for the extant species, Lehtonen et al. [19] reconstructed the phylogeny of Marattiaceae. Thus, Christensenia was resolved as the sister group to Marattia. May et al. [20] inferred a similar phylogeny to that of Lehtonen et al. [19], but Christensenia was resolved as sister to Angiopteris s.l. instead of to Marattia.
Although Angiopteridaceae Fée ex J was reduced into Marattiaceae and widely accepted and recognized, Angiopteridaceae (=Angiopteris s.l.) was a historically accepted and adopted family, especially in Chinese flora and works (e.g., [50,51,52,53]), which often include three genera: Angiopteris s.s., Archangiopteris, and Macroglossum [54]. Angiopteris s.l. (=Angiopteridaceae Fée ex J) includes three historic genera: Angiopteris s.s., Archangiopteris (=the Archangiopteris group), and Macroglossum [54,55]. Angiopteris s.l. has 30~40 species and represents the highest diversity of Marattiaceae in Paleotropics, especially in China, with ca. 30 species [2,42]. Although Angiopteris s.l. has always been resolved as a well-supported monophyletic and morphologically distinguishable group, its phylogenetic placement was uncertain in Marattiaceae [10,47,56]. In Angiopteris s.l., the Archangiopteris group contains about 10 species and is mainly distributed in Southwest China and northern Vietnam [51,52,53]. Morphologically, species of the Archangiopteris group are smaller, have simple pinnate leaves (except An. bipinnata (Ching) J. M. Camus = (Ar. bipinnata (Ching) J. M. Camus), suberect or creeping rhizomes, one pulvinus (rarely reached five) on the stipe, long and thin sori, and spiny spores, which distinguished them from Angiopteris s.s. However, some hypotheses about natural hybrid species with transitional morphology among Angiopteris s.s. and the Archangiopteris group were also reported, e.g., An. sparsisora Ching and An. sugongii Gui L. Zhang, J.Y. Xiang & Ting Wang [42,51,57,58,59]. A triploid species (An. itoi (W. C. Shieh) J. M. Camus (=Ar. itoi J. M. Camus)) was reported as a hybrid between An. lygodiifolia Rosenstock and An. somae (Hayata) Makino and Nemoto (=Ar. somae Hayata) [60], which was supported by the RAPD markers [60] and chromosome numbers [61]. However, these hybrids have never been studied and tested in phylogenetic analysis. The morphological evolution between Angiopteris s.s. and the Archangiopteris group is another historically debated question since Archangiopteris was reconstructed. Some scholars insisted that the Archangiopteris group has primitive morphological characteristics [47,62], which is supported by some earlier molecular studies (e.g., [23,56,63]). However, comparative study on morphology shows that the Archangiopteris group is very likely divergent from Angiopteris s.s. [51,64]. The Archangiopteris group has been always resolved as a monophyletic group in phylogeny but clustered within Angiopteris s.s. [10]. However, in previous studies, only a few species of the Archangiopteris group were sampled and the phylogenetic relationship between the Archangiopteris group and Angiopteris s.s. was poorly resolved.
One of the underlying reasons for this controversy over the phylogeny in Marattiaceae is due to the limited molecular data. Nowadays, phylogenetic inference based on RNA-Seq is more efficient and cost-effective when lacking whole genome data [39], in which sequences from the nuclear genome contain the majority of genetic information among the three genomes in plants [34]. RNA-seq data have been used to explore the backbone phylogeny in ferns [28,34,37,39,40], but such data were rarely used for specific families or genera in ferns before. In this manuscript, using RNA-seq data, a large number of nuclear genes can be used either for inferring the phylogenetic relationships or dealing with hybridization, speciation, and incomplete lineage sorting in closely related species of Marattiaceae, as demonstrated in previous studies in angiosperms [65,66,67,68].
To explore the deeper relationships of Marattiales, one set of broad representative taxon data, including newly generated transcriptomes data, has been used in this study. Multiple strategies, including different datasets and detection and removal of intra-locus recombinant regions, were applied to minimize systematic errors. Moreover, concatenated approaches, coalescent methods, network analysis, and incomplete lineage sorting (ILS) simulation were used to provide new insights into the evolution of Marattiales.
In this study, we used twenty-six transcriptomes, including Marattiaceae (nearly all extant genera, sixteen samples), Ophioglossaceae (two samples), Psilotaceae (one sample), Polypodiidae (two samples), Equisetaceae (one sample), and (Amborellaceae) in order to (1) revisit the phylogenetic framework of Marattiaceae; (2) investigate phylogenetic incongruence among nuclear gene trees based on different datasets and analysis methods using genome-scale data; (3) explore the phylogenetic relationship between Angiopteris s.s. and the Archangiopteris group; and (4) test the hybridization events of an assumed hybrid species, An. sparsisora, between Angiopteris s.s. and the Archangiopteris groups.

2. Results

2.1. Characteristics of Transcriptomes and Nuclear Gene Datasets

We used sixteen transcriptomes (including eleven newly generated in this study) with a wide range of Marattiaceae, including fourteen species from five genera, and ten species from six families (Equisetaceae, Psilotaceae, Ophioglossaceae, Osmundaceae Martinov, Hymenophyllaceae Mart., and Amborellaceae Pichonas) were used as outgroups in this study (Table 1). Of the 11 newly generated transcriptomes, we finally retained a total of 41,613,264–52,013,624 reads of the clean data. After data quality filtering, the percentage of GC content ranged from 46.78% to 47.70% (Table S3).
Twenty single-copy nuclear gene datasets were generated for phylogenetic analysis with concatenation and coalescent approaches. Including all twenty-six samples generated by Orothofinder(O) and SonicParanoid (S), four datasets comprising the aligned amino acid sequences (aa: 26-O-aa and 26-S-aa) and the codon-aligned nucleotide sequences (cds: 26-O-cd and 26-S-cds) were used. Four datasets (16-O-aa, 16-O-cds, 16-S-aa, and 16-S-cds) including only sixteen samples from the Marattiaceae were used to test the earliest evolutionary lineage in Marattiaceae. Due to computational restrictions and given our main focus on potential reticulation among major clades of Marattiaceae at family level, genus level, and species level, another 12 datasets (Family-O-aa, Family-O-cds, Family-S-aa, Family-S-cds, Genus-O-aa, Genus-O-cds, Genus-S-aa, Genus-S-cds, Species-O-aa, Species-O-cds, Species-S-aa, and Species-S-cds) were generated. Statistics of the datasets are presented in Table 2. With the exception of percentage of the pairwise identity and identical sites, all the remaining statistics in datasets generated by SonicParanoid are generally 1.1–2.8 times higher than those of datasets generated by OrthoFinder.

2.2. Phylogenetic Reconstruction Using Single-Copy Nuclear Genes

Based on the 20 single-copy nuclear gene datasets, a total of 40 species trees were inferred using concatenation and coalescent approaches in IQ-tree (Figure 2b, Figure 3b,d, Figure 4d, Figure 5b, Figure 6b, Figure 7b, Figures S1b,d, S2b, S3–S5b, S6b,d and S7–S12b) and Astral-II (Figure 2a, Figure 3a,c, Figure 4d, Figure 5a, Figure 6a, Figure 7a, Figures S1a,c, S2a, S3–S5a, S6a,c and S7–S12a), respectively. In analyses of nucleotide and amino acid sequences, Marattiaceae were found to be sister to Polypodiidae (ML-BS = 100; AS-PP = 1; Figure 2a,b, Figure 4d, Figure 5a,b, Figures S1a–d, S2a,b and S3–S5a,b). The conflict analyses confirmed the monophyly of Marattiaceae and placed it as sister to Polypodiidae by a majority of gene trees, 50 (out of 139, internode certainty all (ICA) = 0.27) (Figure 2a), 201 (out of 712, ICA = 0.44) (Figure 5a), 24 (out of 61, ICA = 0.21) (Figure S1a), 33 (out of 77, ICA = 0.28) (Figure S1c), 63 (out of 154, ICA = 0.33) (Figure S2a), 298 (out of 793, ICA = 0.56) (Figure S3a), 409 (out of 1234, ICA = 0.52) (Figure S4a), 519 (out of 1329, ICA = 0.70) (Figure S5a), and having high quartet sampling (QS) support (0.45–0.77/0/0.98–1) (Figure 2b, Figure 5b, Figures S1b,d, S2b and S3–S5b).
At the genus level in Marattiaceae, in those datasets (26-O-aa/26-O-cds/26-S-aa/26-S-cds), except that the ML tree of 26-S-aa resolved Danaea as the first evolutionary lineage (Figure 2b), the remaining species trees resolved the Danaea + Ptisana clade as the first evolutionary lineage (Figure 2a,b, Figures S1a–d and S2a,b). However, the former (Danaea as the first evolutionary lineage) was with full QS support (1/−/1) (Figure 2b), the latter (the Danaea + Ptisana clade as the first evolutionary lineage a) with counter-support QS values (−0.38–0.2/0/0.94–0.99) (Figures S1b,d and S2b). To further detect the first diverging lineage in Marattiaceae, the 16 taxa datasets (16-O-aa/16-O-cds/16-S-aa/16-S-cds) using Danaea (ML-BS = 100, AS-PP = 1; Figure 3a,b and Figure S6a,b) or Danaea + Ptisana (ML-BS = 100, AS-PP = 1; Figure 3c,d and Figure S6c,d) as outgroups were analyzed, respectively. The conflict analyses of both have full QS support (1/−/1) (Figure 3b,d and Figure S6b,d) but with higher gene trees support in these trees using Danaea as the first divergent clade by 99% (567 out of 571, ICA = 1) (Figure 3a) and 98% (453 out of 460, ICA = 1) (Figure S6a) than the trees using Danaea and Ptisana 91% (785 out of 863, ICA = 1) (Figure 3c), and 88% (548 out of 626, ICA = 1) (Figure S6c). We presume that Danaea was the first evolutionary lineage in Marattiaceae (for more discussion, see “Discussion” section). In all subsequent analyses, Danaea was always rooted as the outgroup.
In our phylogenetic analyses based on the datasets of 16-O-cds, 16-S-cds, 16-O-aa, 16-S-aa, Genus-O-cds, Genus-S-cds, Genus-O-aa, and Genus-S-aa, the monophyly and phylogenetic relationships of five genera in Marattiaceae were well-resolved (ML-BS = 100; AS-PP = 0.98–1; Figure 3a,b, Figure 6a,b, Figures S6a,b and S7–S9a,b). Danaea was rooted as an outgroup. Ptisana is sister to the rest of Marattiaceae. Angiopteris s.l. is sister to Marattia, and they are together sister to Christensenia. In Angiopteris s.l., three well-supported clades were identified: Angiopteris s.s., the Archangiopteris group, and An. sparsisora (a potential hybrid; see below).
Analysis of phylogenetic conflicts confirmed a high level of gene tree accordance at genus level and monophyly of Angiopteris s.l., Danaea, Ptisana, Christensenia, and Marattia with full QS support values (1/−/1) (Figure 2a,b, Figure 3a–d, Figures S1a–d, S2, S3a,b and S5a,b). Danaea is sister to other genus in Marattiaceae by 567 (out of 571; ICA = 1) in 16-O-cds datasets (Figure 3a), 823 (out of 910; ICA = 1) in Genus-O-cds datasets (Figure 6a), 453 (out of 460; ICA = 1) in 16-O-cds datasets (Figure S6a), 630 (out of 699; ICA = 1) in Genus-O-aa datasets (Figure S7a), 989 (out of 1101; ICA = 1) in Genus-S-aa datasets (Figure S8a), and 1254 gene trees (out of 1387; ICA = 1) in Genus-S-cds datasets (Figure S9a), respectively. Ptisana as the sister to the clade Christensenia + Marattia + Angiopteris s.l. was highly supported by 541 (out of 571, ICA = 1) (Figure 3a), 810 (out of 910; ICA = 1) (Figure 6a), 431 (out of 460; ICA = 1) (Figure S6a), 618 (out of 699; ICA = 1) (Figure S7a), 965 (out of 1101; ICA = 1) (Figure S8a), and 1232 gene trees (out of 1387; ICA = 1) (Figure S9a), respectively. The sister relationship between Christensenia and the clade Marattia + Angiopteris s.l. was supported by the majority of gene trees, 495 (out of 571; ICA = 0.89) (Figure 3a), 784 (out of 910; ICA = 0.87) (Figure 6a), 341 (out of 460; ICA = 0.84) (Figure S6a), 530 (out of 699; ICA = 0.82) (Figure S7a), 836 (out of 1101; ICA = 0.84) (Figure S8a), and 1206 (out of 1387; ICA = 0.89) (Figure S9a), respectively. Marattia was sister to Angiopteris s.l., supported by 536 (out of 571, ICA = 0.95) (Figure 3a), 855 (out of 912, ICA = 0.94) (Figure 6a), 412 (out of 460, ICA = 0.87) (Figure S6a), 609 (out of 699, ICA = 0.89) (Figure S7a), 948 (out of 1101, ICA = 0.90) (Figure S8a), and 1305 gene trees (out of 1387, ICA = 0.95) (Figure S9a).
In Angiopteris s.l., interspecific relationships of Angiopteris s.s. and the Archangiopteris group were resolved with medium to high support (ML-BS = 60–100; AS-PP = 0.49–1; QS = −0.54–1/0–0.75/0.96–1; ICA = −0.15–0.42) (Figure 3a,b, Figure 7a,b, Figures S6a,b and S10–S12a,b). Angiopteris s.s. (except An. sparsisora) and the Archangiopteris group were recovered as monophyletic in all concatenation and coalescent species trees, with support by only 87/137 gene trees (out of 460/571; ICA = 0.16–1) (Figure 3a,b, Figure 7a,b, Figures S6a,b and S10–S12a,b) but with strong support (ML-BS = 100; AS-PP = 0.98–1) and full QS support (1/−/1) (Figure 3a,b, Figure 7a,b, Figures S6a,b and S10–S12a,b).

2.3. Phylogenetic Reconstruction of Organelle Genes

Using the customized pipeline, we recovered 87 and 32 protein-coding genes from the CT and MT datasets, respectively. Finally, alignments of CT dataset with 78,897 bp in length, MT dataset with 36,568 bp in length, and the combined matrix (CM) with 115,465 bp in length were used to infer the phylogeny. Detailed information on the assembled chloroplast genes and mitochondria genes are provided in Figshare: https://doi.org/10.6084/m9.figshare.21707546.
At the family level, phylogenetic analysis of the CT and CM datasets recovered Marattiaceae sister to Polypodiidae with strong support (ML-BS = 98; Figure 4a,c, Figures S13 and S15), but the MT dataset resolved Marattiaceae as sister to the clade Equisetaceae + Psilotaceae with weak support (ML-BS = 41; Figure 4b and Figure S14). At the genus level, the plastid genes result corroborates the nuclear phylogeny (Figure S13). The phylogeny based on mitochondrion genes (MT dataset) inferred that five genera are non-monophyletic but with extremely weak support (Figure S14). The CM dataset infers that the clade Danaea + Ptisana is the first diverged clade with strong support (ML-BS = 90). Marattia is nested with Angiopteris s.l., and they are together sister to Christensenia (Figure S15).
To avoid the impact of missing data of chloroplast genes, we only retain Marattia sp. as the outgroup to infer phylogeny of Angiopteris s.l. with chloroplast genes. Three medium-supported clades (Angiopteris s.s. (ML-BS = 97), Angiopteris sparsisora (a putative hybrid species) clade, and the Archangiopteris group (ML-BS = 51)) were identified (Figure S16). Angiopteris sparsisora was sister to Angiopteris s.s. based on chloroplast phylogenomics instead of sister to the Archangiopteris group in nuclear genes.

2.4. Coalescent Simulations

To investigate whether ILS alone could explain the cytonuclear and nuclear gene-tree discordance, coalescent simulation analysis with datasets of Family-O-aa/Family-O-cds/Family-S-aa/Family-S-cds/Genus-O-aa/Genus-O-cds/Genus-S-aa/Genus-S-cds/Species-O-aa/Species-O-cds/Species-S-aa/Species-S-cds were executed. The datasets of Family-O-aa/Family-O-cds/Family-S-aa/Family-S-cds mainly focused on the ILS at the family level in Marattiaceae, the datasets of/Genus-O-aa/Genus-O-cds/Genus-S-aa/Genus-S-cds mainly focused on the ILS at the genus level in Marattiaceae, and the datasets of Species-O-aa/Species-O-cds/Species-S-aa/Species-S-cds mainly focused on the ILS in Angiopteris s.l.
At the genus level and family level, distribution of tree-to-tree distances between simulated gene trees and the observed gene trees largely overlapped (Figure 5, Figure 6c,d, Figures S3–S5c,d and S7–S9c,d) using the Family-O-aa/Family-O-cds/Family-S-aa/Family-S-cds/Genus-O-aa/Genus-O-cds/Genus-S-aa/Genus-S-cds datasets. However, using the same ILS simulation workflows within the remaining datasets at species level, ILS tests for most of the discordance in Angiopteris s.l. were accounted for. All the analyses distinguishing between simulated and observed trees distance distributions show that ILS cannot account for the discordance in Angiopteris s.l. (Figure 7c,d and Figures S10–S12c,d).

2.5. Network Analysis

The datasets used in ILS simulations were used to perform phylogenetic network analysis in PhyloNet while accounting for both hybridization and ILS. At the family level (Figure 5e and Figures S3–S5e), the inferred Network4 has the highest log pseudo-likelihood (Table 3). Gene flow was detected in all outgroups (Figure 5e and Figures S3–S5e), indicating that hybridization could be a possible cause of discordance. Up to five hybridization events at the generic level in Marattiaceae were also examined; the inferred network with two reticulation events has the highest log pseudo-likelihood (Table 3). One reticulation event from ancestral Angiopteris fokiensis and An. Subrotundata lineage to An. Sparsisora was inferred in all five examinations (Figure 6e), with an inheritance probability of 0.413–0.582 from the An. fokiensis lineage and 0.418–0.587 from the An. subrotundata lineage, and it was detected in all five examinations in the Genus-S-cds dataset (Figure S9e), network1-network3 in the Genus-O-aa dataset (Figure S7e), and network2-network5 in the Genus-S-aa dataset (Figure S8e). Three (Figure S7e: Network 3, Figure S8e: Network 4), six (Figure 6e: Network 2-Network3, Figure S9e: Network 2−Network 5), and three (Figure 6e: Network 4−Network5; Figure S8e: Network1) inferred hybridization events between Christensenia, Danaea, and Ptisana were also detected. For the reticulation event of Christensenia, Danaea, and Ptisana, one of the gene flow events was basically from ancestral in Marattiaceae (except Danaea) lineage to the ancestors of each other (Figure 6e: Network 2−Network5; Figure S7e: Network3; Figure S8e: Network1 and Network4; Figure S9e: Network2−Network5).
In Angiopteris s.l., up to five hybridization events among Angiopteris s.s. were allowed (Figure 7e and Figures S10–S12e). Angiopteris sparsisora was inferred to have a genetic contribution from the lineage of Angiopteris s.s. and the Archangiopteris group in all datasets, and its reticulation event was detected in all 20 examinations. Angiopteris sparsisora has an inheritance probability of 0.371–0.506 from the Angiopteris s.s. lineage and 0.494–0.629 from the Archangiopteris group lineage (Figure 7e and Figures S10–S12e). Although the inferred network with the highest log pseudo-likelihood included one reticulation event (Table 3, except Species-S-cds dataset with two reticulations: −267.86003727524945), the genera-level hybridization events that included more reticulations, which were mainly caused by hybridization events among Angiopteris, were calculated (Figure 6e and Figures S7–S9e).

2.6. Divergence Dating Estimation

Our analyses infer a stem age of Marattiaceae dating to the late Carboniferous–early Permian (ca. 319 Ma; Figure 8). The split between Danaea and the rest of extant Marattiaceae occurred in ca. 201 Ma, corresponding to the late Triassic. The stem age of Christensenia was estimated to be ca. 165 Ma (middle Jurassic), and the crown age was ca. 43 Ma (Figure 8). The stem age of Marattia was estimated to be ca. 46 Ma (middle Paleogene; Figure 8). In Angiopteris s.l., the Archangiopteris group diverged from Angiopteris s.s. at ca. 37 Ma during middle Cenozoic (Figure 8).

3. Discussion

3.1. New Insights into Phylogeny of Marattiaceae

In previous phylogenetic reconstruction for Marattiaceae, less than four genera and four species were used for phylogenetic analysis based on nuclear genes [28,34,37,38,39,40]. In this work, using a broad taxon sample of newly generated transcriptomes, the phylogenetic relationships of Marattiales were explored through analysis of uniparentally inherited organelle genes and single-copy biparental nuclear genes.
In this study, two alternative relationships of Marattiaceae with other fern lineages were inferred: sister to Polypodiidae or sister to the clade Equisetaceae + Psilotaceae. All the phylogenies inferred from datasets of nuclear, plastid, or concatenated plastid and mitochondrial genes supported the sister relationship between Marattiaceae and Polypodiidae (Figure 4a,c,d), which is consistent with previous results based on plastid markers [19,21,22,23,24,25,26,30,31,32,33,35], concatenated plastid and mitochondrial genes [27], and nuclear genes [28,34]. However, our mitochondrial dataset resolved Marattiaceae as sister to the clade Equisetaceae + Psilotaceae with weak support (Figure 4b). As Murdock [10] and Lehtonen et al. [19] have noted, inferences of the first diverging lineage in extant Marattiaceae are sensitive to datasets choice, LBA, outgroup choice, and use of optimality criteria.
With extensive outgroups, two alternatives for the first divergent lineage, Danaea (Figure 2b) or the clade Danaea + Ptisana (Figure 2a, Figures S1a–d and S2a,b), were found. The latter, however, has a higher proportion of gene tree conflicts and low ICA and QS values (Figure 2a,b, Figures S1a–d and S2a,b). To further test the first lineage of Marattiaceae, conflict analysis results of different root settings were compared using single-copy nuclear genes specific to Marattiaceae, Danaea as the first evolutionary lineage of Marattiaceae with higher gene proportion (Figure 3a–d and Figure S6a–d). In addition, phylogenetic reconstruction based on chloroplast genes also shows the root of Marattiaceae separates Danaea from the rest of the genus (Figure S13). In all, our phylogenomic results are consistent with Murdock’s [10] study based on chloroplast DNA data but with higher support for each clade (Figure 3a,b, Figure 6a,b, Figures S6a,b and S7–S9a,b). Although Eupodium was not included in our phylogenetic analysis, it has been resolved as the sister to Ptisana in all previous studies (e.g., [1,10,19,20,48]). Here, TreeShrink was further selected to reduce the influence of long branch attraction and improved the accuracy of phylogeny in all datasets, which could identify and remove those sequences that lead to unrealistically long branch lengths within each cluster [70]. Nearly all previous phylogenetic studies have resolved Danaea as the first divergent clade in Marattiaceae [1,10,19,20,23,48,56,63]. In this study, we also recovered Danaea as a sister to the rest of the family in phylogenetic analyses of the multi-locus plastid genes; all the conflict analyses based on different single-copy nuclear gene datasets and the all analyses were concordant (ICA = 1; QS = 1/−/1) (Figure 3a,b, Figure 6a,b, Figures S6a,b, S7–S9a,b and S13). Ptisana was sister to the clade comprised by three genera (Angiopteris s.l., Christensenia, and Marattia) in Marattiaceae. In some previous studies, phylogenetic placement of Danaea and Christensenia was tentative and they both showed long branches and possibly demonstrated convergent evolution in protein coding genes [1,10,63]. In our phylogenetic analysis, Christensenia was resolved as sister to the clade Marattia + Angiopteris s.l. with full support (ML-BS = 100; AS-PP = 1; QS = 1/−/1; ICA = 0.82–0.89), which was different from Lehtonen et al. [19]’s study that either recovered Christensenia as sister to Marattia (in plastid genome dataset) or as the second divergent clade sister to remaining genera except Danaea (in plastid genome and 77 phenotypic datasets). However, our inferred phylogeny based on multi-locus chloroplast genes also supports the same major clade position of Marattiaceae based on single-copy nuclear genes. In this case, two alternative explanations can be assumed: the phylogenetic reconstruction may be confounded by chloroplast RNA editing, and maternally inherited plastome representing only a single coalescent history may not reflect the species phylogenetic relationship [71,72,73,74,75]. The whole plastome is not “the holy grail” [76]; the effectiveness of plastome-scale data is ultimately reflected by the extent to which they reveal the “true” phylogenetic relationships of a given lineage [77]. Incomplete lineage sorting (ILS), introgressive hybridization, and horizontal gene transfer can also result in such high confliction [78]. Based on the phylogenetic analysis of transcriptome data and previous phylogenetic results, we determined that the most likely phylogenetic relationship of six genera in Marattiaceae is (Danaea, ((Ptisana + Eupodium), (Christensenia, (Marattia + Angiopteris s.l.)))).

3.2. Resolution of Angiopteris s.l.

In this study, a total of 10 samples representing 10 species of Angiopteris s.l. were included. Three well-supported clades (Angiopteris s.s., Angiopteris sparsisora, and the Archangiopteris group) were identified based on single-copy nuclear genes datasets and chloroplast genes (Figure 3a–d, Figure 7a,b, Figures S6a–d, S10–S12a,b and S16). Because of the similar morphology and poor phylogenetic resolution, Angiopteris s.s., Archangiopteris, Macroglossum, Protomarattia Hayata, and Protangiopteris Hayata were usually combined into Angiopteris s.l. [1,2,10,42,55,79]. In previous studies, Angiopteris s.s. often was not resolved as a monophyletic group [1,10,23,56,63]. Macroglossum has only one or two species with limited geographical distribution in Borneo and Sumatra [80]. Archangiopteris seems to be a monophyletic group [1] with about 10 species distributed in China and Vietnam [51,52,53]. We sampled four species and our phylogeny confirmed that the Archangiopteris group was a well-supported monophyletic clade in both nuclear gene datasets and chloroplast genes (Figure 3a–d, Figure 7a,b, Figures S6a–d, S10–S12a,b and S16). Angiopteris s.s. is paraphyletic, including two clades: the hybrid An. sparsisora and Angiopteris s.s., in all the nuclear gene datasets. Angiopteris sparsisora has been recognized as a putative hybrid between Angiopteris s.s. and the Archangiopteris group [42,57]. Ching and Wang [57] thought that An. sparsisora is more similar to Angiopteris s.s. morphologically and placed it in Angiopteris, a position that is also supported by our chloroplast genes (Figure S16). However, our phylogenetic analysis showed that An. sparsisora has a closer relationship with the Archangiopteris group instead of Angiopteris s.s. in all nuclear gene datasets. Morphologically, the Archangiopteris group is different from Angiopteris s.s. in having creeping rhizomes and often once-pinnate blades, the stipes with one or several (1–7) naked pulvini (presenting in whole growth phases of plants) [81].
Archangiopteris (=the Archangiopteris group) was established by Christ and Giesenhagen [62] based on Ar. henryi (=An. latipinna). The prefix “arch-” of Archangiopteris means primitive, representing Archangiopteris with more primitive morphological characters (or earlier differentiation) than Angiopteris s.s., which was also demonstrated in a chloroplast genes dataset with one outgroup (Figure S16). Such ideas were supported by some previous morphological studies and phylogenetic analysis [23,47,51,56,63]. However, our phylogenetic analysis recovered that Angiopteris s.s. was diverged earlier than the Archangiopteris group with full support value in all nuclear gene datasets (Figure 3a–d, Figure 7a,b, Figure 8, Figures S6a–d and S10–S12a,b). An estimated divergence time based on nuclear genes showed that the Archangiopteris group evolved ca. 27 Ma, nearly 10 Ma later than Angiopteris. It was also noted by some authors [51,64] who thought that the Archangiopteris group probably was derived from Angiopteris s.s. based on the taxonomy, geographical distribution, morphological, and anatomical features. In order to understand the phylogenetic relationship of Angiopteris s.l. well, potentially, some taxonomic ranks (nothosubgenus, subgenus, or section) in Angiopteris s.l. could be recognized and proposed, and hybridization events in Marattiaceae need to be further investigated in the future.

3.3. Phylogenetic Incongruence as Further Evidence for ILS and Hybridization

With different materials and methods, a large number of previous phylogenetic studies have yielded many incongruent topologies in Marattiaceae (Figure 1) [1,10,19,20,48,56,63]. In addition, our analyses showed that there was a very high level of heterogeneity among single-copy nuclear gene trees, and cytonuclear discordance with different phylogenetic relationships of Equisetaceae, Marattiaceae, Ophioglossiodeae, Polypodiidae, and Psilotaceae. While these inconsistencies have never been well explained, it is often believed that phylogenetic incongruence in plants is mainly due to hybridization and ILS [82,83,84,85,86].
Here, the phylogenetic relationship of Marattiaceae was inferred using coalescent methods, which also allow us to assess the ancient hybridization, introgression, and incomplete lineage sorting (ILS) [87]. The confliction within gene trees and among concatenation and coalescent trees at all topologies was detected in this study (Figure 2a,b, Figure 3a–d, Figure 5a,b, Figure 6a,b, Figure 7a,b, Figures S1a–d, S2a,b, S3–S5a,b, S6a–d and S7–S12a,b). At family level and genus level, nearly overlapping RF distances were detected between the empirical trees and the coalescent simulations, indicating that ILS alone could explain most of the gene tree discordance (Figure 5, Figure 6c,d, Figures S3–S5c,d and S7–S9c,d). Meanwhile, ILS is likely to apply regarding mainly reasons for family-level and genus-level conflict that diverged during rapid speciation events and/or had large population sizes [88]. We also detected inter-family and inter-genus geneflow among almost all lineages (Figure 3e, Figures S2, S3 and S5e). High hybridization frequency had been reported in ferns [89,90,91,92,93], and it is generally believed that hybridization and allopolyploidization are the main driving forces in the evolution of ferns [94]. Topology conflict in Marattiaceae at family level and genus level was potentially caused by ILS and hybridization together.
In Angiopteris s.l., our simulation analysis showed a large difference in both frequency and tree-to-tree distance between the coalescent simulations and the empirical trees (Figure 7c,d and Figures S10–S12c,d), and the ILS, therefore, cannot explain the unresolved phylogenetic relationship between Angiopteris s.s. and the Archangiopteris group. Potentially, gene flow is frequent between Angiopteris s.s. and the Archangiopteris group, leading to topological conflict. Hybridization analysis further suggested that An. sparsisora was of hybrid origin and was detected in all 20 examinations, with an inheritance probability of 0.371–0.506 from the Angiopteris s.s. lineage and 0.494–0.629 from the Archangiopteris group lineage (Figure 7e and Figures S10–S12e). Using the command CalGTProb, the greatest likelihood of the network was computed. It indicated that only one hybridization event was selected, which conflicts with the previous hypothesis of intergeneric hybridization event times (Table 3). Angiopteris sparsisora is endemic to a valley under evergreen broad-leaved forests in Xichou County, Fadou Town, which is located in southeastern Yunnan Province, China. Ching and Wang [57] have assumed that this distinct species is a natural hybrid between Angiopteris s.s. and the Archangiopteris group, with a number of intermediate morphological characteristics [42,57]. Ching and Wang [57] placed An. sparsisora in Angiopteris s.s. rather than in the Archangiopteris group because of the fact that, in overall impression, the new taxon appears more similar to the former than the latter. China and northern Vietnam was thought to be the original center, with high diversity of the Archangiopteris group. In this area, a large number of species of Angiopteris s.s. and the Archangiopteris group occur, growing side by side in great abundance. Another presumptive hybrid species, An. bipinnata (=Ar. bipinnata), was sampled in our study. We confirmed that An. bipinnata is a member of the Archangiopteris group. However, our analysis did not detect obvious gene flow from other lineages (Figure 7e and Figures S10–S12e). In Angiopteris s.s., some other hybrid species have also been reported. (e.g., An. sugongii, An. itoi (=Ar. itoi)), and it is necessary to investigate them with more materials and molecular data.

4. Materials and Methods

4.1. Taxon Sampling and RNA Sequencing

In this study, 26 samples representing 24 species (Table 1) were sampled, including almost all major lineages recognized in Marattiaceae plus outgroups. Eleven samples were newly generated in this study and 15 samples were obtained from GenBank SRA database (Table 1). In total, except Eupodium (a neotropical genus with three or four species [19,95]), all recognized genera of Marattiaceae in PPG I [2] were sampled. New transcriptome data were sequenced on an Illumina HiSeq2000 platform. Total RNA extraction, library preparation, raw data cleaning, and quality control were performed in 2018 at Majorbio, Shanghai, China.

4.2. Assembly and Single-Copy Orthologue Identification

Illumina adapter and poor-quality bases were removed from the raw reads of the transcriptomes using Fastp v0.12.4 [96] with default parameters. The cleaned data were used for de novo assembly using Trinity v2.8.5 [97] and only the longest transcripts were selected for subsequent analysis using Trinity’s own perl script to remove over-represented sequences. Minimap2 v2.17-r941 [98] and Samtools v1.11 [99] were chosen to filter the chloroplast and mitochondrial transcripts using the closest publicly available reference chloroplast genome and mitochondrion genome (Table S1) as references. CD-HIT-EST v4.8.1 [100] was used to further remove redundant contigs with a threshold of 0.99. TransDecoder v5.5.0 [101] was used to identify candidate coding regions. To cluster transcripts into orthologous genes, the OrthoFinder v2.3.8 [102,103] and SonicParanoid v1.3.8 [104] software were used to infer core-orthogroups based on all-against-all searches with default parameters and only single-copy orthologues present in all samples were selected for subsequent analyses. This resulted in 92 single-copy nuclear genes that shared with all 26 samples in total by using OrthoFinder and 187 single-copy nuclear genes that shared with all 26 samples in total by using SonicParanoid, respectively.

4.3. Nuclear Genes Dataset Generation and Phylogenetic Inference

Considering the later hybridization analysis with computational restrictions and by using different datasets to reconstruct the relationship of Marattiaceae, we first constructed 20 single-copy nuclear gene datasets composed of different species (Table S2) and discussed the phylogenetic relationship at the family level, genus level, species level, and the root of Marattiaceae. Each of the acquired amino acid sequences was aligned first then trimmed by using plugins Mafft v7.450 (set E-INS-i) [105,106] and trimAI v1.3 (set -automated1) [107] in PhyloSuite v1.2.2 [108], respectively. Given that recombination within loci might bias the inference of the species tree [109], we further used PhiPack v1.1 [110] to test the pairwise homoplasy index Φ for recombination with default sliding window size of 100 bp, and alignments that showed a strong signal of recombination with p ≤ 0.05 were removed from all subsequent phylogenetic analyses. Subsequently, for the concatenation approach in which all genes in a concatenated matrix are considered a single locus to infer phylogeny, sequences of aligned genes in each dataset were concatenated using the PhyloSuite plugins “Concatenate Sequence”. The concatenation ML tree inferred in IQ-tree v2.1.2 [111] with 10,000 ultrafast bootstraps [112] and performed model selection using the bias-corrected Akaike information criterion (AICc) in ModelFinder [113]. In addition, a coalescent approach was applied to estimate the species tree. Each gene tree was generated via IQ-tree with 1000 ultrafast bootstraps and the coalescent tree was inferred by Astral-II v5.7.8 [114]. In addition, TreeShrink v 1.3.7 [89] was selected to reduce the long branch attraction (LBA) influence. The retained sequences were used to construct the concatenation and coalescent trees with the same method mentioned above. Finally, the corresponding codon alignments were converted from amino acid sequences using PAL2NAL v14.0 [115] to generate datasets, remove recombination loci, reduce LBA, and for phylogeny reconstruction.

4.4. Organelle Genome Assembly, Locus Extraction, and Phylogenetic Analysis

Chloroplast and mitochondrion genes were obtained from assembly transcripts mapped to the closest available reference chloroplast and mitochondrion genome (Table S1) in Geneious Prime [116] with the parameter settings in the “Geneious RNA” program as highest sensitivity. Annotation was subsequently conducted in Geneious [set 90% similarity]. Then, we filled in the gaps with ‘N’ for all of the protein-coding genes and extracted. Each of the acquired protein-coding genes was aligned first then trimmed by using plugins Mafft (set E-INS-i) and trimAI (set -automated1) in PhyloSuite, respectively. For the concatenated matrix, we constructed chloroplast genes dataset (CT), mitochondrial genes dataset (MT), and chloroplast genes and mitochondrial genes combined dataset (CM) and then used the 5000 ultrafast bootstrap replicates for branch support in IQ-tree.

4.5. Concordance, ILS Simulation, and Hybridization Inference

We used two approaches to explore discordance among nuclear gene trees. First, we mapped the individual gene trees onto the coalescent tree and calculated the internode certainty all (ICA [117]) value to quantify the degree of conflict on each node and the number of conflicting and concordant bipartition on each node of the coalescent trees using the gene trees with maximum likelihood bootstrap supports (MLBS) values at least 50% for the corresponding node. Both ICA and conflicting/concordant bipartitions were calculated with Phyparts [118]. Then, we used quartet sampling (QS [119]) to distinguish strong conflict from weakly supported branches using the concatenation tree with 1000 replicates. For the ILS analysis, we used Dendropy v4.5.2 [120] to simulate gene trees using the species tree as a guide tree and generated a total of 20,000 gene trees. Then, we calculated the Robinson-Foulds (RF) distance between the species tree and each simulated or observed gene tree by using the workflow of Wang et al. [82]. To test the hypothesis of hybridization, we used PhyloNet v3.8.0 [121] to infer an evolutionary network with the command “InferNetworks_ML” [122] from individual gene trees. Due to computational restrictions, a maximum of five reticulation events was set and run with 50 runs to ensure accuracy. To estimate the optimal number of hybridizations, we computed the likelihood scores of the given individual gene trees using the command “CalGTProb” [123]. The networks were visualized in Dendroscope v3.8.1 [124].

4.6. Age Estimation

Divergence time estimations were performed based on the concatenated 16-O-cds dataset using penalized likelihood in treePL v2.6.3 [125]. We were mainly concerned with the chronological order of divergence of the two lineages of Angiopteris s.s. and the Archangiopteris group. We used two calibration points based on the estimation of Lehtonen et al. [19], who applied a parsimony-based total-evidence dating approach, which suggested a Triassic age for the extant crown group, which is consistent with the fossil records that have been found so far [12,49,126,127]: (1) an age estimate ranging from 307 Ma to 320 Ma for the Marattiales, and (2) the split between Danaea and the most recent common ancestor of the rest extant species of Marattiaceae was estimated in the Late Triassic (201–236 Ma). Following the guidelines of Maurin [128], cross-validation analysis was conducted with rate-smoothing values from 1010 to 10–30 and a “cvmultstep” of 0.1 and resulted in an optimal smoothing parameter of 0.00001. Lastly, used TreeAnnotater v2.4.7 to summarize all dated 1000 bootstrap replicates into a maximum clade credibility (MCC) tree with 0% of burn-in and mean node heights. The MCC tree was displayed in FigTree v1.4.3 [129].

5. Conclusions

Using different phylogenetic analyses, the phylogeny of Marattiaceae was reconstructed with a dataset from 26 transcriptomes. In general, our results are consistent with previous phylogenetic results [1,10]. The most likely phylogenetic relationship of six genera in Marattiaceae is (Danaea, ((Ptisana + Eupodium), (Christensenia, (Marattia + Angiopteris s.l.)))). Moreover, some new insights have been revealed for hybridization in Angiopteris s.l. The Archangiopteris group (=Archangiopteris) forms a well-supported clade that is nested within Angiopteris s.s. The putative hybrid species Angiopteris sparsisora between Angiopteris s.l. and the Archangiopteris group was confirmed by the network analysis and maternal chloroplast genes. ILS and gene flow could be the main reason for the puzzled systematists at family level, genus level, and species level in Marattiaceae.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/plants12122237/s1. Table S1: The references of complete organelle genome used in this study. Table S2: Sample composition of different datasets. Table S3: Information of transcriptomes included in this study. Figure S1: Phylogenomic analysis of concatenation-based trees from the 26-O-aa and 26-O-cds datasets in Marattiaceae (a,b. 26-O-aa datasets, c,d. 26-O-cds datasets); a,c. astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) values are shown above branches, and numbers of gene trees (concordant/conflicting) were shown below branches. Pie charts present the proportion of congruent gene trees that supports that clade (blue), the proportion of discordant gene trees of the main alternative topology for that clade (green), the proportion of discordant trees for the remaining alternative topologies (red), the proportion of uninformative gene trees (dark gray; bootstrap support <50%), and the proportion of missing data (light gray); b,d. maximum likelihood bootstrap support (ML-BS) is shown above the branches, and quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches. Figure S2: Phylogenetic analysis of species trees based on 26-S-cds in Marattiaceae. a. Astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) values are shown above branches, and numbers of gene trees (concordant/conflicting) were shown below branches. Pie charts present the proportion of congruent gene trees that supports that clade (blue), the proportion of discordant gene trees of the main alternative topology for that clade (green), the proportion of discordant trees for the remaining alternative topologies (red), the proportion of uninformative gene trees (dark gray; bootstrap support <50%), and the proportion of missing data (light gray). b. Maximum likelihood bootstrap support (ML-BS) is shown above the branches, and quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches. Figure S3: a,b. Phylogenetic analysis of species tree with Family-O-cds dataset in Marattiaceae (a. Astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) scores are shown above branches and numbers of gene trees (concordant/conflicting) are shown next to the nodes; b. maximum likelihood bootstrap support (ML-BS) is shown above the branches. Quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches); c. distributions of topology frequencies of observed and simulated gene trees based on Family-O-cds dataset; d. distribution of Robinson-Foulds distances of the simulated and observed gene trees to the coalescent tree; e. species network inferred from PhyloNet pseudolikelihood analyses with one to five hybridization events based on Family-O-cds dataset. Red and blue curved branches indicate the minor and major edges of hybrid nodes, respectively. Numbers next to curved branches indicate inheritance probabilities for each hybrid node. Figure S4: a,b. Phylogenetic analysis of species tree with Family-S-aa dataset in Marattiaceae (a. Astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) scores are shown above branches and numbers of gene trees (concordant/conflicting) are shown next to the nodes; b. maximum likelihood bootstrap support (ML-BS) is shown above the branches. Quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches); c. distributions of topology frequencies of observed and simulated gene trees based on Family-S-aa dataset; d. distribution of Robinson-Foulds distances of the simulated and observed gene trees to the coalescent tree; e. species network inferred from PhyloNet pseudolikelihood analyses with one to five hybridization events based on Family-S-aa dataset. Red and blue curved branches indicate the minor and major edges of hybrid nodes, respectively. Numbers next to curved branches indicate inheritance probabilities for each hybrid node. Figure S5: a,b. Phylogenetic analysis of species tree with Family-S-cds dataset in Marattiaceae (a. Astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) scores are shown above branches and numbers of gene trees (concordant/conflicting) are shown next to the nodes; b. maximum likelihood bootstrap support (ML-BS) is shown above the branches. Quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches); c. distributions of topology frequencies of observed and simulated gene trees based on Family-S-cds dataset; d. distribution of Robinson-Foulds distances of the simulated and observed gene trees to the coalescent tree; e. species network inferred from PhyloNet pseudolikelihood analyses with one to five hybridization events based on Family-S-cds dataset. Red and blue curved branches indicate the minor and major edges of hybrid nodes, respectively. Numbers next to curved branches indicate inheritance probabilities for each hybrid node. Figure S6: Phylogenomic analysis of species trees based on 16-O-aa and 16-S-aa datasets in Marattiaceae (a,b. 16-O-aa datasets; c,d. 16-S-aa datasets). a,c. Astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) values are shown above branches, and numbers of gene trees (concordant/conflicting) were shown below branches. Pie charts present the proportion of congruent gene trees that supports that clade (blue), the proportion of discordant gene trees of the main alternative topology for that clade (green), the proportion of discordant trees for the remaining alternative topologies (red), the proportion of uninformative gene trees (dark gray; bootstrap support <50%), and the proportion of missing data (light gray); b,d. maximum likelihood bootstrap support (ML-BS) is shown above the branches, and quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches. Figure S7: a,b. Phylogenetic analysis of species tree with Genus-O-aa dataset in Marattiaceae (a. Astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) scores are shown above branches and numbers of gene trees (concordant/conflicting) are shown next to the nodes; b. maximum likelihood bootstrap support (ML-BS) is shown above the branches. Quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches); c. distributions of topology frequencies of observed and simulated gene trees based on Genus-O-aa dataset; d. distribution of Robinson-Foulds distances of the simulated and observed gene trees to the coalescent tree; e. species network inferred from PhyloNet pseudolikelihood analyses with one to five hybridization events based on Genus-O-aa dataset. Red and blue curved branches indicate the minor and major edges of hybrid nodes, respectively. Numbers next to curved branches indicate inheritance probabilities for each hybrid node. Figure S8: a,b. Phylogenetic analysis of species tree with Genus-S-aa dataset in Marattiaceae (a. Astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) scores are shown above branches and numbers of gene trees (concordant/conflicting) are shown next to the nodes; b. maximum likelihood bootstrap support (ML-BS) is shown above the branches. Quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches); c. distributions of topology frequencies of observed and simulated gene trees based on Genus-S-aa dataset; d. distribution of Robinson-Foulds distances of the simulated and observed gene trees to the coalescent tree; e. species network inferred from PhyloNet pseudolikelihood analyses with one to five hybridization events based on Genus-S-aa dataset. Red and blue curved branches indicate the minor and major edges of hybrid nodes, respectively. Numbers next to curved branches indicate inheritance probabilities for each hybrid node. Figure S9: a,b. Phylogenetic analysis of species tree with Genus-S-cds dataset in Marattiaceae (a. Astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) scores are shown above branches and numbers of gene trees (concordant/conflicting) are shown next to the nodes; b. maximum likelihood bootstrap support (ML-BS) is shown above the branches. Quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches); c. distributions of topology frequencies of observed and simulated gene trees based on Genus-S-cds dataset; d. distribution of Robinson-Foulds distances of the simulated and observed gene trees to the coalescent tree; e. species network inferred from PhyloNet pseudolikelihood analyses with one to five hybridization events based on Genus-S-cds dataset. Red and blue curved branches indicate the minor and major edges of hybrid nodes, respectively. Numbers next to curved branches indicate inheritance probabilities for each hybrid node. Figure S10: a,b. Phylogenetic analysis of species tree with Species-O-cds dataset in Marattiaceae (a. Astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) scores are shown above branches and numbers of gene trees (concordant/conflicting) are shown next to the nodes; b. maximum likelihood bootstrap support (ML-BS) is shown above the branches. Quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches); c. distributions of topology frequencies of observed and simulated gene trees based on Species-O-cds dataset; d. distribution of Robinson-Foulds distances of the simulated and observed gene trees to the coalescent tree; e. species network inferred from PhyloNet pseudolikelihood analyses with one to five hybridization events based on Species-O-cds dataset. Red and blue curved branches indicate the minor and major edges of hybrid nodes, respectively. Numbers next to curved branches indicate inheritance probabilities for each hybrid node. Figure S11: a,b. Phylogenetic analysis of species tree with Species-S-aa dataset in Marattiaceae (a. Astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) scores are shown above branches and numbers of gene trees (concordant/conflicting) are shown next to the nodes; b. maximum likelihood bootstrap support (ML-BS) is shown above the branches. Quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches); c. distributions of topology frequencies of observed and simulated gene trees based on Species-S-aa dataset; d. distribution of Robinson-Foulds distances of the simulated and observed gene trees to the coalescent tree; e. species network inferred from PhyloNet pseudolikelihood analyses with one to five hybridization events based on Species-S-aa dataset. Red and blue curved branches indicate the minor and major edges of hybrid nodes, respectively. Numbers next to curved branches indicate inheritance probabilities for each hybrid node. Figure S12: a,b. Phylogenetic analysis of species tree with Species-S-cds dataset in Marattiaceae (a. Astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) scores are shown above branches and numbers of gene trees (concordant/conflicting) are shown next to the nodes; b. maximum likelihood bootstrap support (ML-BS) is shown above the branches. Quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches); c. distributions of topology frequencies of observed and simulated gene trees based on Species-S-cds dataset; d. distribution of Robinson-Foulds distances of the simulated and observed gene trees to the coalescent tree; e. species network inferred from PhyloNet pseudolikelihood analyses with one to five hybridization events based on Species-S-cds dataset. Red and blue curved branches indicate the minor and major edges of hybrid nodes, respectively. Numbers next to curved branches indicate inheritance probabilities for each hybrid node. Figure S13: Maximum likelihood phylogram derived from CT dataset. Bootstrap support values (ML-BS) are shown above the branches. Figure S14: Maximum likelihood phylogram derived from MT dataset. Bootstrap support values (ML-BS) are shown above the branches. Figure S15: Maximum likelihood phylogram derived from CM dataset. Bootstrap support values (ML-BS) are shown above the branches. Figure S16: Relationship of Angiopteris s.s. with one outgroup based on chloroplast genes. Bootstrap support values (ML-BS) are shown above the branches.

Author Contributions

Design of the research: H.Y. and Z.H.; performance of the research: J.Z. and X.Z.; data analysis: J.Z., X.Z., S.F. and Z.Z.; collection: Z.H., H.Y., Y.L., S.F., J.Z. and X.Z.; interpretation: J.Z., X.Z., Z.H. and H.Y.; writing the manuscript: J.Z. and X.Z.; funding acquisition: Z.H. All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded by National Natural Science Foundation of China (Grant Nos. 30260011, 31260078, and 31560096).

Institutional Review Board Statement

The study did not require ethical approval.

Informed Consent Statement

Not applicable.

Data Availability Statement

The multiple sequence alignments, concatenated alignments, and phylogenetic trees for this study are publicly available at Figshare repository: https://doi.org/10.6084/m9.figshare.21707546.

Acknowledgments

We thank Dong-Rui Jia, Jia-Liang Li, Huan Xie, Qian Xiong, Qiong Zhang, and Cheng Zhang for their help and suggestions on data analysis. We also thank Wei-Ming Chu for identification assistance. We are very grateful to Jing Liu and Dong Wang for their careful help and advice to improve the manuscript and the help of Wei-Ming Chu to identify some materials used in this study.

Conflicts of Interest

The authors declare that they have no known competing financial interest or personal relationships that could have appeared to influence the work reported in this paper.

References

  1. Murdock, A.G. A taxonomic revision of the eusporangiate fern family Marattiaceae, with description of a new genus Ptisana. Taxon 2008, 57, 737–755. [Google Scholar] [CrossRef]
  2. Ppg, I. A community-derived classifification for extant lycophytes and ferns. J. Syst. Evol. 2016, 54, 563–603. [Google Scholar] [CrossRef]
  3. Mamay, S.H. Some American Carboniferous fern fructififications. Ann. Mo. Bot. Gard. 1950, 37, 409–476. [Google Scholar] [CrossRef]
  4. Millay, M.A.; Taylor, T.N. An unusual Botryopterid sporangial aggregation from the middle Pennsylvanian of north America. Am. J. Bot. 1980, 67, 758–773. [Google Scholar] [CrossRef]
  5. Stewart, W.N.; Rothwell, G.W. Paleobotany and the Evolution of Plants, 2nd ed.; Cambridge University Press: Cambridge, UK, 1993; pp. 1–405. [Google Scholar]
  6. Millay, M.A. A review of permineralized Euramerican Carboniferous tree ferns. Rev. Palaeobot. Palynol. 1997, 95, 191–209. [Google Scholar] [CrossRef]
  7. DiMichele, W.L.; Phillips, T.L. The ecology of Paleozoic ferns. Rev. Palaeobot. Palynol. 2002, 119, 143–159. [Google Scholar] [CrossRef]
  8. Taylor, T.N.; Taylor, E.L.; Krings, M. Paleobotany, the Biology and Evolution of Fossil Plants, 2nd ed.; Academic Press: Burlington, MA, USA, 2009. [Google Scholar]
  9. Rothwell, G.W.; Millay, M.A.; Stockey, R.A. Escapia gen. nov.: Morphological Evolution, Paleogeographic Diversifification, and the Environmental Distribution of Marattialean Ferns through Time. In Transformative Paleobotany: Papers to Commemorate the Life and Legacy of Thomas N. Taylor; Krings, M., Harper, C.J., Cúneo, N.R., Rothwell, G.W., Eds.; Elsevier: New York, NY, USA, 2018; pp. 271–360. [Google Scholar]
  10. Murdock, A.G. Phylogeny of marattioid ferns (Marattiaceae): Inferring a root in the absence of a closely related outgroup. Am. J. Bot. 2008, 95, 626–641. [Google Scholar] [CrossRef]
  11. Hill, C.R.; Wagner, R.H.; El-Khayal, A.A. Qasimia gen. nov., an early Marattia-like fern from the Permian of Saudi Arabia. Scripta. Geol. 1985, 79, 1–50. [Google Scholar]
  12. Hill, C.R. Jurassic Angiopteris (Marattiales) from north Yorkshire. Rev. Palaeobot. Palynol. 1987, 51, 65–93. [Google Scholar] [CrossRef]
  13. Lesnikowska, A.; Galtier, J. A reconsideration of four genera of permineralized Marattiales from the Stephanian and Autunian of France. Rev. Palaeobot. Palynol. 1991, 67, 141–152. [Google Scholar] [CrossRef]
  14. Delevoryas, T.; Taylor, T.N.; Taylor, E.L. A marattialean fern from the Triassic of Antarctica. Rev. Palaeobot. Palynol. 1992, 74, 101–107. [Google Scholar] [CrossRef]
  15. Liu, Z.-H.; Li, C.-S.; Hilton, J. Zhutheca Liu, Li et Hilton gen. nov., the fertile pinnules of Fascipteris densata Gu et Zhi and their significance in marattialean evolution. Rev. Palaeobot. Palynol. 2000, 109, 149–160. [Google Scholar] [CrossRef] [PubMed]
  16. Liu, Z.-H.; Li, C.-S.; Hilton, J. Fertile pinnules of Danaeites rigida Gu and Zhi (Marattiales) from the Upper Permian of south China. Bot. J. Linn. Soc. 2001, 136, 107–117. [Google Scholar] [CrossRef]
  17. Hu, S.; Taylor, D.W.; Brenner, G.J.; Basha, S.H. A new marsilealean fern species from the Early Cretaceous of Jordan. Palaeoworld 2008, 17, 235–245. [Google Scholar] [CrossRef]
  18. Escapa, I.H.; Bomflfleur, B.; Cuneo, N.R.; Scasso, R. A new marattiaceous fern from the Lower Jurassic of Patagonia (Argentina): The renaissance of Marattiopsis. J. Syst. Palaeontol. 2014, 13, 677–689. [Google Scholar] [CrossRef]
  19. Lehtonen, S.; Poczai, P.; Sablok, G.; Hyvonen, J.; Karger, D.N.; Flores, J. Exploring the phylogeny of the marattialean ferns. Cladistics 2020, 36, 569–593. [Google Scholar] [CrossRef]
  20. May, M.R.; Contreras, D.L.; Sundue, M.A.; Nagalingum, N.S.; Rothfels, C.J. Inferring the total-evidence timescale of Marattialean fern evolution in the face of model sensitivity. Syst. Biol. 2021, 70, 1232–1255. [Google Scholar] [CrossRef]
  21. Rai, H.S.; Graham, S.W. Utility of a large; multigene plastid data set in inferring higher-order relationships in ferns and relatives (monilophytes). Am. J. Bot. 2010, 97, 1444–1456. [Google Scholar] [CrossRef]
  22. Kuo, L.-Y.; Li, F.-W.; Chiou, W.-L.; Wang, C.-N. First insights into fern matK phylogeny. Mol. Phylogenet. Evol. 2011, 59, 556–566. [Google Scholar] [CrossRef]
  23. Lehtonen, S. Towards resolving the complete fern tree of life. PLoS ONE 2011, 6, e24851. [Google Scholar] [CrossRef] [Green Version]
  24. Grewe, F.; Guo, W.; Gubbels, E.A.; Hansen, A.K.; Mower, J.P. Complete plastid genomes from Ophioglossum californicum; Psilotum nudum; and Equisetum hyemale reveal an ancestral land plant genome structure and resolve the position of Equisetales among monilophytes. BMC Evol. Biol. 2013, 13, 8. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Kim, H.T.; Chung, M.G.; Kim, K.-J. Chloroplast genome evolution in early diverged leptosporangiate ferns. Mol. Cells. 2014, 37, 372–382. [Google Scholar] [CrossRef] [Green Version]
  26. Lu, J.-M.; Zhang, N.; Du, X.-Y.; Wen, J.; Li, D.-Z. Chloroplast phylogenomics resolves key relationships in ferns. J. Syst. Evol. 2015, 53, 448–457. [Google Scholar] [CrossRef] [Green Version]
  27. Knie, N.; Fischer, S.; Grewe, F.; Polsakiewicz, M.; Knoop, V. Horsetails are the sister group to all other monilophytes and Marattiales are sister to leptosporangiate ferns. Mol. Phylogenet. Evol. 2015, 90, 140–149. [Google Scholar] [CrossRef] [PubMed]
  28. Rothfels, C.J.; Johnson, A.K.; Hovenkamp, P.H.; Swofford, D.L.; Roskam, H.C.; Fraseer-Jenkins, C.R.; Windham, M.D.; Pryer, K. Natural hybridization between genera that diverged from each other approximately 60 million years ago. Am. Nat. 2015, 185, 433–442. [Google Scholar] [CrossRef]
  29. Testo, W.; Sundue, M. A 4000-species dataset provides new insight into the evolution of ferns. Mol. Phylogenet. Evol. 2016, 105, 200–211. [Google Scholar] [CrossRef] [Green Version]
  30. Labiak, P.H.; Karol, K.G. Plastome sequences of an ancient fern lineage reveal remarkable changes in gene content and architecture. Am. J. Bot. 2017, 104, 1008–1018. [Google Scholar] [CrossRef] [Green Version]
  31. Lehtonen, S.; Silvestro, D.; Karger, D.N.; Scotese, C.; Tuomisto, H.; Kessler, M.; Pena, C.; Wahlberg, N.; Antonelli, A. Environmentally driven extinction and opportunistic origination explain fern diversification patterns. Sci. Rep. 2017, 7, 4831. [Google Scholar] [CrossRef] [Green Version]
  32. Gitzendanner, M.A.; Soltis, P.S.; Wong, G.K.S.; Ruhfel, B.R.; Soltis, D.E. Plastid phylogenomic analysis of green plants: A billion years of evolutionary history. Am. J. Bot. 2018, 105, 291–301. [Google Scholar] [CrossRef]
  33. Kuo, L.-Y.; Qi, X.-P.; Ma, H.; Li, F.-W. Order-level fern plastome phylogenomics: New insights from Hymenophyllales. Am. J. Bot. 2018, 105, 1545–1555. [Google Scholar] [CrossRef]
  34. Qi, X.-P.; Kuo, L.-Y.; Guo, C.; Li, H.; Li, Z.-Y.; Qi, J.; Wang, L.-B.; Hu, Y.; Xiang, J.-Y.; Zhang, C.-F.; et al. A well-resolved fern nuclear phylogeny reveals the evolution history of numerous transcription factor families. Mol. Phylogenet. Evol. 2018, 127, 961–977. [Google Scholar] [CrossRef] [PubMed]
  35. Lehtonen, S.; Cardenas, G.G. Dynamism in plastome structure observed across the phylogenetic tree of ferns. Bot. J. Linn. Soc. 2019, 190, 229–241. [Google Scholar] [CrossRef]
  36. Du, X.-Y.; Kuo, L.-Y.; Zuo, Z.-Y.; Li, D.-Z.; Lu, J.M. Structural variation of plastomes provides key insight into the deep phylogeny of ferns. Front. Plant Sci. 2022, 13, 862772. [Google Scholar] [CrossRef] [PubMed]
  37. Pelosi, J.A.; Kim, E.H.; Barbazuk, W.B.; Sessa, E.B. Phylotranscriptomics illuminates the placement of whole genome duplications and gene retention in ferns. Front. Plant Sci. 2022, 13, 882441. [Google Scholar] [CrossRef]
  38. Wickett, N.J.; Mirarab, S.; Nguyen, N.; Warnow, T.; Carpenter, E.; Matasci, N.; Ayyampalayam, S.; Barker, M.S.; Burleigh, J.G.; Gitzendanner, M.A.; et al. Phylotranscriptomic analysis of the origin and early diversifification of land plants. Proc. Natl. Acad. Sci. USA 2014, 111, E4859–E4868. [Google Scholar] [CrossRef] [Green Version]
  39. Shen, H.; Jin, D.; Shu, J.-P.; Zhou, X.L.; Lei, M.; Wei, R.; Shang, H.; Wei, H.J.; Zhang, R.; Liu, L.; et al. Large-scale phylogenomic analysis resolves a backbone phylogeny in ferns. GigaScience 2018, 7, gix116. [Google Scholar] [CrossRef] [Green Version]
  40. One Thousand Plant Transcriptomes Initiative. One thousand plant transcriptomes and the phylogenomics of green plants. Nature 2019, 574, 679–685. [Google Scholar] [CrossRef] [Green Version]
  41. Qiu, Y.-L.; Li, L.; Wang, B.; Chen, Z.; Dombrovska, O.; Lee, J.; Kent, L.; Li, R.; Jobson, R.W.; Hendry, T.A.; et al. A nonflowering land plant phylogeny inferred from nucleotide sequences of seven chloroplast; mitochondrial; and nuclear genes. Int. J. Plant Sci. 2007, 68, 691–708. [Google Scholar] [CrossRef]
  42. He, Z.-R.; Christenhusz, M.J.M. Marattiaceae. In Flora of China; Wu, Z.Y., Raven, P.H., Hong, D.Y., Eds.; Science Press: Beijing, China; Missouri Botanical Garden Press: St. Louis, MI, USA, 2013; Volume 2–3, pp. 82–89. [Google Scholar]
  43. Yan, Y.-H.; Zhou, X.-L. Ferns of Hainan; China Forestry Publishing: Beijing, China, 2018; 45p. [Google Scholar]
  44. Wang, T.; Xiao, B.; Liu, E.-D.; Nguyen, K.S.; Duan, J.-Q.; Wang, K.-L.; Yan, Y.-H.; Xiang, J.-Y. Rediscovery of Angiopteris tonkinensis (Marattiaceae) after 100 years; and its revision. PhytoKeys 2020, 161, 1–9. [Google Scholar] [CrossRef]
  45. Hill, C.R.; Camus, J.M. Evolutionary cladistics of marattialean ferns. Bull. Br. Mus. Nat. Hist. 1986, 14, 219–300. [Google Scholar]
  46. Hill, C.R.; Camus, J.M. Pattern cladistics or evolutionary cladistics. Cladistics 1986, 2, 362–375. [Google Scholar] [CrossRef] [PubMed]
  47. Liu, Z.-H.; Hilton, J.; Li, C.-S. Review on the origin; evolution and phylogeny of Marattiales. Chinese Bull. Bot. 2000, 17, 39–52. [Google Scholar]
  48. Senterre, B.; Rouhan, G.; Fabre, I.; Morel, C.; Christenhusz, M.J.M. Revision of the fern family Marattiaceae in the Seychelles with two new species and a discussion of the African Ptisana fraxinea complex. Phytotaxa 2014, 158, 19–57. [Google Scholar] [CrossRef] [Green Version]
  49. Rothwell, G.W.; Millay, M.A.; Stockey, R.A. Resolving the overall pattern of marattialean fern phylogeny. Am. J. Bot. 2018, 105, 1304–1314. [Google Scholar] [CrossRef] [PubMed]
  50. Ching, R.-C. Systematic arrangements of families and genera of Chinese Pteridophytes with corresponding names in Chinese. Acta Phytotax. Sin. 1954, 3, 93–99. [Google Scholar]
  51. Ching, R.-C. A revision of the ferns genus Archangiopteris Christ & Giesenhagen. Acta Phytotax. Sin. 1958, 7, 212–224. [Google Scholar]
  52. Ching, R.-C. Flora of China; Science Press: Beijing, China, 1959; Volume II, pp. 330–345. [Google Scholar]
  53. Chu, W.-M.; He, Z.-R. Flora of Yunnan; Science Press: Beijing, China, 2006; Volume 20, pp. 117–128. [Google Scholar]
  54. Pichi-Sermolli, R.E.G. Tentamen Pteridophytorum genera in taxonomicum ordinem redigendi. Webbia 1977, 31, 315–512. [Google Scholar] [CrossRef]
  55. Smith, A.R.; Pryer, K.M.; Schuettpelz, E.; Korall, P.; Schneider, H.; Wolf, P. A classification for extant ferns. Taxon 2006, 55, 705–731. [Google Scholar] [CrossRef]
  56. Li, C.-X.; Lu, S.-G. Phylogeny and divergence of Chinese Angiopteridaceae based on chloroplast DNA sequence data (rbcL and trnL-F). Chin. Sci. Bull. 2007, 52, 91–97. [Google Scholar] [CrossRef]
  57. Ching, R.-C.; Wang, Z.-R. Angiopteris sparsisora Ching; sp. nov.—A putative bigeneric hybrid. Acta Phytotax. Sin. 1982, 20, 347–350. [Google Scholar]
  58. Wang, C.-Y.; Yang, B.; He, Z.-R.; Wang, H.-B. Analysis on the geographical distribution characteristics of Angiopteridaceae and its causes in Yungui Plateau and Hengduan Mountains; China. Plant Div. Res. 2012, 344, 317–325. [Google Scholar] [CrossRef]
  59. Wang, T.; Zhang, G.-L.; Yan, Y.-H.; Sun, D.-K.; Shu, J.-P.; Duan, J.-Q.; Zhao, G.-H.; Zhang, G.-S.; Xiang, J.-Y.; Wang, K.-L. Angiopteris sugongii (Marattiaceae); a new diploid species with transitional morphology between Angiopteris (sen. str.) and Archangiopteris. Phytotaxa 2021, 516, 275–282. [Google Scholar] [CrossRef]
  60. Wu, W.H. A Study on the Natural Fern Hybrid–Archangiopteris Itoi Shieh; National Taiwan University: Taipei, Taiwan, 2002; 40p. [Google Scholar]
  61. Hsieh, T.-H.; Chang, Y.-H.; Chiou, W.-L.; Huang, Y.-M. Chromosome numbers of five species of the Marattiaceae in Taiwan. Taiwan. J. For. Sci. 2008, 234, 335–339. [Google Scholar]
  62. Christ, H.; Giesenhagen, K.  Archangiopteris. Flora 1899, 89, 72–79. [Google Scholar]
  63. Christenhusz, M.J.M.; Tuomisto, H.; Metzgar, J.S.; Pryer, K.M. Evolutionary relationships within the neotropical; eusporangiate fern genus Danaea (Marattiaceae). Mol. Phylogenet. Evol. 2008, 46, 34–48. [Google Scholar] [CrossRef] [PubMed]
  64. Zhang, Z.-Y. The morphylogy of Archangiopteris Christ et Gies and its relationship with Angiopteris Hoffm. Acta. Bot. Sin. 1973, 15, 261–270. (In Chinese) [Google Scholar]
  65. Duarte, J.M.; Wall, P.K.; Edger, P.P.; Landherr, L.L.; Ma, H.; Pires, J.C.; Leebens-Mack, J.; Depamphilis, C. Identification of shared single copy nuclear genes in Arabidopsis; Populus; Vitis and Oryza and their phylogenetic utility across various taxonomic levels. BMC Evol. Biol. 2010, 10, 61. [Google Scholar] [CrossRef] [Green Version]
  66. Zhang, N.; Zeng, L.-P.; Shan, H.-Y.; Ma, H. Highly conserved low-copy nuclear genes as effective markers for phylogenetic analyses in angiosperms. New Phytol. 2012, 195, 923–937. [Google Scholar] [CrossRef]
  67. Zimmer, E.A.; Wen, J. Using nuclear gene data for plant phylogenetics: Progress and prospects. Mol. Phylogenet. Evol. 2012, 65, 774–785. [Google Scholar] [CrossRef]
  68. Li, Z.; de La Torre, A.R.; Sterck, L.; Cánovas, F.M.; Avila, C.; Merino, I.; Cabezas, J.A.; Cercas, F.M.; Ingvarsson, P.K.; de Peer, Y.V. Single-copy genes as molecular markers for phylogenomic studies in seed plants. Genome Biol. Evol. 2017, 9, 1130–1147. [Google Scholar] [CrossRef] [Green Version]
  69. Bakkali, M.; Martín-Blázquez, R.; Ruiz-Estévez, M.; Garrido-Ramos, M.A. De novo sporophyte transcriptome assembly and functional annotation in the endangered fern species Vandenboschia speciosa (Willd.) G. Kunkel. Genes 2021, 12, 1017. [Google Scholar] [CrossRef]
  70. Mai, U.; Mirarab, S. TreeShrink: Fast and accurate detection of outlier long branches in collections of phylogenetic trees. BMC Genom. 2018, 19, 4046. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  71. Gastony, G.J.; Yatskievych, G. Maternal inheritance of the chloroplast and mitochondrial genomes in cheilanthoid ferns. Am. J. Bot. 1992, 79, 716–722. [Google Scholar] [CrossRef]
  72. Maddison, W.P. Gene trees in species trees. Syst. Biol. 1997, 46, 523–536. [Google Scholar] [CrossRef]
  73. Vogel, J.C.; Russell, S.J.; Rumsey, F.J.; Barrett, J.A.; Gibby, M. Evidence for the maternal transmission of chloroplast DNA in the genus Asplenium (Aspleniaceae; Pteridophyta). Botanica Acta 1998, 111, 247–249. [Google Scholar] [CrossRef]
  74. Guillon, J.M.; Raquin, C. Maternal inheritance of chloroplasts in the horsetail Equisetum variegatum (Schleich.). Curr. Genet. 2000, 37, 53–56. [Google Scholar] [CrossRef]
  75. Kuo, L.-Y.; Tang, T.-Y.; Li, F.-W. Organelle Genome Inheritance in Deparia Ferns (Athyriaceae; Aspleniineae; Polypodiales). Front. Plant Sci. 2018, 9, 486. [Google Scholar] [CrossRef] [Green Version]
  76. Olmstead, R.G.; Bedoya, A.M. Whole genomes: The holy grail. A commentary on: ‘Molecular phylogenomics of the tribe Shoreeae (Dipterocarpaceae) using whole plastidgenomes’. Ann. Bot. 2019, 123, iv–v. [Google Scholar] [CrossRef] [Green Version]
  77. Walker, J.F.; Walker-Hale, N.; Vargas, O.M.; Larson, D.A.; Stull, G.W. Characterizing gene tree conflict in plastome inferred phylogenies. PeerJ 2019, 7, e7747. [Google Scholar] [CrossRef] [Green Version]
  78. Galtier, N.; Daubin, V. Dealing with incongruence in phylogenomic analyses. Philos. Trans. R. Soc. B Biol. Sci. 2008, 363, 4023–4029. [Google Scholar] [CrossRef] [Green Version]
  79. Camus, J.M. Marattiaceae. In The Families and Genera of Vascular Plants: Pteridophytes and Gymnosperms; Kubitzki, K., Kramer, K.U., Green, P.S., Eds.; Springer: Berlin, Germany, 1990; Volume 1, pp. 174–180. [Google Scholar]
  80. Walker, T.G. A chromosome count for Macroglossum (Marattiales). Fern Gaz. 1981, 12, 161–163. [Google Scholar]
  81. Christenhusz, M.J.M. Evolutionary history and taxonomy of Neotropical marattioid ferns: Studies of an ancient lineage of plants. Ann. Univ. Turku. Ser. AII. 2007, 216, 1–78. [Google Scholar]
  82. Wang, K.; Lenstra, J.A.; Liu, L.; Hu, Q.-J.; Ma, T.; Qiu, Q.; Liu, J.-Q. Incomplete lineage sorting rather than hybridization explains the inconsistent phylogeny of the wisent. Commun. Biol. 2018, 1, 169. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  83. Li, J.-L.; Zhang, Y.-J.; Ruhsam, M.; Milne, R.I.; Wang, Y.; Wu, D.-Y.; Jia, S.-Y.; Tao, T.-Z.; Mao, K.-S. Seeing through the hedge: Phylogenomics of Thuja (Cupressaceae) reveals prominent incomplete lineage sorting and ancient introgression for Tertiary relict flora. Cladistics 2021, 38, 187–203. [Google Scholar] [CrossRef] [PubMed]
  84. Ma, J.-X.; Sun, P.-C.; Wang, D.-S.; Wang, Z.-Y.; Yang, J.; Li, Y.; Mu, W.-J.; Xu, P.-R.; Wu, Y.; Dong, C.-C.; et al. The Chloranthus sessilifolius genome provides insight into early diversification of angiosperms. Nat. Commun. 2021, 12, 6929. [Google Scholar] [CrossRef] [PubMed]
  85. Zhang, Q.; Zhao, L.; Folk, R.A.; Zhao, J.-L.; Zamora, N.A.; Yang, S.-X.; Soltis, D.E.; Soltis, D.E.; Gao, L.-M.; Peng, H.; et al. Phylotranscriptomics of Theaceae: Generic-level relationships; reticulation and whole-genome duplication. Ann. Bot. 2022, 129, 457–471. [Google Scholar] [CrossRef] [PubMed]
  86. Liu, B.-B.; Ren, C.; Kwak, M.; Hodel, R.G.; Xu, C.; He, J.; Zhou, W.-B.; Huang, C.-H.; Ma, H.; Qian, G.-Z.; et al. Phylogenomic conflict analyses in the apple genus Malus s.l. reveal widespread hybridization and allopolyploidy driving diversification; with insights into the complex biogeographic history in the Northern Hemisphere. J. Integr. Plant Biol. 2022, 64, 1020–1043. [Google Scholar] [CrossRef]
  87. Degnan, J.H.; Rosenberg, N.A. Gene tree discordance; phylogenetic inference and the multispecies coalescent. Trends Ecol. Evol. 2009, 24, 332–340. [Google Scholar] [CrossRef]
  88. Flouri, T.; Jiao, X.; Rannala, B.; Yang, Z. Species tree inference with BPP using genomic sequences and the multispecies coalescent. Mol. Biol. Evol. 2018, 35, 2585–2593. [Google Scholar] [CrossRef] [Green Version]
  89. Warren, H.; Wagner, J. Reticulate Evolution in the Appalachian Aspleniums. Evolution 1953, 8, 103–118. [Google Scholar]
  90. Barrington, D.S.; Haufler, C.H.; Werth, C.R. Hybridization; reticulation; and species concepts in the ferns. Am. Fern J. 1989, 79, 55–64. [Google Scholar] [CrossRef] [Green Version]
  91. Wood, T.E.; Takebayashi, N.; Barker, M.S.; Mayrose, I.; Greenspoon, P.B.; Rieseberg, L.H. The frequency of polyploid speciation in vascular plants. Proc. Natl. Acad. Sci. USA 2009, 106, 13875–13879. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  92. Whitney, K.D.; Ahern, J.R.; Cambell, L.G.; Albert, L.; King, M.S. Patterns of hybridization in plants. Perspect. Plant Ecol. Evol. Syst. 2010, 12, 175–182. [Google Scholar] [CrossRef]
  93. Sessa, E.B.; Zimmer, E.A.; Givnish, T.J. Unraveling reticulate evolution in north american Dryopteris (Dryopteridaceae). BMC Evol. Biol. 2012, 12, 104. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  94. Sigel, E.M. Genetic and genomic aspects of hybridization in ferns. J. Syst. Evol. 2016, 54, 638–655. [Google Scholar] [CrossRef]
  95. Christenhusz, M.J.M. Revision of the Neotropical fern genus Eupodium (Marattiaceae). Kew Bull. 2010, 65, 115–121. [Google Scholar] [CrossRef]
  96. Chen, S.-F.; Zhou, Y.-Q.; Chen, Y.-R.; Gu, J. fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 2018, 34, i884–i890. [Google Scholar] [CrossRef]
  97. Grabherr, M.G.; Haas, B.J.; Yassour, M.; Levin, J.Z.; Thompson, D.A.; Amit, I.; Adiconis, X.; Fan, L.; Raychowdhury, R.; Zeng, Q.; et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 2011, 29, 644–652. [Google Scholar] [CrossRef] [Green Version]
  98. Li, H. Minimap2: Pairwise alignment for nucleotide sequences. Bioinformatics 2018, 34, 3094–3100. [Google Scholar] [CrossRef] [Green Version]
  99. Danecek, P.; Bonfield, J.K.; Liddle, J.; Marshall, J.; Ohan, V.; Pollard, M.O.; Whitwham, A.; Keane, T.; McCarthy, S.A.; Davies, R.M.; et al. Twelve years of SAMtools and BCFtools. GigaScience 2021, 10, 2. [Google Scholar] [CrossRef]
  100. Li, W.; Godzik, A. Cd-hit: A fast program for clusteringand comparing large sets of protein or nucleotide sequences. Bioinformatics 2006, 22, 1658–1659. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  101. Haas, B.J.; Papanicolaou, A.; Yassour, M.; Grabherr, M.; Blood, P.D.; Bowden, J.; Couger, M.B.; Eccles, D.; Li, B.; Lieber, M.; et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc 2013, 8, 1494–1512. [Google Scholar] [CrossRef] [PubMed]
  102. Emms, D.M.; Kelly, S. OrthoFinder: Solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. Genome Biol. 2015, 16, 157. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  103. Emms, D.M.; Kelly, S. OrthoFinder: Phylogenetic orthology inference for comparative genomics. Genome Biol. 2019, 20, 238. [Google Scholar] [CrossRef] [Green Version]
  104. Cosentino, S.; Iwasaki, W. SonicParanoid: Fast accurate and easy orthology inference. Bioinformatics 2019, 35, 149–151. [Google Scholar] [CrossRef] [Green Version]
  105. Katoh, K. MAFFT: A novel method for rapid multiple sequence alignment based on fast fourier transform. Nucleic Acids Res. 2002, 30, 3059–3066. [Google Scholar] [CrossRef] [Green Version]
  106. Katoh, K.; Standley, D.M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 2013, 30, 772–780. [Google Scholar] [CrossRef] [Green Version]
  107. Capella-Gutierrez, S.; Silla-Martinez, J.M.; Gabaldon, T. trimAl: A tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 2009, 25, 1972–1973. [Google Scholar] [CrossRef] [Green Version]
  108. Zhang, D.; Gao, F.; Jakovlić, I.; Zou, H.; Zhang, J.; Li, W.-X.; Wang, G.-T. PhyloSuite: An integrated and scalable desktop platform for streamlined molecular sequence data management and evolutionary phylogenetics studies. Mol. Ecol. Res. 2020, 20, 348–355. [Google Scholar] [CrossRef]
  109. Morales-Briones, D.F.; Kadereit, G.; Tefarikis, D.T.; Moore, M.J.; Smith, S.A.; Brockington, S.F.; Timoneda, A.; Yim, W.C.; Cushman, J.C.; Yang, Y. Disentangling sources of gene tree discordance in phylogenomic datasets: Testing ancient hybridizations in Amaranthaceae s.l. Syst. Biol. 2020, 70, 219–235. [Google Scholar] [CrossRef]
  110. Bruen, T.C.; Philippe, H.; Bryant, D. A simple and robust statistical test for detecting the presence of recombination. Genetics 2006, 172, 2665–2681. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  111. Nguyen, L.T.; Schmidt, H.A.; von Haeseler, A.; Minh, B.Q. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum likelihood phylogenies. Mol. Biol. Evol. 2015, 32, 268–274. [Google Scholar] [CrossRef] [PubMed]
  112. Hoang, D.T.; Chernomor, O.; Haeseler, A.V.; Minh, B.Q.; Vinh, L.S. UFBoot2: Improving the ultrafast bootstrap approximation. Mol. Biol. Evol. 2018, 35, 518–522. [Google Scholar] [CrossRef]
  113. Kalyaanamoorthy, S.; Minh, B.Q.; Wong, T.K.F.; von Haeseler, A.; Jermiin, L.S. ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat. Methods 2017, 14, 587–589. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  114. Sayyari, E.; Mirarab, S. Fast coalescent-based computation of local branch support from quartet frequencies. Mol. Biol. Evol. 2016, 33, 1654–1668. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  115. Suyama, M.; Torrents, D.; Bork, P. PAL2NAL: Robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res. 2006, 34, W609–W612. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  116. Kearse, M.; Moir, R.; Wilson, A.; Stones-Havas, S.; Cheung, M.; Sturrock, S.; Buxton, S.; Cooper, A.; Markowitz, S.; Duran, C. Geneious basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 2012, 28, 1647–1649. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  117. Salichos, L.; Stamatakis, A.; Rokas, A. Novel information theory-based measures for quantifying incongruence among phylogenetic trees. Mol. Biol. Evol. 2014, 31, 1261–1271. [Google Scholar] [CrossRef] [Green Version]
  118. Smith, S.A.; Moore, M.J.; Brown, J.W.; Yang, Y. Analysis of phylogenomic datasets reveals conflict; concordance; and gene duplications with examples from animals and plants. BMC Evol. Biol. 2015, 15, 150. [Google Scholar] [CrossRef] [Green Version]
  119. Pease, J.B.; Brown, J.W.; Walker, J.F.; Hinchliff, C.E.; Smith, S.A. Quartet Sampling distinguishes lack of support from conflicting support in the green plant tree of life. Am. J. Bot. 2018, 105, 385–403. [Google Scholar] [CrossRef] [Green Version]
  120. Sukumaran, J.; Holder, M.T. DendroPy: A Python library for phylogenetic computing. Bioinformatics 2010, 26, 1569–1571. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  121. Than, C.; Ruths, D.; Nakhleh, L. PhyloNet: A software package for analyzing and reconstructing reticulate evolutionary relationships. BMC Bioinform. 2008, 9, 322. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  122. Yu, Y.; Dong, J.; Liu, K.-J.; Nakhleh, L. Maximum likelihood inference of reticulate evolutionary histories. Proc. Natl. Acad. Sci. USA 2014, 111, 16448–16453. [Google Scholar] [CrossRef] [Green Version]
  123. Yu, Y.; Degnan, J.H.; Nakhleh, L. The probability of a gene tree topology within a phylogenetic network with applications to hybridization detection. PLoS Genet. 2012, 8, e1002660. [Google Scholar] [CrossRef] [Green Version]
  124. Huson, D.H.; Scornavacca, C. Dendroscope 3: An interactive tool for rooted phylogenetic trees and networks. Syst. Biol. 2012, 61, 1061–1067. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  125. Smith, S.A.; O’Meara, B.C. treePL: Divergence time estimation using penalized likelihood for large phylogenies. Bioinformatics 2012, 28, 2689–2690. [Google Scholar] [CrossRef] [Green Version]
  126. Lundblad, A.B. Studies in the Rhaeto-Liassie Flora of Sweden; 1. Pteridophyta; Pteridospermae and Cycadophyta from the mining district of NW Scania. K. Sven. Vetensk. Handl. Fjärde Ser. 1950, 1, 1–82. [Google Scholar]
  127. Harris, T.M. The Yorkshire Jurassic Flora. I. Thalophyta-Pteridophyta. Trustees of the British Museum (Natural History); India. Nat. Hist. 1961. [Google Scholar] [CrossRef]
  128. Maurin, J.L.K. An empirical guide for producing a dated phylogeny with treePL in a maximum likelihood framework. arXiv 2020, arXiv:2008.07054v2. [Google Scholar]
  129. Rambaut, A. FigTree-Version 1.4.3; A Graphical Viewer of Phylogenetic Tree. 2017. Available online: http://tree.bio.ed.ac.uk/software/figtree/ (accessed on 5 May 2023).
Figure 1. Summary of the topologies among Marattiales in previous studies.
Figure 1. Summary of the topologies among Marattiales in previous studies.
Plants 12 02237 g001
Figure 2. Phylogenetic analysis of species trees based on 26-S-aa in Marattiaceae. (a). Astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) values are shown above branches, and numbers of gene trees (concordant/conflicting) are shown below branches. Pie charts present the proportion of congruent gene trees that supports that clade (blue), the proportion of discordant gene trees of the main alternative topology for that clade (green), the proportion of discordant trees for the remaining alternative topologies (red), the proportion of uninformative gene trees (dark gray; bootstrap support < 50%), and the proportion of missing data (light gray). (b). Maximum likelihood bootstrap support (ML-BS) is shown above the branches, and quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches.
Figure 2. Phylogenetic analysis of species trees based on 26-S-aa in Marattiaceae. (a). Astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) values are shown above branches, and numbers of gene trees (concordant/conflicting) are shown below branches. Pie charts present the proportion of congruent gene trees that supports that clade (blue), the proportion of discordant gene trees of the main alternative topology for that clade (green), the proportion of discordant trees for the remaining alternative topologies (red), the proportion of uninformative gene trees (dark gray; bootstrap support < 50%), and the proportion of missing data (light gray). (b). Maximum likelihood bootstrap support (ML-BS) is shown above the branches, and quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches.
Plants 12 02237 g002
Figure 3. Phylogenomic analysis of species trees based on 16-O-cds and 16-S-cds datasets in Marattiaceae ((a,b). 16-O-cds datasets, (c,d). 16-S-cds datasets). (a,c). Astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) values are shown above branches, and numbers of gene trees (concordant/conflicting) were shown below branches. Pie charts present the proportion of congruent gene trees that supports that clade (blue), the proportion of discordant gene trees of the main alternative topology for that clade (green), the proportion of discordant trees for the remaining alternative topologies (red), the proportion of uninformative gene trees (dark gray; bootstrap support <50%), and the proportion of missing data (light gray). (b,d). Maximum likelihood bootstrap support (ML-BS) is shown above the branches, and quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches.
Figure 3. Phylogenomic analysis of species trees based on 16-O-cds and 16-S-cds datasets in Marattiaceae ((a,b). 16-O-cds datasets, (c,d). 16-S-cds datasets). (a,c). Astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) values are shown above branches, and numbers of gene trees (concordant/conflicting) were shown below branches. Pie charts present the proportion of congruent gene trees that supports that clade (blue), the proportion of discordant gene trees of the main alternative topology for that clade (green), the proportion of discordant trees for the remaining alternative topologies (red), the proportion of uninformative gene trees (dark gray; bootstrap support <50%), and the proportion of missing data (light gray). (b,d). Maximum likelihood bootstrap support (ML-BS) is shown above the branches, and quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches.
Plants 12 02237 g003
Figure 4. Simplified phylogeny of Marattiaceae based on different datasets. (a). Maximum likelihood phylogeny based on CT dataset. (b). Maximum likelihood phylogeny based on MT dataset. (c). Maximum likelihood phylogeny based on the combined dataset of CM. (d). Consensus of 40 species trees from all inferences with IQ-tree and Astral-II using single-copy nuclear genes. The sizes of hollow triangles are in proportion to the sampled sizes of individual clades. Support values are shown along the branches; values of 100 and 1 are not displayed.
Figure 4. Simplified phylogeny of Marattiaceae based on different datasets. (a). Maximum likelihood phylogeny based on CT dataset. (b). Maximum likelihood phylogeny based on MT dataset. (c). Maximum likelihood phylogeny based on the combined dataset of CM. (d). Consensus of 40 species trees from all inferences with IQ-tree and Astral-II using single-copy nuclear genes. The sizes of hollow triangles are in proportion to the sampled sizes of individual clades. Support values are shown along the branches; values of 100 and 1 are not displayed.
Plants 12 02237 g004
Figure 5. (a,b). Phylogenetic analysis of species tree with Family-O-aa dataset in Marattiaceae ((a). Astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) scores are shown above branches and numbers of gene trees (concordant/conflicting) are shown next to the nodes; (b) maximum likelihood bootstrap support (ML-BS) is shown above the branches. Quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches); (c) distributions of topology frequencies of observed and simulated gene trees based on Family-O-aa dataset; (d) distribution of Robinson-Foulds distances of the simulated and observed gene trees to the coalescent tree; (e) species network inferred from PhyloNet pseudolikelihood analyses with one to five hybridization events based on Family-O-aa dataset. Red and blue curved branches indicate the minor and major edges of hybrid nodes, respectively. Numbers next to curved branches indicate inheritance probabilities for each hybrid node.
Figure 5. (a,b). Phylogenetic analysis of species tree with Family-O-aa dataset in Marattiaceae ((a). Astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) scores are shown above branches and numbers of gene trees (concordant/conflicting) are shown next to the nodes; (b) maximum likelihood bootstrap support (ML-BS) is shown above the branches. Quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches); (c) distributions of topology frequencies of observed and simulated gene trees based on Family-O-aa dataset; (d) distribution of Robinson-Foulds distances of the simulated and observed gene trees to the coalescent tree; (e) species network inferred from PhyloNet pseudolikelihood analyses with one to five hybridization events based on Family-O-aa dataset. Red and blue curved branches indicate the minor and major edges of hybrid nodes, respectively. Numbers next to curved branches indicate inheritance probabilities for each hybrid node.
Plants 12 02237 g005
Figure 6. (a,b). Phylogenetic analysis of species tree with Genus-O-cds dataset in Marattiaceae ((a). Astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) scores are shown above branches and numbers of gene trees (concordant/conflicting) are shown next to the nodes; (b) maximum likelihood bootstrap support (ML-BS) is shown above the branches. Quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches); (c) distributions of topology frequencies of observed and simulated gene trees based on Genus-O-cds dataset; (d) distribution of Robinson-Foulds distances of the simulated and observed gene trees to the coalescent tree; (e) species network inferred from PhyloNet pseudolikelihood analyses with one to five hybridization events based on Genus-O-cds dataset. Red and blue curved branches indicate the minor and major edges of hybrid nodes, respectively. Numbers next to curved branches indicate inheritance probabilities for each hybrid node.
Figure 6. (a,b). Phylogenetic analysis of species tree with Genus-O-cds dataset in Marattiaceae ((a). Astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) scores are shown above branches and numbers of gene trees (concordant/conflicting) are shown next to the nodes; (b) maximum likelihood bootstrap support (ML-BS) is shown above the branches. Quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches); (c) distributions of topology frequencies of observed and simulated gene trees based on Genus-O-cds dataset; (d) distribution of Robinson-Foulds distances of the simulated and observed gene trees to the coalescent tree; (e) species network inferred from PhyloNet pseudolikelihood analyses with one to five hybridization events based on Genus-O-cds dataset. Red and blue curved branches indicate the minor and major edges of hybrid nodes, respectively. Numbers next to curved branches indicate inheritance probabilities for each hybrid node.
Plants 12 02237 g006
Figure 7. (a,b). Phylogenetic analysis of species tree with Species-O-aa dataset in Marattiaceae ((a). Astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) scores are shown above branches and numbers of gene trees (concordant/conflicting) are shown next to the nodes; (b) maximum likelihood bootstrap support (ML-BS) is shown above the branches. Quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches); (c) distributions of topology frequencies of observed and simulated gene trees based on Species-O-aa dataset; (d) distribution of Robinson-Foulds distances of the simulated and observed gene trees to the coalescent tree; (e) species network inferred from PhyloNet pseudolikelihood analyses with one to five hybridization events based on Species-O-aa dataset. Red and blue curved branches indicate the minor and major edges of hybrid nodes, respectively. Numbers next to curved branches indicate inheritance probabilities for each hybrid node.
Figure 7. (a,b). Phylogenetic analysis of species tree with Species-O-aa dataset in Marattiaceae ((a). Astral-II posterior probabilities (AS-PP) and internode certainty all (ICA) scores are shown above branches and numbers of gene trees (concordant/conflicting) are shown next to the nodes; (b) maximum likelihood bootstrap support (ML-BS) is shown above the branches. Quartet sampling internal node scores (quartet concordance/quartet differential/quartet informativeness) are shown below branches); (c) distributions of topology frequencies of observed and simulated gene trees based on Species-O-aa dataset; (d) distribution of Robinson-Foulds distances of the simulated and observed gene trees to the coalescent tree; (e) species network inferred from PhyloNet pseudolikelihood analyses with one to five hybridization events based on Species-O-aa dataset. Red and blue curved branches indicate the minor and major edges of hybrid nodes, respectively. Numbers next to curved branches indicate inheritance probabilities for each hybrid node.
Plants 12 02237 g007
Figure 8. Time-calibrated phylogeny of Marattiaceae. The times were inferred by treePL based on 16-O-cds dataset. The divergence times are shown behind the nodes, the 95% highest posterior densities are represented as blue bars, and branch length are proportional to relative ages (in millions of years; see scale at bottom of tree).
Figure 8. Time-calibrated phylogeny of Marattiaceae. The times were inferred by treePL based on 16-O-cds dataset. The divergence times are shown behind the nodes, the 95% highest posterior densities are represented as blue bars, and branch length are proportional to relative ages (in millions of years; see scale at bottom of tree).
Plants 12 02237 g008
Table 1. Voucher information of taxa included in this study.
Table 1. Voucher information of taxa included in this study.
FamilyGenera (PPG I)SpeciesVoucher DataLocations or Sample ProviderCitation
MarattiaceaeAngiopterisAn. caudatiformis HieronymusAN_CA (PYU)Yunnan, China, This study
MarattiaceaeAngiopterisAn. fokiensis HieronymusAN_FO (PYU)Yunnan, ChinaThis study
MarattiaceaeAngiopterisAn. hokouensis ChingAN_HO (PYU)Yunnan, ChinaThis study
MarattiaceaeAngiopterisAn. sparsisora ChingAN_SP (PYU)Yunnan, ChinaThis study
MarattiaceaeAngiopterisAn. wangii ChingAN_WA (PYU)Yunnan, ChinaThis study
MarattiaceaeAngiopterisAn. yunnanensis HieronymusAN_YN (PYU)Yunnan, ChinaThis study
MarattiaceaeAngiopteris* An. bipinnata (Ching) J.M. Camus (=Ar. bipinnata Ching)AN_BP (PYU)Yunnan, ChinaThis study
MarattiaceaeAngiopteris* An. latipinna Z.R. He, W.M. Chu & Christenhusz (=Ar. henryi Christ & Giesenhagen)AN_LA (PYU)Yunnan, ChinaThis study
MarattiaceaeAngiopteris* An. chingii J. M. Camus (=Ar. hokouensis Ching)AN_CH (PYU)Yunnan, ChinaThis study
MarattiaceaeAngiopteris* An. subrotundata (Ching) Z.R. He & Christenhusz (=Ar. subrotundata Ching)AN_SU (PYU)Yunnan, ChinaThis study
MarattiaceaeChristenseniaChristensenia aesculifolia (Blume) MaxonCH_AE (PYU)Yunnan, ChinaThis study
MarattiaceaeChristenseniaChristensenia aesculifolia (Blume) MaxonK024247 (KBCC)Dr. Ceceilia Koo of Botanic Conservation Center[34]
MarattiaceaeDanaeaDanaea nodosa (L.) Sm.Vasco 879 (NY)Sample Provider: Alejandra Vasco1 kp., 2019
MarattiaceaeDanaeaDanaea nodosa (L.) Sm.K034856(KBCC)Dr. Ceceilia Koo of Botanic Conservation Center[34]
MarattiaceaeMarattiaMarattia sp.NYBG1295/78-A (NYBG)Sample Provider: D. W. Stevenson1 kp., 2019
MarattiaceaePtisanaPtisana pellucida (C.Presl) MurdockK013208(FUS)Dr. Ceceilia Koo of Botanic Conservation Center[34]
EquisetaceaeEquisetumEquisetum arvense L.xp705
(FUS)
Zhejiang, China[34]
EquisetaceaeEquisetumEquisetum diffusum D.DonRS-107 (CSH)Sichuan, China[39]
PsilotaceaePsilotumPsilotum nudum (L.) P.Beauv.xp703
(FUS)
Guangxi, China[34]
OphioglossaceaeOphioglossumOphioglossum thermale Kom.xp746
(FUS)
Shanghai, China[34]
OphioglossaceaeOphioglossumOphioglossum vulgatum L.RS-84 (CSH)Guangdong, China[39]
OsmundaceaeOsmundastrumOsmundastrum cinnamomeum (L.) C.PreslUnknown (NYBG)Sample Provider: D. W. Stevenson1 kp., 2019
OsmundaceaeOsmundaOsmunda japonica Thunb.RS38 (CSH)Shanghai, China[39]
HymenophyllaceaeHymenophyllumHymenophyllum holochilum (Bosch) C.Chr.Kuo4290 (TAFI)Taiwan, China[34]
HymenophyllaceaeVandenboschiaVandenboschia speciosa (Willd.) G.KunkelUnknownAlcornocales Natural Park (Cádiz, Spain): Valdeinfierno (VDI).[69]
AmborellaceaeAmborellaAmborella trichopoda Baill.UnknownUnknownUnknown
Note: An. Angiopteris; Ar. Archangiopteris; * show these species have been recognized as the members of Archangiopteris historically.
Table 2. Datasets’ statistics.
Table 2. Datasets’ statistics.
Dataset26-O-aa26-O-cds26-S-aa26-S-cds16-O-aa16-O-cds16-S-aa16-S-cdsFamily-O-aaFamily-O-cds
Number of Taxon26262626161616161111
Number of Gene6580147162474595782947749834
Number of Sites 23,13678,82862,321194,713212,344679,941381,4131,189,191351,3991,093,200
Number of Informative sites 11,27843,17229,032103,05732,57783,06855,719137,762148,631523,878
Number of Invariable sites888826,92324,91269,474157,262541,874285,182955,281144,968416,330
Sequence Length23,13678,82862,321194,713212,344679,941381,4131,189,191351,3991,093,200
Pairwise Identity (%)71.1070.7069.5069.2081.3083.7078.9081.3062.4062.50
Identical Sites (%)14.4012.6012.1010.7027.1032.0025.4030.2027.9024.60
Average GC content (%)46.6046.4047.3046.9045.50
Max Sequence Length (bp)22,91877,97360,898190,133207,882665,631367,7271,147,431344,8231,072,395
Min Sequence Length (bp)11,41638,56025,97084,13893,695323,922159,932542,583296,857917,580
Mean coverage (%)25.98 26.00 25.99 25.99 16.00 16.00 16.00 16.00 11.00 11.00
DatasetFamily-S-aaFamily-S-cdsGenus-O-aaGenus-O-cdsGenus-S-aaGenus-S-cdsSpecies-O-aaSpecies-O-cdsSpecies-S-aaSpecies-S-cds
Number of Taxon1111999911111111
Number of Gene129313977009111105138954110586161216
Number of Sites 678,5282,102,052314,3251,032,691532,9631,702,007244,990967,773269,1501,094,673
Number of Informative sites289,3931,012,80545,580118,50473,153186,245485010,908551012,700
Number of Invariable sites 273,670785,106235,281830,260401,7291,375,289232,965939,305254,8611,060,028
Sequence Length678,5282,102,052314,3251,032,691532,9631,702,007244,990967,773269,1501,094,673
Pairwise Identity (%)60.7061.0071.8075.5069.9073.6084.9089.1084.6089.20
Identical Sites (%)25.9022.8027.3033.1026.5031.7038.8054.5042.8058.00
Average GC content (%)45.3046.9046.6047.5047.50
Max Sequence Length (bp)661,8312,050,271305,6141,005,724508,7291,631,249239,958952,194261,8921,073,154
Min Sequence Length (bp)575,1701,774,443133,950488,227221,802779,205111,063578,586136,543702,030
Mean coverage (%)11.00 11.00 9.00 9.00 9.00 9.00 11.00 11.00 11.00 11.00
Note: “O” and “S” mean performed by Orothofinder and SonicParanoid. “aa” and “cds” mean amino acid sequence and nucleotide sequence.
Table 3. Likelihood scores of networks corresponding to the best five network searches. The bold values show the best network.
Table 3. Likelihood scores of networks corresponding to the best five network searches. The bold values show the best network.
Total Log ProbabilityFamily-O-aaFamily-O-cdsFamily-S-aaFamily-S-cdsGenus-O-aaGenus-O-cds
Network 1−6190.5703−3501.058081−11295.45488−5950.690602−744.3201955−457.4195646
Network 2−6113.875518−3227.002002−11199.63915−5872.642211−743.1046667−487.2788227
Network 3−6231.19481−3287.813375−11385.39823−5950.690602−720.0360015−487.2788227
Network 4−5643.849581−3323.11141−10901.25444−5158.20478−743.3484675−452.0974283
Network 5−6165.723153−3484.299045−11033.73158−5699.184902−712.7726379−510.6857333
Total log probabilityGenus-S-aaGenus-S-cdsSpecies-O-aaSpecies-O-cdsSpecies-S-aaSpecies-S-cds
Network 1−1357.70562−772.4172772−247.1150431−199.7265792−328.9024602−267.86003727524957
Network 2−1358.256057−772.0158611−249.0607795−213.7213554−332.0709943−267.86003727524945
Network 3−1358.256057−773.8058876−247.117924−220.5996779−331.5036725−272.798912
Network 4−1377.690453−808.015031−249.0611516−199.8006603−331.5036725−268.2104459
Network 5−1389.124744−808.015031−249.0611364−226.4899819−328.9071446−300.2662086
Note: the bold values indicated the best network using the command “CalGTProb” in PhyloNet.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhao, J.; Zhou, X.; Fang, S.; Zhu, Z.; Li, Y.; Yu, H.; He, Z. Transcriptome-Based Study on the Phylogeny and Hybridization of Marattialean Ferns (Marattiaceae). Plants 2023, 12, 2237. https://doi.org/10.3390/plants12122237

AMA Style

Zhao J, Zhou X, Fang S, Zhu Z, Li Y, Yu H, He Z. Transcriptome-Based Study on the Phylogeny and Hybridization of Marattialean Ferns (Marattiaceae). Plants. 2023; 12(12):2237. https://doi.org/10.3390/plants12122237

Chicago/Turabian Style

Zhao, Jing, Xinmao Zhou, Shaoli Fang, Zhangming Zhu, Yuxin Li, Hong Yu, and Zhaorong He. 2023. "Transcriptome-Based Study on the Phylogeny and Hybridization of Marattialean Ferns (Marattiaceae)" Plants 12, no. 12: 2237. https://doi.org/10.3390/plants12122237

APA Style

Zhao, J., Zhou, X., Fang, S., Zhu, Z., Li, Y., Yu, H., & He, Z. (2023). Transcriptome-Based Study on the Phylogeny and Hybridization of Marattialean Ferns (Marattiaceae). Plants, 12(12), 2237. https://doi.org/10.3390/plants12122237

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop