Next Article in Journal
Characterization of Mucosal-Associated Invariant T Cells in Oral Lichen Planus
Next Article in Special Issue
Advances in Genetic Engineering in Improving Photosynthesis and Microalgal Productivity
Previous Article in Journal
New Monoterpenoid Indole Alkaloids from Tabernaemontana crassa Inhibit β-Amyloid42 Production and Phospho-Tau (Thr217)
Previous Article in Special Issue
Genome-Wide Characterization and Expression Profiling of the GRAS Gene Family in Salt and Alkali Stresses in Miscanthus sinensis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genome-Wide Comparative Analysis of the Fasciclin-like Arabinogalactan Proteins (FLAs) in Salicacea and Identification of Secondary Tissue Development-Related Genes

Key Laboratory for Tree Breeding and Germplasm Improvement, Southern Modern Forestry Collaborative Innovation Center, College of Forestry, Nanjing Forestry University, Nanjing 210037, China
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2023, 24(2), 1481; https://doi.org/10.3390/ijms24021481
Submission received: 2 December 2022 / Revised: 24 December 2022 / Accepted: 10 January 2023 / Published: 12 January 2023
(This article belongs to the Special Issue Functional Genomics of Energy Crops)

Abstract

:
Fasciclin-like arabinogalactan proteins (FLAs) are a subclass of arabinogalactan proteins (AGPs) containing both AGP-like glycated domains and fasciclin (FAS) domains, which are involved in plant growth and development and synthesis of the cell wall. However, these proteins have not been identified or analyzed in willow, Salix, the sister genus of Populus. In this study, we performed a whole genome study of the FLA gene family of Salix suchowensis and compared it with the FLA gene family of Populus deltoides. The results showed the presence of 40 and 46 FLA genes in P. deltoides and S. suchowensis, distributed on 17 and 16 chromosomes, respectively. Four pairs of tandem repeat genes were found in willow, while poplar had no tandem repeat genes. Twelve and thirteen pairs of duplicated gene fragments were identified in poplar and willow, respectively. The multispecies phylogenetic tree showed that the FLA gene family could be divided into four groups (I–IV), with Group 1 showing significant expansion in woody plants. A gene expression analysis showed that PdeFLA19/27 in Group I of poplar was highly expressed, specifically during the secondary growth period of the stem and the rapid elongation of seed hairs. In the Group I genes of S. suchowensis, SsuFLA25/26/28 was also highly expressed during the secondary growth period, whereas increased expression of SsuFLA35 was associated with seed hair tissue. These results provide important clues about the differences in the FLA gene family during the evolution of herbs and woody plants, and suggest that the FLA gene family may play an essential role in regulating the secondary growth of woody plants. It also provides a reference for further studies on the regulation of secondary growth and seed hair development by FLA genes in poplar and willow.

1. Introduction

The hydroxyproline-rich glycoproteins (HRGPs) are a class of structural proteins usually found in the cell walls of higher plants [1,2,3,4]. Arabinogalactan proteins (AGPs) are a subfamily of HRGPs involved in many aspects of plant development, including tissue differentiation, reproduction, plant hormone interactions, responses to biotic stress, cell expansion, and secondary cell wall deposition [5,6]. AGPs are subdivided according to the structure of their protein backbone into classical, lysine-rich, AG peptide, fasciclin-like (FLAs), non-classical, and chimeric AGPs [2,7,8]. The fasciclin-like arabinogalactan proteins (FLAs) are characterized by having fasciclin (FAS) domains which were first detected in fruit flies (Drosophila melanogaster) and later in other organisms including algae, lichens, plants, and animals. FLAs are found in numerous species. Twenty-one FLAs have been identified in Arabidopsis thaliana [9], 30 in Musa spp. [10], 33 in Brassica rapa L. [11], 23 in Cannabis sativa L. [12], 19 in Corchorus olitorius [13], 19 in Gossypium hirsutum [14], 18 in Eucalyptus grandis [15], 38 in Nicotiana benthamiana [16], 35 in Populus trichocarpa [17], 27 in Oryza sativa [3], 34 in Triticum aestivum [18], and 17 in Cuscuta campestris [19].
Studies on plant FLAs have focused on tissue-specific functions, pollination, and embryogenesis, as well as general responses to biotic and abiotic stresses [6,15,17,20]. In addition, numerous studies have demonstrated the importance of FLAs in aspects related to cell wall biosynthesis. For example, in Arabidopsis, AtFLA4 induced aberrant cell expansion, adhesion, cell wall synthesis, and seed coat pectin mucilage [21]. AtFLA11 and AtFLA12 played a role in stem tensile strength, biomechanics, and elastic modulus, thus affecting the composition and structure of the cell walls [20]. Studies of AtFLA16 mutants showed that deletion of AtFLA16 resulted in reduced stem length and altered biomechanical properties [22]. Studies on poplar showed that the PtFLA6 gene was associated with the poplar’s xylem fiber cells, cell wall composition, and stem biomechanics [23,24]. Woody plants have more developed lignified stems than herbaceous plants, and their lignified stems are mainly composed of secondary wall-thickened cells. Therefore, the prediction and study of FLA gene functions in woody plants could provide essential clues to improve the properties of wood.
Populus and Salix are sister genera belonging to the Salicaceae family. They are useful species for maintaining the ecological environment and for solving the shortage of wood [25,26]. Although the FLA gene family has been identified and analyzed in poplar [17], it has not been analyzed in willow. Here, we performed a genome-wide comparative analysis of the FLAs in Populus and Salix. A multi-species phylogenetic tree of the FLA gene family to explore differences in gene expansion in herbaceous and woody plants was constructed. The expression patterns of FLA genes in different tissues of both poplar and willow were combined and those that were strongly and specifically expressed during stem lignification and seed hair development were identified, thus providing reliable candidate genes for subsequent verification of the gene functions.

2. Results and Discussion

2.1. Genome-Wide Identification and Sequence Analysis of Populus and Salix FLA genes

Based on the sequence structural characteristics of the FLA gene family, 40 and 46 FLA gene family members were identified in the genomes of P. deltoides and S. suchowensis, respectively. They were named PdeFLA1-PdeFLA40 and SsuFLA1-SsuFLA46, respectively, according to their positions on the chromosomes. All the genes contained the conserved FLA structural domain (Supplementary Tables S1 and S2). In P. deltoides, 29 proteins were predicted by the BIG-PI online program to have C-terminal glycophosphatidylinositol (GPI) anchors, and 34 N-terminal signal peptides were predicted by the SignalP online program (Supplementary Table S3). The proteins contained between 176 and 466 amino acids, with isoelectric points in the range of 4.50–10.77 and molecular weights of between 18,211.68 and 51,122.1 Da. In S. suchowensis, 32 proteins were predicted to have C-terminal GPI anchors by the BIG-PI prediction software, and 41 to were predicted to have N-terminal signal peptides by the SignalP online program (Supplementary Table S4). The amino acid lengths ranged from 206 to 468, the isoelectric points ranged from 4.33 to 10.81, and the molecular weights were between 21,708.68 and 51,646.93 Da. The gene names, gene IDs, chromosomal location, amino acid lengths, isoelectric point (pIs), and molecular weights (Mw) are listed in Table 1. This study comprehensively analyzed the FLA gene family in P. deltoides and S. suchowensis, identifying 40 and 46 genes, respectively. These gene numbers were 1.9-fold and 2.2-fold higher than the number of FLA genes in Arabidopsis [9]. Previous research has shown that early flowering species have a faster replacement rate than species with long generations [27,28]. This suggests that these two species have more genes than Arabidopsis, either because the poplar and willow genomes were larger or the replacement rate was much lower.

2.2. Chromosomal Distribution and Identification of Gene Duplication in Populus and Salix FLA Genes

To determine the distribution of FLA gene family members on the chromosomes of the P. deltoides and S. suchowensis genomes, and to evaluate the gene duplication relationships among the genes, we mapped the 40 poplar and 46 willow FLA genes onto the 19 chromosomes of their respective genomes using nucleotide sequence alignment. Four poplar genes and two willow genes failed to map to chromosomes (Figure 1), but were mapped to unassembled scaffold fragments, probably due to incomplete genome assembly in these plant tissues.
We found that the distribution of FLA genes on the chromosomes in the two studied genomes was uneven. The maximum number of FLA gene members distributed on the poplar chromosomes I, VI, XIII, and XIX was four, and no FLA genes were distributed on chromosomes III and VII (Figure 1A). There were relatively more FLA genes distributed on willow chromosomes VI, XIII, and XIX, with 7, 8, and 6 members (Figure 1B), respectively. No FLA genes were distributed on chromosomes II, III, and V. It was also found that the same number of FLA genes were distributed in the same positions on chromosomes I, X, XI, XII, XIV, and XVII of the two genomes, and it was speculated that they were orthologous genes. We analyzed the orthologous relationship between P. deltoides and S. suchowensis. Thirty-six pairs of orthologous genes were identified (Supplementary Table S5). The synteny analysis showed that 28 pairs of FLA genes were distributed in the same chromosomal position of P. deltoides and S. suchowensis, five pairs of genes were distributed in different positions, and three pairs of genes could not be determined linearly at the time because they were not yet mounted to the chromosome (Figure 2A). Our results demonstrated that the distribution of FLA genes is similar overall on both genomes, although there were large distribution differences on certain chromosomes. For example, compared with the poplar genome, there were no genes on chromosome II in the willow genome (Figure 1B), which might be because the evolution of willow species was faster than that of poplar species, and some genes that were not adapted to the environment were gradually eliminated.
Segmental and tandem duplication are the main drivers of gene family expansion. We found four pairs of tandem duplicated genes (SsuFLA10, SsuFLA11 and SsuFLA12; SsuFLA14 and SsuFLA15; SsuFLA25, SsuFLA26, SsuFLA27, SsuFLA28 and SsuFLA29; and SsuFLA43, SsuFLA44, and SsuFLA45) in S. suchowensis, distributed on chromosomes VI, VII, XIII, and XIX (Figure 1B). No tandem duplicated genes were found in P. deltoides. Twelve segmental duplication events were identified in P. deltoides, involving 23 genes (Figure 2B and Supplemental Table S6). Thirteen segmental duplication events were identified in S. suchowensis, involving 25 genes (Figure 2C and Supplemental Table S6). All gene pairs, apart from SsuFLA43/45 in S. suchowensis, which showed complete sequence identity, had Ka/Ks values less than 1 (Supplemental Table S6), indicating that the FLA gene family underwent purifying selection. A comparison of the gene duplication results between poplar and willow showed that the number of segmental duplication events was similar, but the location of segmental duplication differed greatly, with only six segmental duplications having the same chromosomal location. In addition, tandem duplication of FLA genes was more frequent in willow than poplar. The above results suggest that FLA genes may have undergone six segmental duplications within the common ancestral species of poplar and willow. After divergence of the two species, the average rate of gene replacement was significantly higher in the willow genome than in the poplar genome [27,28], and the rate of evolution was also higher [26,29]. This rapid expansion allowed the preservation of these newly generated species. This may also be why willow trees had more FLA genes than poplars.

2.3. Phylogeny, Conserved Gene Structures, and Protein Motif Analysis of Populus and Salix FLA Genes

Both poplar and willow evolved from the same ancestral species, and during their respective evolution after species divergence, both retained orthologous genes inherited from their ancestors. At the same time, replication occurred in both genomes, generating new paralogous genes. To compare the sequences and structures of these homologous genes, we constructed a phylogenetic tree based on the full-length sequences of the FLA genes. We combined gene structure and conserved sequence prediction to present a comprehensive picture of the homology of FLA genes between P. deltoides and S. suchowensis. The analysis showed that there were 22 pairs of P. deltoides and S. suchowensis FLA genes at the extreme ends of the evolutionary tree branches, including PdeFLA2 and SsuFLA46, PdeFLA35 and SsuFLA27, and PdeFLA18 and SsuFLA15 (Figure 3A). In addition, multiple willow FLA genes were found to cluster with one poplar FLA gene on one evolutionary branch (Figure 3A). For example, SsuFLA43, SsuFLA44, and SsuFLA45 clustered with PdeFLA36 on the same branch, and this, combined with their chromosomal distributions, confirmed that SsuFLA43, SsuFLA44, and SsuFLA45 were duplicates of genes generated by tandem repeats during willow evolution.
Normally, the FLA binding domain shows that introns were lost during plant evolution. Nevertheless, in our analysis, we found that 34.88% (30/86) of P. deltoides and S. suchowensis FLA genes contained one intron, 11.63% (10/86) had two introns, and 2.33% (2/86) had more than two introns, with only 51.16% (44/86) having no introns (Figure 3B). Overall, the intron structure varied widely among the FLA gene family members. However, combined with the phylogenetic tree analysis, our results indicated that genes on the same evolutionary branches of the P. deltoides and S. suchowensis FLA gene families had similar intron patterns. Furthermore, while some of the closest genes showed similar structures, a small subset showed different intron-exon arrangements. For example, SsuFLA6 had no introns, while its nearby homologous gene, PdeFLA7, contained three, even though their evolutionary relationships reached 99% bootstrap values, respectively.
We further predicted the conserved motifs in the P. deltoides and S. suchowensis FLA proteins using the MEME online program to determine the specific regions of the FLA proteins. A comparison of the 20 conserved motifs found in P. deltoides and S. suchowensis (Figure 3C) revealed that the major motifs of the two species were similar, and the composition of the motifs was not significantly different. The lengths of the 20 conserved motifs ranged from 11 to 50 amino acids (Supplementary Table S7), and the number of motifs in each FLA protein ranged from 1 to 11. Most FLA proteins had motifs 1, 2, 3, 4, 8, and 9 (Figure 3C). Motifs 1, 2, 3, and 4 were included within the fasciclin domain, and motifs 8 and 9 were observed in the AGP-like glycosylation regions. However, while motifs 1, 2, 3, and 4 followed the fasciclin domain in the sequence and were particularly conserved in most clades, some motifs were selectively distributed between specific clades of the phylogenetic tree. For example, PdeFLA32 and SsuFLA40 contained only motif 4, and SsuFLA41, SsuFLA30, and PdeFLA33 contained only motif 18. These results suggested that these conserved motifs played key roles in specific or similar functions.

2.4. Phylogenetic Analysis and Functional Prediction of the FLA Gene Family

We have investigated the phylogenetic relationships of the FLA proteins in different plants and constructed a phylogenetic tree including five woody plants (P. deltoides, S. suchowensis, Betula platyphylla, Quercus rubra L., and Cinnamomum kanehirae) and five herbaceous plants (Setaria italica, Zea mays L., Cucumis sativus L., Oryza sativa L., and Arabidopsis thaliana). In all, 277 FLA gene members were classified using the neighbor-joining (NJ) method. The phylogenetic tree showed that the FLA gene family was divided into four evolutionary branches, which were named Group I, II, III, and IV, respectively (Figure 4).
The statistics of the gene members in each group showed that Group I was the largest group, containing 123 genes, while Group IV was the smallest, with 26 genes. The number of genes in each group was counted for each species used in the construction of the evolutionary tree, and it was found that the woody plants had more Group I genes than herbaceous plants. Group I genes made up an average of 50.49% of the FLA genes in woody plants, while the average proportion in herbaceous plants was 33.16% (Supplementary Table S8). Additionally, Group I FLA made up an average of 0.04584% of all genes in the genomes of woody plants, but only 0.02118% in herbaceous plants (Supplementary Table S8). The differences in these proportions between woody and herbaceous plants were found to be highly significant (Supplementary Figure S1). This suggests that among the plants selected for evolutionary analysis, FLA genes in the Group I branch of woody plants underwent a major expansion. It also implies that FLA genes play important roles in secondary growth in woody plants.
Combined with the functional conservation of homologous genes, genes with similar or identical functions were classified into the same groups, providing a reliable reference basis for predicting the functions of related genes in the gene family. To predict the biological functions of the poplar and willow FLA genes, we referred to the functions of FLA genes that have been verified in Arabidopsis. The gene members in Group I belonged to the same subgroup as the AtFLA11 and AtFLA12 genes that are associated with xylem development. Related studies have found a correlation between the abundance of AtFLA11 and AtFLA12 transcripts containing a single FAS domain and the onset of secondary cell wall cellulose synthase expression in Arabidopsis stems [30,31]. In addition, the phenotypes of AtFLA11 mutants showed the presence of a mild collapsed vessel phenotype and reduced stem cellulose content [30,31,32]. These analyses indicated that the FLA members in Group I were associated with secondary wall and cellulose synthesis in the stem. Group II proteins fall into the same subgroup as AtFLA1. Several studies have described a T-DNA insertion mutant in the AtFLA1 gene under standard growth conditions, showing that although the FLA1-1 mutants had no distinct phenotype, the gene may play a role in the ability of the callus to induce regeneration, shown by in vitro shoot induction assays [33]. Therefore, we speculate that the Group II proteins might be involved in the development of buds. AtFLA16 belongs to Group III, and studies on AtFLA16 mutants showed that the deletion of the gene resulted in reduced stem lengths and altered biomechanical properties [22]. We thus speculate that Group III members are responsible for maintaining and regulating stem strength and support. Group IV includes AtFLA3, and overexpression of the gene led to defective elongation of stamen filaments and reduced female fertility resulting in siliques with low seed settings, suggesting that AtFLA3 was involved in microspore development and might affect the pollen lining by participating in cellulose deposition [34]. Therefore, we speculate that the members of Group IV might have functions involved in the development of microspores.
A combination of the phylogenetic analysis and functional predictions of FLA genes strongly suggested that genes in Group I were likely to be associated with cell wall synthesis and secondary growth in plants. In particular, two sets of tandem repeat genes from willow (SsuFLA10, SsuFLA11, and SsuFLA12 and SsuFLA43, SsuFLA44, and SsuFLA45) were included in Group I (Figure 4). In addition, it has been shown that a subset of FLAs containing a single FAS domain contributed to plant stem strength by affecting cellulose deposition. It has also been suggested that this influences the stem elastic modulus by affecting the integrity of the cell wall matrix [20]. We further found that FLA genes in Group I contained only one FAS domain and had undergone duplication. Thus, these analyses supported our hypothesis that Group I genes were involved in plant cellulose and cell wall formation, were significantly amplified in woody plants, and were involved in wood formation. These results suggested that these genes were good candidates for further functional verification and phylogenetic analysis.

2.5. Identification of FLA Candidate Genes Related to Stem Development in Populus and Salix

Wood production relies on the activity of vascular bundle-forming layers. A primary vascular system consists of a discrete set of vascular bundles that contain bundle-forming layers, primary phloem, and primary xylem. The primary vascular bundle originates from proto-formation layer cells on the periphery of the rib region of the shoot apical meristem (SAM). In perennial woody plants, secondary growth arises from the meristem activity of the vascular formation layer, and the bundle-like formation layer located at the center of the primary vascular bundle extends to the interbundled region, tangentially generating an interbundled formation layer to form a vascular formation layer ring [35,36]. The meristem activity of the vascular forming layer then leads to the continuous production of cylindrical secondary vascular tissue (wood). Therefore, we selected tissues with different degrees of lignification in P. deltoides and S. suchowensis stems and analyzed the expression patterns of Group I members in the FLA gene family. Since some genes had high sequence similarity, it was impossible to design specific primers to distinguish them, so universal primers were designed for these genes to analyze their expression patterns. For example, PdeFLA19 and PdeFLA27 were 91.42% similar; PdeFLA35 and PdeFLA38 were 96.09% similar; SsuFLA43, SsuFLA44 and SsuFLA45 were 97.99% alike; and SsuFLA25, SsuFLA26 and SsuFLA28 were 93.38% identical.
In P. deltoides, five parts of the stem, internodes 1-2, 3-4, 5-7, 8-9, and 10-11, were selected from the stem tip downwards and named P_S1, P_S2, P_S3, P_S4, and P_S5, respectively (Figure 5A). Expression patterns were analyzed for members of some FLA gene families in Group I. RT-qPCR analysis showed (Figure 5C) that PdeFLA2/15/19/20/22/27/35/36/38 were expressed significantly in the elongated region of the stem (P_S3) but were significantly reduced in the tip of the stem (P_S1) and the secondary growth region of the stem (P_S4). On the other hand, PdeFLA7/14/37 was expressed mainly in young stems (P_S2) and elongated regions of stems (P_S3). In particular, PdeFLA19/27 had the highest relative expression. These results suggested that PdeFLA19/27 was associated with secondary thickening growth and could thus serve as a major candidate gene for later studies on poplar wood growth and development.
We selected tissues from S. suchowensis, as we did in poplar. They were named S_S1 to S_S5 and were analyzed by RT-qPCR to determine the expression patterns of the Group I FLA family members. The results of the stem cross-sections showed that the willow had a faster lignification rate than the poplar in the same nodal tissues, especially in S4 and S5, where the secondary vascular tissue formed a thicker structure (Figure 5B). The RT-qPCR analysis showed that a total of 17 genes (Figure 5D), all members of the Group I branch in willow, were positively correlated with stem node lignification and had a very low expression in shoot leaves and stem tips (S_S1). The expression of SsuFLA5/6/18/23/27 increased gradually from S_S1 to S_S5, while the expression levels of SsuFLA21/22/25/26/28/29/34/35/43/44/45/46 were highest in S_S4 tissue, but tended to decrease in S_S5 tissue (Figure 5D). Therefore, it was inferred that rapid lignification of willow stems occurred at internodes 8-9 (S_S4). The Group I FLA genes showed greater functional redundancy in the regulation of stem lignification function. Of these, the three tandem repeats of SsuFLA25/26/28 were expressed at high levels and played a dominant role in willow stem lignification. These results suggested that these could be important candidate genes for further investigation of wood development and formation in willow.

2.6. Identification of FLA Candidate Genes Related to Seed Hair Development in Populus and Salix

Seed hairs are composed of flocculent fibers produced by poplar and willow trees to assist in seed dispersal after reaching sexual maturity. They are formed by the projection of placentation epidermal cells and their cell walls are largely composed of cellulose, hemicellulose, and pectin. As in the investigation of the regulatory roles of FLA genes in plant cell walls, we collected tissues from different periods of poplar seed hair development to analyze the expression of the Group I genes. The results showed that poplar seed hairs produced obvious flocculent fibers that elongated rapidly 3 days after pollination (DAP). Furthermore, the length of seed hairs was sufficient to completely wrap the ovules by 5 DAP, indicating that the 3–5 DAP period was associated with the rapid elongation of seed hairs (Figure 6A).
Furthermore, the RT-qPCR analysis showed that PdeFLA2/18/19/27 genes were highly expressed specifically at 3 DAP, which was consistent with the rapid elongation development of seed hairs. The highest relative expression was seen in the PdeFLA19/27 genes (Figure 6C). We therefore speculate that these genes have important regulatory roles in the development of cell walls during the rapid elongation of poplar seed hairs.
In S. suchowensis, we collected pollinated and unpollinated florets in the same catkin to analyze the expression of Group I FLA genes. The results showed that pollinated florets developed numerous seed hairs on 7 DAP, while unpollinated florets did not produce seed hairs (Figure 6B). Furthermore, RT-qPCR analysis showed that 15 of the 17 gene Group I genes were expressed at significantly higher levels in pollinated florets than in unpollinated tissue, except for SsuFLA22/34 (Figure 6D). Of these, the SsuFLA35 gene showed the highest relative expression in pollinated florets, and it is speculated that it may be involved in regulating the development of willow seed hair tissue.

3. Materials and Methods

3.1. Genome-Wide Identification and Sequence Analysis of Populus and Salix FLA Genes

The genome sequences of P. deltoides (PRJNA598948) and S. suchowensis (ASM1755242v1) were downloaded from the NCBI website (https://www.ncbi.nlm.nih.gov/ accessed on 8 March 2021). The fasciclin domain genome sequence (PF02469) was downloaded from the Pfam database (http://pfam.xfam.org/ accessed on 8 March 2021). HMM3.0 software was used to screen proteins containing this domain in the genome sequences of P. deltoides and S. suchowensis [37]. In addition, a local database was constructed from the whole genome sequences of P. deltoides and S. suchowensis, and a BLASTP search was performed using an FLA protein sequence of A. thaliana. The candidate protein sequences identified by the above two methods were combined to remove duplicates, incomplete sequencing, and protein sequences without complete coding frames. Candidate proteins were analyzed using the Pfam (http://pfam.xfam.org/ accessed on 8 March 2021, PF02469) [38] and SMART (http://smart.embl.de/ accessed on 8 March 2021) [39] databases, and protein sequences with incomplete domains were removed.
Prediction of the N-terminal signal peptides was performed on the SignalP 5.0 website (https://services.healthtech.dtu.dk/service.php?SignalP-5.0 accessed on 8 March 2021) [40]. Prediction of the attachment sites of the C-terminal GPI anchor was performed in the BIG-PI database (http://mendel.imp.ac.at/sat/gpi/gpi_server.html accessed on 8 March 2021) [41]. The fasciclin domain, N-terminal signal peptide, and C-terminal GPI anchor attachment site were removed for further screening for AGP-like glycosylation sites. Finally, the sequences containing the fasciclin domain and AGP-like glycosylation regions were considered the P. deltoides and S. suchowensis FLA gene family sequences. ExPASy (https://web.expasy.org/compute_pi accessed on 8 March 2021) was used for the prediction of the pI (isoelectric point) and molecular weight (Mw) [42].
The nucleotide and protein sequences of the identified FLA gene family members were aligned with the whole genome sequences of P. deltoides and S. suchowensis using BLAST, and the corresponding genome sequences, chromosomal positions, and exon/intron distribution patterns of each member were obtained. In addition, the genome sequences of six other species were downloaded from different websites, including Corylus avellana (https://hardwoodgenomics.org/bio_data/4302786?tripal_pane=group_downloads accessed on 14 May 2021), Quercus rubra L. (https://phytozome-next.jgi.doe.gov/info/Qrubra_v2.1 accessed on 14 May 2021), Cinnamomum kanehirae (https://phytozome-next.jgi.doe.gov/info/Ckanehirae_v3 accessed on 14 May 2021), Setaria italica (https://phytozome-next.jgi.doe.gov/info/Sitalica_v2.2. accessed on 14 May 2021), Zea mays L. (http://plants.ensembl.org/Zea_mays/Info/Index accessed on 14 May 2021), and Cucumis sativus L. (http://cucurbitgenomics.org/ftp/genome/cucumber/Chinese_long/v3/ accessed on 14 May 2021), and their respective FLA gene families were identified. Members of the FLA gene family of A. thaliana (21 sequences) [9] and Oryza sativa L. (27 sequences) [3] were also downloaded (Supplementary Sequence S1).

3.2. Chromosomal Distribution of FLA Genes

Using the annotation files of the P. deltoides and S. suchowensis genomes, the chromosome location information of the FLA gene family was extracted, and the length of each chromosome was obtained. MapChart software was used to map the chromosomal locations and relative distances in the FLA genes. Tandem duplications of FLA genes were confirmed based on two criteria: (1) genes distributed in the 100 kb range on chromosomes and (2) similarity of >70% of the sequence alignment of the two genes [43,44]. Synteny analysis between species and gene duplication within species were performed using MCScanX software. Visual mapping was completed using Tbtools and KaKs_Calculator 2.0 was used to calculate synonymous (Ks) and non-synonymous (Ka) substitution of FLA gene pairs.

3.3. Phylogenetic Analysis and Functional Prediction of FLA Genes

Multiple sequence comparisons of FLA full-length protein sequences of P. deltoides and S. suchowensis were performed using ClustalX 2.1 with the default parameters [45]. Neighbor-joining (NJ) phylogenetic trees were constructed using MAGE 11 with parameters set to Poisson correction, pairwise deletion, and bootstrap analysis with 1000 replicates [46]. All FLA protein sequences from P. deltoides, S. suchowensis, C. avellana, Q. rubra L., C. kanehirae, S. italica, Z. mays L., C. sativus L., O. sativa L., and A. thaliana were clustered by MEGA 11 software for multiple sequence alignment and construction of the phylogenetic tree. The phylogenetic tree was visualized using the online program iTOL (https://itol.embl.de/login.cgi accessed on 11 August 2021), and a folded phylogenetic tree consisting of 10 species was constructed [47].

3.4. Analysis of Gene Structure and Protein Motif Identification

The intron and exon positions of the FLA genes were extracted, and the number and length of introns and exons were counted in generic feature format (GFF) files released for the whole genomes of P. deltoides and S. suchowensis. The exon and intron structures were displayed through the online program Gene Structure Display Server (http://gsds.gao-lab.org/index.php accessed on 8 March 2021) [48]. The MEME online program (https://meme-suite.org/meme/tools/meme accessed on 8 March 2021) was used to identify conserved motifs in the FLA proteins. The applied inclusion criteria were: (1) each FLA protein sequence appeared with at least 1 and a maximum of 20 motifs; (2) each motif had a minimum of 2 and a maximum of 250 amino acids in the sequence; and (3) other default parameters were used. The IDs of FLA members were kept the same as previously named.

3.5. Preparation of Plant Materials

Stems with different degrees of lignification were collected. Six-month-old P. deltoides and S. suchowensis were selected at the Plant Growth Laboratory of Nanjing Forestry University (China, Nanjing). Five parts of the stem, internodes 1–2, 3–4, 5–7, 8–9, and 10–11, were selected from the stem tip downwards and named S1, S2, S3, S4, and S5. Tissues at different stages of poplar seed hair development were also collected; all were from female P. deltoides flowering branches. Hydroponics was used to promote flowering. Pollination was performed with the stigma fully expanded, termed 0 days after pollination (0 DAP). Inflorescence ovaries were collected at −1 DAP, 0 DAP, 3 DAP, and 5 DAP in sequence. For the collection of willow seed hair tissue, male and female flowering branches of S. suchowensis were selected to promote flowering using hydroponics. After opening of the female inflorescences, some of the florets were pollinated. Seven days after pollination, pollinated and unpollinated florets in the same inflorescence were collected separately. Three replicates of all samples were collected and were immediately frozen in liquid nitrogen after collection and stored at −80 ℃ for RNA extraction.

3.6. Primer Design, RNA Extraction, and qRT-PCR

The primers used for analyzing the expression of FLA genes in P. deltoides and S. suchowensis were designed using Primer 5.0. The specificity of the primers was verified by agarose gel electrophoresis and the primer sequences are shown in Supplementary Table S10. PtUBQ (Potri.015G013600.1) [49] and ACT7 (SapurV1A.0231s0320.1) [50] were used as internal reference genes for P. deltoides and S. suchowensis, respectively. Total RNA was extracted from the P. deltoides and S. suchowensis tissues using the RNA Easy Fast Plant Tissue Kit (TIANGEN, Nanjing, China). One microgram of RNA from each sample was reverse transcribed to cDNA using the One-Step gDNA Removal and cDNA Synthesis SuperMix (TIANGEN, Nanjing, China). Quantitative RT-PCR was performed on a 7500 Fast Real-Time PCR System (Applied Biosystems, USA). The amplification reaction system was as follows: 1 µL each of forward and reverse primers (10 umol/L), 2 µL cDNA (5-fold dilution), 10 µL PowerUpTM SYBRTM Green Master Mix (Applied Biosystems), and RNase-free water was added to make the solution up to 20 µL. The amplification procedure was as follows: 95 °C for 15 s, 60 °C for 30 s, and 72 °C for 30 s, with 40 cycles. Finally, the relative expression of each FLA gene was calculated using the CT method [51].

4. Conclusions

In this study, we performed a systematic analysis of the FLA gene family in P. deltoides and S. suchowensis, including the identification of the FLA genes and analysis of their structure and chromosomal distribution, as well as the identification of conserved domains and the evolution of the gene family in multiple species. The analysis revealed that FLA genes on the Group I branch of the phylogenetic tree were involved in the regulation of secondary growth processes, such as lignification in poplar and willow, and further showed that these genes had undergone significant expansion in woody plants, suggesting that this group of genes may have been important in the evolution of woody plants from herbaceous plants. Finally, analysis of the expression of the Group I genes in different tissues confirmed that the PdeFLA19/27 gene was highly expressed in poplar, specifically during stem lignification and seed hair development, suggesting its important regulatory role in stem or seed hair development. Furthermore, it was verified that a set of duplicated genes, SsuFLA25/26/28, in willow were important candidates for stem lignification development, and that SsuFLA35 may be involved in the regulation of germplasm hair development. In summary, this study priovides a multi-faceted reference for subsequent studies on the functions of FLA genes in poplar and willow.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms24021481/s1.

Author Contributions

Conceptualization, H.W. (Huaitong Wu) and T.Y.; software, Y.Z. and F.Z.; validation, Y.Z., H.W. (Hui Wang) and Y.C.; data curation, Y.Z. and H.W. (Huaitong Wu); writing—original draft preparation, Y.Z.; writing—review and editing, H.W. (Huaitong Wu); visualization, Y.Z.; supervision, H.W. (Huaitong Wu); project administration, H.W. (Huaitong Wu); funding acquisition, T.Y., Y.C., and H.W(Huaitong Wu). All authors have read and agreed to the published version of the manuscript.

Funding

This study was financially supported by the Key Research and Development Project of Jiangsu Province, China (grant number BE2021366); the Natural Science Foundation of China (grant numbers 32071795); and the Youth Science and Technology Innovation Fund of Nanjing Forestry University (grant number CX2018008).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data is contained within the article or Supplementary Material.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Kieliszewski, M.J.; Shpak, E. Synthetic genes for the elucidation of glycosylation codes for arabinogalactan-proteins and other hydroxyproline-rich glycoproteins. Cell. Mol. Life Sci. 2001, 58, 1386–1398. [Google Scholar] [CrossRef] [PubMed]
  2. Schultz, C.J.; Rumsewicz, M.P.; Johnson, K.L.; Jones, B.J.; Gaspar, Y.M.; Bacic, A. Using genomic resources to guide research directions. The arabinogalactan protein gene family as a test case. Plant Physiol. 2002, 129, 1448–1463. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Ma, H.; Zhao, J. Genome-wide identification, classification, and expression analysis of the arabinogalactan protein gene family in rice (Oryza sativa L.). J. Exp. Bot. 2010, 61, 2647–2668. [Google Scholar] [CrossRef]
  4. Showalter, A.M.; Keppler, B.; Lichtenberg, J.; Gu, D.; Welch, L.R. A bioinformatics approach to the identification, classification, and analysis of hydroxyproline-rich glycoproteins. Plant Physiol. 2010, 153, 485–513. [Google Scholar] [CrossRef] [Green Version]
  5. Seifert, G.J.; Roberts, K. The biology of arabinogalactan proteins. Annu. Rev. Plant Biol. 2007, 58, 137–161. [Google Scholar] [CrossRef]
  6. Pereira, A.M.; Pereira, L.G.; Coimbra, S. Arabinogalactan proteins: Rising attention from plant biologists. Plant Reprod. 2015, 28, 1–15. [Google Scholar] [CrossRef]
  7. Gaspar, Y.; Johnson, K.L.; McKenna, J.A.; Bacic, A.; Schultz, C.J. The complex structures of arabinogalactan-proteins and the journey towards understanding function. Plant Mol. Biol. 2001, 47, 161–176. [Google Scholar] [CrossRef]
  8. Tan, L.; Leykam, J.F.; Kieliszewski, M.J. Glycosylation motifs that direct arabinogalactan addition to arabinogalactan-proteins. Plant Physiol. 2003, 132, 1362–1369. [Google Scholar] [CrossRef] [Green Version]
  9. Johnson, K.L.; Jones, B.J.; Bacic, A.; Schultz, C.J. The fasciclin-like arabinogalactan proteins of Arabidopsis. A multigene family of putative cell adhesion molecules. Plant Physiol. 2003, 133, 1911–1925. [Google Scholar] [CrossRef] [Green Version]
  10. Meng, J.; Hu, B.; Yi, G.; Li, X.; Chen, H.; Wang, Y.; Yuan, W.; Xing, Y.; Sheng, Q.; Su, Z.; et al. Genome-wide analyses of banana fasciclin-like AGP genes and their differential expression under low-temperature stress in chilling sensitive and tolerant cultivars. Plant Cell Rep. 2020, 39, 693–708. [Google Scholar] [CrossRef]
  11. Jun, L.; Xiaoming, W. Genome-wide identification, classification and expression analysis of genes encoding putative fasciclin-like arabinogalactan proteins in Chinese cabbage (Brassica rapa L.). Mol. Biol. Rep. 2012, 39, 10541–10555. [Google Scholar] [CrossRef] [PubMed]
  12. Guerriero, G.; Mangeot-Peter, L.; Legay, S.; Behr, M.; Lutts, S.; Siddiqui, K.S.; Hausman, J.F. Identification of fasciclin-like arabinogalactan proteins in textile hemp (Cannabis sativa L.): In silico analyses and gene expression patterns in different tissues. BMC Genom. 2017, 18, 1–13. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Hossain, M.; Ahmed, B.; Ullah, M.; Aktar, N.; Haque, M.; Islam, M. Genome-wide identification of fasciclin-like arabinogalactan proteins in jute and their expression pattern during fiber formation. Mol. Biol. Rep. 2020, 47, 7815–7829. [Google Scholar] [CrossRef] [PubMed]
  14. Huang, G.Q.; Xu, W.L.; Gong, S.Y.; Li, B.; Wang, X.L.; Xu, D.; Li, X.B. Characterization of 19 novel cotton FLA genes and their expression profiling in fiber development and in response to phytohormones and salt stress. Physiol. Plant. 2008, 134, 348–359. [Google Scholar] [CrossRef] [PubMed]
  15. MacMillan, C.P.; Taylor, L.; Bi, Y.; Southerton, S.G.; Evans, R.; Spokevicius, A. The fasciclin-like arabinogalactan protein family of Eucalyptus grandis contains members that impact wood biology and biomechanics. New Phytol. 2015, 206, 1314–1327. [Google Scholar] [CrossRef]
  16. Wu, X.; Lai, Y.; Lv, L.; Ji, M.; Han, K.; Yan, D.; Lu, Y.; Peng, J.; Rao, S.; Yan, F.; et al. Fasciclin-like arabinogalactan gene family in Nicotiana benthamiana: Genome-wide identification, classification and expression in response to pathogens. BMC Plant Biol. 2015, 20, 1–15. [Google Scholar] [CrossRef]
  17. Zang, L.; Zheng, T.; Chu, Y.; Ding, C.; Zhang, W.; Huang, Q.; Su, X. Genome-wide analysis of the fasciclin-like arabinogalactan protein gene family reveals differential expression patterns, localization, and salt stress response in Populus. Front Plant Sci. 2015, 6, 1140. [Google Scholar] [CrossRef] [Green Version]
  18. Faik, A.; Abouzouhair, J.; Sarhan, F. Putative fasciclin-like arabinogalactan-proteins (FLA) in wheat (Triticum aestivum) and rice (Oryza sativa): Identification and bioinformatic analyses. Mol. Genet. Genom. 2006, 276, 478–494. [Google Scholar] [CrossRef]
  19. Hozumi, A.; Bera, S.; Fujiwara, D.; Obayashi, T.; Yokoyama, R.; Nishitani, K.; Aoki, K. Arabinogalactan proteins accumulate in the cell walls of searching hyphae of the stem parasitic plants, Cuscuta campestris and Cuscuta japonica. Plant Cell Physiol. 2017, 58, 1868–1877. [Google Scholar] [CrossRef] [Green Version]
  20. MacMillan, C.P.; Mansfield, S.D.; Stachurski, Z.H.; Evans, R.; Southerton, S.G. Fasciclin-like arabinogalactan proteins: Specialization for stem biomechanics and cell wall architecture in Arabidopsis and Eucalyptus. Plant J. 2010, 62, 689–703. [Google Scholar] [CrossRef]
  21. Shi, H.; Kim, Y.; Guo, Y.; Stevenson, B.; Zhu, J.K. The Arabidopsis SOS5 locus encodes a putative cell surface adhesion protein and is required for normal cell expansion. Plant Cell 2003, 15, 19–32. [Google Scholar] [CrossRef] [Green Version]
  22. Liu, E.; MacMillan, C.P.; Shafee, T.; Ma, Y.; Ratcliffe, J.; Van de Meene, A.; Bacic, A.; Humphries, J.; Johnson, K.L. Fasciclin-like arabinogalactan-protein 16 (FLA16) is required for stem development in Arabidopsis. Front. Plant Sci. 2020, 11, 615392. [Google Scholar] [CrossRef]
  23. Wang, H.; Jiang, C.; Wang, C.; Yang, Y.; Yang, L.; Gao, X.; Zhang, H. Antisense expression of the fasciclin-like arabinogalactan protein FLA6 gene in Populus inhibits expression of its homologous genes and alters stem biomechanics and cell wall composition in transgenic trees. J. Exp. Bot. 2015, 66, 1291–1302. [Google Scholar] [CrossRef] [Green Version]
  24. Wang, H.; Jin, Y.; Wang, C.; Li, B.; Jiang, C.; Sun, Z.; Zhang, Z.; Kong, F.; Zhang, H. Fasciclin-like arabinogalactan proteins, PtFLAs, play important roles in GA-mediated tension wood formation in Populus. Sci. Rep. 2017, 7, 6182. [Google Scholar] [CrossRef]
  25. Zhang, L.; Xi, Z.; Wang, M.; Guo, X.; Ma, T. Plastome phylogeny and lineage diversification of Salicaceae with focus on poplars and willows. Ecol. Evol. 2018, 8, 7817–7823. [Google Scholar] [CrossRef] [Green Version]
  26. Hou, J.; Wei, S.; Pan, H.; Zhuge, Q.; Yin, T. Uneven selection pressure accelerating divergence of Populus and Salix. Hortic. Res. 2019, 6, 37. [Google Scholar] [CrossRef] [Green Version]
  27. Gaut, B.; Yang, L.; Takuno, S.; Eguiarte, L.E. The patterns and causes of variation in plant nucleotide substitution rates. Annu. Rev. Ecol. Evol. Syst. 2011, 42, 245–266. [Google Scholar] [CrossRef] [Green Version]
  28. Smith, S.A.; Donoghue, M.J. Rates of molecular evolution are linked to life history in flowering plants. Science 2008, 322, 86–89. [Google Scholar] [CrossRef] [Green Version]
  29. Dai, X.; Hu, Q.; Cai, Q.; Feng, K.; Ye, N.; Tuskan, G.A.; Milne, R.; Chen, Y.; Wan, Z.; Wang, Z.; et al. The willow genome and divergent evolution from poplar after the common genome duplication. Cell Res. 2014, 24, 1274–1277. [Google Scholar] [CrossRef]
  30. Brown, D.M.; Zeef, L.A.; Ellis, J.; Goodacre, R.; Turner, S.R. Identification of novel genes in Arabidopsis involved in secondary cell wall formation using expression profiling and reverse genetics. Plant Cell 2005, 17, 2281–2295. [Google Scholar] [CrossRef]
  31. Persson, S.; Wei, H.; Milne, J.; Page, G.P.; Somerville, C.R. Identification of genes required for cellulose synthesis by regression analysis of public microarray data sets. Proc. Natl. Acad. Sci. USA 2005, 102, 8633–8638. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  32. Ito, S.; Suzuki, Y.; Miyamoto, K.; Ueda, J.; Yamaguchi, I. AtFLA11, a fasciclin-like arabinogalactan-protein, specifically localized in screlenchyma cells. Biosci. Biotechnol. Biochem. 2005, 69, 1963–1969. [Google Scholar] [CrossRef] [PubMed]
  33. Johnson, K.L.; Kibble, N.A.; Bacic, A.; Schultz, C.J. A Fasciclin-Like A rabinogalactan-Protein (FLA) Mutant of Arabidopsis thaliana, fla1, Shows Defects in Shoot Regeneration. PLoS ONE 2011, 6, e25154. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Li, J.; Yu, M.; Geng, L.L.; Zhao, J. The fasciclin-like arabinogalactan protein gene, FLA3, is involved in microspore development of Arabidopsis. Plant J. 2010, 64, 482–497. [Google Scholar] [CrossRef]
  35. Nieminen, K.; Blomster, T.; Helariutta, Y.; Mähönen, A.P. Vascular cambium development. Arab. Book 2015, 2015, 13. [Google Scholar] [CrossRef] [Green Version]
  36. Sehr, E.M.; Agusti, J.; Lehner, R.; Farmer, E.E.; Schwarz, M.; Greb, T. Analysis of secondary growth in the Arabidopsis shoot reveals a positive role of jasmonate signalling in cambium formation. Plant J. 2010, 63, 811–822. [Google Scholar] [CrossRef]
  37. Eddy, S.R. Profile hidden Markov models. Bioinformatics 1998, 14, 755–763. [Google Scholar] [CrossRef] [Green Version]
  38. Finn, R.D.; Mistry, J.; Schuster-Böckler, B.; Griffiths-Jones, S.; Hollich, V.; Lassmann, T.; Moxon, S.; Marshall, M.; Khanna, A.; Durbin, R.; et al. Pfam: Clans, web tools and services. Nucleic Acids Res. 2006, 34, D247–D251. [Google Scholar] [CrossRef] [Green Version]
  39. Letunic, I.; Doerks, T.; Bork, P. SMART 7: Recent updates to the protein domain annotation resource. Nucleic Acids Res. 2012, 40, D302–D305. [Google Scholar] [CrossRef]
  40. Almagro Armenteros, J.J.; Tsirigos, K.D.; Sønderby, C.K.; Petersen, T.N.; Winther, O.; Brunak, S.; Von Heijne, G.; Nielsen, H. SignalP 5.0 improves signal peptide predictions using deep neural networks. Nat. Biotechnol. 2019, 37, 420–423. [Google Scholar] [CrossRef] [Green Version]
  41. Eisenhaber, B.; Wildpaner, M.; Schultz, C.J.; Borner, G.H.; Dupree, P.; Eisenhaber, F. Glycosylphosphatidylinositol lipid anchoring of plant proteins. Sensitive prediction from sequence-and genome-wide studies for Arabidopsis and rice. Plant Physiol. 2003, 133, 1691–1701. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  42. Artimo, P.; Jonnalagedda, M.; Arnold, K.; Baratin, D.; Csardi, G.; De Castro, E.; Duvaud, S.; Flegel, V.; Fortier, A.; Gasteiger, E.; et al. ExPASy: SIB bioinformatics resource portal. Nucleic Acids Res. 2012, 40, W597–W603. [Google Scholar] [CrossRef]
  43. Gu, Z.; Cavalcanti, A.; Chen, F.C.; Bouman, P.; Li, W.H. Extent of gene duplication in the genomes of Drosophila, nematode, and yeast. Mol. Biol. Evol. 2002, 19, 256–262. [Google Scholar] [CrossRef] [Green Version]
  44. Yang, S.; Zhang, X.; Yue, J.X.; Tian, D.; Chen, J.Q. Recent duplications dominate NBS-encoding gene expansion in two woody species. Mol. Genet. Genom. 2008, 280, 187–198. [Google Scholar] [CrossRef]
  45. Thompson, J.D.; Higgins, D.G.; Gibson, T.J. CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22, 4673–4680. [Google Scholar] [CrossRef] [Green Version]
  46. Kumar, S.; Stecher, G.; Li, M.; Knyaz, C.; Tamura, K. MEGA X: Molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 2018, 35, 1547. [Google Scholar] [CrossRef]
  47. Subramanian, B.; Gao, S.; Lercher, M.J.; Hu, S.; Chen, W.H. Evolview v3: A webserver for visualization, annotation, and management of phylogenetic trees. Nucleic Acids Res. 2019, 47, W270–W275. [Google Scholar] [CrossRef]
  48. Hu, B.; Jin, J.; Guo, A.Y.; Zhang, H.; Luo, J.; Gao, G. GSDS 2.0: An upgraded gene feature visualization server. Bioinformatics 2015, 31, 1296–1297. [Google Scholar] [CrossRef] [Green Version]
  49. Gutierrez, L.; Mauriat, M.; Guénin, S.; Pelloux, J.; Lefebvre, J.F.; Louvet, R.; Rusterucci, C.; Moritz, T.; Guerineau, F.; Bellini, C.; et al. The lack of a systematic validation of reference genes: A serious pitfall undervalued in reverse transcription-polymerase chain reaction (RT-PCR) analysis in plants. Plant Biotechnol. J. 2008, 6, 609–618. [Google Scholar] [CrossRef] [Green Version]
  50. Li, J.; Jia, H.; Han, X.; Zhang, J.; Sun, P.; Lu, M.; Hu, J. Selection of reliable reference genes for gene expression analysis under abiotic stresses in the desert biomass willow, Salix psammophila. Front. Plant Sci. 2016, 7, 1505. [Google Scholar] [CrossRef]
  51. Livak, K.J.; Schmittgen, T.D. Analysis of relative gene expression data using real-time quantitative PCR and the 2−ΔΔCT method. Methods 2001, 25, 402–408. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Location of FLA genes on chromosomes. Distribution of FLA gene family members on chromosomes in P. deltoides (A) and S. suchowensis (B). Genes are named according to their order on the chromosomes. Identified tandem repeat genes are highlighted using a colored background.
Figure 1. Location of FLA genes on chromosomes. Distribution of FLA gene family members on chromosomes in P. deltoides (A) and S. suchowensis (B). Genes are named according to their order on the chromosomes. Identified tandem repeat genes are highlighted using a colored background.
Ijms 24 01481 g001
Figure 2. Synteny analysis of FLA genes. Segmental duplication of FLA gene pairs in P. deltoides (A) and S. suchowensis (B), with red lines indicating the correspondence of duplicated genes. (C) Gene duplication and synteny relationship of FLA genes between P. deltoides and S. suchowensis. The red lines indicate the syntenic FLA gene pairs, and the light gray indicates collinear blocks.
Figure 2. Synteny analysis of FLA genes. Segmental duplication of FLA gene pairs in P. deltoides (A) and S. suchowensis (B), with red lines indicating the correspondence of duplicated genes. (C) Gene duplication and synteny relationship of FLA genes between P. deltoides and S. suchowensis. The red lines indicate the syntenic FLA gene pairs, and the light gray indicates collinear blocks.
Ijms 24 01481 g002
Figure 3. Phylogenetic relationships, gene structures, and motif compositions of FLA genes in P. deltoides and S. suchowensis. (A) Phylogenetics of FLA genes. (B) Genetic structure of exons and introns in each FLA gene. The yellow boxes represent exons and the single black lines represent introns. (C) The conserved motifs contained in each FLA protein. (D) Continuation of (A). (E) Continuation of (B). (F) Continuation of (C). Each motif is indicated using colored boxes and single black lines indicate non-conserved sequences. The bottom scale allows estimation of the size of each motif.
Figure 3. Phylogenetic relationships, gene structures, and motif compositions of FLA genes in P. deltoides and S. suchowensis. (A) Phylogenetics of FLA genes. (B) Genetic structure of exons and introns in each FLA gene. The yellow boxes represent exons and the single black lines represent introns. (C) The conserved motifs contained in each FLA protein. (D) Continuation of (A). (E) Continuation of (B). (F) Continuation of (C). Each motif is indicated using colored boxes and single black lines indicate non-conserved sequences. The bottom scale allows estimation of the size of each motif.
Ijms 24 01481 g003
Figure 4. The phylogenetic tree of FLA proteins from P. deltoides, S. suchowensis, Corylus avellana L., Quercus rubra L., Cinnamomum kanehirae, Setaria italica, Zea mays L., Cucumis sativus L., Oryza sativa L., and Arabidopsis thaliana. A total of 277 FLA proteins were analyzed using ClustalX 2.1, and the Neighbor-Joining tree was constructed using MEGA 11 software. The bootstrap value was 1000 replicates. The FLA proteins were clustered into four distinct groups (I–IV), represented by different colors.
Figure 4. The phylogenetic tree of FLA proteins from P. deltoides, S. suchowensis, Corylus avellana L., Quercus rubra L., Cinnamomum kanehirae, Setaria italica, Zea mays L., Cucumis sativus L., Oryza sativa L., and Arabidopsis thaliana. A total of 277 FLA proteins were analyzed using ClustalX 2.1, and the Neighbor-Joining tree was constructed using MEGA 11 software. The bootstrap value was 1000 replicates. The FLA proteins were clustered into four distinct groups (I–IV), represented by different colors.
Ijms 24 01481 g004
Figure 5. Expression analysis of Group I members of the FLA gene family in P. deltoides and S. suchowensis at different stages of stem lignification development. (A,B) Observations of P. deltoides and S. suchowensis at stem lignification development sites. The red arrow points to the xylem. Scale bar = 0.5 mm. (C,D) Expression pattern analysis of Group I members in P. deltoides and S. suchowensis. The X-axis represents different tissues and the Y-axis represents relative expression. Values shown are means ± SD of three biological replicates.
Figure 5. Expression analysis of Group I members of the FLA gene family in P. deltoides and S. suchowensis at different stages of stem lignification development. (A,B) Observations of P. deltoides and S. suchowensis at stem lignification development sites. The red arrow points to the xylem. Scale bar = 0.5 mm. (C,D) Expression pattern analysis of Group I members in P. deltoides and S. suchowensis. The X-axis represents different tissues and the Y-axis represents relative expression. Values shown are means ± SD of three biological replicates.
Ijms 24 01481 g005
Figure 6. Analysis of the expression of P. deltoides and S. suchowensis FLA Group I genes involved in seed hair development. (A) Observation of different stages of seed hair development in P. deltoides. Scale bar = 0.5 mm. (B) Observation of seed hair phenotypes of S. suchowensis florets in pollinated versus unpollinated treatments. Scale bar = 0.5 mm. (C,D) Expression patterns of Group I FLA genes in P. deltoides and S. suchowensis (p-values are provided in Supplementary Table S9). The X-axis represents different tissues and the Y-axis represents relative expression. Values shown are means ± SD of three biological replicates (** p  <  0.01, two-sided Student’s t-test).
Figure 6. Analysis of the expression of P. deltoides and S. suchowensis FLA Group I genes involved in seed hair development. (A) Observation of different stages of seed hair development in P. deltoides. Scale bar = 0.5 mm. (B) Observation of seed hair phenotypes of S. suchowensis florets in pollinated versus unpollinated treatments. Scale bar = 0.5 mm. (C,D) Expression patterns of Group I FLA genes in P. deltoides and S. suchowensis (p-values are provided in Supplementary Table S9). The X-axis represents different tissues and the Y-axis represents relative expression. Values shown are means ± SD of three biological replicates (** p  <  0.01, two-sided Student’s t-test).
Ijms 24 01481 g006
Table 1. Putative FLA genes identified in the genomes of P. deltoides and S. suchowensis.
Table 1. Putative FLA genes identified in the genomes of P. deltoides and S. suchowensis.
Gene SymbolGene IDChrStartEndAmino AcidpIMw
PdeFLA1EVM0033614chr12863,9342,864,9082817.7928,804.55
PdeFLA2EVM0013530chr131,953,85931,954,5892135.5822,696.96
PdeFLA3EVM0016507chr137,804,82737,807,4804065.4943,144.42
PdeFLA4EVM0019309chr146,418,16246,419,3553975.0642,295.82
PdeFLA5EVM0012430chr214,552,75514,553,7951766.3818,211.68
PdeFLA6EVM0003247chr214,743,91114,744,7072544.5026,606.07
PdeFLA7EVM0003107chr420,721,63920,727,6893238.5934,386.37
PdeFLA8EVM0000914chr56,121,3736,122,7014426.0347,578.60
PdeFLA9EVM0035476chr610,905,26310,906,7242274.7524,112.12
PdeFLA10EVM0035420chr617,439,68017,441,3424265.2344,980.52
PdeFLA11EVM0030864chr620,717,49720,720,6964666.0750,934.06
PdeFLA12EVM0005454chr626,560,55426,561,8103948.9241,861.21
PdeFLA13EVM0010041chr8598,740601,8784405.9848,356.80
PdeFLA14EVM0021839chr92,003,0762,004,1252638.4428,699.16
PdeFLA15EVM0017387chr92,005,8472,007,2332698.4028,404.59
PdeFLA16EVM0034822chr1021,572,86321,578,2064606.2150,769.91
PdeFLA17EVM0012659chr1111,492,40511,495,5604085.6943,677.05
PdeFLA18EVM0036111chr121,843,8291,848,1432297.7723,800.59
PdeFLA19EVM0016248chr1214,176,83714,177,5592405.3325,352.79
PdeFLA20EVM0035889chr131,002,3731,003,1722557.0227,077.81
PdeFLA21EVM0037601chr1313,975,54913,976,5102416.5525,114.65
PdeFLA22EVM0020019chr1315,988,60216,005,8272696.6428,265.15
PdeFLA23EVM0022991chr1316,027,08116,028,1423539.3239,230.62
PdeFLA24EVM0034823chr145,118,9545,120,7954235.3643,390.45
PdeFLA25EVM0004019chr1412,708,76412,710,7212625.5627,645.30
PdeFLA26EVM0001785chr1413,240,88113,242,1433767.6540,562.30
PdeFLA27EVM0017855chr1513,612,96513,614,2882406.7225,379.90
PdeFLA28EVM0008767chr164,738,8454,742,3704665.9051,122.10
PdeFLA29EVM0000252chr167,252,9817,254,1422395.7325,172.56
PdeFLA30EVM0030626chr1712,475,60512,476,6643394.5037,001.51
PdeFLA31EVM0029903chr1811,808,90111,810,4964275.2944,971.60
PdeFLA32EVM0013339chr191,874,3781,875,22628210.7730,578.54
PdeFLA33EVM0014374chr199,899,0089,899,7662527.7627,151.55
PdeFLA34EVM0017850chr1915,221,09315,221,9922456.7225,421.84
PdeFLA35EVM0023928chr1917,054,29217,069,4772616.5027,443.27
PdeFLA36EVM0017906chr1917,076,48917,081,7362129.6822,669.99
PdeFLA37EVM0003708Contig003071379872116.1822,266.27
PdeFLA38EVM0009936Contig0030714,01615,1792636.5027,647.50
PdeFLA39EVM0004735Contig0034228,12729,7633975.5142,607.59
PdeFLA40EVM0030058Contig02018240831392439.0425,611.18
SsuFLA1EVM0026394chr16,636,8046,639,1784546.2949,805.69
SsuFLA2EVM0024166chr19,407,9539,408,9902405.2925,207.58
SsuFLA3EVM0006591chr125,816,88525,817,3283405.1935,939.01
SsuFLA4EVM0006450chr132,542,61132,543,8134005.4742,028.31
SsuFLA5EVM0021770chr415,902,01515,902,9392428.8926,127.29
SsuFLA6EVM0006635chr415,905,28815,906,1032718.5528,341.48
SsuFLA7EVM0003490chr68,865,5038,866,4582416.5525,146.69
SsuFLA8EVM0000130chr610,047,97810,048,8402577.8527,187.19
SsuFLA9EVM0014792chr612,547,68412,549,0683885.1640,792.62
SsuFLA10EVM0022300chr614,817,54414,820,0394687.7751,646.93
SsuFLA11EVM0039496chr614,859,43314,862,8304446.0648,952.77
SsuFLA12EVM0030612chr614,870,41714,872,8284656.2551,410.46
SsuFLA13EVM0002771chr620,137,15620,138,0042828.9829,097.38
SsuFLA14EVM0038485chr79,014,7919,016,1434505.9548,762.99
SsuFLA15EVM0004828chr79,076,3609,077,7124505.9548,775.05
SsuFLA16EVM0004641chr8467,485470,6094285.7847,346.78
SsuFLA17EVM0030085chr87,324,4777,325,4273164.6834,441.38
SsuFLA18EVM0025887chr91,053,5461,054,7112696.4128,307.37
SsuFLA19EVM0026596chr1016,349,98016,352,8014615.6150,880.71
SsuFLA20EVM0032967chr119,355,6319,358,6613446.1437,231.76
SsuFLA21EVM0003643chr12927,908931,2482756.4128,692.16
SsuFLA22EVM0039047chr129,693,3419,693,3562065.7821,712.59
SsuFLA23EVM0028769chr13792,822793,7822577.7427,166.11
SsuFLA24EVM0029643chr1312,512,02512,512,7432114.7321,708.68
SsuFLA25EVM0007673chr1314,193,25814,194,0712456.5125,665.23
SsuFLA26EVM0039302chr1314,197,60414,198,3982647.8727,766.61
SsuFLA27EVM0003847chr1314,200,80414,202,0102458.6725,785.40
SsuFLA28EVM0021248chr1314,285,37314,286,2762657.8827,869.72
SsuFLA29EVM0029154chr1314,289,87114,290,9232597.0627,479.26
SsuFLA30EVM0019512chr1314,308,00914,309,0703538.9139,166.45
SsuFLA31EVM0039706chr144,440,3094,441,5734115.3742,242.28
SsuFLA32EVM0024403chr1410,667,36110,669,3072604.7327,416.04
SsuFLA33EVM0016971chr1411,035,38011,037,0153917.0941,605.74
SsuFLA34EVM0008734chr15781,936782,9812676.5028,227.71
SsuFLA35EVM0034733chr1511,756,92411,758,5262406.5725,349.91
SsuFLA36EVM0008341chr162,217,7682,218,7453256.5733,166.50
SsuFLA37EVM0021655chr1710,904,74810,905,7943254.3335,075.06
SsuFLA38EVM0000963chr18324,610325,4102665.4527,430.66
SsuFLA39EVM0000740chr189,619,7929,620,5533974.9642,023.18
SsuFLA40EVM0022185chr193,251,0163,251,87928710.8131,346.24
SsuFLA41EVM0019293chr199,639,2749,639,9962408.4125,832.92
SsuFLA42EVM0007092chr1914,655,29614,656,3292436.2725,492.98
SsuFLA43EVM0013439chr1916,018,46416,019,5992636.9727,756.63
SsuFLA44EVM0030289chr1916,037,70416,038,7932636.5827,414.32
SsuFLA45EVM0013597chr1916,041,72016,042,5112636.9727,756.63
SsuFLA46EVM0006563Contig0427921,08121,8022237.0624,277.03
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhang, Y.; Zhou, F.; Wang, H.; Chen, Y.; Yin, T.; Wu, H. Genome-Wide Comparative Analysis of the Fasciclin-like Arabinogalactan Proteins (FLAs) in Salicacea and Identification of Secondary Tissue Development-Related Genes. Int. J. Mol. Sci. 2023, 24, 1481. https://doi.org/10.3390/ijms24021481

AMA Style

Zhang Y, Zhou F, Wang H, Chen Y, Yin T, Wu H. Genome-Wide Comparative Analysis of the Fasciclin-like Arabinogalactan Proteins (FLAs) in Salicacea and Identification of Secondary Tissue Development-Related Genes. International Journal of Molecular Sciences. 2023; 24(2):1481. https://doi.org/10.3390/ijms24021481

Chicago/Turabian Style

Zhang, Yingying, Fangwei Zhou, Hui Wang, Yingnan Chen, Tongming Yin, and Huaitong Wu. 2023. "Genome-Wide Comparative Analysis of the Fasciclin-like Arabinogalactan Proteins (FLAs) in Salicacea and Identification of Secondary Tissue Development-Related Genes" International Journal of Molecular Sciences 24, no. 2: 1481. https://doi.org/10.3390/ijms24021481

APA Style

Zhang, Y., Zhou, F., Wang, H., Chen, Y., Yin, T., & Wu, H. (2023). Genome-Wide Comparative Analysis of the Fasciclin-like Arabinogalactan Proteins (FLAs) in Salicacea and Identification of Secondary Tissue Development-Related Genes. International Journal of Molecular Sciences, 24(2), 1481. https://doi.org/10.3390/ijms24021481

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop