Next Article in Journal
Trajectories of Kidney Function in Patients with ATTRv Treated with Gene Silencers
Previous Article in Journal
Plastidial Phosphoglucomutase (pPGM) Overexpression Increases the Starch Content of Transgenic Sweet Potato Storage Roots
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genome-Wide Identification and Analysis of DOF Gene Family in Eugenia uniflora L. (Myrtaceae)

by
Edgar Luis Waschburger
1,
Frank Guzman
2 and
Andreia Carina Turchetto-Zolet
1,*
1
Programa de Pós-Graduação em Genética e Biologia Molecular (PPGBM), Departamento de Genética, Instituto de Biociências, UFRGS, Av. Bento Gonçalvez 9500, Porto Alegre 91509-900, Brazil
2
Escuela de Medicina, Universidad Científica del Sur, Lima 15067, Peru
*
Author to whom correspondence should be addressed.
Genes 2022, 13(12), 2235; https://doi.org/10.3390/genes13122235
Submission received: 30 October 2022 / Revised: 16 November 2022 / Accepted: 19 November 2022 / Published: 28 November 2022
(This article belongs to the Section Plant Genetics and Genomics)

Abstract

:
Eugenia uniflora is a Brazilian native plant species with great ecological and economic importance. It is distributed throughout the Atlantic forest, where two distinct populations show local adaptation to the contrasting conditions of restinga and riparian forest. Among various TFs described in plants, the DOF TF family has been reported to affect flowering and vascular development, making them promising candidates for characterization in E. uniflora. In this study, 28 DOF genes were identified by a genome-wide analysis, of which 20 were grouped into 11 MCOGs by Bayesian phylogeny, suggesting a shared functionallity between members. Based on RNA-seq experiments, we have detected eight drought responsive genes, and SNPs identification revealed population unique polymorphisms, implying a role in local adapatation mechanisms. Finally, analysis of conserved motifs through MEME revealed 15 different protein motifs, and a promoter region analysis returned 40 enriched TF binding motifs, both reporting novel biological functions circa the DOF gene family. In general, the DOF family is found to be conserved both in sequence and expression. Furthermore, this study contributes to both DOF literature and the genetic exploration of native species, elucidating their genetic potential and bringing to light new research topics, paving the way to future studies.

1. Introduction

The Atlantic Forest (AF) is a South American biome estimated to host over 15,000 different plant species [1], most of them belonging to the Myrtaceae botanical family [2], which includes the Eugenia, Psidium, Syzigium and Eucalyptus genera. One of its representatives, Eugenia uniflora L., also known as “Brazilian cherry tree”, is a model plant for genetic and biodiversity studies [3,4]. It is a small- to medium-sized woody tree reaching up to 12 m under optimal environmental conditions. Blooming mainly in spring, it gives rise to small, white flowers, which can undergo cross-pollination or self-fertilization, yielding ripe fruits of yellow, red and black colors. The leaves and fruits of E. uniflora are rich in vitamin A, B and C, lycopene, calcium, iron, phosphorus, anthocyanins and flavonols. The extracts of these tissues are widely used in folk medicine and have many biological activities described in the literature [5,6,7,8,9]. Eugenia uniflora presents local adaptation to different ecoregions with contrasting characteristics within the AF (Figure 1C) [10]. It presents itself as a shrub in the Brazilian Restinga (RE), coastal location of sandy and saline terrain, strong insolation and intense gusts of wind (Figure 1A), while growing to be 5 m tall in the Riparian Forest (RF), characterized by closed, humid and tropical vegetation (Figure 1B). This disparity between populations was also confirmed at the genetic level, where individuals from the RE had much smaller genetic variation compared to RF individuals, possibly due to a recent population expansion of individuals adapted to the conditions of the RE [11].
DNA-binding with one finger (DOF) proteins are part of a transcription factor (TF) family particular to the Viridiplantae clade [12]. They have been identified in numerous species, including monocots (rice, maize and sorghum) [13,14,15], eudicots (arabidopsis, cotton and tomato) [16,17,18], ferns (Selaginella moellendorffi) [12], mosses, (Physcomitrella patens) [19], and algae (Chlorella vulgaris and Chlamydomonas reinhardtii [12,20]). These proteins are mainly expressed in vascular tissues, and DOF over-expression studies have reported higher fruit yields, making them compelling candidates for characterization studies [21]. DOF TFs have a highly variable number of representatives, ranging from 1 in C. reinhardtii to more than 100 in polyploid species, such as cotton [12,17]. This diversity is the result of multiple duplication events at the genetic and genomic levels [17]. Sequences range from 200 to 500 amino-acids and are characterized by a C2C2 zinc-finger DNA-binding domain [22], that specifically binds to 5’-(A/T)AAAG-3’ motifs [23] at the gene promoter region, and at least one other domain for protein interaction. They have already been described as regulators of several physiological functions, such as flowering [24], lipid metabolism [20], starch synthesis [25], regulation of development in response to abiotic stresses [26,27], acclimation to cold [28] and others [29,30,31].
As aforementioned, E. uniflora individuals exhibit contrasting phenological characteristics among their populations (RE and RF), some of which are maintained even when greenhouse-grown (e.g., different flowering periods). Thus, the characterization of DOF TFs (described as actuators in flowering and abiotic stresses), and their comparison of different populations is important both from an ecological and economical points of view (conservation, gene flow between populations and, since this species is cultivated, a phenological standardization is essential). The objective of the present study was to carry out a genomic characterization of the DOF gene family in E. uniflora, and as result, we have identified 28 DOF members in E. uniflora’s genome, further classified into 11 Majorly Conserved Orthologous Groups (MCOGs) by a phylogenetic approach. These MCOGs returned new phylogenetic groups not yet described, and brought to light the confounding DOF classification system used in the current literature. As for E. uniflora populations, DOF genes appear to be conserved both in sequence and expression profiles, serving as actuators in local adaptation.

2. Materials and Methods

2.1. Identification of DOF Genes in Eugenia uniflora

To identify DOF genes in E. uniflora, all DOF gene sequences from its closest phylogenetic species—Eucalyptus grandis—were used as query for BLASTp [32] search against the contigs of the E. uniflora genome (SUB10728242). Hits obtained, with an e-value below 1.00 5 , were validated by manual filtering using transcript visualization and ORF prediction with Unipro UGENE [33], ab initio gene prediction with FGENESH [34], and visualization of hits of RNA-seq experiment alignment files with TABLET [35]. All sequences obtained were compared with the databases available at NCBI (ncbi.nlm.nih.gov, accessed on 29 October 2022) and Phytozome (phytozome-next.jgi.doe.gov, accessed on 29 October 2022) by BLASTp to confirm their existence.

2.2. In Silico Predictions

The Expasy server’s ProtParam online tool (web.expasy.org/protparam, accessed on 29 October 2022) was used to compute the chemical characteristics of E. uniflora. The subcellular location prediction was performed on the DeepLoc1.0 server [36]. The gene structure was manually assembled in Adobe Illustrator CC (adobe.com/products/illustrator.html, accessed on 29 October 2022). The MEME tool from the MEME-suite [37] was used to identify conserved protein motifs, 15 motifs between 6 and 100 amino-acids with one or no occurrence were predicted from all sequences present in the phylogenetic analysis.

2.3. Alignment and Phylogeny Reconstruction

Sequences of DOF genes were gathered from 16 species that have already been phylogenetically characterized. Included species in this study are: Arabidopsis thaliana [16], Cajanus cajan [38], Camellia sinensis [39], Citrullus lanatus [40], Cucumis sativus [41], Daucus carota [42], Eucalyptus grandis [43], Manihot esculenta [44], Medicago truncatula [45], Musa acuminata [46], Oryza sativa [13], Populus trichocarpa [47], Selaginella moellendorffii [12], Solanum lycopersicum [18], Vitis vinifera [48] and Zea mays [14]. The alignment used in the phylogeny used the protein sequences of the DOF genes and was generated by the MAFFT [49] tool. Not all sequences were kept for future analysis (Supplementary File S1). MEGAX [50] was used to visualize and remove sequences without the presence of the DOF domain. Only regions referring to the DOF domain and adjacent regions with a possible phylogenetic signal were considered for alignment. Columns with high representation of gaps were excluded from the final alignment. Gaps were converted to missing data. The most suitable evolutionary model for the resulting alignment was predicted by ModelFinder, included in the IQTree [51] package. The phylogenetic tree was constructed by Bayesian analysis in BEAST [52] using 50,000,000 generations, while discarding 10% of the most discrepant trees. The FigTree program (tree.bio.ed.ac.uk/software/figtree, accessed on 29 October 2022) was used to visualize and manipulate the trees.

2.4. Differential Expression Analysis, SNP Indentification and Promoter Motif Enrichment Analysis

The raw transcriptome datasets of E. uniflora leaves were recovered from the NCBI Sequence Read Archive at Bioproject PRJNA784246 (Turchetto-Zolet et al. unpublished). Drought experiment reads of plants in natura, controlled conditions, and under drought treatment were aligned against E. uniflora genome contigs using the STAR software package [53]. SNP identification was performed using Picard (broadinstitute.github.io/picard, accessed on 29 October 2022) followed by the GATK best practice workflow for variant identification (gatk.broadinstitute.org/hc/en-us, accessed on 29 October 2022). Differential expression analysis was performed using DESeq2 package [54] and promoter region analysis was performed on the NewPlace database [55]. Heatmap plots were assembled from Fragments per kilobase per million (FPKM) values—differential expression analysis (Figure 6)—and raw number of motifs found in the 2 kpb upstream region of DOF genes starting from the coding sequence (CDS)—promoter motif analysis (Figure 7). Differentially expressed genes were selected according to their fold-change (FC) values > 2. Only motifs that presented more than 8 copies were included in the promoter analysis heatmap. Both axes were grouped by the minimum distances of their profiles using the UPGMA algorithm. The Python data visualization toolkit Matplotlib [56] and Adobe Illustrator CC illustration were used to generate the figures.

3. Results

3.1. Identification and Classification of E. uniflora DOF Genes

A total of 32 genes were identified in E. uniflora, of which 28 remained after manual filtering steps (Table 1). Proteins cover sizes between 186 and 507 residues (g8059 and 2186), with most proteins being between 250 and 350 residues. The majority of proteins had their isoelectric point (pI) above 7.0, with the exception of 8 proteins (g1532, g2186, g3994, g5186, g8476, g15014, g23438, g28923). All proteins were predicted to be located in the cell nucleus, as to be expected from typical transcription factors.
A total of 647 sequences covering the 16 species studied were recovered from the different databases used in each study, of which 32 sequences were removed for various reasons (Supplementary File S1)—including the lack of the DOF domain—leaving 615 sequences for further analyses. The total size of the alignment was manually curated to include only 61 sites, covering the DOF domain and adjacent regions with a possible phylogenetic signal. The tree generated by Bayesian statistics (Figure 2) returned 12 different MCOGs with high support—Posterior Probability (PP) > 0.94—and representativeness of sequences from different species.
Among the recovered groups, the largest was subdivided into two groups (Group I, representing the Cycling DOF Factors (CDF) subfamily of DOF genes, and II). Group II, even though paraphyletic, was considered to be just one group. The reasoning was based on species representativity (each of its divisions represents only monocot or eudicot species) and a PP equal to 1 on the branch that refers to Groups I and II. Groups VI and XII also presented PP equal to 1. Group III is another group with only monocot species. In general, it was not possible to determine the evolutionary relationships between MCOGs, given by the low support values between the groups, with the exception of Groups I and II. Of the 28 genes of E. uniflora, 20 were grouped into 11 MCOGs (Groups I–II and IV–XII), with 5 sequences present in the CDF group, while 8 were not grouped (g25726, g6872, g5186, g1532, g14258, g32327, g3898 and g16418). Among the sequences present in the conserved groups, g2186, g6740, g15014, g6300, g25319 and g8476 did not have their closest ortholog being an Eucalyptus grandis gene. Regarding the ungrouped sequences, g32327 and g16418 also did not cluster with Eucalyptus grandis.

3.2. Gene Structure and Domain Conservation of E. uniflora DOF Genes

Among the 28 genes identified in E. uniflora, none had more than 1 intron in their gene structure, while 13 had no introns at all (g1433, g4036, g5301, g6740, g8059, g8476, g13301, g14258, g15014, g25339, g25726, g28923, g32327) (Supplementary Figure S1). The CDF subfamily returned the largest intron sizes, with a mean of nearly 1000 nucleotides. Some sequences had exons with only a few dozen nucleotides, mainly in the N-terminal region, with the exception of the g1532 sequence, which had a small exon in its C-terminal region. In general, the different MCOGs found seem to be represented in the genetic structure of their representative sequences.
Of the 15 protein motifs identified in the protein sequences of DOF genes (Figure 3 and Figure 4), sizes vary, from small motifs of 11 amino-acids (motifs 2, 9, 10 and 11) up to motifs with 46 amino-acids (motif 1).
The vast majority of proteins had 3 or 4 motifs found in their sequence, with the exception of CDF proteins (g2186, g10059, g17019, g23438, g32088), which had up to 11 motifs found. Motifs 3, 9, 10 and 11 are typical motifs present in CDF proteins and have already been characterized as responsible for protein-protein interaction with other participants in the flowering regulation process, such as FLAVIN-BINDING KELCH REPEAT F BOX PROTEIN (FKF1), TOPLESS (TPL) and GIGANTEA (GI). Motifs 5, 6, 13 and 14 were also found exclusively in the CDF proteins and in more than one representative, although their functions are unknown. Motif 12 appears to be conserved in sequences belonging to Group XI, also with unknown function. Apart from motif 15, all the other motifs showed considerable conservation of their residues.

3.3. E. uniflora Populations and Their DOF Gene Arsenals

A total of 30 high quality SNPs were identified from expression data of the two populations (Figure 3 and Figure 5), but with only 9 SNPs resulting in non-synonymous mutations. Of the 28 genes, two (g2186 and g15014) had 6 SNPs each, the highest number identified among all sequences. SNPs were identified in four of the five genes representing Group I. Seventeen SNPs are specific to only one of the populations (12 from the RF population and 5 from the RE population). Almost two thirds (18) of the identified SNPs are found in the third position of the codons referring to the reading phase of the DOF genes, all being synonymous mutations, except for the second SNP of the g17019 gene. Of the remaining 13 SNPs, 5 are located in the regions of the motifs found (g21423;4, g6300;1, g2186;6, g17019;1/2). Three SNPs are exclusive to RF individuals (g32327, g21423;4, g15014;1) and 1 to RE individuals (g2186;6).
Regarding the differential expression experiment, of the 28 genes, 4 (g6740, g8476, g23438 and g28923) were not expressed in at least one analyzed condition studied, and 4 genes (g1433, g4036, g8476 and g23438) did not have their expression altered between the two populations or between treatments (Figure 6A). Of the 24 differentially expressed genes (DEGs) in at least one of the conditions, 8 have a clear drought responsive profile, 5 of which are upregulated (g3893, g3994, g5186, g6300, g13301) and 3 are downregulated (g2186, g5301, g25726). Genes g5301, g13301 and g25726 are the three most differentially expressed, with FC values around 4.9, 4.7 and 6.9 respectively. Overall, control conditions (RF C and RE C ) and the in natura RF (RF N ) have little variance amongst each other, also true for both RF and RE drought stress conditions (RF S and RE S ) (Figure 6B). The highest number of DEGs was found when comparing RE S with RF N (19).
Figure 6. Differential gene expression analysis of Eugenia uniflora DOF genes. (A) Each gene phylogenetic groups is indicated on its left. Ungrouped genes are indicated by a crossed-out “O”. Asterisks represent statistically significant FCs, while red lines represent no expression. Each column refers to a sample condition: Riparian Forest in natura (RF N ), RF Control (RF C ), RF Stressed (RF S ), Restinga (RE) N , RE C , and RE S . Each gene has six comparisons on every column, exemplified by the scheme over the FC color gradient. (B) DEGs comparison amongst every condition. Thicker lines represent more DEGs.
Figure 6. Differential gene expression analysis of Eugenia uniflora DOF genes. (A) Each gene phylogenetic groups is indicated on its left. Ungrouped genes are indicated by a crossed-out “O”. Asterisks represent statistically significant FCs, while red lines represent no expression. Each column refers to a sample condition: Riparian Forest in natura (RF N ), RF Control (RF C ), RF Stressed (RF S ), Restinga (RE) N , RE C , and RE S . Each gene has six comparisons on every column, exemplified by the scheme over the FC color gradient. (B) DEGs comparison amongst every condition. Thicker lines represent more DEGs.
Genes 13 02235 g006

3.4. Promoter Region Analysis

All Eugenia uniflora DOF genes had their upstream 2 kbp, starting from the CDS region, used for promoter analysis with the exception of two genes: g32088 and 32327, which were recovered only 1327 and 138 bp respectively due to their close location to the 5’ region of the genome contigs.
A total of 290 unique motifs were identified, of which 40 were left after filtration steps (Figure 7). The number of copies per motif varies from 0 to 55 copies, the case of the CTRMCAMV35S motif. Among the motifs found in the promoter region of DOF genes, sequences related elements of the transcription machinery, such as TATABox and CAATBox, were enriched. Recognition sequences of different transcription factors were also well represented, some already mentioned in both in silico and in vivo studies of the promoter region (WRKY, ABRE, MYB, BELL) [30,41,47,57,58,59], with the vast majority yet to be confirmed (CAMTA3, BBR/BPC, SURE). Among the tissue-specific-expression elements found, primary and secondary root, vascular tissues, leaves, seeds, flowers and pollen are already mentioned in the literature [60], however we also identified a nodule specific motif. As for the biological function, three different motifs for light induction were found (PRECONSCRHSP70A, INRNTPSADB and GT1CONSENSUS). Other motifs include responses to sulfur, calcium, copper, chlorophyll precursors, etiolation, drought, salinity, pathogen induction, and vernalization. Interestingly, the motif that was the most represented over all DOF genes was not related to classical transcriptional machinery, such as CAATBox, CCAATBox or TATABox, but the DOF recognition motif itself.
Figure 7. Motif enrichment analysis of the promoter region of Eugenia uniflora DOF genes. Rows represent DOF genes while columns are motifs found. Each gene has its phylogenetic group represented on its left. Crossed-out “O”s represent ungrouped genes. Both genes and motifs were clustered using their count profiles, denoted by the cladograms on the bottom and right.
Figure 7. Motif enrichment analysis of the promoter region of Eugenia uniflora DOF genes. Rows represent DOF genes while columns are motifs found. Each gene has its phylogenetic group represented on its left. Crossed-out “O”s represent ungrouped genes. Both genes and motifs were clustered using their count profiles, denoted by the cladograms on the bottom and right.
Genes 13 02235 g007

4. Discussion

4.1. Phylogenetic Relationships Reveal DOF Genes Acting in Similar Pathways

Twenty eight DOF genes were identified in E. uniflora with their physicochemical characteristics shown in Table 1. Their genetic structure was reconstructed and, overall, sequences shared a similar structure among orthologous groups identified in the phylogeny (Supplementary Figure S1 and Figure 2). Motifs 1 and 2, referring to the DOF domain, are present in all sequences; why they appear separately is not well understood, but it is a recurrent phenomenon in the prediction of conserved DOF sequence motifs through MEME [61,62]. From the reconstruction of phylogenetic relationships, eight E. uniflora DOF genes were not grouped into orthologous groups. However, the other 20 genes are present in clades with high support (PP > 0.94) and their functions could be inferred (Figure 2). The composition of the other groups vary greatly in the literature. Some are preserved, such as the case of Group VI [16,63], and the vast majority end up being subdivisions of larger groups, such as Groups X, XI (the second largest group) and XII [12,63]. These inconsistencies arise due to many methodological choices (e.g., the inclusion of only a few species, utilization of less refined phylogenetic methods neighbor-joining, and labelling of poorly supported groups, PP < 80), stating a need for more evolutionary focused studies circa the DOF TF family.
Five genes (g2186, g10059, g17019, g23438 and g32088) were grouped together with Group II, referring to the CDF subfamily of DOF genes, which represents the average number of CDF genes found by species, with some exceptions [64]. CDFs are DOF genes that act in the repression of CONSTANS (CO) and FLOWERING LOCUS T (FT) and thus in flowering inhibition. They have also been characterized to respond to abiotic stresses [65]. Their protein interactions and regulation networks, such as ubiquitination by the GI/FKF1 complex, are well explored in the literature [66]. CDF grouped genes of E. uniflora also contain domains 3, 9, 10 and 11, for interaction with FKF1, TPL and GI respectively, with only the function of motif 11 yet to be discovered [64,67,68,69], corroborating with their possible role as flowering regulators (Figure 3 and Figure 4). Interestingly, 3 other motifs with no described function (5, 6 and 13) also appear conserved in E. uniflora CDF genes. The same three motifs are organized in such a way that there are 4 represented combinations among the 5 CDF genes, (No domains, 6, 6 + 13, and 5 + 6 + 13). This may be a diversification of CDF genes to interact with other proteins and act in other signaling pathways that are not flowering, as all have been described as pleiotropic genes in CO and FT inhibition [24]. Group I holds genes orthologous to CDF genes, but with very different functionalities. In past phylogenetic studies, Groups I and II were considered as a single group [13,16], but recently, phylogenies with higher sequence numbers present these groups separately, even if in lower support clades [19,63]. In this study, Group I was found with a PP equal to 1, indicating that these genes, despite having a probable common ancestor with the CDF genes, are not part of the same functional group. The g8059 gene was grouped into Group I, together with sequences from A. thaliana, such as AtDOF1.5 (AT1G29160/COG1) and AtDOF2.3 (AT2G34140/CDF4), the only ones experimentally characterized between eudicots and monocots. COG1 and CDF4 are transcription factors involved in the inhibition of phytochrome response pathways, both phytochrome A and B (phyA and pyhB), and the transduction of these signals to hormonal pathways, such as the synthesis of gibberellic acid (GA), abscisic acid (ABA) and brassionosteroids (BRs) [70,71,72,73]. Motif 9, referring to the TPL corepressor protein binding domain, was also identified as conserved in Group I sequences, probably acting as a key domain for the action of its repressor activities. The presence of this domain in both Group I and Group II genes is further evidence of their likely common ancestry. The most likely hypothesis is the formation of Group I from gene duplications in the ancestors of these sequences and subsequent loss of its C-terminal region by negative or neutral selection, thus maintaining the TPL-binding domain. As with Eucalyptus grandis, M. truncatula and S. moellendorffii, only one sequence was identified in Group I, indicating a convoluted evolutionary history, since other species usually present two, possibly marked by pseudogenization and gene duplication events.
Group III, also reconstituted by other phylogenies (d3) [13], presents only monocot sequences, which may represent an evolutionary novelty exclusive to this clade. Although one M. acuminata sequence is present, this clade might be an exclusivity to Poales, after all, the genes of M. acuminata tend to be grouped in a greater number of sequences. A phylogeny with a greater number of species representing monocots would be ideal for a proper investigation. Only the rice OsDOF18 gene (LOC_Os08g38220) was characterized in this group, acting on the inhibition of genes of the jasmonic acid (JA) biosynthesis pathway and chlorophyll degradation in leaves and on inducing the expression of ammonia transporters in roots [31,74]. Although they seem to be two uncorrelated pathways, studies evaluating soil nitrogen availability and JA biosynthesis found changes in the expression of the same genes that OsDOF18 regulates [75]. On the other hand, Group IV did not present any sequence of monocots, only eudicots and the lycophyte S. moellendorffii, indicating a possible loss of this MCOG in cereals. The gene g6740 was grouped with the sequence DOF5.8 (AT5G66940), characterized by the expression in procambium and provascular cells, related to the presence of auxin in the plant, since DOF5.8 is regulated by MONOPTEROS [76,77]. DOF5.8 also regulates the expression of ANAC069, an auxin-responsive factor responsible for controlling responses to abiotic stresses such as salinity [78]. Group V has two E. uniflora, g4036 and g5031, grouped with two other A. thaliana genes, AtDOF1.7 (AT1G51700/ADOF1) and AtDOF3.1 (AT3G21270/ADOF2). Although there is not much information about the function of these two genes in specific characterization studies, multi omics-data have returned both genes as differentially overexpressed in the overexpression of factors involved in cell division and vascular tissue development (TDIF-PXY) [79]. The same study also detected genes regulated by ADOF1 (LBD4, PHB, PHV and REV) and by ADOF2 (REV) from a Y1H analysis. Apart from the LBD4 gene, the genes regulated by ADOFs belong to the HD Zip III family, responsible for mediating auxin responses, formation of leaf axes and maintenance of the shoot apical meristem. Furthermore, ADOF1 was found to be overexpressed in roots of plant subjected to salinity stress [80]. Peculiarly, another study identified ADOF1 mRNA to be expressed mainly in the aerial tissue, later mobilized to the root system [81].
Group VI has one E. uniflora gene (g13301) and no A. thaliana sequences, the same case for Group VIII with the g25339 gene. The only characterized gene in Group VI is OsDOF28 (LOC_Os03g55610), responsible for root apical meristem cell division by inhibition of OsACS1, responsible for ethylene biosynthesis [26]. In contrast to the promotion of cell division in the root of Group VI, Group VII groups genes g15014 and AtDOF5.4 (AT5G60850), the later functionally characterized as an inhibitor of cell cycle progression in callous cells by repression of expansins, cyclins, kinase-dependent cyclins and xyloglucan endo-transglycosylation enzymes [82]. It has also been shown to be induced by ABA and responsible for inhibiting the growth of secondary roots by regulating RSL280. Group IX has genes g6300 and AtDOF1.4 (AT1G28310), however no sequences are characterized. Another gene related to vascular development is AtDOF5.6 (AT5G62940/HCA2) present in Group X, together with the sequence g3994. The generation of dominant mutants for the HCA2 led to malformation of the interfascicular cambium and periclinal divisions of its cells [83].
Group XI has five A. thaliana genes AtDOF1.8 (AT1G64620), AthDOF2.5 (AT2G46590/DAG2), AthDOF3.7 (AT3G61850/DAG1), AtDOF4.1 (AT4G00940/ITD1) and AtDOF4.6 (AT4G24060) grouped with four genes from E. uniflora (g1433, g2006, g21423 and g25319). DAG1 and DAG2 were characterized in mutants with altered germination times. Mutants for DAG1 show a lower need for incidence of red light for germination, while mutants for DAG2 demonstrate the opposite [84]. Phytochrome Interacting Factor3-Like 5 (PIL5), responsible for inhibiting the germination process, is degraded via phyB-mediated ubiquitination and forwarding to the 26S proteasome. PIL5 induces the expression of members of the DELLA family, such as GA Insensitive (GAI) and Repressor of ga1-3 (RGA), inhibitors of the GA synthesis pathway and essential for the maintenance of endogenous levels of ABA [85,86]. The DAG1 gene has been shown to be related to the expression of PIL5, suggesting an induction of DAG1 by PIL5 [87]. It is known that the interaction between DAG1 and GAI is necessary for the repression of AtGA3ox1, a gene of the GA biosynthesis pathway [88]. DAG1 has also been shown to inhibit the expression of CYP707A2, the ABA catabolic pathway gene, demonstrating similarity with DELLA proteins functions [89]. On the other hand, DAG2 is inhibited by the expression of PIL5 and DAG1. As for the genes regulated by DAG2, RGA appears to be negatively regulated, while the GA biosynthesis genes: GA3ox1 and GA3ox2 are shown to be induced, however it is not known whether this interaction is direct or not [90]. In general, it is proposed that the DAG1 and DAG2 genes act in the inhibition of germination by GA catalysis and inhibition of ABA catalysis. As for the ITD1 gene, its function is not known, but its expression pattern shows localization in the cells of the cortex and endoderm of the root [91]. Furthermore, it has an intercellular traffic domain that overlaps with the DOF domain, allowing its dispersion from the expression region to nearby tissues by diffusion via plasmodesmata, a fundamental characteristic for the performance of some DOF genes involved in the development of vascular tissues [92]. All sequences present in Group XI showed domain 12. The AtDOF1.8 and AtDOF4.6 genes are not characterized. Group XII has two A. thaliana genes: AtDOF1.2 (AT1G21340) and AtDOF3.5 (AT3G52440) together with E. uniflora genes (g8476 and g28923).

4.2. DOF Genes as Drivers for Local Adaptation

Monitoring greenhouse-grown E. uniflora plants, individuals coming from the RE showed a shorter flowering time when compared to individuals coming from the RF. In order to search for genetic variants linked to E. uniflora populations and that are also related to alterations in the functionality of DOF genes, 30 SNPs were identified, of which 2 population specific markers resulted in non-synonymous mutations (Figure 5). The g21423;4 SNP, exclusive to RF individuals, is located in motif 15, a highly variable region between the DOF sequences and most likely does not cause a functional change in the context of the g21423 gene protein, belonging to Group XI of genes related to germination. In similar terms, the g2186;6 SNP, even being present in a CDF protein, occurs in a motif not yet characterized, and in a variable region within this motif. Both g17019 SNPs were obtained within the FKF1 binding domain and correspond to conserved amino-acid changes within the motif, however they are not unique to any population. The SNP g17019;1 was found only in heterozygosity, which may suggest a greater importance of conservation of the first lysine residue of this motif when compared to aspartic acid residues, but even if the binding function to FKF1 is compromised, the overlapping function of CDF genes would probably be enough to cover early flowering phenotypes. At first, the greenhouse observations cannot be explained by genetic variants present in DOF gene sequences. Perhaps a search for variants in regulatory regions such as the promoter will result in findings, but with a lack of motif characterization, further studies become difficult.
In order to understand the possible modulations of DOF genes, a gene expression analysis of plants under drought conditions was performed (Figure 6). DOF genes with the highest differential expression between control and drought-exposed conditions were genes g5301, g13301 and g25726. Gene g25726 was grouped with the Egr.F00039 gene from Eucalyptus grandis (PP = 1), its closest ortholog and without known function. These genes may be a novelty exclusive to the Myrtaceae family, considering that both sequences were not present in conserved clades. The gene g13301, belonging to Group VI, was overexpressed under conditions exposed to drought. Ethylene, even being a phytohormone related to stress signals and orchestration of adaptive responses, is an auxin antagonist and its presence in the root stops the growth of its cells. In the context of physiological responses to drought, a stressful condition leads to a search for water by the roots. Inhibition of ethylene synthesis and promotion of apical root cell division in search for water sources may be one of the reasons for g13301 differential expression, if its function is similar to its ortholog (OsDOF28). The third differentially expressed gene was g5301, representative of Group V. According to the previous gene, cell division of meristematic cells present in the root must be followed by mechanisms for differentiation and maturation, such as regulation of vascular development genes. Interestingly, its g4036 ortholog was not differentially expressed, which may be due to a mechanism analogous to the specificity of the AtDOF1.7 salinity response [80]. Although some sequences were not detected in leaf tissues, in general, CDF genes appear to be repressed in the drought treatment. Other members of conserved groups, such as g25339 (Group VIII), g15014 (Group VII), g8059 (Group I), g6300 (Group IX) and g3994 (Group X) also showed changes in their expression by drought treatment. As for differences between populations in their natural environment, only genes g10059 (Group II) and g13301 (Group IV) showed differential expression. Possibly related to abiotic stresses of the RE environment, but with so many possibilities (salinity, drought, insolation) it is difficult to correlate with a specific response. Individuals from E. uniflora populations do not have alterations in the expression of DOF genes when greenhouse-grown, and based on their overall expression profiles, DOF genes seem to act as conserved tools to combat possible stresses present in the environment, rather than specialized pathways that confer an evolutionary advantage to individuals in a population.
The promoter region analysis (Figure 7) returned a total of 40 motifs described in the literature, with 9 repeats or more. The motif patterns found for each gene were grouped by clustering and the conserved orthologous groups were not recovered, indicating a high diversification of the promoter regions and their binding factors. The clustering of motif profiles returned two main groups: a group of 5 motifs with high representation in all DOF sequences (DOFCOREZM, CACTFPCCA1, GT1CONSENSUS, CAATBOX and ARR1AT), with an average of 20 or more repetitions, and another group representing the other 35 reasons and high variability. Among the most represented motifs, the presence of the DOFCOREZM and ARR1AT motifs indicate a high self-regulation of DOF genes, and a regulation by cytokinin-responsive factors, such as ARR1. The CACTFPCCA1 motif, specific to leaf mesophilic cells, and the GT1CONSENSUS motif, light signal transduction pathway regulation—such as phyA—indicate a wide expression of DOF genes in leaves. Finally, the CAATBOX motif is related to the recruitment of other transcription factors to the promoter region. Motifs for specific factors, such as MYC and WRKY genes, were highly represented, as were root and pollen tissue specific motifs. Some motifs have multiple copies in only a few genes, such as the BBR/BPC binding motif in the g4036 and g6300 gene, as well as the copper-response motifs present in the g25339 and g28923 genes.

5. Conclusions

In conclusion, 28 DOF genes were identified in E. uniflora, of which 20 were grouped into 11 phylogenetically conserved groups with a high probability for conservation of functionality. The eight remaining genes require a more robust phylogenetic analysis for proper grouping, yielding higher branch supports. The addition of more sequences would also be of interest for explaining MCOGs evolutionary relationships, since our results did not manage to do so. The analysis of conserved motifs corroborated with the data presented in the phylogeny, identifying new DOF protein motifs. It would be compelling to functionally characterize these same motifs and genes, possibly revealing new protein interactions and functionalities. With the differential expression analysis, we found eight drought-responsive DOF genes with little expression changes among populations. Hence appearing to be well-preserved tools that E. uniflora populations use to respond to environmental stresses. Future studies utilizing other tissues (e.g., roots and fruits) would also be of interest for providing a full picture of the local adaptation process. Finally, a comprehensive analysis of the promoter region revealed the diversity of regulatory regions of E. uniflora DOF genes, including previously unreported motifs, such as nodule and pollen specific which could be of interest for gene reported and promoter engineering studies. Overall, this work identified new DOF genes and brought to light aspects of their evolutionary history not yet addressed, presenting new avenues of research for the analysis of this family of genes in future analyzes.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes13122235/s1, Figure S1: E. uniflora DOF genes genetic structure.

Author Contributions

Conceptualization, A.C.T.-Z. and E.L.W.; methodology, all authors; software, E.L.W.; validation, all authors; formal analysis, E.L.W. and F.G.; investigation, E.L.W.; resources, A.C.T.-Z. and F.G.; data curation, all authors; writing—original draft preparation, E.L.W.; writing—review and editing, A.C.T.-Z.; visualization, E.L.W.; supervision, A.C.T.-Z.; project administration, A.C.T.-Z. and E.L.W.; funding acquisition, A.C.T.-Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was financially supported by Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq; Grant number: 306202/2016-6 and 308135/2020-2), Coordenação de Aperfeiçoamento de Pessoal de Nível Superior, and Fundação de Amparo à Pesquisa do Estado do Rio Grande do Sul (Grant number: 16/491-9). https://search.crossref.org/funding, any errors may affect your future funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

As of writing this manuscript, both genome and raw transcript data used in this study are under process of submission to NCBI. The genomic contigs are under submission “SUB10728242” and the raw transcript data under bioproject “PRJNA784246”.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
AAAmino-acid
AFAtlantic Forest
CDSCoding sequence
DEGDifferentialy expressed genes
DOFDNA-binding with one-finger
CDFCycling-dof-factor
FCFold-change
PPPosterior probability
RERestinga
RFRiparian Forest
TFTranscription Factor
MCOGMajorly Conserved Orthologous Group

References

  1. Scarano, F. Biomas Brasileiros: Retrato de um País Plural; Casa da Palavra: Rio de Janeiro, Brazil, 2012. [Google Scholar]
  2. Lucas, E.J.; Bünger, M.O. Myrtaceae in the Atlantic forest: Their role as a ‘model’ group. Biodivers. Conserv. 2015, 24, 2165–2180. [Google Scholar] [CrossRef]
  3. Anton, D.B.; Guzman, F.L.; Vetö, N.M.; Krause, F.A.; Kulcheski, F.R.; Coelho, A.P.D.; Duarte, G.L.; Margis, R.; Dillenburg, L.R.; Turchetto-Zolet, A.C. Characterization and expression analysis of P5CS (Δ1-pyrroline-5-carboxylate synthase) gene in two distinct populations of the Atlantic Forest native species Eugenia uniflora L. Mol. Biol. Rep. 2020, 47, 1033–1043. [Google Scholar] [CrossRef] [PubMed]
  4. De Souza Neto, J.D.; Dos Santos, E.K.; Lucas, E.; Vetö, N.M.; Barrientos-Diaz, O.; Staggemeier, V.G.; Vasconcelos, T.; Turchetto-Zolet, A.C. Advances and perspectives on the evolutionary history and diversification of Neotropical Myrteae (Myrtaceae). Bot. J. Linn. Soc. 2022, 199, 173–195. [Google Scholar] [CrossRef]
  5. Falcão, T.R.; de Araújo, A.A.; Soares, L.A.L.; de Moraes Ramos, R.T.; Bezerra, I.C.F.; Ferreira, M.R.A.; de Souza Neto, M.A.; Melo, M.C.N.; de Araújo, R.F.; de Aguiar Guerra, A.C.V.; et al. Crude extract and fractions from Eugenia uniflora Linn leaves showed anti-inflammatory, antioxidant, and antibacterial activities. BMC Complement. Altern. Med. 2018, 18, 84. [Google Scholar] [CrossRef] [Green Version]
  6. Silva-Rocha, W.P.; de Brito Lemos, V.L.; Ferreira, M.R.A.; Soares, L.A.L.; Svidzisnki, T.I.E.; Milan, E.P.; Chaves, G.M. Effect of the crude extract of Eugenia uniflora in morphogenesis and secretion of hydrolytic enzymes in Candida albicans from the oral cavity of kidney transplant recipients. BMC Complement. Altern. Med. 2015, 15, 6. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Santos, K.K.A.; Matias, E.F.F.; Tintino, S.R.; Souza, C.E.S.; Braga, M.F.B.M.; Guedes, G.M.M.; Rolón, M.; Vega, C.; de Arias, A.R.; Costa, J.G.M.; et al. Anti-Trypanosoma cruzi and cytotoxic activities of Eugenia uniflora L. Exp. Parasitol. 2012, 131, 130–132. [Google Scholar] [CrossRef] [Green Version]
  8. Rattmann, Y.D.; de Souza, L.M.; Malquevicz-Paiva, S.M.; Dartora, N.; Sassaki, G.L.; Gorin, P.A.J.; Iacomini, M. Analysis of Flavonoids from Eugenia uniflora Leaves and Its Protective Effect against Murine Sepsis. Evid. Based Complement. Altern. Med. 2012, 2012, 623940. [Google Scholar] [CrossRef] [Green Version]
  9. Sobeh, M.; El-Raey, M.; Rezq, S.; Abdelfattah, M.; Petruk, G.; Osman, S.; El-Shazly, A.; El-Beshbishy, H.; Mahmoud, M.; Wink, M. Chemical profiling of secondary metabolites of Eugenia uniflora and their antioxidant, anti-inflammatory, pain killing and anti-diabetic activities: A comprehensive approach. J. Ethnopharmacol. 2019, 240, 111939. [Google Scholar] [CrossRef]
  10. Vetö, N.M.; Postolache, D.; Guzman Escudero, F.L.; Vajana, E.; Burgo Braga, R.; Salgueiro, F.; Margis, R.; Vendramin, G.G.; Turchetto-Zolet, A.C. Population structure and signals of local adaptation in Eugenia uniflora (Myrtaceae), a widely distributed species in the Atlantic Forest. Bot. J. Linn. Soc. 2022, boac012. [Google Scholar] [CrossRef]
  11. Turchetto-Zolet, A.C.; Salgueiro, F.; Turchetto, C.; Cruz, F.; Veto, N.M.; Barros, M.J.F.; Segatto, A.L.A.; Freitas, L.B.; Margis, R. Phylogeography and ecological niche modelling in Eugenia uniflora (Myrtaceae) suggest distinct vegetational responses to climate change between the southern and the northern Atlantic Forest. Bot. J. Linn. Soc. 2016, 182, 670–688. [Google Scholar] [CrossRef]
  12. Moreno-Risueno, M.Á.; Martínez, M.; Vicente-Carbajosa, J.; Carbonero, P. The family of DOF transcription factors: From green unicellular algae to vascular plants. Mol. Genet. Genom. 2007, 277, 379–390. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Lijavetzky, D.; Carbonero, P.; Vicente-Carbajosa, J. Genome-wide comparative phylogenetic analysis of the rice and Arabidopsis Dof gene families. BMC Evol. Biol. 2003, 3, 17. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Chen, Y.; Cao, J. Comparative Analysis of Dof Transcription Factor Family in Maize. Plant Mol. Biol. Report. 2015, 33, 1245–1258. [Google Scholar] [CrossRef]
  15. Kushwaha, H.; Gupta, S.; Singh, V.K.; Rastogi, S.; Yadav, D. Genome wide identification of Dof transcription factor gene family in sorghum and its comparative phylogenetic analysis with rice and Arabidopsis. Mol. Biol. Rep. 2011, 38, 5037–5053. [Google Scholar] [CrossRef]
  16. Yanagisawa, S. The Dof family of plant transcription factors. Trends Plant Sci. 2002, 7, 555–560. [Google Scholar] [CrossRef] [PubMed]
  17. Li, Y.; Liu, Z.; Zhang, K.; Chen, S.; Liu, M.; Zhang, Q. Genome-wide analysis and comparison of the DNA-binding one zinc finger gene family in diploid and tetraploid cotton (Gossypium). PLoS ONE 2020, 15, e0235317. [Google Scholar] [CrossRef]
  18. Cai, X.; Zhang, Y.; Zhang, C.; Zhang, T.; Hu, T.; Ye, J.; Zhang, J.; Wang, T.; Li, H.; Ye, Z. Genome-wide Analysis of Plant-specific Dof Transcription Factor Family in Tomato. J. Integr. Plant Biol. 2013, 55, 552–566. [Google Scholar] [CrossRef] [PubMed]
  19. Shigyo, M.; Tabei, N.; Yoneyama, T.; Yanagisawa, S. Evolutionary Processes During the Formation of the Plant-Specific Dof Transcription Factor Family. Plant Cell Physiol. 2007, 48, 179–185. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  20. Tokunaga, S.; Sanda, S.; Uraguchi, Y.; Nakagawa, S.; Sawayama, S. Overexpression of the DOF-Type Transcription Factor Enhances Lipid Synthesis in Chlorella vulgaris. Appl. Biochem. Biotechnol. 2019, 189, 116–128. [Google Scholar] [CrossRef] [PubMed]
  21. Renau-Morata, B.; Carrillo, L.; Cebolla-Cornejo, J.; Molina, R.V.; Martí, R.; Domínguez-Figueroa, J.; Vicente-Carbajosa, J.; Medina, J.; Nebauer, S.G. The targeted overexpression of SlCDF4 in the fruit enhances tomato size and yield involving gibberellin signalling. Sci. Rep. 2020, 10, 10645. [Google Scholar] [CrossRef] [PubMed]
  22. Umemura, Y.; Ishiduka, T.; Yamamoto, R.; Esaka, M. The Dof domain, a zinc finger DNA-binding domain conserved only in higher plants, truly functions as a Cys2/Cys2 Zn finger domain. Plant J. 2004, 37, 741–749. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Yanagisawa, S.; Schmidt, R.J. Diversity and similarity among recognition sequences of Dof transcription factors. Plant J. 1999, 17, 209–214. [Google Scholar] [CrossRef] [PubMed]
  24. Fornara, F.; Panigrahi, K.C.; Gissot, L.; Sauerbrunn, N.; Rühl, M.; Jarillo, J.A.; Coupland, G. Arabidopsis DOF Transcription Factors Act Redundantly to Reduce CONSTANS Expression and Are Essential for a Photoperiodic Flowering Response. Dev. Cell 2009, 17, 75–86. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Fang, W.; Zhang, Y.; Zhang, W.; Gu, J.; Xiong, F.; An, G.; Wu, Y. Rice transcription factor OsDOF18 enlarges the starch granule size by cytokinin. Curr. Plant Biol. 2022, 31, 100253. [Google Scholar] [CrossRef]
  26. Qin, H.; Wang, J.; Chen, X.; Wang, F.; Peng, P.; Zhou, Y.; Miao, Y.; Zhang, Y.; Gao, Y.; Qi, Y.; et al. Rice OsDOF15 contributes to ethylene-inhibited primary root elongation under salt stress. New Phytol. 2019, 223, 798–813. [Google Scholar] [CrossRef] [PubMed]
  27. Zhang, Z.; Yuan, L.; Liu, X.; Chen, X.; Wang, X. Evolution analysis of Dof transcription factor family and their expression in response to multiple abiotic stresses in Malus domestica. Gene 2018, 639, 137–148. [Google Scholar] [CrossRef]
  28. Chawade, A.; Bräutigam, M.; Lindlöf, A.; Olsson, O.; Olsson, B. Putative cold acclimation pathways in Arabidopsis thaliana identified by a combined analysis of mRNA co-expression patterns, promoter motifs and transcription factors. BMC Genom. 2007, 8, 304. [Google Scholar] [CrossRef] [Green Version]
  29. Cooper, B.; Clarke, J.D.; Budworth, P.; Kreps, J.; Hutchison, D.; Park, S.; Guimil, S.; Dunn, M.; Luginbühl, P.; Ellero, C.; et al. A network of rice genes associated with stress response and seed development. Proc. Natl. Acad. Sci. USA 2003, 100, 4945–4950. [Google Scholar] [CrossRef] [Green Version]
  30. Zou, X.; Neuman, D.; Shen, Q.J. Interactions of two transcriptional repressors and two transcriptional activators in modulating gibberellin signaling in aleurone cells. Plant Physiol. 2008, 148, 176–186. [Google Scholar] [CrossRef] [Green Version]
  31. Shim, Y.; Kang, K.; An, G.; Paek, N.C. Rice DNA-Binding One Zinc Finger 24 (OsDOF24) Delays Leaf Senescence in a Jasmonate-Mediated Pathway. Plant Cell Physiol. 2019, 60, 2065–2076. [Google Scholar] [CrossRef]
  32. Camacho, C.; Coulouris, G.; Avagyan, V.; Ma, N.; Papadopoulos, J.; Bealer, K.; Madden, T.L. BLAST+: Architecture and applications. BMC Bioinform. 2009, 10, 421. [Google Scholar] [CrossRef] [Green Version]
  33. Okonechnikov, K.; Golosova, O.; Fursov, M.; UGENE team. Unipro UGENE: A unified bioinformatics toolkit. Bioinformatics 2012, 28, 1166–1167. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Solovyev, V.; Kosarev, P.; Seledsov, I.; Vorobyev, D. Automatic annotation of eukaryotic genes, pseudogenes and promoters. Genome Biol. 2006, 7, S10. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Milne, I.; Stephen, G.; Bayer, M.; Cock, P.J.; Pritchard, L.; Cardle, L.; Shaw, P.D.; Marshall, D. Using Tablet for visual exploration of second-generation sequencing data. Briefings Bioinform. 2013, 14, 193–202. [Google Scholar] [CrossRef]
  36. Almagro Armenteros, J.J.; Sønderby, C.K.; Sønderby, S.K.; Nielsen, H.; Winther, O. DeepLoc: Prediction of protein subcellular localization using deep learning. Bioinformatics 2017, 33, 3387–3395. [Google Scholar] [CrossRef] [PubMed]
  37. Bailey, T.L.; Boden, M.; Buske, F.A.; Frith, M.; Grant, C.E.; Clementi, L.; Ren, J.; Li, W.W.; Noble, W.S. MEME SUITE: Tools for motif discovery and searching. Nucleic Acids Res. 2009, 37, W202–W208. [Google Scholar] [CrossRef] [PubMed]
  38. Malviya, N.; Gupta, S.; Singh, V.K.; Yadav, M.K.; Bisht, N.C.; Sarangi, B.K.; Yadav, D. Genome wide in silico characterization of Dof gene families of pigeonpea (Cajanus cajan (L) Millsp.). Mol. Biol. Rep. 2015, 42, 535–552. [Google Scholar] [CrossRef] [PubMed]
  39. Yu, Q.; Li, C.; Zhang, J.; Tian, Y.; Wang, H.; Zhang, Y.; Zhang, Z.; Xiang, Q.; Han, X.; Zhang, L. Genome-wide identification and expression analysis of the Dof gene family under drought stress in tea (Camellia sinensis). PeerJ 2020, 8, e9269. [Google Scholar] [CrossRef]
  40. Zhou, Y.; Cheng, Y.; Wan, C.; Li, J.; Yang, Y.; Chen, J. Genome-wide characterization and expression analysis of the Dof gene family related to abiotic stress in watermelon. PeerJ 2020, 8, e8358. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  41. Wen, C.l.; Cheng, Q.; Zhao, L.; Mao, A.; Yang, J.; Yu, S.; Weng, Y.; Xu, Y. Identification and characterisation of Dof transcription factors in the cucumber genome. Sci. Rep. 2016, 6, 23072. [Google Scholar] [CrossRef] [PubMed]
  42. Huang, W.; Huang, Y.; Li, M.y.; Wang, F.; Xu, Z.s.; Xiong, A.s. Dof transcription factors in carrot: Genome-wide analysis and their response to abiotic stress. Biotechnol. Lett. 2016, 38, 145–155. [Google Scholar] [CrossRef] [PubMed]
  43. d’Almeida, G.S.; Breton, M.; Camargo, S.; Frazzon, J.; Pasquali, G. Phylogenetic comparative and expression analysis of genes encoding dof transcription factors from Eucalyptus grandis. BMC Proc. 2011, 5, 1–2. [Google Scholar] [CrossRef] [Green Version]
  44. Zou, Z.; Zhu, J.; Zhang, X. Genome-wide identification and characterization of the Dof gene family in cassava (Manihot esculenta). Gene 2019, 687, 298–307. [Google Scholar] [CrossRef] [PubMed]
  45. Shu, Y.J.; Song, L.L.; Zhang, J.; Liu, Y.; Guo, C.H. Genome-wide identification and characterization of the Dof gene family in Medicago truncatula. Genet. Mol. Res. 2015, 14, 10645–10657. [Google Scholar] [CrossRef]
  46. Dong, C.; Hu, H.; Xie, J. Genome-wide analysis of the DNA-binding with one zinc finger (Dof) transcription factor family in bananas. Genome 2016, 59, 1085–1100. [Google Scholar] [CrossRef] [PubMed]
  47. Wang, H.; Zhao, S.; Gao, Y.; Yang, J. Characterization of Dof Transcription Factors and Their Responses to Osmotic Stress in Poplar (Populus trichocarpa). PLoS ONE 2017, 12, e0170210. [Google Scholar] [CrossRef] [Green Version]
  48. da Silva, D.C.; da Silveira Falavigna, V.; Fasoli, M.; Buffon, V.; Porto, D.D.; Pappas, G.J.; Pezzotti, M.; Pasquali, G.; Revers, L.F. Transcriptome analyses of the Dof-like gene family in grapevine reveal its involvement in berry, flower and seed development. Hortic. Res. 2016, 3, 16042. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  49. Katoh, K.; Rozewicki, J.; Yamada, K.D. MAFFT online service: Multiple sequence alignment, interactive sequence choice and visualization. Brief. Bioinform. 2019, 20, 1160–1166. [Google Scholar] [CrossRef] [Green Version]
  50. Kumar, S.; Stecher, G.; Li, M.; Knyaz, C.; Tamura, K. MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. Mol. Biol. Evol. 2018, 35, 1547–1549. [Google Scholar] [CrossRef] [PubMed]
  51. Kalyaanamoorthy, S.; Minh, B.Q.; Wong, T.K.F.; von Haeseler, A.; Jermiin, L.S. ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat. Methods 2017, 14, 587–589. [Google Scholar] [CrossRef] [PubMed]
  52. Suchard, M.A.; Lemey, P.; Baele, G.; Ayres, D.L.; Drummond, A.J.; Rambaut, A. Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evol. 2018, 4, vey016. [Google Scholar] [CrossRef] [Green Version]
  53. Dobin, A.; Davis, C.A.; Schlesinger, F.; Drenkow, J.; Zaleski, C.; Jha, S.; Batut, P.; Chaisson, M.; Gingeras, T.R. STAR: Ultrafast universal RNA-seq aligner. Bioinformatics 2013, 29, 15–21. [Google Scholar] [CrossRef]
  54. Love, M.I.; Huber, W.; Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014, 15, 550. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  55. Higo, K.; Ugawa, Y.; Iwamoto, M.; Korenaga, T. Plant cis-acting regulatory DNA elements (PLACE) database: 1999. Nucleic Acids Res. 1999, 27, 297–300. [Google Scholar] [CrossRef] [Green Version]
  56. Hunter, J.D. Matplotlib: A 2D Graphics Environment. Comput. Sci. Eng. 2007, 9, 90–95. [Google Scholar] [CrossRef]
  57. Deeba, F.; Sultana, T.; Mahmood, T.; O’Shea, C.; Skriver, K.; Naqvi, S.M.S. Involvement of WRKY, MYB and DOF DNA-binding proteins in interaction with a rice germin-like protein gene promoter. Acta Physiol. Plant. 2017, 39, 189. [Google Scholar] [CrossRef]
  58. Lorrai, R.; Gandolfi, F.; Boccaccini, A.; Ruta, V.; Possenti, M.; Tramontano, A.; Costantino, P.; Lepore, R.; Vittorioso, P. Genome-wide RNA-seq analysis indicates that the DAG1 transcription factor promotes hypocotyl elongation acting on ABA, ethylene and auxin signaling. Sci. Rep. 2018, 8, 15895. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  59. Kondhare, K.R.; Vetal, P.V.; Kalsi, H.S.; Banerjee, A.K. BEL1-like protein (StBEL5) regulates CYCLING DOF FACTOR1 (StCDF1) through tandem TGAC core motifs in potato. J. Plant Physiol. 2019, 241, 153014. [Google Scholar] [CrossRef] [PubMed]
  60. Gupta, S.; Malviya, N.; Kushwaha, H.; Nasim, J.; Bisht, N.C.; Singh, V.K.; Yadav, D. Insights into structural and functional diversity of Dof (DNA binding with one finger) transcription factor. Planta 2015, 241, 549–562. [Google Scholar] [CrossRef] [PubMed]
  61. Yu, H.; Ma, Y.; Lu, Y.; Yue, J.; Ming, R. Expression profiling of the Dof gene family under abiotic stresses in spinach. Sci. Rep. 2021, 11, 14429. [Google Scholar] [CrossRef]
  62. Hamdi, J.; Kmeli, N.; Bouktila, D. Genome-wide survey of sugar beet (Beta vulgaris subsp. vulgaris) Dof transcription factors reveals structural diversity, evolutionary expansion and involvement in taproot development and biotic stress response. Biologia 2021, 76, 2421–2436. [Google Scholar] [CrossRef]
  63. Noguero, M.; Atif, R.M.; Ochatt, S.; Thompson, R.D. The role of the DNA-binding One Zinc Finger (DOF) transcription factor family in plants. Plant Sci. 2013, 209, 32–45. [Google Scholar] [CrossRef]
  64. Renau-Morata, B.; Carrillo, L.; Dominguez-Figueroa, J.; Vicente-Carbajosa, J.; Molina, R.V.; Nebauer, S.G.; Medina, J. CDF transcription factors: Plant regulators to deal with extreme environmental conditions. J. Exp. Bot. 2020, 71, 3803–3815. [Google Scholar] [CrossRef]
  65. Corrales, A.R.; Nebauer, S.G.; Carrillo, L.; Fernández-Nohales, P.; Marqués, J.; Renau-Morata, B.; Granell, A.; Pollmann, S.; Vicente-Carbajosa, J.; Molina, R.V.; et al. Characterization of tomato Cycling Dof Factors reveals conserved and new functions in the control of flowering time and abiotic stress responses. J. Exp. Bot. 2014, 65, 995–1012. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  66. Seaton, D.D.; Smith, R.W.; Song, Y.H.; MacGregor, D.R.; Stewart, K.; Steel, G.; Foreman, J.; Penfield, S.; Imaizumi, T.; Millar, A.J.; et al. Linked circadian outputs control elongation growth and flowering in response to photoperiod and temperature. Mol. Syst. Biol. 2015, 11, 776. [Google Scholar] [CrossRef] [PubMed]
  67. Imaizumi, T.; Schultz, T.F.; Harmon, F.G.; Ho, L.A.; Kay, S.A. FKF1 F-Box Protein Mediates Cyclic Degradation of a Repressor of CONSTANS in Arabidopsis. Science 2005, 309, 293–297. [Google Scholar] [CrossRef] [PubMed]
  68. Goralogia, G.S.; Liu, T.K.; Zhao, L.; Panipinto, P.M.; Groover, E.D.; Bains, Y.S.; Imaizumi, T. CYCLING DOF FACTOR 1 represses transcription through the TOPLESS co-repressor to control photoperiodic flowering in Arabidopsis. Plant J. 2017, 92, 244–262. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  69. Kloosterman, B.; Abelenda, J.A.; Gomez, M.d.M.C.; Oortwijn, M.; de Boer, J.M.; Kowitwanich, K.; Horvath, B.M.; van Eck, H.J.; Smaczniak, C.; Prat, S.; et al. Naturally occurring allele diversity allows potato cultivation in northern latitudes. Nature 2013, 495, 246–250. [Google Scholar] [CrossRef] [PubMed]
  70. Bueso, E.; Muñoz-Bertomeu, J.; Campos, F.; Martínez, C.; Tello, C.; Martínez-Almonacid, I.; Ballester, P.; Simón-Moya, M.; Brunaud, V.; Yenush, L.; et al. Arabidopsis COGWHEEL1 links light perception and gibberellins with seed tolerance to deterioration. Plant J. 2016, 87, 583–596. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  71. Park, D.H.; Lim, P.O.; Kim, J.S.; Cho, D.S.; Hong, S.H.; Nam, H.G. The Arabidopsis COG1 gene encodes a Dof domain transcription factor and negatively regulates phytochrome signaling. Plant J. 2003, 34, 161–171. [Google Scholar] [CrossRef] [PubMed]
  72. Xu, P.; Chen, H.; Cai, W. Transcription factor CDF4 promotes leaf senescence and floral organ abscission by regulating abscisic acid and reactive oxygen species pathways in Arabidopsis. EMBO Rep. 2020, 21, e48967. [Google Scholar] [CrossRef] [PubMed]
  73. Wei, Z.; Yuan, T.; Tarkowská, D.; Kim, J.; Nam, H.G.; Novák, O.; He, K.; Gou, X.; Li, J. Brassinosteroid Biosynthesis Is Modulated via a Transcription Factor Cascade of COG1, PIF4, and PIF5. Plant Physiol. 2017, 174, 1260–1273. [Google Scholar] [CrossRef] [Green Version]
  74. Wu, Q.; Liu, X.; Yin, D.; Yuan, H.; Xie, Q.; Zhao, X.; Li, X.; Zhu, L.; Li, S.; Li, D. Constitutive expression of OsDof4, encoding a C2-C2 zinc finger transcription factor, confesses its distinct flowering effects under long- and short-day photoperiods in rice (Oryza sativa L.). BMC Plant Biol. 2017, 17, 166. [Google Scholar] [CrossRef] [Green Version]
  75. Deepika; Singh, A. Expression dynamics indicate the role of Jasmonic acid biosynthesis pathway in regulating macronutrient (N, P and K+) deficiency tolerance in rice (Oryza sativa L.). Plant Cell Rep. 2021, 40, 1495–1512. [Google Scholar] [CrossRef] [PubMed]
  76. Konishi, M.; Yanagisawa, S. Transcriptional repression caused by Dof5.8 is involved in proper vein network formation in Arabidopsis thaliana leaves. J. Plant Res. 2015, 128, 643–652. [Google Scholar] [CrossRef] [PubMed]
  77. Konishi, M.; Donner, T.J.; Scarpella, E.; Yanagisawa, S. MONOPTEROS directly activates the auxin-inducible promoter of the Dof5.8 transcription factor gene in Arabidopsis thaliana leaf provascular cells. J. Exp. Bot. 2015, 66, 283–291. [Google Scholar] [CrossRef] [Green Version]
  78. He, L.; Su, C.; Wang, Y.; Wei, Z. ATDOF5.8 protein is the upstream regulator of ANAC069 and is responsive to abiotic stress. Biochimie 2015, 110, 17–24. [Google Scholar] [CrossRef] [PubMed]
  79. Smit, M.E.; McGregor, S.R.; Sun, H.; Gough, C.; Bågman, A.M.; Soyars, C.L.; Kroon, J.T.; Gaudinier, A.; Williams, C.J.; Yang, X.; et al. A PXY-Mediated Transcriptional Network Integrates Signaling Mechanisms to Control Vascular Development in Arabidopsis. Plant Cell 2020, 32, 319–335. [Google Scholar] [CrossRef] [Green Version]
  80. Ma, S.; Gong, Q.; Bohnert, H.J. Dissecting salt stress pathways. J. Exp. Bot. 2006, 57, 1097–1107. [Google Scholar] [CrossRef] [Green Version]
  81. Thieme, C.J.; Rojas-Triana, M.; Stecyk, E.; Schudoma, C.; Zhang, W.; Yang, L.; Miñambres, M.; Walther, D.; Schulze, W.X.; Paz-Ares, J.; et al. Endogenous Arabidopsis messenger RNAs transported to distant tissues. Nat. Plants 2015, 1, 15025. [Google Scholar] [CrossRef]
  82. Xu, P.; Chen, H.; Ying, L.; Cai, W. AtDOF5.4/OBP4, a DOF Transcription Factor Gene that Negatively Regulates Cell Cycle Progression and Cell Expansion in Arabidopsis thaliana. Sci. Rep. 2016, 6, 27705. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  83. Guo, Y.; Qin, G.; Gu, H.; Qu, L.J. Dof5.6/HCA2, a Dof Transcription Factor Gene, Regulates Interfascicular Cambium Formation and Vascular Tissue Development in Arabidopsis. Plant Cell 2009, 21, 3518–3534. [Google Scholar] [CrossRef] [Green Version]
  84. Gualberti, G.; Papi, M.; Bellucci, L.; Ricci, I.; Bouchez, D.; Camilleri, C.; Costantino, P.; Vittorioso, P. Mutations in the Dof Zinc Finger Genes DAG2 and DAG1 Influence with Opposite Effects the Germination of Arabidopsis Seeds. Plant Cell 2002, 14, 1253–1263. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  85. Oh, E.; Yamaguchi, S.; Hu, J.; Yusuke, J.; Jung, B.; Paik, I.; Lee, H.S.; Sun, T.P.; Kamiya, Y.; Choi, G. PIL5, a Phytochrome-Interacting bHLH Protein, Regulates Gibberellin Responsiveness by Binding Directly to the GAI and RGA Promoters in Arabidopsis Seeds. Plant Cell 2007, 19, 1192–1208. [Google Scholar] [CrossRef] [Green Version]
  86. Piskurewicz, U.; Turečková, V.; Lacombe, E.; Lopez-Molina, L. Far-red light inhibits germination through DELLA-dependent stimulation of ABA synthesis and ABI3 activity. EMBO J. 2009, 28, 2259–2271. [Google Scholar] [CrossRef] [Green Version]
  87. Gabriele, S.; Rizza, A.; Martone, J.; Circelli, P.; Costantino, P.; Vittorioso, P. The Dof protein DAG1 mediates PIL5 activity on seed germination by negatively regulating GA biosynthetic gene AtGA3ox1: DAG1 represses seed germination via PIL5 signalling. Plant J. 2009, 61, 312–323. [Google Scholar] [CrossRef]
  88. Boccaccini, A.; Santopolo, S.; Capauto, D.; Lorrai, R.; Minutello, E.; Serino, G.; Costantino, P.; Vittorioso, P. The DOF protein DAG1 and the DELLA protein GAI cooperate in negatively regulating the AtGA3ox1 gene. Mol. Plant 2014, 7, 1486–1489. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  89. Boccaccini, A.; Lorrai, R.; Ruta, V.; Frey, A.; Mercey-Boutet, S.; Marion-Poll, A.; Tarkowská, D.; Strnad, M.; Costantino, P.; Vittorioso, P. The DAG1 transcription factor negatively regulates the seed-to-seedling transition in Arabidopsis acting on ABA and GA levels. BMC Plant Biol. 2016, 16, 198. [Google Scholar] [CrossRef] [Green Version]
  90. Santopolo, S.; Boccaccini, A.; Lorrai, R.; Ruta, V.; Capauto, D.; Minutello, E.; Serino, G.; Costantino, P.; Vittorioso, P. DOF AFFECTING GERMINATION 2 is a positive regulator of light-mediated seed germination and is repressed by DOF AFFECTING GERMINATION 1. BMC Plant Biol. 2015, 15, 72. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  91. Chen, H.; Ahmad, M.; Rim, Y.; Lucas, W.J.; Kim, J. Evolutionary and molecular analysis of Dof transcription factors identified a conserved motif for intercellular protein trafficking. New Phytol. 2013, 198, 1250–1260. [Google Scholar] [CrossRef]
  92. Miyashima, S.; Roszak, P.; Sevilem, I.; Toyokura, K.; Blob, B.; Heo, J.o.; Mellor, N.; Help-Rinta-Rahko, H.; Otero, S.; Smet, W.; et al. Mobile PEAR transcription factors integrate positional cues to prime cambial growth. Nature 2019, 565, 490–494. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Eugenia uniflora individuals along the Atlantic Forest. (A) Restinga. (B) Riparian Forest. (C) Atlantic Forest dispersion along the Brazilian coast. Sampling points are depicted by colored dots.
Figure 1. Eugenia uniflora individuals along the Atlantic Forest. (A) Restinga. (B) Riparian Forest. (C) Atlantic Forest dispersion along the Brazilian coast. Sampling points are depicted by colored dots.
Genes 13 02235 g001
Figure 2. Phylogenetic tree of 615 DOF amino-acid sequences. Conserved clades with species representativity are colored, together with their branches. Outer circle colors denote sequences derived from monocots (dark grey), eudicots (light grey), and Selaginella moellendorffii (white). Black dots indicate Eugenia uniflora sequences.
Figure 2. Phylogenetic tree of 615 DOF amino-acid sequences. Conserved clades with species representativity are colored, together with their branches. Outer circle colors denote sequences derived from monocots (dark grey), eudicots (light grey), and Selaginella moellendorffii (white). Black dots indicate Eugenia uniflora sequences.
Genes 13 02235 g002
Figure 3. Conserved protein motifs and SNPs from Eugenia uniflora DOF genes. The phylogenetic relationships between DOF genes are shown on the left. Colored boxes represent protein motifs. Black lines represent protein sequences without annotated motifs. Triangles indicate identified SNPs locations. On the right, a comparison of SNPs between the two populations is shown. Filled slices represent homozigote polymorphisms and outlined slices heterozigotes ones.
Figure 3. Conserved protein motifs and SNPs from Eugenia uniflora DOF genes. The phylogenetic relationships between DOF genes are shown on the left. Colored boxes represent protein motifs. Black lines represent protein sequences without annotated motifs. Triangles indicate identified SNPs locations. On the right, a comparison of SNPs between the two populations is shown. Filled slices represent homozigote polymorphisms and outlined slices heterozigotes ones.
Genes 13 02235 g003
Figure 4. Conserved protein motifs identified in Eugenia uniflora DOF genes. Motifs are ordered based on false discovery rate, from lowest to highest.
Figure 4. Conserved protein motifs identified in Eugenia uniflora DOF genes. Motifs are ordered based on false discovery rate, from lowest to highest.
Genes 13 02235 g004
Figure 5. Characteristics of the 30 identified DOF SNPs in Eugenia uniflora. For every SNP, the gene, CDS mutation position, nucleotide mutation, codon position, and amino-acid changes are shown. The last four columns indicate, in order, a non-synonymous mutation ( Δ ), a mutation inside a motif (D), RE specificity, and RF specificity.
Figure 5. Characteristics of the 30 identified DOF SNPs in Eugenia uniflora. For every SNP, the gene, CDS mutation position, nucleotide mutation, codon position, and amino-acid changes are shown. The last four columns indicate, in order, a non-synonymous mutation ( Δ ), a mutation inside a motif (D), RE specificity, and RF specificity.
Genes 13 02235 g005
Table 1. Physicochemical properties and predicted subcellular location of Eugenia uniflora DOF proteins. Protein size is in amino-acid residues (Aa), molecular weight in kilo Daltons (kDa), and isoelectric point (pI) in pH.
Table 1. Physicochemical properties and predicted subcellular location of Eugenia uniflora DOF proteins. Protein size is in amino-acid residues (Aa), molecular weight in kilo Daltons (kDa), and isoelectric point (pI) in pH.
ProteinLenght (Aa)Weight (kDa)pISubcellular Location
g143327629,8228.85Nucleus
g153233136,4456.52Nucleus
g200630733,0478.79Nucleus
g218650755,0496.10Nucleus
g389833235,5719.19Nucleus
g399434036,6876.72Nucleus
g403619321,3087.55Nucleus
g518630632,9716.74Nucleus
g530124326,1118.54Nucleus
g630038141,2568.70Nucleus
g674025926,0478.45Nucleus
g687226828,3319.38Nucleus
g805918620,7279.62Nucleus
g847629531,8755.87Nucleus
g1005949152,6228.37Nucleus
g1330136038,3709.07Nucleus
g1425837038,9398.82Nucleus
g1501432534,7056.51Nucleus
g1641836238,3999.59Nucleus
g1701945849,6386.87Nucleus
g2142325827,4089.55Nucleus
g2343846750,3785.23Nucleus
g2531930832,8588.50Nucleus
g2533930333,5788.45Nucleus
g2572625325,7829.13Nucleus
g2892326929,0615.05Nucleus
g3208844748,0748.89Nucleus
g3232729131,4519.76Nucleus
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Waschburger, E.L.; Guzman, F.; Turchetto-Zolet, A.C. Genome-Wide Identification and Analysis of DOF Gene Family in Eugenia uniflora L. (Myrtaceae). Genes 2022, 13, 2235. https://doi.org/10.3390/genes13122235

AMA Style

Waschburger EL, Guzman F, Turchetto-Zolet AC. Genome-Wide Identification and Analysis of DOF Gene Family in Eugenia uniflora L. (Myrtaceae). Genes. 2022; 13(12):2235. https://doi.org/10.3390/genes13122235

Chicago/Turabian Style

Waschburger, Edgar Luis, Frank Guzman, and Andreia Carina Turchetto-Zolet. 2022. "Genome-Wide Identification and Analysis of DOF Gene Family in Eugenia uniflora L. (Myrtaceae)" Genes 13, no. 12: 2235. https://doi.org/10.3390/genes13122235

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop