Next Article in Journal
Differential Expression Proteins Contribute to Race-Specific Resistant Ability in Rice (Oryza sativa L.)
Previous Article in Journal
Autotoxicity of Diterpenes Present in Leaves of Cistus ladanifer L.
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Identification and Characterization of Mitogen-Activated Protein Kinase (MAPK) Genes in Sunflower (Helianthus annuus L.)

1
Department of Biology and Microbiology, South Dakota State University, Brookings, SD 57007, USA
2
Department of Agronomy, Horticulture and Plant Science, South Dakota State University, Brookings, SD 57007, USA
*
Author to whom correspondence should be addressed.
Plants 2019, 8(2), 28; https://doi.org/10.3390/plants8020028
Submission received: 16 December 2018 / Revised: 7 January 2019 / Accepted: 16 January 2019 / Published: 22 January 2019

Abstract

:
Mitogen-Activated Protein Kinase (MAPK) genes encode proteins that regulate biotic and abiotic stresses in plants through signaling cascades comprised of three major subfamilies: MAP Kinase (MPK), MAPK Kinase (MKK), and MAPKK Kinase (MKKK). The main objectives of this research were to conduct genome-wide identification of MAPK genes in Helianthus annuus and examine functional divergence of these genes in relation to those in nine other plant species (Amborella trichopoda, Aquilegia coerulea, Arabidopsis thaliana, Daucus carota, Glycine max, Oryza sativa, Solanum lycopersicum, Sphagnum fallax, and Vitis vinifera), representing diverse taxonomic groups of the Plant Kingdom. A Hidden Markov Model (HMM) profile of the MAPK genes utilized reference sequences from A. thaliana and G. max, yielding a total of 96 MPKs and 37 MKKs in the genomes of A. trichopoda, A. coerulea, C. reinhardtii, D. carota, H. annuus, S. lycopersicum, and S. fallax. Among them, 28 MPKs and eight MKKs were confirmed in H. annuus. Phylogenetic analyses revealed four clades within each subfamily. Transcriptomic analyses showed that at least 19 HaMPK and seven HaMKK genes were induced in response to salicylic acid (SA), sodium chloride (NaCl), and polyethylene glycol (Peg) in leaves and roots. Of the seven published sunflower microRNAs, five microRNA families are involved in targeting eight MPKs. Additionally, we discussed the need for using MAP Kinase nomenclature guidelines across plant species. Our identification and characterization of MAP Kinase genes would have implications in sunflower crop improvement, and in advancing our knowledge of the diversity and evolution of MAPK genes in the Plant Kingdom.

1. Introduction

Plant responses to abiotic and biotic stresses involve protein kinases that are crucial to signal transduction pathways [1]. The protein kinases are involved in a phosphorylation of Serine/Threonine and Tyrosine sidechains of proteins [2]. Among these protein kinases, Mitogen-Activated Protein Kinase (MAPK) cascade genes are key components of signal transduction pathways in animals, plants, and fungi [3] that help transduce extracellular signals to intracellular responses [4]. Discovered in 1986, the MAPK gene family was originally found in animal cells as a microtubule-associated protein kinase [5]. The first reports of plant MAPK gene family in 1993, identified MsERK1 in alfalfa [6] and D5 kinase in pea [7]. MsERK1 is believed to play a role as an inducer of mitosis in root nodules during symbiosis by Rhizobium and D5 kinase as a cell cycle regulator in pea [6,7]. In addition to such roles in cell proliferation and cell differentiation, MAPK genes are involved in regulating various biotic (e.g., bacteria, fungi, viruses) and abiotic (e.g., light, drought, UV, salinity, pH, cold) stress responses [8].
Stress signals trigger the MAPK cascade, which is composed of reversibly phosphorylated kinases such as MAP Kinase (MAPK, MPK), MAPK Kinase (MAP2K, MAPKK, MKK), and MAPKK Kinase (MAP3K, MAPKKK, MKKK) [9,10]. The MKKKs constitute a relatively larger gene family, constituting three sub-groups of genes: the MEKKs, Rafs, and ZIKs [11]. Each of these proteins in the cascade is activated through the recognition and phosphorylation of a specific serine/threonine amino acid motif [12]. An external or internal stimulus triggers the first step, the activation of an MKKK member, through receptor-mediated phosphorylation or intermediate bridging factors or interlinking MKKKs [10]. The phosphorylated MKKK member induces the activation of MKK through the phosphorylation of two serine or threonine amino acid residues in the conserved motif S/TxxxxxS/T [10]. The activated MKKs, which are dual-specificity kinases, in turn, trigger the phosphorylation of MPKs at the Thr-Asp/Glu-Tyr [T(D/E)Y] motif located in the activation loop (T-loop) between kinase subdomains VII and VIII [3,10,13]. Apart from T(D/E)Y motif in many plant species, some other variants such as T(Q/V/S)Y, T(/Q/R)M, MEY, TEC in the activation loop have also been reported [1]. The MPK members phosphorylate a variety of substrates, including transcription factors, protein kinases, and cytoskeleton proteins [10,14]. The activation of the MAPK cascade genes induces the translocation from the cytoplasm to the nucleus [15], further enacting the specific cellular response to the external stimuli through gene activation and inactivation. The detailed illustration of the MAP Kinase signaling pathway in response to diverse abiotic and biotic stresses in plants is represented in Figure S1 adapted from various studies [16,17,18,19,20,21,22,23,24].
The advent of sequencing technologies and rapid progress in bioinformatics tools has assisted the sequencing of the plant genomes at a faster pace. Genome-wide identification of MPKs and MKKs has been documented in various plant species, including both model and crop species [14,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39]. Previous identification and characterization of MAP Kinase cascade proteins in rice, Arabidopsis, and other plants [4,39,40] provide a wealth of information for comparative analyses of these proteins in species that have yet to be studied. The availability of the complete genome sequences from each of the major plant groups such as Asterids (Daucus carota [41], Helianthus annuus [42], Solanum lycopersicum [43]), Amborellales (Amborella trichopoda [44]), Ranunculales (Aquilegia coerulea [45]), Bryophyte (Sphagnum fallax [46]), and Algae (Chlamydomonas reinhardtii [47]) allowed us to identify the MPK and MKK genes of these species and assess phylogenetic relationships. Domesticated sunflower is the fourth most important oilseed crop in the world (http://www.fao.org/) and can adapt to diverse environmental conditions such as drought and maintain the stable yields [48]. Thus, the MAPK gene family might play an important role in helping sunflower adapt and survive in different environmental conditions. This research was carried out with two major objectives: (a) detailed identification and functional characterization of MPK and MKK genes in H. annuus; and (b) assess phylogenetic relationships of MPK and MKK genes of H. annuus with that of A. coerulea, A. trichopoda, C. reinhardtii, D. carota, S. fallax, and S. lycopersicum and including the homologs from relatively better-studied plant species from Rosids (A. thaliana, G. max, and V. vinifera) and a monocot (O. sativa). Findings from this study might support further efforts in crop improvement focused on the development of cultivars that maintain yield when challenged by biotic and abiotic stresses as well as understand the evolution pattern of MAPK gene family in sunflower and other plant species.

2. Materials and Methods

2.1. Retrieval and Identification of Putative MAP Kinase Cascade Genes

Genome-wide identification of MPK and MKK cascade genes was performed using protein sequences of A. coerulea (v 3.1), A. trichopoda (v 1.0), C. reinhardtii (v 5.5), D. carota (v 2.0), H. annuus (r 1.2), S. fallax (v 0.5), and S. lycopersicum (iTAG2.4) obtained from the Phytozome database [45]. Sunflower protein sequences from INRA inbred genotype XRQ whose genome is 3.6 gigabases and encodes 52,243 proteins distributed over 17 chromosomes [42] were analyzed in the present study. The 20 MPK and ten MKK sequences of A. thaliana [25] along with 38 MPK and 11 MKK sequences of G. max [26] were used as reference sequences for the identification of MPK and MKK proteins. The multiple sequence alignment of these reference sequences was employed in HMM profiling using the program HMMER (version 3.1b2) [49] at a threshold e-value of 0.01. MPK and MKK genes were further identified using InterProScan (version 5.27) [50], Pfam ID [51], and PROSITE ID (http://prosite.expasy.org/). The proteins with PfamID of MAPK domain (PS01351), ATP-binding domain (PS00107), protein kinase domain (PS50011), and serine/threonine protein kinase active site (PS00108) were used for identification of corresponding MPK and MKK proteins (Figure 1). Multiple expectation maximization for motif elicitation (MEME) [52] and multiple sequence alignment analysis was performed to confirm the presence of the following signature motifs: (a) the phosphate binding P-loop, GxGxxG [1], where ATP binds in protein kinases; (b) the catalytic C-loop, D(L/I/V)K, found within the S/T PK active site signature; and (c) the activation- or T-loop, T(D/E)Y in MPK and GTxxYMSPER in MKK proteins. The following parameters for MEME were employed: maxsize: 100,000; mod: zoops; nmotifs: 10; minw: 6; and maxw: 25. Furthermore, MKK genes were identified using BLAST [53], with an e-value cutoff of 0.01, in which A. thaliana MKK sequences were used as a query, and the top ten hits for each A. thaliana MKK query sequence were employed for MKK gene identification. The protein theoretical molecular weight and isoelectric point were predicted using compute pI/Mw tool available in ExPASy (http://au.expasy.org/tools). Subcellular localization of the putative MPK and MKK genes of sunflower were analyzed using TargetP 1.1 [54].

2.2. Phylogenetic Tree Construction and Homology Assessment

The multiple sequence alignment of identified MPK and MKK proteins of H. annuus and other species used in this study was performed using CLUSTALW [55] and MUSCLE [56] in Geneious [57] and subjected to phylogenetic analysis employing the maximum likelihood (ML; with 100 replicates) using MEGA (version 7.0.14) [58]. The phylogenetic analyses employed an evolutionary model ‘Jones-Taylor-Thornton with gamma distribution and invariant sites (JTT+G+I)’, the best evolutionary model resulted from the ModelTest analysis using MEGA7. The trees using MPK and MKK sequences were rooted with corresponding human MAPK proteins [HsMAPK1 (GenBank: NP_002736.3) and HsMAPKK1 (GenBank: AAI37460.1), respectively] as an outgroup. Timetree was constructed using the Reltime method [59] from MEGA7 to study the evolutionary divergence of MKK3 proteins belonging to all species under study. The following criteria were used for the construction of Timetree: constraints used: 3 [Divergence time: O. sativa and A. trichopoda (168–194 MYA), G. max and H. annuus (110–124 MYA), and V. vinifera and A. thaliana (105–115 MYA), obtained from http://www.timetree.org/ [60]]; variance estimation method: analytical; statistical method: maximum likelihood; substitution model: JTT; rates among sites: 5 categories (+G, parameter = 0.6307); rate variation model allowed: ([+I], 0.00% sites); amino acids involved: 11; and total positions: 574 positions. Homology to MPKs and MKKs of other plants was assessed using the BLASTp top-hit approach (https://blast.ncbi.nlm.nih.gov/Blast.cgi) with non-redundant protein sequences (nr) database.

2.3. Chromosomal Locations and Gene Structure

All 17 chromosome sequences of H. annuus accessed from the Phytozome database were uploaded into the program Geneious [57]. The chromosome locations of MPK and MKK genes of sunflower were visualized using annotation file in Generic Feature Format (GFF) obtained from the annotation database of Phytozome. The exon-intron distribution pattern was obtained by the Gene Structure Display Server [61].

2.4. Nomenclature of MPKs and MKKs

Nomenclature of sunflower MPKs and MKKs was carried out using MAPK gene nomenclature guidelines [3,4]. The nomenclature uses the following format: (a) the first letter (upper case) of the genus name followed by two to three letters of species (lower case) was used; (b) a number was provided based on the homology to the Arabidopsis MAPK cascade genes; and (c) the number was followed by a hyphen and a number if paralogs were present. Such guidelines for nomenclature of MPKs and MKKs have been employed in many studies [1,4,26,27,33,34,35,36,62,63,64,65]. In this study, we renamed GSVIVT01005924001 (VvMPK2) and GSVIVT0102277001 (VvMPK10), identified by Cakir and Kılıçkaya 2015 [37], as VvMPK22 and VvMPK23, respectively, which were not identified in a study by Mohanta et al. 2015 [1].

2.5. Expression Analysis and miRNA Prediction of Sunflower MPKs and MKKs

The expression pattern of sunflower MPKs and MKKs was investigated using data accessed from NCBI SRA SRP092742 [SRR4996815 (polyethylene glycol or peg)-treated pooled root samples), SRR4996819 (NaCl-treated pooled root samples), SRR4996823 (Peg-treated pooled leaf samples), SRR4996828 (pooled control root samples), SRR4996834 (NaCl-treated pooled leaf samples), SRR4996836 (pooled control leaf samples), SRR4996839 (salicylic acid-treated pooled leaf samples), and SRR4996847 (salicylic acid-treated pooled root samples)]. These data are the result of the application of one hormone treatment (0.05 µM SA), two abiotic stresses [Peg 6000 (100g/l), which creates osmotic stress, and NaCl (100mM) for salt stress], and control [dimethyl sulfoxide (DMSO) only] collected from root and leaf samples. The detailed experiment is described in Badouin et al. 2017 [42]. Briefly, roots and first leaves were collected after six hours of treatment (SA, Peg, NaCl, and DMSO), and applied to two-week-old sunflower seedlings (INRA inbred genotype XRQ) grown in a hydroponic system. The collection was repeated three times and was pooled after separate RNA extractions in equimolar concentration. RNA sequencing of root and leaf samples was performed as non-oriented pair end libraries (2*76bp for roots and 2*100 for leaves). The quality control of these reads was accessed by running the FastQC program (version 0.11.3) [66], and trimming was done using Btrim64 (version 0.2.0) [67] to remove low-quality bases (QC value > 20; 5-bp window size). High-quality pair-end reads were mapped against the coding sequences of H. annuus (Hannuus: Hannuus_494_r1.2.transcript.fa.gz) obtained from the Phytozome database using the Salmon (version 0.9.1) [68] in Bioconda [69]. The codes that were used for data processing are available as Supplementary File S1. The obtained transcript estimated quantification reads for each treatment were compared with their respective reads from the controls to calculate the log2Fold Change (log2FC) and visualized using integrated Differential Expression and Pathway analysis (iDEP 0.81 R/Bioconductor packages; http://bioinformatics.sdstate.edu/idep/) [70]. The heatmap was generated using the following criteria: distance–correlation, linkage–average and cut-off Z score–4 to study the hierarchical clustering and expression pattern of MPK and MKK genes in different tissues under different treatments. k-means clustering was done using the standardization normalization technique. For identifying the potential miRNA targeting sites, the nucleotide sequences of the identified sunflower MPKs and MKKs were subjected to a plant small RNA (psRNATarget) target analysis server [71] against seven published H. annuus microRNAs, selecting Schema V2 (2017 release) as a scoring option.

2.6. Tajima’s Relative Rate and Neutrality Test

Tajima’s relative rate test [72] was conducted to study the statistical significance of variations in molecular evolution in a different group of plants. The same MEGA files used in phylogenetic tree construction were used in the program MEGA7. In this test, three random sequences of either MPKs or MKKs of different plant species were selected, considering one of the sequences as the outgroup, and the χ2 test statistic was applied. A p-value of less than 0.05 was used to reject the null hypothesis of equal rates of evolution between selected sequences of different plant groups. All positions containing gaps and missing data were eliminated. Tajima’s test of neutrality [73] was performed to understand and distinguish the evolutionary pattern of randomly evolved MPKs or MKKs with non-randomly evolving MPKs or MKKs. During the neutrality test, all positions with less than 95% site coverage were eliminated. Therefore, fewer than 5% alignment gaps, missing data, and ambiguous bases were allowed at any position. The groupings of A, B, and C represent the statistical groups, which should not be confused with MPK or MKK clades.

3. Results

3.1. The Diversity of MPK and MKK Genes in Sunflower Relative to Other Species

After a careful examination of the signature motifs of the 2,419 sequences resulting from the HMM profiling using reference sequences of A. thaliana and G. max against 52,243 protein sequences of sunflower, we identified 28 MPKs (filtered from 244 possible MPKs) and eight MKKs (filtered from 100 possible MKKs) (Table 1 and Table 2). We also used protein sequences of A. coerulea, A. trichopoda, C. reinhardtii, D. carota, S. fallax, and S. lycopersicum and identified their MPKs and MKKs, which are shown in Tables S1 and S2. The protein sequences identified, including reference sequences used in this study and their identity in percentage, are presented in Supplementary File S2. The abundance of MPK and MKK genes in the genomes of A. coerulea (306.5 Mb), A. trichopoda (706 Mb), C. reinhardtii (111 Mb), D. carota (421 Mb), H. annuus (3600 Mb), S. lycopersicum (900 Mb), and S. fallax (395 Mb) shares no apparent correlation with genome size (Table 1).

3.2. Gene Location, Subcellular Localization and Structural Variation of MPKs and MKKs in H. annuus

The MPK and MKK genes were distributed on all chromosomes of sunflower, with the highest of five genes in chromosome 3. The MPK genes were absent in chromosomes 2, 7, 10, 12, 10, 16, and 17; whereas, MKK genes were absent in chromosomes 1, 2, 5, 6, 7, 8, 11, 13, 15, 16, and 17. Both MPK and MKK genes are completely absent in chromosomes 2, 7, 16, and 17. One HaMPK gene each was found in chromosome 1 and 14; two HaMPKs each in chromosome 4, 11, 13, and 15; three HaMPKs each in chromosome 5 and 9, and four HaMPKs each in chromosome 3, 6 and 8 (Figure 2). Eight paralog pairs HaMPK3-1/3-2, HaMPK6-1/6-2, HaMPK9-1/9-2, HaMPK11-1/11-2, HaMPK13-1/13-2, HaMPK 16-1/16-2, HaMPK19-1/19-2, and HaMPK23-2/23-4 were located on different chromosomes. Only one paralog pair (HaMPK23-1/23-3) was present in the same chromosome (i.e., chromosome 3). Likewise, only one MKK gene was present in chromosomes 3, 4, 9, 12, and 14, while three MKKs were present in chromosome 10. The only paralog pair, HaMKK6-1/6-2 was present in different chromosomes. TargetP analysis showed that the proteins encoded by three MPKs (HaMPK11-1/11-2 and HaMPK4) and two MKKs (HaMKK9 and HaMKK3) were predicted to localize in mitochondria, two MKKs (HaMKK4 and HaMKK5) in the chloroplast, and the rest in subcellular locations other than mitochondria or the chloroplast (Table 2). Regarding the structural variation due to exons and introns, the number of exons in MPKs ranged from two (HaMPK4) to 18 (HaMPK22, HaMPK23-4/23-2) with an average of 8.9 exons per gene (Table 2, Figure S2). The number of exons in MKKs ranged from one (HaMKK9, HaMKK4, and HaMKK5) to 12 (HaMKK3), with an average of 6.25 exons per gene (Table 2, Figure S3).

3.3. Phylogenetic Analyses

Full-length amino acid sequences of MPKs and MKKs of sunflower, Arabidopsis and soybean were employed for evaluating evolutionary relationships, as well as for nomenclature of the MPKs and MKKs of species under study. These sequences were subjected to multiple sequence alignment and subsequent phylogenetic analyses. Phylogenetic analyses included MPK and MKK gene sequences from, A. coerulea, A. thaliana, A. trichopoda, C. reinhardtii, D. carota, G. max, H. annuus, O. sativa, S. lycopersicum, and V. vinifera.

3.3.1. MPKs

Sunflower MPK (HaMPK) protein sequence length ranged from 349 to 588 amino acid (aa), except for HaMPK4, which was only 157 aa. The average length of MPKs was 425 aa, with isoelectric points ranging from 5.22 (HaMPK13-1) to 9.65 (HaMPK23-1) and a predicted average molecular mass of 48523.772 Da (Table 1). Twenty-eight HaMPKs identified in this study were nested into four clades (A–D; each with bootstrap support > 70%) (Figure S4), which corresponded to their homologs in A. thaliana and G. max, except for the Clade C MPK members (Table S3). The Clade A members in this study include the previously identified group A and B members of A. thaliana MPKs [3,4]. Likewise, Clade B consists of previously identified group C members of A. thaliana MPKs. In addition, Clade C includes the members identified in group E of soybean MPKs [26]. The number of HaMPKs in Clades A, B, C, and D were nine, four, five, and ten, respectively. Sunflower MPK Clade C included five members with HaMPK22 (a homolog to GmMPK22-1 and GmMPK22-2) and HaMPK23-1/23-2/23-3/23-4 (homologs to the corresponding GmMPK23-1/23-2/23-4/23-4). Clade A and B consisted of members with phosphorylation motif TEY (except for HaMPK23-1 and HaMPK23-2 that are nested within Clade C), while those with the TDY motif were found in Clade C and D. The sunflower MPK orthologs are shown in Table S4. The phosphate-binding P-loop, the catalytic C-loop, D(L/I/V)K, and the activation- or T-loop, TxY, in MPKs were defined as (I/V/L)GxGx(S/F/G)GxV, HRD(L/I)KPxN and T(D/E)Y in sunflower, respectively. The protein sequence of HaMAPK23-3 had a variation in catalytic C-loop, D(L/I/V)K motif, as it possessed ‘Phenylalanine (F)’ instead of ‘Leucine/ Isoleucine/Valine (L/I/V)’. Other additional motifs, such as VAIKKIxxxF, were defined as VA(I/V/M)KK(I/M)xxx(F/Y) in the protein sequences of MPKs. The MPKs that belonged to Clade C possessed VA(I/V/M)KKMxxxY. The motifs ‘DFGLAR’ and ‘TRWYRAPE’ were found conserved in all of the MPKs of sunflower. HaMPK4 was the only member that lacked phosphate binding P-loop and VAIKKIxxxF motif. The structural analyses mapped onto phylogeny provided important insights into the duplication events. In the HaMPK gene family, the number of introns ranged from one (HaMPK4) to 17 [three members from Clade C (HaMPK22, HaMPK23-4/23-2)]. The gene members showed a similar pattern of exon/intron structure within the clades. The majority of the HaMPKs (seven) in Clade A consist of six exons, and members, HaMPK13-1 and HaMPK4 had seven and two exons, respectively. In Clade B, all three members consisted of three exons. Three of the five members in Clade C possessed 18 exons, and HaMPK23-1 and HaMPK23-3 possessed 15 and 16 exons, respectively. Likewise, half of the gene members in Clade D (five) possessed ten exons, two (HaMPK19-2 and HaMPK18) possessed nine exons, and three genes (HaMPK8, HaMPK15, and HaMPK9-1) possessed 11 exons (Figure S1).
Phylogenetic analysis of full-length protein sequences was conducted to study evolutionary patterns of the MPKs in ten plant species with sequences of C. reinhardtii (Figure 3). The MPKs were nested in four clades (Clade A–D; Table S3). Clade A is the second largest clade, consisting of 64 MPKs of MPK3/6/4/11/5/13/10 of all species under the study. Clade B consists of 29 MPKs of MPK1/2/7 and 14. In the cases of S. lycopersicum and V. vinifera, these two species contain MPK1 and MPK7 in Clade B. Thus, MPK2 and MPK14 are absent in two species, but not only MPK2. In addition, A. trichopoda only has AmtMPK14 in Figure 3. Therefore, MPK1/2/7 of A. trichopoda is absent. The MPK14 of V. vinifera and D. carota, MPK2 of S. lycopersicum and V. vinifera, and MPK7 of A. trichopoda are absent. The smallest clade, Clade C, consists of 18 members of MPK22 and MPK23 from H. annuus, G. max, S. lycopersicum, V. vinifera, S. fallax, and C. reinhardtii. All the members of Clade A and B consist of the TEY motif, whereas some members of Clade C (HaMPK23-1/23-4, GmMAPK23-1/23-2/23-3/23-4, and VvMPK22) consist of the TEY motif. The largest clade, Clade D, consists of 70 MPKs of MPK16/18/19/20/21/17/9/8/15, and MPK13 of C. reinhardtii. All clades had moderate to strong support (bootstrap values ranging from 80 to 100%). Figure 4a and Supplementary File S3 show the motifs related to the P-loop, catalytic C-loop, and activation or T-loop, representing variations in clades A–D, including other predicted conserved domains of MPK group proteins. In addition, the clade divergence was also based on the common docking site, which is important for downstream target proteins. Clade A consisted of K-M-L-V-F-D-P-N-K-R-I-V-E-E-A-L, Clade B consisted of K-M-L-V-F-D-P-S-K-R-I-S-V-T-E-A-L, Clade C consisted of S-L-C-S-W-D-P-C-K-R-P-T-A-E-E-A-L, and Clade D consisted of R-L-L-A-F-D-P-K-D-R-P-T-A-E-E-A-L consensus common docking sites (Table 3).

3.3.2. MKKs

Sunflower HaMKK protein sequence length ranged from 308 to 520 aa. The average length of proteins for MKKs was 372 aa, with isoelectric points ranging from 5.43 (HaMKK2) to 9.25 (HaMKK5) and a predicted average molecular mass of 42688.86 (Table 1). Corresponding with their homologs in Arabidopsis and G. max, the eight identified HaMKKs are divided into four distinct clades (Figure S5). The MKK homologs of MKK1/2/6-1/6-2/3/4/5/9 were only found in sunflower. The clades’ divergence followed serine/threonine amino acid motif patterns in sunflower. For example, Clade A contained SxxxxxS/TxxxxxT, Clade B with SxxxxxTxxxxxT, Clade C with SxxxxxTxxxxxS, and D with SxxxxxSxxxxxT. The HaMKKs in Clades A, B, C, and D were four, one, two, and one, respectively (Table S5). The orthologs of identified MKKs of sunflower in different plant species are represented in Table S6. In the HaMKK gene family, the number of introns ranged from zero (HaMKK9, HaMKK4, and HaMKK5) to 11 (HaMKK3) (Table 2, Figure S3). Clade A members HaMKK6-1 and HaMKK6-2 consisted of eight exons and are paralogs to each other. The remaining Clade A members, HaMKK2 and HaMKK1, consisted of nine and ten exons, respectively. The only member of Clade B, HaMKK3, consisted of twelve exons. Interestingly, the members of Clade C and D (HaMKK9, HaMKK4, and HaMKK5) had no introns.
Phylogenetic analysis of full-length MKK amino acid sequences from the plant species with sequences of C. reinhardtii under this study revealed four distinct clades (Clades A–D, Figure 5). Figure 4b and Supplementary File S4 show the motifs related to P-loop, catalytic C-loop, and activation or GTxxYMSPER, representing variations in Clades A–D, including other predicted conserved domains of MKK group proteins. The largest clade, Clade A, consisted of 26 MKKs, belonging to MKK1, MKK2, and MKK6. While MKK3 orthologs formed Clade B, consisting 12 MKKs, MKK4 and MKK5, with 16 members, formed Clade C. Gene MKK4 is absent in S. lycopersicum, V. vinifera, D. carota, and C. reinhardtii species. MKK7, MKK8, MKK9, and MKK10 formed Clade D, which consisted of 16 of the total MKKs under study. With respect to all MKKs belonging to ten species, the phosphate-binding P-loop, the catalytic C-loop, D(L/I/V)K, and activation- or T-loop, (S/T)xxxxx(S/T) varied across the clades. The GTxxYMSPER motif was well conserved in all species except for the OsMAPKK6 and AmtMKK6 with GTxxYMAPER in Clade A and OsMAPKK10-1 in Clade D with GTxxYMSPEK. The ATP binding signature in MKK of sunflower terminates with ALK except for GmMAPKK6-1 (completely absent), CrMKK3 with AVK, VvMKK4 with ANT, OsMAPKK10-1 (completely absent), and OsMAPKK10-1 with AVK. The Timetree based on the 11 MKK3 (each MKK3 protein from all species belonging to Clade B) sequences shows the evolutionary divergence across all species under study. Upon use of three constraints of divergence between O. sativa and A. trichopoda (168–194 MYA), G. max and H. annuus (110–124 MYA), and V. vinifera and A. thaliana (105–115 MYA), the approximate divergence of these MKK3 proteins across species has been found. For instance, DcMKK3 and SlMKK3 diverged 90.70 MYA from HaMKK3 (Figure S6).

3.4. Expression Analysis and miRNA Prediction of Sunflower MPKs and MKKs

The functional analysis of both HaMPKs and HaMKKs was studied using RNA-seq data available in NCBI. Since the sunflower genome was recently available, the expression data for pathogen stress were not available in the public databases. We investigated the expression pattern of MPKs and MKKs in leaves and roots treated with one hormone treatment (SA) and two abiotic stresses (NaCl and Peg). We did observe expression patterns for all HaMPKs and HaMKKs except for HaMPK4 (Supplementary File S5). The k-means clustering result showed that the HaMPKs and HaMKKs were clustered into four groups (Figure S7 and Table S7). Cluster A consisted of seven HaMPKs (from Clades A, B, and D) and four HaMKKs (from Clades A, B, and C). Cluster B consisted of three HaMKK genes (from Clades A and D) and two HaMPK genes (from Clade A). Cluster C consisted of three genes belonging to both HaMPKs (from Clades A and D) and one HaMKK (from Clade C). Cluster D consisted of 15 genes belonging to HaMPKs (belonging to clades A–D). The log2FC for each gene and hierarchical clustering of HaMPKs and HaMKKs representing the functional divergence of these genes are represented in Figure S8 and Figure 6, respectively. Some genes were upregulated in response to the treatments compared to the control of their respective tissues. For instance, in leaves, HaMKK5, HaMKK6-2, HaMPK3-2, HaMPK11-1, HaMPK14, HaMPK1, HaMPK6-2, HaMPK19-1, and HaMPK18 showed log2FC > 1 in response to Peg; HaMKK5, HaMKK6-2, HaMPK11-1, HaMPK14 showed log2FC > 1 in response to NaCl; HaMPK11-1 showed log2FC > 1 in response to SA. In roots, HaMKK4, HaMKK1, HaMKK2, HaMPK3-2, HaMPK13-2, HaMPK23-2, HaMPK9-2 and HaMPK11-2 showed log2FC > 1 in response to Peg; HaMKK9, HaMPK13-2, HaMPK6-1, and HaMPK3-1 showed log2FC in range of 0.7 to 1.45 in response to SA; HaMPK6-1, HaMPK2, HaMPK23-2, and HaMPK17 showed log2FC > 0.9 in response to NaCl. In contrast, some genes were downregulated in response to the treatments compared to the control of their respective tissues. For example, in leaves, HaMKK9, HaMKK2, and HaMPK13-2 showed log2FC in a range of −0.6 to −0.8 in response to Peg; HaMKK9, HaMPK7, HaMPK23-1 showed log2FC in a range of −0.6 to −0.8 in response to NaCl; HaMKK4, HaMPK7, and HaMPK11-2 showed log2fold change in a range of −0.58 to −2.11 in response to SA. Likewise, in roots, HaMPK14 showed log2FC of −0.53 in response to Peg; HaMKK6-2, HaMPK13-2, HaMPK14, and HaMPK9-2 showed log2fold change in a range of −0.62 to −1.50 in response to NaCl; HaMPK14, HaMPK19-1, and HaMPK9-2 showed log2FC in a range of −0.68 to −1.6 in response to SA. In addition, the expression of HaMPKs, HaMKKs showed functional divergence in response to stresses as the clustering of these genes in a heatmap was not in accordance with the nesting pattern within clades in phylogenetic trees. The potential miRNA target sites in MPKs and MKKs identified using psRNATarget server revealed five (han-miR156a/b/c, han-miR160a, han-miR3630-5p) of seven miRNA families that may be involved in targeting sunflower MPKs only (Table S8). HaMPK16-2, HaMPK11-1, and HaMPK23-3 were found to be targeted by both miRNAs (han-miR156a/b).

3.5. Tajima’s Relative Rate and Neutrality Tests on MPKs and MKKs

Separate statistical analyses were performed selecting three random sequences from MPKs and MKKs group. For Tajima’s relative rate test for MPKs and MKKs, the sequences were selected from the species representing a diverse taxonomic group: monocot, a dicot, basal angiosperm, bryophytes, and algae. For the analysis of MPK genes following a group of sequences were selected: (a) OsMAPK4 (monocot) and HaMPK6 (dicot) with AmtMPK13-1 (basal angiosperm); (b) OsMAPK4 (monocot) and HaMPK6 (dicot) with sequence SfMPK4-1 (bryophyte); and (c) OsMAPK4 (monocot) and HaMPK16-1 (dicot), with sequence CreMPK2 (algae) (Table S9). The plant group combination in column 1, 2 and 3 of MPKs resulted in p-values of 0.01, 0.0053, and 0.0007, with χ2 values of 6.54, 7.78 and 11.46, respectively. For MKKs, the following group of sequences were selected: (a) OsMAPKK5 (monocot) and HaMKK6-1 (dicot) with AmtMKK6 (basal angiosperm); (b) OsMAPKK5 (monocot) and HaMKK6-1 (dicot) with sequence SfMKK3 (bryophyte); and (c) OsMAPKK5 (monocot) and HaMKK6-1 (dicot) with CreMKK3 (algae) (Table S10). The plant group combination in columns 1, 2 and 3 of MKKs resulted in p-values of 0, 0.04965 and 0.05687 with χ2 values of 100.55, 3.85 and 3.36, respectively. Tajima’s Relative Rate test is commonly used to analyze variation in both DNA and amino acid sequences [78]. This test has been applied to various genes belonging to different gene families, such as MAPKs and WRKY transcription factors [1,78]. In this study, the p-value (less than 0.05) and χ2 statistic showed randomly selected sequences of MPKs and MKKs of different plant groups to be statistically significant, rejecting the null hypothesis of equal rates between selected sequences of different plant groups. The interpretation of Tajima’s D is as follows: D = 0 (observed variation is similar to the expected variation, which shows evidence of no selection), D < 0 (presence of excessive rare alleles, suggesting recent selection sweep and recent population expansion), and D > 0 (lack of rare alleles, suggesting balanced selection and sudden population contraction) [72,73]. The values in the ranges greater than 2 or less than −2 are considered to be statistically significant [72,73]. In our study, Tajima’s neutrality test statistics (D) were found to be 5.391062 for MPKs and 5.928839 for MKKs (Table S11). This suggests that both MPKs and MKKs have undergone a balanced selection with contraction in gene family size. Also, the average heterozygosity of both MAPKs and MKKs is more than those of the segregating sites, suggesting a high frequency of polymorphism.

4. Discussion

MAPK signaling in plants plays important roles in multifaceted biological processes such as growth, development, and regulation of various environmental stresses [4,34,36,79,80,81,82,83,84,85,86,87,88,89,90]. The MPK and MKK genes have been strong candidates for studying the evolution of gene families in plant species as well [27,28,39,91]. In this study, the HMM analysis of protein sequences and examination of the signature motifs resulted in the identification of 96 MPK and 37 MKK genes in A. coerulea, A. trichopoda, C. reinhardtii, D. carota, H. annuus, S. fallax, and S. lycopersicum.

4.1. Nomenclature of MPKs and MKKs

A recent study on various Triticeae species (wheat, barley, rye, and triticale) by Goyal et al. 2018 [35] reported numerous discrepancies in MAPK nomenclature of wheat and barley and suggested a new name based on sequence homology. A consistent nomenclature of proteins belonging to the same gene family across species based on orthology facilitates easy prediction and understanding of the function of a particular protein [92]. Cakir and Kılıçkaya 2015 [37] reported MAP kinase cascade genes in V. vinifera and confirmed the orthology of VvMPK14, VvMPK12, VvMPK11, VvMPK13, VvMPK7, VvMPK3, VvMKK5, VvMKK3, and VvMKK2 to Arabidopsis AtMPK6, AtMPK3, AtMPK13, AtMPK12, AtMPK16, AtMPK9, AtMKK3, AtMKK6, and AtMKK2, respectively. Likewise, MAP Kinase cascade genes analyses in Ziziphus jujuba [30] provided nomenclature of MAP kinase cascade genes based on the order of appearance in different groups in the phylogenetic tree, and not based on orthology (or sequence homology) to Arabidopsis MAP Kinase cascade genes. The proper nomenclature of these MAP Kinase cascade genes should be used following an orthology or sequence homology-based MAPK gene nomenclature guidelines to maintain consistency across the plant kingdom.

4.2. Diversity and the Phylogenetic Relationship of MPKs

Our identification of MPKs yielded a slight variation in the number of genes from the previous studies; for example, we identified 15 MPKs in S. lycopersicum, which is different from Kong et al. 2012 [93], who reported 16 MPKs, and Mohanta et al. 2015 [1], who found 17 MPKs in the tomato genome. The number of AcMPKs in this study was 11, whereas Mohanta et al. 2015 [1] reported only 10 AcMPKs. In C. reinhardtii, six CreMPKs identified in this study were consistent with Mohanta et al. 2015 [1], whereas Dóczi et al. 2012 [39] reported only five CreMPKs. The variation in several genes within the same species in different studies might come as a result of different statistical and stringency parameters employed during HMM profiling and further downstream motif analysis. The detailed study of MPKs of D. carota, A. trichopoda, S. fallax, and H. annuus has never been reported in previous studies. The number of MPK genes in sunflower is higher than that previously identified in numerous other plant species, such as Arabidopsis (119Mb) [3] and rice (420Mb) [94], and lower than in soybean (1100Mb) [26]. Even the size of the sunflower genome, which is believed to have undergone the first whole genome triplication approximately 38–50 MYA, and whole genome duplication approximately 29 MYA, is about 3.5 times larger [95] than that of the soybean genome: the number of MPKs is lower in sunflower than soybean. Soybean has undergone two polyploidization events, approximately 59 and 13 MYA [75,96]. Thus, recent polyploidy in plants has resulted in extra copies of genes to their genome [97,98]. The slightly lower number of MPKs in sunflower might be due to past polyploidization events and the recent amplification of repetitive elements causing highly similar and related sequences [99]; the sunflower genome also encodes 52,243 proteins [42], which is slightly fewer than the soybean genome (56,044 proteins) [75].
Phylogenetic analysis of HaMPKs revealed four distinct clades, which were consistent to the MPKs previously identified in Arabidopsis [100], poplar [101], rice [102], Brachypodium distachyon [33], Malus domestica [32], Ziziphus jujuba [30], Triticeae species [35], Brassica rapa [28], and Fragaria vesca [103]. In Clade A, sunflower has one extra copy of MPK3, MPK6, MPK11, and MPK13 genes that might be because of duplications after the divergence from Arabidopsis. Such extra copies of these genes have also been observed in soybean [26]. The two copies of MPK3 and MPK6 were also found in D. carota. The nesting pattern of sunflower and other species’ MPK genes with the characterized Arabidopsis MPKs suggest their potential role in respective functions. AtMPK3 is involved in various signaling pathways related to various stresses, such as wounding and hypersensitive responses elicited by Avr-R gene interaction [8,104]. The MAP kinase genes, IbMPK3 and IbMPK6, in sweet potato (Ipomoea batatas), and homologs of AtMPK3 and AtMPK6, provide resistance to Pseudomonas syringae pv. tabaci (Pta) bacteria in tobacco leaves, and were induced in various abiotic stresses, as well [84]. In maize, ZmMPK3, a homolog of AtMPK3 is induced in response to various environmental stresses [105]. Similarly, AtMPK4 and AtMPK6 are involved in response to abiotic and biotic stress such as cold, drought, touch and wounding, resulting in the production of reactive oxygen species in Arabidopsis [106,107]. AtMPK4 is phosphorylated and activated by the upstream components AtMEKK1 and AtMKK2 upon cold and salt stress signaling in Arabidopsis [107,108]. Clade A also consists of AtMPK5, the homolog of which in rice, OsMPK5, is well characterized to regulate stress responses [109]. All copies of MPK1/2, MPK7/14 are retained in soybean in sunflower, soybean, and Arabidopsis. Among them, AtMPK1, AtMPK2, AtMPK7, AtMPK14 are phosphorylated by AtMKK3 upon abscisic acid application in A. thaliana plantlets [110]. AtMPK1 is induced upon salt stress, whereas some MPKs in rice and alfalfa such as BWMK1 and TDY1, respectively, are activated upon wounding by pathogens [111,112,113]. G. max MAP kinase 1 (GMK1), a homolog of AtMAPK1, is activated in response to salt stress in soybean [114]. Likewise, a homolog of AtMPK7 in maize, ZmMPK7 is involved in the removal of reactive oxygen species upon induction by abscisic acid and hydrogen peroxide in maize [115]. Another homolog of AtMPK1 in Hordeum vulgare (HvMPK4) showed enhanced resistance to Magnaporthe grisea and enhanced tolerance to salt stress [85]. Clade C members include the homologs of G. max GmMAPK22-1/22-2 and GmMAPK23-1/23-2/23-3/23-4 [26] with no MPKs in Arabidopsis. A single copy of GmMAPK22-1/GmMAPK22-2 ortholog is retained in sunflower, and hence it is named HaMPK22. Meanwhile, all copies of GmMAPK23-1/23-2/23-3/23-4 are retained in sunflower and are hence named HaMPK23-1/23-2/23-2/23-3/23-4. All the members of Clade D consist of the TDY motif in the T-loop and are homologs to various Arabidopsis and soybean MPKs belonging to MPK16/19/18/8/15/17/9.
Gene members HaMPK3-1/3-2, HaMPK6-1/6-2, HaMPK9-1/9-2, HaMPK11-1/11-2, HaMPK13-1/13-2, HaMPK16-1/16-2, HaMPK19-1/19-2, and HaMPK23-2/23-4 are present on different chromosomes, while only paralogs HaMPK23-1/23-3 are present on the same chromosome 3. Other MPKs, such as AcMPK3-1/3-2, AcMPK2-1/2-2, DcMPK3-1/3-2, DcMPK6-1/6-2, DcMPK8-1/8-2/8-3, DcMPK9-1/9-2, SfMPK4-1/4-2, SfMPK20-1/20-2, SfMPK23-1/23-2, SlMPK4-1/4-2, SlMPK17-1/17-2, SlMPK9-1/9-2, AmtMPK13-1/13-2, are present on different chromosomes. The only AmtMPK11-1/11-2 pair is present in the same scaffold (AmTr_v1.0_scaffold00001) (Table S1). This suggests a potentially crucial role of segmental duplications and transposition events in the evolution of MAPKs in sunflower and other plant species, except for the HaMPK23-1/23-3 and AmtMPK11-1/11-2 pairs, in which tandem duplication might have been involved. Such features of segmental and tandem duplications in MPKs are also evidently seen in many plant species such as soybean [26], apple [32], cotton [116]. Such duplications are the major reason for the expansion of the many gene families, such as nucleotide-binding site-leucine-rich repeat (NBS-LRR), cytochrome P450 family, transcription factors and many more [117].

4.3. Diversity and Phylogenetic Relationship of MKKs

Sunflower MKKs also formed four distinct clades (A–D) with previously identified MKKs of Arabidopsis and soybean. These four clades (A–D) are consistent with the MKKs of various plant species such as Arabidopsis [100], rice [102], poplar [101], B. distachyon [33] and apple [32]. MKK clades consist of well-characterized MKK proteins such as AtMKK1/2/3/4/5 [118,119,120,121]. Clade A consists of HaMKKs grouped with AtMKK1/6/2, GmMAPKK6-1/6-2, GmMAPKK1, GmMAPK2-1/2-2. Sunflower and soybean have extra one copy of MKK6 compared to Arabidopsis and other plant species under study, including S. fallax. This suggests that the extra one copy of MKK6 was not seen until soybean diverged from Arabidopsis. Also, the retention of at least one copy of MKK6 in all species suggests its important role in signaling mechanisms during various stresses. We did not find a copy of MKK2-2 in sunflower, as is found in soybean (GmMAPKK2-2). The characterized AtMKK1 protein (orthologue of HaMPKK1) is induced upon the application of various stresses such as wounding, drought, cold, and high salinity in Arabidopsis seedlings [118]. AtMKK2 (ortholog of HaMKK2) is activated upon cold and salt stress signaling in Arabidopsis and mediates the phosphorylation of downstream MPKs [107]. Clade B consists of MKKs from the MKK3 proteins across all species under study, including C. reinhardtii. All species have a single copy of the MKK3 proteins except G. max, with two copies (GmMAPKK3-1/3-2). Two copies of MKK3 proteins in soybean is expected, as they underwent two duplication events to become a tetraploid. A divergence-time estimation based on Clade B sequences (each from all species) revealed how MKK3 proteins are conserved and retained in Algae, Bryophyte, Amborellales, Monocots, Ranunculales, Rosids, and Asterids. The divergence time analysis of MKK3 with CreMKK3 as the outgroup showed bryophyte and Amborellales being sister to the land plants and other extant species, which is consistent to previous studies [122,123] and follows the evolutionary history inferred on the Angiosperm Phylogeny Website [124]. One of these MKK3s, AtMKK3 is activated upon exposure to various stresses, such as cold, salt, hyperosmotic and ABA treatments [120]. This suggests the potential role of HaMKK3 in such stresses. Clade C consists of both copies of AtMKK4 and AtMKK5 only in A. trichopoda, O. sativa, and sunflower. However, V. vinifera, S. lycopersicum, and D. carota consist copies of MKK5 (MKK4 group absent). AtMKK4 and AtMKK5 are activated in Arabidopsis, mediating cell death and production of hydrogen peroxide [119]. In Clade D, the orthologs for MKK9 were found in all angiosperms except in soybean and O. sativa. Interestingly, we found three copies of MKK10 in S. fallax, as in O. sativa, and one copy of MKK10 in basal angiosperm, A. trichopoda, and Ranunculales, A. coerulea. We did not find any copy of MKK10 in sunflower, S. lycopersicum, D. carota, or V. vinifera. We observed HaMKK4/5/9 with one exon each that correlates to the At1g51660 (AtMKK4), At3g21220 (AtMKK5), and At1g73500 (AtMKK9), consisting of one exon per gene (https://www.Arabidopsis.org/index.jsp). Also, members belonging to Clade C and D in Gossypium raimondii had one exon in each [116]. This suggests that gene members belonging to Clade C and D encode proteins that are well conserved across plant species. Altogether, the diversity in the exon-intron structures might imply that duplication events caused the evolution of these genes under different environmental conditions. Also, AtMKK1 and AtMKK2 are involved in maintaining ROS homeostasis in Arabidopsis [121]. Since the paralog pairs, HaMKK6-1/6-2 and SfMKK10-1/10-2, are present on their different respective chromosomes, we infer a possible role of segmental duplications.

4.4. Expression Analysis and miRNA Prediction

In this study, we explored the expression pattern of MPKs and MKKs of sunflower under one hormone treatment, SA and two simulated abiotic stresses: NaCl for salinity, and Peg for osmotic stress in leaves and roots from the publicly available RNA-seq data. The expression of all sunflower MPKs and MKKs was detected in both leaves and roots, except for HaMPK4. In response to hormone SA, HaMPK11-1 was upregulated in leaves; HaMKK9, HaMPK13-2, HaMPK6-1, and HaMPK3-1 were upregulated in roots; HaMKK4, HaMPK7, and HaMPK11-2 were downregulated in leaves; HaMPK19-1, HaMPK14, and HaMPK9-2 were downregulated in roots. It has been established that SA is directly involved in MAPK phosphorylation [125]. SA-induced protein kinase (SIPK; AtMPK6) and wound-induced protein kinase (WIPK; AtMPK3) are important in balancing salicylic acid or jasmonic acid during herbivore wounding [126]. In Arabidopsis, AtMKK9 and AtMPK6 play important role in leaf senescence, which is a complex process caused by various factors including salicylic acid [127]. Also, ZmMPK3 in Zea mays is activated upon the application of SA hormone [128]. Thus, HaMPK3-1, HaMKK9, and HaMPK6-1 might play an important role in leaf senescence and salicylic acid pathways in sunflower. In response to NaCl, HaMKK5, HaMKK6-2, HaMPK11-1 were upregulated in leaves; HaMPK14, HaMPK6-1, HaMPK2, HaMPK23-2, and HaMPK17 were upregulated in roots; HaMKK9, HaMPK7, HaMPK23-1 were downregulated in leaves; HaMKK6-2, HaMPK13-2, HaMPK14, and HaMPK9-2 were downregulated in roots. Among them, HaMPK17 play an important role under salinity stress, as its ortholog in Gossypium hirsutum, GhMPK17, was induced by salt, osmosis and abscisic acid [129]. The expression pattern of some genes depended on different parts of the plant, for example, HaMKK6-2 was upregulated in leaves and downregulated in roots in response to NaCl. In response to Peg, HaMKK5, HaMKK6-2, HaMPK3-2, HaMPK11-1, HaMPK14, HaMPK1, HaMPK6-2, HaMPK19-1, and HaMPK18 were upregulated in leaves; HaMKK4, HaMKK1, HaMKK2, HaMPK3-2, HaMPK13-2, HaMPK23-2, HaMPK9-2, and HaMPK11-2 were upregulated in roots; HaMKK9, HaMKK2, and HaMPK13-2 were downregulated in leaves; HaMPK14 was downregulated in roots. This reveals that at least 19 HaMPK and seven MKK genes were induced upon these treatments, as compared to the control. Among them, some genes are induced upon multiple treatments. For example, HaMKK4 and HaMKK6-2 were induced upon both NaCl and Peg; HaMPK6-1 was induced upon NaCl and Peg; HaMPK16-2 was induced upon both SA and NaCl. The functional divergence can be observed on both HaMPKs and HaMKKs, as the hierarchical clustering patterns of expression of these genes do not follow the nesting pattern within clades in the phylogenetic trees, except for a few genes. For example, in MPKs, HaMPK22/23-3 that belonged to Clade C, HaMPK3-1/3-2/11-2 that belonged to Clade A, and HaMPK9-2/16-2/17 that belonged to clade D showed hierarchical clustering for expression of these genes. However, only HaMKK6-1/6-2 that belonged to Clade A of the MKK subgroup showed hierarchical clustering for expression of these genes. This shows the functional divergence and convergence of the HaMPK and HaMKK genes within and among the clades under different stress responses. Among seven published H. annuus microRNAs, five families of miRNAs are involved in possibly targeting eight MPKs. We did not find any miRNAs targeting HaMKK genes. Previous studies have reported the role of miRNAs in MAPK signaling pathways of animal systems in chronic myeloid leukemia [130], papillary thyroid carcinoma [131], Caenorhabditis elegans [132]. Not only in animals, but studies also reported the prediction of miRNAs targeting MAPK genes of plants such as Gossypium hirsutum (ghr-miR5272a regulating MAPKK6) [133] and Oryza sativa (miR1429_5p targeting MPK17-1 and miR531 families targeting various MKKK transcripts) [134].

5. Conclusions

This study represents the first genome-wide identification, analysis and nomenclature of MPKs and MKKs in H. annuus, D. carota and, S. fallax, as well as reassessment of these genes in A. coerulea, A. trichopoda, C. reinhardtii, and S. lycopersicum. We identified 28 MPKs and eight MKKs in sunflower, and studied their genomic architecture, phylogenetic relationships, and functions in relation to nine other plant species (including A. thaliana, G. max, O. sativa, and V. vinifera). While the 3.6 gigabase sunflower genome is one of the largest among plants with available complete genome sequences, more MPKs and MKKs were found in soybean, which has a genome size of 975 Mbs. Analyses of P-loop, catalytic C-loop, and T-loop showed that HaMPKs and HaMKKs could be classified into four clades, which are comparable to those groups identified in A. thaliana and G. max. However, clades such as Clade A, B, and C of MPKs consisted of different group members of A. thaliana and G. max. Among the MPK and MKK genes studied, the MKK3 proteins were well- conserved and retained in all species included in this study, including the outgroup C. reinhardtii, which warrants further exploration of these proteins across a wide array of species. The transcriptome data generated under hormone and abiotic stress treatments revealed diverse expression patterns of sunflower MPKs and MKKs exhibiting a dynamic role in adaptation to changing environmental conditions. We observed functional divergence of the HaMPK and HaMKK genes within the gene members of the same clade. The results of this study are important for understanding diversity and evolution of the MAPK gene family in plants and enhancing our knowledge of MAPK signaling pathways in sunflower. These findings can inform cultivar improvement in sunflower through stress-tolerance breeding.

Supplementary Materials

Supplementary files are available at https://www.mdpi.com/2223-7747/8/2/28/s1, Table S1: MPKs identified in different species in the present study. Table S2: MKKs identified in different species in the present study. Table S3: Abundance of the MPK gene family of sunflower and ten other species. Table S4: Sunflower MPKs with their orthologs in other plant species. Table S5: Abundance of the MKK gene family of sunflower and ten other species. Table S6: Sunflower MKKs with their orthologs in other plant species. Supplementary Table S7: k-means clustering of HaMPK and HaMKK genes. Table S8: miRNAs involved in targeting MPKs and MKKs of Sunflower. Table S9: Tajima’s relative rate test of MPKs. Table S10: Tajima’s relative rate test of MKKs. Table S11: Tajima’s Neutrality Test of MPKs and MKKs. Supplementary Figure S1: Illustrating a MAP Kinase signaling pathway in response to abiotic and biotic stresses in plants adapted from various studies. Supplementary Figure S2: Exon/intron architecture of HaMPK genes. The blue boxes and black lines indicate the exons and introns, respectively. Supplementary Figure S3: Exon/intron architecture of HaMKK genes. The blue boxes and black lines indicate the exons and introns, respectively. Supplementary Figure S4: Maximum Likelihood analysis of full-length MPK amino acid sequences from A. thaliana (At), G. max (Gm) and H. annuus (Ha) using MEGA7 program with 100 bootstrap replicates. Homo sapiens, HsMAPK1 (GenBank: NP_002736.3) was used as an outgroup. MPK clades are labeled as A, B, C and D. Supplementary Figure S5: ML tree of full-length MKK amino acid sequences from A. thaliana (At), G. max (Gm) and H. annuus (Ha) in the program MEGA7, with 100 bootstrap replicates. Homo sapiens, HsMAPKK1 [GenBank: AAI37460.1] was used as an outgroup. MKK clades are labeled as A, B, C, and D. Supplementary Figure S6: Divergence time estimation of MKK3 sequences from all species used in this study. Supplementary Figure S7: Clustering of sunflower MPK and MKK genes in response to diverse conditions as simulated by salicylic acid (SA), salt (NaCl) and polyethylene glycol (Peg) in leaves and roots. k-means clustering method was employed for clustering of genes. Supplementary Figure S7: The log2FC values for each HaMPK and HaMKK genes in samples treated with salicylic acid (SA), salt (NaCl) and polyethylene glycol (Peg) in leaves and roots Supplementary File S1: Codes used for RNA-seq data processing. Supplementary File S2: Sequences of the identified MPKs and MKKs including the reference sequences used in this study. Supplementary File S3: MEME predicted conserved motifs in MPK proteins belonging to different phylogenetic clades. Supplementary File S4: MEME predicted conserved motifs in MKK proteins belonging to different phylogenetic clades. Supplementary File S5: Estimated transcript reads for each treatment and their log2 fold change values.

Author Contributions

S.N. and S.E.S. performed gene identification and downstream analyses and wrote the manuscript. M.P.N. conceived and supervised the project and guided S.N. in drafting and revision of the manuscript. A.N., E.J.A., A.F., and R.Z. provided useful insights on data analysis, interpretation, and revision of the manuscript.

Funding

This work was supported by the USDA National Institute of Food and Agriculture, Hatch project SD00H659-18.

Conflicts of Interest

The authors declare no competing interests.

References

  1. Mohanta, T.K.; Arora, P.K.; Mohanta, N.; Parida, P.; Bae, H. Identification of new members of the MAPK gene family in plants shows diverse conserved domains and novel activation loop variants. BMC Genom. 2015, 16, 58. [Google Scholar] [CrossRef] [PubMed]
  2. Wang, Z.; Cole, P.A. Catalytic mechanisms and regulation of protein kinases. In Methods Enzymolgy; Elsevier: Amsterdam, The Netherlands, 2014; Volume 548, pp. 1–21. [Google Scholar]
  3. Ichimura, K.; Shinozaki, K.; Tena, G.; Sheen, J.; Henry, Y.; Champion, A.; Kreis, M.; Zhang, S.; Hirt, H.; Wilson, C. Mitogen-activated protein kinase cascades in plants: A new nomenclature. Trends Plant Sci. 2002, 7, 301–308. [Google Scholar]
  4. Hamel, L.-P.; Nicole, M.-C.; Sritubtim, S.; Morency, M.-J.; Ellis, M.; Ehlting, J.; Beaudoin, N.; Barbazuk, B.; Klessig, D.; Lee, J. Ancient signals: Comparative genomics of plant MAPK and MAPKK gene families. Trends Plant Sci. 2006, 11, 192–198. [Google Scholar] [CrossRef] [PubMed]
  5. Sturgill, T.W.; Ray, L.B. Muscle proteins related to microtubule associated protein-2 are substrates for an insulin-stimulatable kinase. Biochem. Biophys. Res. Commun. 1986, 134, 565–571. [Google Scholar] [CrossRef]
  6. Duerr, B.; Gawienowski, M.; Ropp, T.; Jacobs, T. MsERK1: A mitogen-activated protein kinase from a flowering plant. Plant Cell 1993, 5, 87–96. [Google Scholar] [CrossRef]
  7. Stafstrom, J.P.; Altschuler, M.; Anderson, D.H. Molecular cloning and expression of a MAP kinase homologue from pea. Plant Mol. Biol. 1993, 22, 83–90. [Google Scholar] [CrossRef]
  8. Nakagami, H.; Pitzschke, A.; Hirt, H. Emerging MAP kinase pathways in plant stress signalling. Trends Plant Sci. 2005, 10, 339–346. [Google Scholar] [CrossRef]
  9. Tatebayashi, K.; Takekawa, M.; Saito, H. A docking site determining specificity of Pbs2 MAPKK for Ssk2/Ssk22 MAPKKKs in the yeast HOG pathway. EMBO J. 2003, 22, 3624–3634. [Google Scholar] [CrossRef] [Green Version]
  10. Jonak, C.; Okresz, L.; Bogre, L.; Hirt, H. Complexity, cross talk and integration of plant MAP kinase signalling. Curr. Opin. Plant Biol. 2002, 5, 415–424. [Google Scholar] [CrossRef]
  11. Tena, G.; Asai, T.; Chiu, W.-L.; Sheen, J. Plant mitogen-activated protein kinase signaling cascades. Curr. Opin. Plant Biol. 2001, 4, 392–400. [Google Scholar] [CrossRef]
  12. Bardwell, L. Mechanisms of MAPK Signalling Specificity; Portland Press Limited: London, UK, 2006. [Google Scholar]
  13. Lewis, T.S.; Shapiro, P.S.; Ahn, N.G. Signal transduction through MAP kinase cascades. In Advances in Cancer Research; Elsevier: Amsterdam, The Netherlands, 1998; Volume 74, pp. 49–139. [Google Scholar]
  14. Janitza, P.; Ullrich, K.K.; Quint, M. Toward a comprehensive phylogenetic reconstruction of the evolutionary history of mitogen-activated protein kinases in the plant kingdom. Front. Plant Sci. 2012, 3, 271. [Google Scholar] [CrossRef] [PubMed]
  15. Marshall, C. Specificity of receptor tyrosine kinase signaling: Transient versus sustained extracellular signal-regulated kinase activation. Cell 1995, 80, 179–185. [Google Scholar] [CrossRef] [Green Version]
  16. Kissoudis, C.; van de Wiel, C.; Visser, R.G.; van der Linden, G. Enhancing crop resilience to combined abiotic and biotic stress through the dissection of physiological and molecular crosstalk. Front. Plant Sci. 2014, 5, 207. [Google Scholar] [CrossRef] [PubMed]
  17. Ranty, B.; Aldon, D.; Cotelle, V.; Galaud, J.-P.; Thuleau, P.; Mazars, C. Calcium sensors as key hubs in plant responses to biotic and abiotic stresses. Front. Plant Sci. 2016, 7, 327. [Google Scholar] [CrossRef] [PubMed]
  18. Pitzschke, A.; Schikora, A.; Hirt, H. MAPK cascade signalling networks in plant defence. Curr. Opin. Plant Biol. 2009, 12, 421–426. [Google Scholar] [CrossRef] [PubMed]
  19. Müller-Xing, R.; Xing, Q.; Goodrich, J. Footprints of the sun: Memory of UV and light stress in plants. Front. Plant Sci. 2014, 5, 474. [Google Scholar]
  20. Davis, P.K.; Brachmann, R.K. Chromatin remodeling and cancer. Cancer Biol. Ther. 2003, 2, 23–30. [Google Scholar] [CrossRef]
  21. Rejeb, I.B.; Pastor, V.; Mauch-Mani, B. Plant responses to simultaneous biotic and abiotic stress: Molecular mechanisms. Plants 2014, 3, 458–475. [Google Scholar] [CrossRef]
  22. Meyers, B.C.; Kozik, A.; Griego, A.; Kuang, H.; Michelmore, R.W. Genome-wide analysis of NBS-LRR-encoding genes in Arabidopsis. Plant Cell 2003, 15, 809–834. [Google Scholar] [CrossRef]
  23. Shao, Z.-Q.; Xue, J.-Y.; Wu, P.; Zhang, Y.-M.; Wu, Y.; Hang, Y.-Y.; Wang, B.; Chen, J.-Q. Large-scale analyses of angiosperm nucleotide-binding site-leucine-rich repeat (NBS-LRR) genes reveal three anciently diverged classes with distinct evolutionary patterns. Plant Physiol. 2016. [Google Scholar] [CrossRef]
  24. Ramirez-Prado, J.S.; Abulfaraj, A.A.; Rayapuram, N.; Benhamed, M.; Hirt, H. Plant immunity: From signaling to epigenetic control of defense. Trends Plant Sci. 2018, 9, 833–844. [Google Scholar] [CrossRef] [PubMed]
  25. Asai, T.; Tena, G.; Plotnikova, J.; Willmann, M.R.; Chiu, W.-L.; Gomez-Gomez, L.; Boller, T.; Ausubel, F.M.; Sheen, J. MAP kinase signalling cascade in Arabidopsis innate immunity. Nature 2002, 415, 977–983. [Google Scholar] [CrossRef] [PubMed]
  26. Neupane, A.; Nepal, M.P.; Piya, S.; Subramanian, S.; Rohila, J.S.; Reese, R.N.; Benson, B.V. Identification, nomenclature, and evolutionary relationships of mitogen-activated protein kinase (MAPK) genes in soybean. Evol. Bioinform. 2013, 9, 363. [Google Scholar] [CrossRef] [PubMed]
  27. Neupane, A.; Nepal, M.P.; Benson, B.V.; MacArthur, K.J.; Piya, S. Evolutionary history of mitogen-activated protein kinase (MAPK) genes in Lotus, Medicago, and Phaseolus. Plant Signal. Behav. 2013, 8, e27189. [Google Scholar] [CrossRef] [PubMed]
  28. Wu, P.; Wang, W.; Li, Y.; Hou, X. Divergent evolutionary patterns of the MAPK cascade genes in Brassica rapa and plant phylogenetics. Hort. Res. 2017, 4, 17079. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  29. Wu, J.; Wang, J.; Pan, C.; Guan, X.; Wang, Y.; Liu, S.; He, Y.; Chen, J.; Chen, L.; Lu, G. Genome-wide identification of MAPKK and MAPKKK gene families in tomato and transcriptional profiling analysis during development and stress response. PLoS ONE 2014, 9, e103032. [Google Scholar] [CrossRef] [PubMed]
  30. Liu, Z.; Zhang, L.; Xue, C.; Fang, H.; Zhao, J.; Liu, M. Genome-wide identification and analysis of MAPK and MAPKK gene family in Chinese jujube (Ziziphus jujuba Mill.). BMC Genom. 2017, 18, 855. [Google Scholar] [CrossRef] [PubMed]
  31. Kong, X.; Pan, J.; Zhang, D.; Jiang, S.; Cai, G.; Wang, L.; Li, D. Identification of mitogen-activated protein kinase kinase gene family and MKK–MAPK interaction network in maize. Biochem. Biophys. Res. Commun. 2013, 441, 964–969. [Google Scholar] [CrossRef] [PubMed]
  32. Zhang, S.; Xu, R.; Luo, X.; Jiang, Z.; Shu, H. Genome-wide identification and expression analysis of MAPK and MAPKK gene family in Malus domestica. Gene 2013, 531, 377–387. [Google Scholar] [CrossRef] [PubMed]
  33. Chen, L.; Hu, W.; Tan, S.; Wang, M.; Ma, Z.; Zhou, S.; Deng, X.; Zhang, Y.; Huang, C.; Yang, G. Genome-wide identification and analysis of MAPK and MAPKK gene families in Brachypodium distachyon. PLoS ONE 2012, 7, e46744. [Google Scholar] [CrossRef]
  34. Piao, Y.; Jin, K.; He, Y.; Liu, J.; Liu, S.; Li, X.; Piao, Z. Genome-wide identification and role of MKK and MPK gene families in clubroot resistance of Brassica rapa. PLoS ONE 2018, 13, e0191015. [Google Scholar] [CrossRef] [PubMed]
  35. Goyal, R.K.; Tulpan, D.; Chomistek, N.; Fundora, D.G.-P.; West, C.; Ellis, B.E.; Frick, M.; Laroche, A.; Foroud, N.A. Analysis of MAPK and MAPKK gene families in wheat and related Triticeae species. BMC Genom. 2018, 19, 178. [Google Scholar] [CrossRef] [PubMed]
  36. Wang, L.; Hu, W.; Tie, W.; Ding, Z.; Ding, X.; Liu, Y.; Yan, Y.; Wu, C.; Peng, M.; Xu, B. The MAPKKK and MAPKK gene families in banana: Identification, phylogeny and expression during development, ripening and abiotic stress. Sci. Rep. 2017, 7, 1159. [Google Scholar] [CrossRef] [PubMed]
  37. Cakir, B.; Kılıçkaya, O. Mitogen-activated protein kinase cascades in Vitis vinifera. Front. Plant Sci. 2015, 6, 556. [Google Scholar] [CrossRef] [PubMed]
  38. Zhang, X.; Mi, X.; Chen, C.; Wang, H.; Guo, W. Identification on mitogen-activated protein kinase signaling cascades by integrating protein interaction with transcriptional profiling analysis in cotton. Sci. Rep. 2018, 8, 8178. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  39. Dóczi, R.; Ökrész, L.; Romero, A.E.; Paccanaro, A.; Bögre, L. Exploring the evolutionary path of plant MAPK networks. Trends Plant Sci. 2012, 17, 518–525. [Google Scholar] [CrossRef]
  40. Widmann, C.; Gibson, S.; Jarpe, M.B.; Johnson, G.L. Mitogen-activated protein kinase: Conservation of a three-kinase module from yeast to human. Physiol. Rev. 1999, 79, 143–180. [Google Scholar] [CrossRef]
  41. Iorizzo, M.; Ellison, S.; Senalik, D.; Zeng, P.; Satapoomin, P.; Huang, J.; Bowman, M.; Iovene, M.; Sanseverino, W.; Cavagnaro, P. A high-quality carrot genome assembly provides new insights into carotenoid accumulation and asterid genome evolution. Nat. Genet. 2016, 48, 657. [Google Scholar] [CrossRef]
  42. Badouin, H.; Gouzy, J.; Grassa, C.J.; Murat, F.; Staton, S.E.; Cottret, L.; Lelandais-Brière, C.; Owens, G.L.; Carrère, S.; Mayjonade, B. The sunflower genome provides insights into oil metabolism, flowering and Asterid evolution. Nature 2017, 546, 148. [Google Scholar] [CrossRef]
  43. Consortium, T.G. The tomato genome sequence provides insights into fleshy fruit evolution. Nature 2012, 485, 635. [Google Scholar] [CrossRef]
  44. Albert, V.A.; Barbazuk, W.B.; Der, J.P.; Leebens-Mack, J.; Ma, H.; Palmer, J.D.; Rounsley, S.; Sankoff, D.; Schuster, S.C.; Soltis, D.E. The Amborella genome and the evolution of flowering plants. Science 2013, 342, 1241089. [Google Scholar]
  45. Goodstein, D.M.; Shu, S.; Howson, R.; Neupane, R.; Hayes, R.D.; Fazo, J. Phytozome: A comparative platform for green plant genomics. Nucleic Acids Res. 2012, 40, D1178–D1186. [Google Scholar] [CrossRef] [PubMed]
  46. Shaw, A.; Schmutz, J.; Devos, N.; Shu, S.; Carrell, A.; Weston, D. The Sphagnum Genome Project: A new model for ecological and evolutionary genomics. In Advances in Botanical Research; Elsevier: Amsterdam, The Netherlands, 2016; Volume 78, pp. 167–187. [Google Scholar]
  47. Merchant, S.S.; Prochnik, S.E.; Vallon, O.; Harris, E.H.; Karpowicz, S.J.; Witman, G.B.; Terry, A.; Salamov, A.; Fritz-Laylin, L.K.; Maréchal-Drouard, L. The Chlamydomonas genome reveals the evolution of key animal and plant functions. Science 2007, 318, 245–250. [Google Scholar] [CrossRef] [PubMed]
  48. Kane, N.C.; Rieseberg, L.H.J.G. Selective sweeps reveal candidate genes for adaptation to drought and salt tolerance in common sunflower, Helianthus annuus. Genetics 2007, 175, 1823–1834. [Google Scholar] [CrossRef]
  49. Finn, R.D.; Clements, J.; Arndt, W.; Miller, B.L.; Wheeler, T.J.; Schreiber, F.; Bateman, A.; Eddy, S.R. HMMER web server: 2015 update. Nucleic Acids Res. 2015, 43, W30–W38. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  50. Jones, P.; Binns, D.; Chang, H.-Y.; Fraser, M.; Li, W.; McAnulla, C.; McWilliam, H.; Maslen, J.; Mitchell, A.; Nuka, G. InterProScan 5: Genome-scale protein function classification. Bioinformatics 2014, 30, 1236–1240. [Google Scholar] [CrossRef]
  51. Finn, R.D.; Bateman, A.; Clements, J.; Coggill, P.; Eberhardt, R.Y.; Eddy, S.R.; Heger, A.; Hetherington, K.; Holm, L.; Mistry, J. Pfam: The protein families database. Nucleic Acids Res. 2013, 42, W30–W38. [Google Scholar]
  52. Bailey, T.L.; Elkan, C. Fitting a mixture model by expectation maximization to discover motifs in bipolymers. Proc. Int. Conf. Intell. Syst. Mol. Biol. 1994, 2, 28–36. [Google Scholar] [PubMed]
  53. Altschul, S.F.; Gish, W.; Miller, W.; Myers, E.W.; Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 1990, 215, 403–410. [Google Scholar] [CrossRef]
  54. Emanuelsson, O.; Brunak, S.; von Heijne, G.; Nielsen, H. Locating proteins in the cell using TargetP, SignalP and related tools. Nat. Prot. 2007, 2, 953–971. [Google Scholar] [CrossRef] [PubMed]
  55. Thompson, J.D.; Higgins, D.G.; Gibson, T.J. CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22, 4673–4680. [Google Scholar] [CrossRef] [PubMed]
  56. Edgar, R.C. MUSCLE: Multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32, 1792–1797. [Google Scholar] [CrossRef] [PubMed]
  57. Kearse, M.; Moir, R.; Wilson, A.; Stones-Havas, S.; Cheung, M.; Sturrock, S.; Buxton, S.; Cooper, A.; Markowitz, S.; Duran, C. Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 2012, 28, 1647–1649. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  58. Kumar, S.; Stecher, G.; Tamura, K. MEGA7: Molecular Evolutionary Genetics Analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 2016, 33, 1870–1874. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  59. Tamura, K.; Battistuzzi, F.U.; Billing-Ross, P.; Murillo, O.; Filipski, A.; Kumar, S. Estimating divergence times in large molecular phylogenies. Proc. Natl. Acad. Sci. USA 2012, 109, 19333–19338. [Google Scholar] [CrossRef] [Green Version]
  60. Kumar, S.; Stecher, G.; Suleski, M.; Hedges, S.B. Evolution TimeTree: A resource for timelines, timetrees, and divergence times. Mol. Biol. Evol. 2017, 34, 1812–1819. [Google Scholar] [CrossRef]
  61. Hu, B.; Jin, J.; Guo, A.-Y.; Zhang, H.; Luo, J.; Gao, G. GSDS 2.0: An upgraded gene feature visualization server. Bioinformatics 2015, 31, 1296–1297. [Google Scholar] [CrossRef]
  62. He, S. Genome-wide identification and transcriptional expression analysis of mitogen-activated protein kinase and mitogen-activated protein kinase kinase genes in Capsicum annuum. Front. Plant Sci. 2015, 6, 780. [Google Scholar]
  63. Liang, W.; Yang, B.; Yu, B.-J.; Zhou, Z.; Li, C.; Jia, M.; Sun, Y.; Zhang, Y.; Wu, F.; Zhang, H. Identification and analysis of MKK and MPK gene families in canola (Brassica napus L.). BMC Genom. 2013, 14, 392. [Google Scholar] [CrossRef]
  64. Wang, J.; Pan, C.; Wang, Y.; Ye, L.; Wu, J.; Chen, L.; Zou, T.; Lu, G. Genome-wide identification of MAPK, MAPKK, and MAPKKK gene families and transcriptional profiling analysis during development and stress response in cucumber. BMC Genom. 2015, 16, 386. [Google Scholar] [CrossRef]
  65. Liu, Y.; Zhang, D.; Wang, L.; Li, D. Genome-Wide Analysis of Mitogen-Activated Protein Kinase Gene Family in Maize. Plant Mol. Biol. Rep. 2013, 31, 1446–1460. [Google Scholar] [CrossRef]
  66. Andrews, S. FastQC: A Quality Control Tool for High Throughput Sequence Data; Babraham Institute: Cambridge, UK, 2010. [Google Scholar]
  67. Kong, Y.J.G. Btrim: A fast, lightweight adapter and quality trimming program for next-generation sequencing technologies. Genomics 2011, 98, 152–153. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  68. Patro, R.; Duggal, G.; Love, M.I.; Irizarry, R.A.; Kingsford, C. Salmon provides fast and bias-aware quantification of transcript expression. Nat. Methods 2017, 14, 417–419. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  69. Grüning, B.; Dale, R.; Sjödin, A.; Chapman, B.A.; Rowe, J.; Tomkins-Tinch, C.H.; Valieris, R.; Köster, J.; Bioconda, T. Bioconda: Sustainable and comprehensive software distribution for the life sciences. Nat. Methods 2018, 15, 475–476. [Google Scholar] [CrossRef] [PubMed]
  70. Ge, S.X. iDEP: An integrated web application for differential expression and pathway analysis. bioRxiv 2017. [Google Scholar] [CrossRef]
  71. Dai, X.; Zhao, P.X. psRNATarget: A plant small RNA target analysis server. Nucleic Acids Res. 2011, 39, W155–W159. [Google Scholar] [CrossRef]
  72. Tajima, F. Simple methods for testing the molecular evolutionary clock hypothesis. Genetics 1993, 135, 599–607. [Google Scholar]
  73. Tajima, F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 1989, 123, 585–595. [Google Scholar]
  74. Lamesch, P.; Berardini, T.Z.; Li, D.; Swarbreck, D.; Wilks, C.; Sasidharan, R.; Muller, R.; Dreher, K.; Alexander, D.L.; Garcia-Hernandez, M. The Arabidopsis Information Resource (TAIR): Improved gene annotation and new tools. Nucleic Acids Res. 2011, 40, D1202–D1210. [Google Scholar] [CrossRef]
  75. Schmutz, J.; Cannon, S.B.; Schlueter, J.; Ma, J.; Mitros, T.; Nelson, W.; Hyten, D.L.; Song, Q.; Thelen, J.J.; Cheng, J. Genome sequence of the palaeopolyploid soybean. Nature 2010, 463, 178–183. [Google Scholar] [CrossRef]
  76. Ouyang, S.; Zhu, W.; Hamilton, J.; Lin, H.; Campbell, M.; Childs, K.; Thibaud-Nissen, F.; Malek, R.L.; Lee, Y.; Zheng, L. The TIGR rice genome annotation resource: Improvements and new features. Nucleic Acids Res. 2006, 35, D883–D887. [Google Scholar] [CrossRef] [PubMed]
  77. Jaillon, O.; Aury, J.-M.; Noel, B.; Policriti, A.; Clepet, C.; Casagrande, A.; Choisne, N.; Aubourg, S.; Vitulo, N.; Jubin, C. The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 2007, 449, 463. [Google Scholar] [PubMed]
  78. Mohanta, T.K.; Park, Y.-H.; Bae, H. Novel genomic and evolutionary insight of WRKY transcription factors in plant lineage. Sci. Rep. 2016, 6, 37309. [Google Scholar] [CrossRef] [PubMed]
  79. Xu, J.; Zhang, S. Mitogen-activated protein kinase cascades in signaling plant growth and development. Trends Plant Sci. 2015, 20, 56–64. [Google Scholar] [CrossRef] [PubMed]
  80. De Smet, I.; Voß, U.; Jürgens, G.; Beeckman, T. Receptor-like kinases shape the plant. Nat. Cell Biol. 2009, 11, 1166. [Google Scholar] [CrossRef]
  81. Li, J.; Tax, F.E. Receptor-like kinases: Key regulators of plant development and defense. J. Integr. Plant Biol. 2013, 55, 1184–1187. [Google Scholar] [CrossRef] [PubMed]
  82. Lee, J.S.; Wang, S.; Sritubtim, S.; Chen, J.G.; Ellis, B.E. Arabidopsis mitogen-activated protein kinase MPK12 interacts with the MAPK phosphatase IBR5 and regulates auxin signaling. Plant J. 2009, 57, 975–985. [Google Scholar] [CrossRef] [PubMed]
  83. Dai, Y.; Wang, H.; Li, B.; Huang, J.; Liu, X.; Zhou, Y.; Mou, Z.; Li, J. Increased expression of MAP KINASE KINASE7 causes deficiency in polar auxin transport and leads to plant architectural abnormality in Arabidopsis. Plant Cell 2006, 18, 308–320. [Google Scholar] [CrossRef] [PubMed]
  84. Kim, H.S.; Park, S.-C.; Ji, C.Y.; Park, S.; Jeong, J.C.; Lee, H.-S.; Kwak, S.-S. Molecular characterization of biotic and abiotic stress-responsive MAP kinase genes, IbMPK3 and IbMPK6, in sweetpotato. Plant Physiol. Biochem. 2016, 108, 37–48. [Google Scholar] [CrossRef] [PubMed]
  85. Abass, M.; Morris, P.C. The Hordeum vulgare signalling protein MAP kinase 4 is a regulator of biotic and abiotic stress responses. J. Plant Physiol. 2013, 170, 1353–1359. [Google Scholar] [CrossRef]
  86. Jalmi, S.K.; Sinha, A.K. ROS mediated MAPK signaling in abiotic and biotic stress-striking similarities and differences. Front. Plant Sci. 2015, 6, 769. [Google Scholar] [CrossRef] [PubMed]
  87. Jaggi, M.; Gupta, M.; Tuteja, N.; Sinha, A.K. Mitogen-Activated Protein Kinases in Abiotic Stress Tolerance in Crop Plants:“-Omics”Approaches. Improv. Crop Prod. Sustain. Agric. 2013, 107–132. [Google Scholar]
  88. Taj, G.; Agarwal, P.; Grant, M.; Kumar, A. MAPK machinery in plants: Recognition and response to different stresses through multiple signal transduction pathways. Plant Signal. Behav. 2010, 5, 1370–1378. [Google Scholar] [CrossRef] [PubMed]
  89. Mishra, N.S.; Tuteja, R.; Tuteja, N. Signaling through MAP kinase networks in plants. Arch. Biochem. Biophys. 2006, 452, 55–68. [Google Scholar] [CrossRef] [PubMed]
  90. Rasmussen, M.W.; Roux, M.; Petersen, M.; Mundy, J. MAP kinase cascades in Arabidopsis innate immunity. Front. Plant Sci. 2012, 3, 169. [Google Scholar] [CrossRef]
  91. Kültz, D. Evolution of osmosensory MAP kinase signaling pathways. Am. Zool. 2001, 41, 743–757. [Google Scholar] [CrossRef]
  92. Lee, D.; Redfern, O.; Orengo, C. Predicting protein function from sequence and structure. Nat. Rev. Mol. Cell Biol. 2007, 8, 995. [Google Scholar] [CrossRef]
  93. Kong, F.; Wang, J.; Cheng, L.; Liu, S.; Wu, J.; Peng, Z.; Lu, G. Genome-wide analysis of the mitogen-activated protein kinase gene family in Solanum lycopersicum. Gene 2012, 499, 108–120. [Google Scholar] [CrossRef]
  94. Liu, Q.; Xue, Q. Computational identification and phylogenetic analysis of the MAPK gene family in Oryza sativa. Plant Physiol. Biochem. 2007, 45, 6–14. [Google Scholar] [CrossRef]
  95. Barker, M.S.; Li, Z.; Kidder, T.I.; Reardon, C.R.; Lai, Z.; Oliveira, L.O.; Scascitelli, M.; Rieseberg, L.H. Most Compositae (Asteraceae) are descendants of a paleohexaploid and all share a paleotetraploid ancestor with the Calyceraceae. Am. J. Bot. 2016, 103, 1203–1211. [Google Scholar] [CrossRef] [Green Version]
  96. Gill, N.; Findley, S.; Walling, J.G.; Hans, C.; Ma, J.; Doyle, J.; Stacey, G.; Jackson, S.A. Molecular and chromosomal evidence for allopolyploidy in soybean. Plant Physiol. 2009, 151, 1167–1174. [Google Scholar] [CrossRef] [PubMed]
  97. Soltis, P.S.; Soltis, D.E. The role of genetic and genomic attributes in the success of polyploids. Proc. Natl. Acad. Sci. USA 2000, 97, 7051–7057. [Google Scholar] [CrossRef] [Green Version]
  98. Gottlieb, L. Conservation and duplication of isozymes in plants. Science 1982, 216, 373–380. [Google Scholar] [CrossRef]
  99. Seiler, G. Wild annual Helianthus anomalus and H. deserticola for improving oil content and quality in sunflower. Ind. Crops Prod. 2007, 25, 95–100. [Google Scholar] [CrossRef]
  100. Colcombet, J.; Hirt, H. Arabidopsis MAPKs: A complex signalling network involved in multiple biological processes. Biochem. J. 2008, 413, 217–226. [Google Scholar] [CrossRef] [PubMed]
  101. Nicole, M.-C.; Hamel, L.-P.; Morency, M.-J.; Beaudoin, N.; Ellis, B.E.; Séguin, A. MAP-ping genomic organization and organ-specific expression profiles of poplar MAP kinases and MAP kinase kinases. BMC Genom. 2006, 7, 223. [Google Scholar] [CrossRef] [PubMed]
  102. Rao, K.P.; Richa, T.; Kumar, K.; Raghuram, B.; Sinha, A.K. In silico analysis reveals 75 members of mitogen-activated protein kinase kinase kinase gene family in rice. DNA Res. 2010, 17, 139–153. [Google Scholar] [CrossRef] [PubMed]
  103. Wei, W.; Chai, Z.; Xie, Y.; Gao, K.; Cui, M.; Jiang, Y.; Feng, J.J.P.o. Bioinformatics identification and transcript profile analysis of the mitogen-activated protein kinase gene family in the diploid woodland strawberry Fragaria vesca. PLoS ONE 2017, 12, e0178596. [Google Scholar] [CrossRef] [PubMed]
  104. Zhang, S.; Liu, Y.; Klessig, D.F. Multiple levels of tobacco WIPK activation during the induction of cell death by fungal elicitins. Plant J. 2000, 23, 339–347. [Google Scholar] [CrossRef] [Green Version]
  105. Wang, J.; Ding, H.; Zhang, A.; Ma, F.; Cao, J.; Jiang, M. A novel mitogen-activated protein kinase gene in maize (Zea mays), ZmMPK3, is involved in response to diverse environmental cues. J. Integr. Plant Biol. 2010, 52, 442–452. [Google Scholar] [CrossRef] [PubMed]
  106. Yuasa, T.; Ichimura, K.; Mizoguchi, T.; Shinozaki, K. Oxidative stress activates ATMPK6, an Arabidopsis homologue of MAP kinase. Plant Cell Physiol. 2001, 42, 1012–1016. [Google Scholar] [CrossRef] [PubMed]
  107. Ichimura, K.; Mizoguchi, T.; Yoshida, R.; Yuasa, T.; Shinozaki, K. Various abiotic stresses rapidly activate Arabidopsis MAP kinases ATMPK4 and ATMPK6. Plant J. 2000, 24, 655–665. [Google Scholar] [CrossRef] [PubMed]
  108. Teige, M.; Scheikl, E.; Eulgem, T.; Dóczi, R.; Ichimura, K.; Shinozaki, K.; Dangl, J.L.; Hirt, H. The MKK2 pathway mediates cold and salt stress signaling in Arabidopsis. Mol. Cell 2004, 15, 141–152. [Google Scholar] [CrossRef] [PubMed]
  109. Rohila, J.S.; Yang, Y. Rice mitogen-activated protein kinase gene family and its role in biotic and abiotic stress response. J. Integr. Plant Biol. 2007, 49, 751–759. [Google Scholar] [CrossRef]
  110. Danquah, A.; Zélicourt, A.; Boudsocq, M.; Neubauer, J.; Frei dit Frey, N.; Leonhardt, N.; Pateyron, S.; Gwinner, F.; Tamby, J.P.; Ortiz-Masia, D. Identification and characterization of an ABA-activated MAP kinase cascade in Arabidopsis thaliana. Plant J. 2015, 82, 232–244. [Google Scholar] [CrossRef] [PubMed]
  111. Cheong, Y.H.; Moon, B.C.; Kim, J.K.; Kim, C.Y.; Kim, M.C.; Kim, I.H.; Park, C.Y.; Kim, J.C.; Park, B.O.; Koo, S.C. BWMK1, a rice mitogen-activated protein kinase, locates in the nucleus and mediates pathogenesis-related gene expression by activation of a transcription factor. Plant Physiol. 2003, 132, 1961–1972. [Google Scholar] [CrossRef] [PubMed]
  112. Schoenbeck, M.A.; Samac, D.A.; Fedorova, M.; Gregerson, R.G.; Gantt, J.S.; Vance, C.P. The alfalfa (Medicago sativa) TDY1 gene encodes a mitogen-activated protein kinase homolog. Mol. Plant-Microbe Interact. 1999, 12, 882–893. [Google Scholar] [CrossRef] [PubMed]
  113. Mizoguchi, T.; Irie, K.; Hirayama, T.; Hayashida, N.; Yamaguchi-Shinozaki, K.; Matsumoto, K.; Shinozaki, K. A gene encoding a mitogen-activated protein kinase kinase kinase is induced simultaneously with genes for a mitogen-activated protein kinase and an S6 ribosomal protein kinase by touch, cold, and water stress in Arabidopsis thaliana. Proc. Natl. Acad. Sci. USA 1996, 93, 765–769. [Google Scholar] [CrossRef] [PubMed]
  114. Im, J.H.; Lee, H.; Kim, J.; Kim, H.B.; Seyoung, K.; Kim, B.M.; An, C.S. A salt stress-activated mitogen-activated protein kinase in soybean is regulated by phosphatidic acid in early stages of the stress response. J. Plant Biol. 2012, 55, 303–309. [Google Scholar] [CrossRef]
  115. Zong, X.-j.; Li, D.-p.; Gu, L.-k.; Li, D.-q.; Liu, L.-x.; Hu, X.-l. Abscisic acid and hydrogen peroxide induce a novel maize group C MAP kinase gene, ZmMPK7, which is responsible for the removal of reactive oxygen species. Planta 2009, 229, 485. [Google Scholar] [CrossRef] [PubMed]
  116. Zhang, X.; Xu, X.; Yu, Y.; Chen, C.; Wang, J.; Cai, C.; Guo, W.J.S.r. Integration analysis of MKK and MAPK family members highlights potential MAPK signaling modules in cotton. Sci. Rep. 2016, 6, 29781. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  117. Lespinet, O.; Wolf, Y.I.; Koonin, E.V.; Aravind, L. The role of lineage-specific gene family expansion in the evolution of eukaryotes. Genome Res. 2002, 12, 1048–1059. [Google Scholar] [CrossRef] [PubMed]
  118. Matsuoka, D.; Nanmori, T.; Sato, K.i.; Fukami, Y.; Kikkawa, U.; Yasuda, T. Activation of AtMEK1, an Arabidopsis mitogen-activated protein kinase kinase, in vitro and in vivo: Analysis of active mutants expressed in E. coli and generation of the active form in stress response in seedlings. Plant J. 2002, 29, 637–647. [Google Scholar] [CrossRef] [PubMed]
  119. Ren, D.; Yang, H.; Zhang, S. Cell death mediated by MAPK is associated with hydrogen peroxide production in Arabidopsis. J. Biol. Chem. 2002, 277, 559–565. [Google Scholar] [CrossRef] [PubMed]
  120. Hwa, C.-M.; Yang, X.-C. The AtMKK3 pathway mediates ABA and salt signaling in Arabidopsis. Acta Physiol. Planta 2008, 30, 277–286. [Google Scholar] [CrossRef]
  121. Liu, Y. Roles of mitogen-activated protein kinase cascades in ABA signaling. Plant Cell Rep. 2012, 31, 1–12. [Google Scholar] [CrossRef]
  122. Zhong, B.; Xi, Z.; Goremykin, V.V.; Fong, R.; Mclenachan, P.A.; Novis, P.M.; Davis, C.C.; Penny, D. Streptophyte algae and the origin of land plants revisited using heterogeneous models with three new algal chloroplast genomes. Mol. Biol. Evol. 2013, 31, 177–183. [Google Scholar] [CrossRef]
  123. Jansen, R.K.; Cai, Z.; Raubeson, L.A.; Daniell, H.; Leebens-Mack, J.; Müller, K.F.; Guisinger-Bellian, M.; Haberle, R.C.; Hansen, A.K.; Chumley, T.W.; et al. Analysis of 81 genes from 64 plastid genomes resolves relationships in angiosperms and identifies genome-scale evolutionary patterns. Proc. Natl. Acad. Sci. USA 2007, 104, 19369–19374. [Google Scholar] [CrossRef] [Green Version]
  124. Stevens, P.F.; Davis, H. Angiosperm Phylogeny Website; Missouri Botanical Garden: St. Louis, MO, USA, 2001. [Google Scholar]
  125. Mockaitis, K.; Howell, S.H.J.T.P.J. Auxin induces mitogenic activated protein kinase (MAPK) activation in roots of Arabidopsis seedlings. Plant J. 2000, 24, 785–796. [Google Scholar] [CrossRef]
  126. Seo, S.; Katou, S.; Seto, H.; Gomi, K.; Ohashi, Y. The mitogen-activated protein kinases WIPK and SIPK regulate the levels of jasmonic and salicylic acids in wounded tobacco plants. Plant J. 2007, 49, 899–909. [Google Scholar] [CrossRef] [Green Version]
  127. Zhou, C.; Cai, Z.; Guo, Y.; Gan, S. An Arabidopsis mitogen-activated protein kinase cascade, MKK9-MPK6, plays a role in leaf senescence. Plant Physiol. 2009, 150, 167–177. [Google Scholar] [CrossRef] [PubMed]
  128. Wu, T.; Kong, X.P.; Zong, X.J.; Li, D.P.; Li, D.Q. Expression analysis of five maize MAP kinase genes in response to various abiotic stresses and signal molecules. Mol. Biol. Rep. 2011, 38, 3967–3975. [Google Scholar] [CrossRef] [PubMed]
  129. Zhang, J.; Zou, D.; Li, Y.; Sun, X.; Wang, N.-N.; Gong, S.-Y.; Zheng, Y.; Li, X.B. GhMPK17, a cotton mitogen-activated protein kinase, is involved in plant response to high salinity and osmotic stresses and ABA signaling. PLoS ONE 2014, 9, e95642. [Google Scholar]
  130. Chakraborty, C.; Sharma, A.R.; Patra, B.C.; Bhattacharya, M.; Sharma, G.; Lee, S.-S. MicroRNAs mediated regulation of MAPK signaling pathways in chronic myeloid leukemia. Oncotarget 2016, 7, 42683. [Google Scholar] [CrossRef] [PubMed]
  131. Liu, Z.; Zhang, J.; Gao, J.; Li, Y. MicroRNA-4728 mediated regulation of MAPK oncogenic signaling in papillary thyroid carcinoma. Saudi J. Biol. Sci. 2018, 25, 986–990. [Google Scholar] [CrossRef]
  132. Johnson, S.M.; Grosshans, H.; Shingara, J.; Byrom, M.; Jarvis, R.; Cheng, A.; Labourier, E.; Reinert, K.L.; Brown, D.; Slack, F.J. RAS is regulated by the let-7 microRNA family. Cell 2005, 120, 635–647. [Google Scholar] [CrossRef]
  133. Li, S.; Yu, B. miRNA limits MAP kinase-mediated immunity: Optimization of plant fitness. J. Exp. Bot. 2017, 68, 5685. [Google Scholar] [CrossRef]
  134. Raghuram, B.; Sheikh, A.H.; Sinha, A.K. Regulation of MAP kinase signaling cascade by microRNAs in Oryza sativa. Plant Signal. Behav. 2014, 9, e972130. [Google Scholar] [CrossRef]
Figure 1. Schematic representation of in silico approaches used in the identification of MPK and MKK genes in seven plant species and their downstream analyses.
Figure 1. Schematic representation of in silico approaches used in the identification of MPK and MKK genes in seven plant species and their downstream analyses.
Plants 08 00028 g001
Figure 2. Chromosomal distribution of MPK and MKK genes in sunflower (n = 17). Color-coded arrows represent MAP Kinase gene types and their orientation on the chromosome indicated by the black line.
Figure 2. Chromosomal distribution of MPK and MKK genes in sunflower (n = 17). Color-coded arrows represent MAP Kinase gene types and their orientation on the chromosome indicated by the black line.
Plants 08 00028 g002
Figure 3. Maximum Likelihood (ML) tree constructed using full length amino acid sequences from Amborella trichopoda (Amt), Arabidopsis thaliana (At), Aquilegia coerulea (Ac), Chlamydomonas reinhardtii (Cre), Daucus carota (Dc), Glycine max (Gm), Helianthus annuus (Ha), Oryza sativa (Os), Solanum lycopersicum (Sl), and Sphagnum fallax (Sf), and Vitis vinifera (Vv) MPK proteins. Phylogenetic analysis with 100 bootstrap replicates was performed in the program MEGA7. Homo sapiens, HsMAPK1 (GenBank: NP_002736.3) was used as an outgroup. Different species are color-coded, and the MPK clades are labeled A–D. The Clade A members include the previously identified group A (MPK3, MPK6, MPK10) and B (MPK4, MPK5, MPK11, MPK12, MPK13) members of A. thaliana MPKs [3,4]. The Clade B members include the previously identified group C (MPK1, MPK2, MPK7, and MPK14) members of A. thaliana MPKs [3,4]. The Clade C members include the previously identified group E (MPK22 and MPK23) members of soybean MPKs [26]. The Clade D members include the previously identified group D (MPK8, MPK9, MPK16, MPK17, MPK18, MPK19, MPK20, and MPK21) members of A. thaliana MPKs [3,4].
Figure 3. Maximum Likelihood (ML) tree constructed using full length amino acid sequences from Amborella trichopoda (Amt), Arabidopsis thaliana (At), Aquilegia coerulea (Ac), Chlamydomonas reinhardtii (Cre), Daucus carota (Dc), Glycine max (Gm), Helianthus annuus (Ha), Oryza sativa (Os), Solanum lycopersicum (Sl), and Sphagnum fallax (Sf), and Vitis vinifera (Vv) MPK proteins. Phylogenetic analysis with 100 bootstrap replicates was performed in the program MEGA7. Homo sapiens, HsMAPK1 (GenBank: NP_002736.3) was used as an outgroup. Different species are color-coded, and the MPK clades are labeled A–D. The Clade A members include the previously identified group A (MPK3, MPK6, MPK10) and B (MPK4, MPK5, MPK11, MPK12, MPK13) members of A. thaliana MPKs [3,4]. The Clade B members include the previously identified group C (MPK1, MPK2, MPK7, and MPK14) members of A. thaliana MPKs [3,4]. The Clade C members include the previously identified group E (MPK22 and MPK23) members of soybean MPKs [26]. The Clade D members include the previously identified group D (MPK8, MPK9, MPK16, MPK17, MPK18, MPK19, MPK20, and MPK21) members of A. thaliana MPKs [3,4].
Plants 08 00028 g003
Figure 4. P-loop, catalytic C-loop, and activation or T-loop motifs representing variations in Clades A–D. (a) MPK; (b) MKK.
Figure 4. P-loop, catalytic C-loop, and activation or T-loop motifs representing variations in Clades A–D. (a) MPK; (b) MKK.
Plants 08 00028 g004
Figure 5. Maximum Likelihood (ML) tree constructed using full-length MKK amino acid sequences from Amborella trichopoda (Amt), Arabidopsis thaliana (At), Aquilegia coerulea (Ac), Chlamydomonas reinhardtii (Cre), Daucus carota (Dc), Glycine max (Gm), Helianthus annuus (Ha), Oryza sativa (Os), Solanum lycopersicum (Sl), and Sphagnum fallax (Sf), and Vitis vinifera (Vv). Phylogenetic analysis with 100 bootstrap replicates was performed in the program MEGA7. Homo sapiens, HsMAPKK1 (GenBank: AAI37460.1) was used as an outgroup. Different species are color-coded, and the MKK clades are labeled A–D. Clade A, B, C, and D members include the previously identified group A (MKK1, MKK2, and MKK6), group B (MKK3), group C (MKK4, MKK5), and group D (MKK7, MKK8, MKK9, and MKK10) members of A. thaliana MKKs, respectively [3,4].
Figure 5. Maximum Likelihood (ML) tree constructed using full-length MKK amino acid sequences from Amborella trichopoda (Amt), Arabidopsis thaliana (At), Aquilegia coerulea (Ac), Chlamydomonas reinhardtii (Cre), Daucus carota (Dc), Glycine max (Gm), Helianthus annuus (Ha), Oryza sativa (Os), Solanum lycopersicum (Sl), and Sphagnum fallax (Sf), and Vitis vinifera (Vv). Phylogenetic analysis with 100 bootstrap replicates was performed in the program MEGA7. Homo sapiens, HsMAPKK1 (GenBank: AAI37460.1) was used as an outgroup. Different species are color-coded, and the MKK clades are labeled A–D. Clade A, B, C, and D members include the previously identified group A (MKK1, MKK2, and MKK6), group B (MKK3), group C (MKK4, MKK5), and group D (MKK7, MKK8, MKK9, and MKK10) members of A. thaliana MKKs, respectively [3,4].
Plants 08 00028 g005
Figure 6. Expression profile of sunflower MPK and MKK genes visualized as a heatmap, with clade information. The heatmap was generated using log2FC values. The expression pattern is in response to salicylic acid (SA), salt (NaCl) and polyethylene glycol (Peg) in leaves and roots. The RNA-seq data was accessed from NCBI SRA SRP092742 [SRR4996815 (Roots Peg), SRR4996819 (Roots_NaCl), SRR4996823 (Leaves_Peg), SRR4996828 (Roots_Control), SRR4996834 (Leaves_NaCl), SRR4996836 (Leaves_Contol), SRR4996839 (Leaves_SA), and SRR4996847 (Roots_SA)].
Figure 6. Expression profile of sunflower MPK and MKK genes visualized as a heatmap, with clade information. The heatmap was generated using log2FC values. The expression pattern is in response to salicylic acid (SA), salt (NaCl) and polyethylene glycol (Peg) in leaves and roots. The RNA-seq data was accessed from NCBI SRA SRP092742 [SRR4996815 (Roots Peg), SRR4996819 (Roots_NaCl), SRR4996823 (Leaves_Peg), SRR4996828 (Roots_Control), SRR4996834 (Leaves_NaCl), SRR4996836 (Leaves_Contol), SRR4996839 (Leaves_SA), and SRR4996847 (Roots_SA)].
Plants 08 00028 g006
Table 1. Abundance of MPK and MKK genes in the genomes of 11 species used in this study.
Table 1. Abundance of MPK and MKK genes in the genomes of 11 species used in this study.
Plant SpeciesPloidySize of Genome (Mbs) γNo. of loci γMPKMKK
Amborella trichopoda Diploid7062684687
Aquilegia coerulea Diploid30224823115
Arabidopsis thalianaDiploid1352741620 a10 a
Chlamydomonas reinhardtii Haploid111.11774161
Daucus carota Diploid42132,113175
Glycine maxTetraploid9755604438 b11 b
Helianthus annuus Diploid360052243288
Oryza sativaDiploid3723904916 c8 c
Solanum lycopersicum Diploid90034727155
Sphagnum fallax Haploid/Diploid39526939116
Vitis viniferaDiploid4872634614 d5 d
= Plant species with MPKs and MKKs identified or revisited in this study; γ = References on the size of genome and number of loci Amborella trichopoda [44], Arabidopsis thaliana [74], Aquilegia coerulea [45], Chlamydomonas reinhardtii [47], Daucus carota [41], Glycine max [75], Helianthus annuus [42], Oryza sativa [76], Solanum lycopersicum [43], and Sphagnum fallax [46], and Vitis vinifera [77]; a = [10], b = [26], c = [4], d = [37]
Table 2. Sunflower MPK and MKK genes with their proposed name, GeneID, chromosomal location (Chr), strand direction (Str), start and end position of the genes on chromosome, protein length (PL), number of exon (Exo) and intron (Int), subcellular localization [Sl; M = Mitochondria and C = Chloroplast, - = Subcellular locations other than mitochondria or the chloroplast), isoelectric points (pI) and molecular weight (Mw)].
Table 2. Sunflower MPK and MKK genes with their proposed name, GeneID, chromosomal location (Chr), strand direction (Str), start and end position of the genes on chromosome, protein length (PL), number of exon (Exo) and intron (Int), subcellular localization [Sl; M = Mitochondria and C = Chloroplast, - = Subcellular locations other than mitochondria or the chloroplast), isoelectric points (pI) and molecular weight (Mw)].
NameGene IDChrStrStartEndPL ExoIntSlpIMw
MPK
HaMPK6-1HanXRQChr01g0023391Ha1-13030168613029296535965-5.8541581.61
HaMPK16-1HanXRQChr03g0071491Ha3-7737813777372246564109-9.1764059.43
HaMPK7HanXRQChr03g0074811Ha3+10241016110240616935332-7.6240274.83
HaMPK23-1HanXRQChr03g0081221Ha3+1299784431299734524531514-9.6550392.28
HaMPK23-3HanXRQChr03g0081391Ha3+1305061621305000134231615-8.9147648.79
HaMPK22HanXRQChr04g0108301Ha4-77321727773159704321817-5.4649633.87
HaMPK11-1HanXRQChr04g0121371Ha4+15878145115877822135865M6.4241228.21
HaMPK3-1HanXRQChr05g0133161Ha5+210642252106108935865-5.6841323.35
HaMPK8HanXRQChr05g0143371Ha5-1167746381167679235051110-6.857051.89
HaMPK2HanXRQChr05g0151241Ha5-16957475016957160934932-6.5440295.67
HaMPK11-2HanXRQChr06g0167011Ha6-7104659709987035965M6.2541336.17
HaMPK4HanXRQChr06g0170261Ha6+168942921689310015721M8.3617702.54
HaMPK13-1HanXRQChr06g0175501Ha6-346352513463152836376-5.2241353.31
HaMPK9-1HanXRQChr06g0183531Ha6+90706107906993124781110-6.5354442.91
HaMPK23-4HanXRQChr08g0226701Ha8+84318787843083814421817-9.5249480.06
HaMPK15HanXRQChr08g0227231Ha8+87599490875915775011110-8.5357073.07
HaMPK3-2HanXRQChr08g0229941Ha8-10101312710100986435865-5.5841298.31
HaMPK13-2HanXRQChr08g0230171Ha8-10280822910280425236265-5.8541552.83
HaMPK14HanXRQChr09g0243011Ha9+346731543466929236232-5.5741423.42
HaMPK16-2HanXRQChr09g0248301Ha9+7621239876202758559109-9.0763370.4
HaMPK1HanXRQChr09g0269211Ha9-18508634718508382536132-6.6441831.44
HaMPK19-2HanXRQChr11g0330461Ha11+437913214378498957498-9.3365344.85
HaMPK6-2HanXRQChr11g0343001Ha11-12596786612596337435965-5.841553.72
HaMPK19-1HanXRQChr13g0389781Ha13-1904831519044532588109-9.0666613.36
HaMPK23-2HanXRQChr13g0411961Ha13-1426344421426255114591817-9.6350984.95
HaMPK9-2HanXRQChr14g0432771Ha14-4968329049679650484109-6.5755530.13
HaMPK17HanXRQChr15g0484561Ha15-84424855844206534291110-6.2449909.6
HaMPK18HanXRQChr15g0495321Ha15-16015501216014927356398-9.4764374.62
MKK
HaMKK9HanXRQChr03g0087071Ha3-14842490214842582530810M6.7534332.34
HaMKK4HanXRQChr04g0094171Ha4+47174347281635110C9.0438917.18
HaMKK6-1HanXRQChr09g0238861Ha9+9311933932291635787-6.7639934.36
HaMKK5HanXRQChr10g0311571Ha10+21960489921960600435510C9.2539840.46
HaMKK6-2HanXRQChr10g0318871Ha10+24405604424406418535587-7.1339751.09
HaMKK2HanXRQChr10g0319531Ha10-24531827424532411837198-5.4340967.01
HaMKK1HanXRQChr12g0354521Ha12-12362781243005358109-5.7739199.81
HaMKK3HanXRQChr14g0450561Ha14-1415791161415871705201211M5.7968568.6
Table 3. Consensus common docking sites in the MPK proteins belonging to clades A–D.
Table 3. Consensus common docking sites in the MPK proteins belonging to clades A–D.
CladesConsensus Common Docking Sites
Clade AK-M-L-V-F-D-P-N-K-R-I-V-E-E-A-L
Clade BK-M-L-V-F-D-P-S-K-R-I-S-V-T-E-A-L
Clade CS-L-C-S-W-D-P-C-K-R-P-T-A-E-E-A-L
Clade DR-L-L-A-F-D-P-K-D-R-P-T-A-E-E-A-L

Share and Cite

MDPI and ACS Style

Neupane, S.; Schweitzer, S.E.; Neupane, A.; Andersen, E.J.; Fennell, A.; Zhou, R.; Nepal, M.P. Identification and Characterization of Mitogen-Activated Protein Kinase (MAPK) Genes in Sunflower (Helianthus annuus L.). Plants 2019, 8, 28. https://doi.org/10.3390/plants8020028

AMA Style

Neupane S, Schweitzer SE, Neupane A, Andersen EJ, Fennell A, Zhou R, Nepal MP. Identification and Characterization of Mitogen-Activated Protein Kinase (MAPK) Genes in Sunflower (Helianthus annuus L.). Plants. 2019; 8(2):28. https://doi.org/10.3390/plants8020028

Chicago/Turabian Style

Neupane, Surendra, Sarah E. Schweitzer, Achal Neupane, Ethan J. Andersen, Anne Fennell, Ruanbao Zhou, and Madhav P. Nepal. 2019. "Identification and Characterization of Mitogen-Activated Protein Kinase (MAPK) Genes in Sunflower (Helianthus annuus L.)" Plants 8, no. 2: 28. https://doi.org/10.3390/plants8020028

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop