Next Article in Journal
Mixed-Ploidy and Dysploidy in Hypericum perforatum: A Karyomorphological and Genome Size Study
Next Article in Special Issue
The Integration of Data from Different Long-Read Sequencing Platforms Enhances Proteoform Characterization in Arabidopsis
Previous Article in Journal
Towards a Discovery of a Zinc-Dependent Phosphate Transport Road in Plants
Previous Article in Special Issue
Investigation of Roles of TaTALE Genes during Development and Stress Response in Bread Wheat
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multiomic Data Integration in the Analysis of Drought-Responsive Mechanisms in Quercus ilex Seedlings

by
Víctor M. Guerrero-Sánchez
,
Cristina López-Hidalgo
,
María-Dolores Rey
,
María Ángeles Castillejo
,
Jesús V. Jorrín-Novo
and
Mónica Escandón
*,‡
Agroforestry and Plant Biochemistry, Proteomics and Systems Biology, Department of Biochemistry and Molecular Biology, University of Cordoba, UCO-CeiA3, 14014 Cordoba, Spain
*
Author to whom correspondence should be addressed.
Present address: Vascular Pathophysiology Area, Cardiovascular Proteomics Laboratory, Centro Nacional de Investigaciones Cardiovasculares Carlos III (CNIC), 28029 Madrid, Spain.
Present address: Plant Physiology, Department of Organisms and Systems Biology, Faculty of Biology and Biotechnology Institute of Asturias, University of Oviedo, 33071 Oviedo, Spain.
Plants 2022, 11(22), 3067; https://doi.org/10.3390/plants11223067
Submission received: 13 October 2022 / Revised: 3 November 2022 / Accepted: 10 November 2022 / Published: 12 November 2022

Abstract

:
The integrated analysis of different omic layers can provide new knowledge not provided by their individual analysis. This approach is also necessary to validate data and reveal post-transcriptional and post-translational mechanisms of gene expression regulation. In this work, we validated the possibility of applying this approach to non-model species such as Quercus ilex. Transcriptomics, proteomics, and metabolomics from Q. ilex seedlings subjected to drought-like conditions under the typical summer conditions in southern Spain were integrated using a non-targeted approach. Two integrative approaches, PCA and DIABLO, were used and compared. Both approaches seek to reduce dimensionality, preserving the maximum information. DIABLO also allows one to infer interconnections between the different omic layers. For easy visualization and analysis, these interconnections were analyzed using functional and statistical networks. We were able to validate results obtained by analyzing the omic layers separately. We identified the importance of protein homeostasis with numerous protease and chaperones in the networks. We also discovered new key processes, such as transcriptional control, and identified the key function of transcription factors, such as DREB2A, WRKY65, and CONSTANS, in the early response to drought.

Graphical Abstract

1. Introduction

Holm oak (Quercus ilex L.) is the dominant tree species in the Mediterranean forest and the key element of the Spanish agrosilvopastoral “dehesa”, with both ecological and economic importance [1]. Currently, several anthropogenic and environmental factors, such as aging trees, overexploitation, and abiotic and biotic stresses (drought, Phytophthora cinnamomi, etc.), threaten the sustainability and survival of Q. ilex and related ecosystems [2,3]. A combination of these stresses triggers the so-called Holm oak decline syndrome, leading to an increase in tree mortality and reducing the forested area. Holm oak is quite tolerant to drought and high temperatures [4]. However, the current climate change scenario, characterized by a drastic decrease in rainfall and a severe rise in temperatures, means that many areas where this species is found would no longer be suitable for its survival [5,6]. In addition, seedlings are less tolerant than adult trees, making water deprivation the main cause of the failure of reforestation programs [4]. For this reason, identifying and characterizing phenotypes more resilient to drought is a priority in plant breeding, management, and conservation programs in the Mediterranean area.
Since holm oak is a non-domesticated species and considering its biological characteristics (long-lived cycle, allogamy, and pollen dispersal by wind), the only viable alternative for its breeding is to exploit the biodiversity to select elite resilient genotypes [7,8]. For this purpose, it is necessary to characterize its inter- and intrapopulation variability and study its molecular mechanisms of resisting and tolerating biotic and abiotic stresses [9,10]. Although holm oak is a non-model species, the first draft of the genome has been recently published [11]. Furthermore, in recent years, great advances have been made through omic techniques [10,12,13], for example, a reference transcriptome with a fairly complete collection of holm oak genes [13] and a protein database that allows proteomic analysis through mass spectrometry technologies [14] have been obtained. In addition, the acorn and leaf metabolomes have been recently described [8,15].
When studying biological systems, traditionally, the different molecular levels analyzed by each omic technology have been studied by applying a reductionist and independent vision [16,17]. However, although each omic technology offers a lot of information, individual analysis cannot describe the entire biological complexity resulting from the relationships between the different molecular layers [18]. In addition, the molecular levels are influenced by the environment [19]. For this reason, the current trend is to combine different omic technologies to get a holistic view of the studied system [16,20]. This new approach provides a better understanding of the biology of the species and allows one to select stress-related resilience genes or gene products as markers in breeding programs that can help in the conservation of an endangered species, such as the holm oak.
The development of bioinformatics software has allowed us to carry out a joint analysis of the large amount of data generated by omic technologies to establish statistically significant relationships between the different molecular layers [21]. However, interpreting the results of such analysis remains a challenge even in model species. Some molecular analyses have been performed on holm oak [12] using a non-targeted omic analysis [8,22,23,24] or a non-targeted multiomic approach [14,25]. Our group has previously analyzed the effect of drought on Q. ilex and its response to drought by using morphology, classic biochemistry, physiology, and omic approaches [7,8,14,22,26,27,28,29]. Despite numerous studies, a global non-targeted multiomic analysis of the drought response of Q. ilex species has not been performed to date.
In this work, using a systems biology approach, we have evaluated the effect of drought stress on six-month-old seedlings to characterize the molecular mechanisms triggered under the stress response (Figure 1). With that aim, we performed a non-targeted integrative multiomic analysis (transcriptomics, proteomics, and metabolomics) of the omic data generated by our group in previous studies [8,14,22], carrying out complementary interaction network analysis. In addition, a multivariate principal component analysis (PCA) was performed for comparison with the non-targeted integrative multiomic analysis (transcriptome, proteome, and metabolome) and to supplement it. Non-targeted integrative multiomic analysis allowed us to evaluate and categorize the transcripts, proteins, and metabolites that define the response to drought in Q. ilex seedlings and to obtain potential biomarkers of resilience to be validated and employed in selecting elite, resilient genotypes for breeding and reforestation/conservation programs.

2. Results and Discussion

2.1. Comparison of Principal Component Analysis (PCA) and DIABLO Multiomic Data Analysis

To assess the response of holm oak to drought from a systems biology perspective and to see how a non-targeted multiomic approach can provide new knowledge on the biology of this species, we carried out different multivariate analyses. The first dimensionality reduction approach was based on the popular principal component analysis (PCA) using the entire dataset (47,569 transcripts; 3312 proteins; 342 metabolites) (Figure 2A, Table S1), and the other PCA was based on only those with differences between treatments with a p-value lower than 0.05 (4574 transcripts; 562 proteins; 81 metabolites) (Figure 2B, Table S1) in the three datasets. The number of variables was reduced to less than 6000 using the significant variables, reducing the noise level. However, the initial proportion of each omic layer was still retained. In the PCA plots (Figure 2A,B), well-watered seedlings (control) and seedlings exposed to drought-like conditions for two lengths of time (D1, 17 days and D2, 24 days) were separated when the original dataset was filtered for differential variables with p-value < 0.05 (Figure 2B). In both analyses, transcripts were the variables with more weight in components 1 and 2, whereas proteins and metabolites were represented within these variables with more weight in component 2 (Supplementary Table S1).
In the second dimensionality reduction approach, we performed a DIABLO (Data Integration Analysis for Biomarker discovery using Latent variable approaches for Omics studies) analysis, a method of integration and interrelation of multiple omic layers (Figure 2C and the partial DIABLO plots of each omic dataset). In DIABLO, the variables with the highest sample discrimination power are selected and the optimum number of variables within each omic dataset is chosen, thus reducing the complexity of the thousands of variables in the omic data. In our data, the optimal number of variables within the three analyzed omic layers for each component were: 350 transcripts (250 in component 1 (comp1) and 100 in component 2 (comp2)) out of a total of 47,569; 514 proteins (300 in comp1 and 300 in comp2, with 86 in both components) out of a total of 3312; and 71 metabolites (62 in comp1 and 10 in comp2, with 1 in both components) out of a total of 342 (Supplementary Table S2). Comparing the variables selected by DIABLO with the variables with the highest loadings in PCA, we could observe that the three omic datasets were more equally represented in DIABLO, which did not show a bias toward the largest dataset, i.e., transcriptomics.
DIABLO analysis also succeeded in separating the seedlings exposed to drought-like conditions for the two lengths of time (D1 and D2); Figure 2C. Comp1 separated D2 from the rest, and comp2 separated D1. Controls, on the other hand, were similar, unlike in PCA. Furthermore, in DIABLO analysis (Figure 2C), we can see how much variance was explained by each omic layer separately, seeing that the variables selected can do a relatively good job of separating the samples in the individual omic layers (Figure 2C, partial DIABLO plots). Metabolites were the major contributors in comp1 (39.41%), whereas transcripts were the major contributors in comp2 (38.68%). Proteins presented similar percentages (around 30–35%) in both components. Finally, the integration of the three omic layers allowed a better grouping between replicates, as seen in the boxes in Figure 2C.
To identify the dominant trends in the variables selected by DIABLO for treatment separation, a heatmap was created (Figure 3; Supplementary Figure S1). We can observe that the magnitude of the molecular changes due to the response to drought stress clustered treatments and replicates. Five clear clusters can be observed (Figure 3, from top to bottom): (i) early transient overexpression/accumulation in seedlings exposed to drought-like conditions (D1, early response); (ii) overexpression/accumulation in controls (C1 and C2); (iii) transient overexpression/accumulation at a later time in seedlings exposed to drought-like conditions (D2, later response); (iv) overexpression/accumulation at time 2 in both control seedlings and seedlings exposed to drought-like conditions (C2 and D2); and (v) permanent changes with overexpression/accumulation at both times in seedlings exposed to drought-like conditions (D1 and D2, permanent stress changes). Among these clusters, drought clusters are the most interesting and form the most abundant groups (D1, D1 and D2, and D2, from the highest to the lowest). In these main clusters, we can see a high percentage of transcripts and proteins related to cellular homeostasis, such as proteases and chaperones, necessary to maintain cellular protein function during drought stress [30]. D1 and D2 (permanent stress changes) is the group with the highest number of variables included in cellular homeostasis (small heat shock proteins, ClpBs, heat shock 70/80/90 Kda proteins, molecular chaperone GroEl, LEA, etc.), followed by the D2 (later response) group (ClpB2, small heat shock proteins, heat shock 70 Kda proteins, and DNAJ heat shock family proteins) and to a lesser extent the D1 (early response) group (heat shock 70/83 Kda proteins). In addition, redox homeostasis shows a higher percentage of variables in the D1 and D2 (permanent stress changes) group (thioredoxin H2/H9/F-type or reductase 1, nucleoredoxins, and several glutathione S-transferases or L-ascorbate peroxidases). Protein and redox homeostasis are key to the survival of plants under stress [31,32,33]. Heat shock proteins (HSPs) and proteases play a fundamental role in protecting plants under stress since they are responsible for maintaining protein conformation and folding or degrading damaged proteins [32,34]. In drought, in particular, the importance of these transcripts and proteins had already been seen in previous studies [14]. Moreover, antioxidant machinery activation is key to counteracting the overaccumulation of reactive oxygen species during stress [33,35].
At the metabolomic level, we have observed a slight metabolite accumulation in the later response (D2) group, where more metabolites were included, showing a higher abundance of several amino acids and peptides (Figure 3; Supplementary Figure S1), highlighting L-proline and L-phenylalanine. L-proline accumulation is a common response to several abiotic stresses, functioning as an osmotic agent and radical scavenger [36]. L-phenylalanine accumulation is related to drought response, and its accumulation has already been observed in other species of the genus Quercus, such as Q. suber [37]. This compound is an essential precursor of numerous secondary metabolites (flavonoids, coumarins, monolignols, etc.) as the initiator of the core phenylpropanoid biosynthesis pathway. This may be due to the importance of phenolic compounds in the biology of this species, being the most represented compounds in the Q. ilex metabolome [8].

2.2. DIABLO and STITCH Interaction Networks and Their Biological Interpretation

To visualize the interactions between the different omic layers, two types of interaction networks were created: a statistical one, obtained by DIABLO (Figure 4), and a functional one, using STITCH (chemical–protein interaction network) (Figure 5) [38]. Both networks were built with the variables selected by comp1 and comp2 of DIABLO (350 transcripts, 514 proteins, and 71 metabolites), finding a high percentage of homologs in the STRING/STITCH databases (309 transcripts, 501 proteins, and 71 metabolites). The networks presented were reduced to the strongest interactions (DIABLO) or the highest confidence (STITCH), established at 0.9, in both cases. At these thresholds, the variables selected in the two networks were different, providing two complementary views of the Q. ilex drought response. The DIABLO network contains mainly early response variables (Figure 4B), whereas STITCH contains mainly permanent changes during drought (D1 and D2) (Figure 5B).
Next, we discuss the key gene products and interconnections obtained from the integrated analysis and potentially of relevance in the response to drought.
As shown in Figure 4B and Figure 5B, in both networks, protein biosynthesis and homeostasis are the main processes, forming clear sub-networks made up of a large number of variables. STITCH shows a clear main ribosomal network, another one including components of the proteasome complex, and a third one including chaperones (Figure 5B). In DIABLO, there is no clear distribution except for the small network on the right, associated with stress and cellular homeostasis and including chaperones and proteases (Figure 4B). Stress response and protein homeostasis processes show variables in both networks, with several candidates in both interaction networks (such as the 17.5 kDa HSP family protein, several HSP70, and proteasome subunits). This group of proteins accumulated in D1, D1 and D2, and D2 (Figure 4B) has also been described in previous work [14], with a greater number of proteins of this family identified in this work, giving a more complete vision of the HSPs implicated in drought stress. In the small DIABLO network on the right, HSPs along with other stress proteins are interconnected by some metabolites, such as emodin and amino acids (L-proline and homoarginine). The anthraquinone emodin is a secondary compound affected by abiotic environmental factors, such as osmotic stresses (drought and salinity) and light intensity, [39] and its external application triggers a higher expression of HSPs [40]. L-proline (accumulated in D2; Figure 4B and Figure 5B) acts not only as an osmoticum (a substance that acts to supplement osmotic pressure in a plant) [36] but also as a molecular chaperone, preserving protein integrity and speeding up the activity of many enzymes [41], which could explain the direct relationship shown in the DIABLO network between these metabolites and HSPs. Furthermore, proline residues at the N-terminus of alpha helices have been shown to contribute to protein thermostability [42].
With respect to protease activity, we found two casein lytic proteinases, ClpB2 (qilexprot_8964), common in the two networks, and ClpB3 (contig-10000000), in the DIABLO network. An ATP-dependent Clp protease (qilexprot_22489) was also observed in the DIABLO network. These chaperones were highly represented in response to drought, and some of them were selected as candidates for drought tolerance in a study by Guerrero-Sanchez et al. [14]. Furthermore, these chaperones, together with other types of proteins, such as FTSH6 (qilexprot_61641, present in the small DIABLO network, and the other candidate proposed by Guerrero-Sanchez et al. [14]), have been described to be involved in the degradation of the light-harvesting complex of photosystem II (LHC II) during senescence or high light acclimation [43,44]. Loss of water could result in the plant being unable to maintain an adequate temperature and resist the high irradiance, because of which, it is forced to activate defense mechanisms against this type of stress resulting from drought stress in these environmental conditions.
Regarding the protein biosynthesis process, ribosomal proteins can be found as candidates in both networks (as protein and transcript variables) but are more represented in the STITCH network (the main circular network in Figure 5). Ribosomal proteins, such as 60S ribosomal protein L14-2 (qilexprot_78276), are implicated in several regulatory processes, such as drought [45]. The DIABLO network (Figure 4) showed two transcripts belonging to this group, 40S RIBOSOMAL PROTEIN S13, and 60S RIBOSOMAL PROTEIN L34, upregulated in D1. Both transcripts shared the same positive interconnections (as with elongation factors related to the biosynthesis process) and a negative one with 3-coumaric acid (precursor of the phenylpropanoid pathway or the phenylpropanoid–acetate pathway).
In addition to the predominant processes mentioned above, which to a greater or lesser extent have been seen in the previous analysis of the different omic layers [8,14,22], we were able to observe important new pathways or processes related to the response to drought, which will be discussed next.
When a plant faces unfavorable conditions, for example, drought, to survive it must modulate gene expression to optimize the response to overcome such conditions. This task is performed by transcription factors (TFs), which trigger the conversion of stress-induced signals into cellular responses [46]. A novelty in the DIABLO network, we can see a high number of TFs upregulated in early response (D1, Figure 4B), some related to stress (DREB2A-interacting protein 2 isoform 1, probable WRKY transcription factor 65, and heat stress transcription factor A2-b) [47], some related to the circadian clock (CONSTANS-like zinc finger protein, zinc finger protein ZPR1, and Zf-ZPR1 domain-containing protein) [48,49], and some related to growth and development processes (transcription factor bHLH74 and zinc finger BED domain-containing protein DAYSLEEPER) [50,51].
DREB2A (contig-1429000013) is a transcription factor that interacts with a cis-acting dehydration-responsive element (DRE) sequence and activates the expression of downstream genes involved in drought– and salt–stress response in Arabidopsis thaliana [52]. DREB2A, whose tendency changes after 17 days of drought, shows a relatively rapid plant response by triggering stress response mechanisms before the fluorescence drops below 50%. Furthermore, Arabidopsis DREB2A has a dual role in combating both water and heat-shock stress responses [52]. Drought and heat stress are highly interrelated. Both stresses are often combined under field conditions, and between them, an overlap of heat- and drought-responsive genes has been observed [53]. Therefore, the presence of heat stress TFs (such as heat stress transcription factor A2-b) is common in drought. Heat stress transcription factors are important transcription factors in both heat and drought stresses [47]. The transcriptional expression level of HSPs is regulated by heat stress TFs, as well as regulating other TFs, such as DREB2A [54]. However, little is known about heat stress transcription factor A2-b and its involvement in drought.
Lastly, among stress TFs, WRKY is a large plant family of TFs, which is induced by numerous abiotic stresses, such as drought and heat stress. In the DIABLO network, probable WRKY transcription factor 65 showed many connections with other elements in the upregulated network in the early response (D1, Figure 4B), positioned at the top of the network. Several WRKYs have been identified that enhance tolerance [55,56,57,58]. Recently, Huo et al. [59] found that ZmWRKY65, a gene in the maize, enhanced drought tolerance in Arabidopsis, proposed as a multifunctional factor that can integrate different stress signals in plants.
Candidates related to flowering and the circadian clock appear to be key in the short-time drought response (D1). Transcript factor CONSTANS (contig-4162000011) (Figure 4) promotes flowering and has been found to be related to the control of flowering and the circadian clock [48]. Moreover, in the network, we can see the presence of GIGANTEA (contig-291000017) (Figure 4 and Figure 5) linked with adagio protein 3 (contig-440000008) (Figure 5). The complex forming by GIGANTEA and adagio protein 3 regulates CONSTANS expression [60]. Given the age of the plants, the flowering process is not a viable hypothesis, being more related to the involvement of CONSTANS in regulating other genes, such as P5CS2 related to proline biosynthesis (over-accumulated in later response D2). Furthermore, GIGANTEA is a well-known protein with a role in drought tolerance. GIGANTEA is a key regulator of the photoperiod and the circadian cycle and is also involved in the signaling pathways for various abiotic stresses, such as the regulation of ABA synthesis [61].
Finally, we can see that an antioxidant response is necessary for the detoxification of the ROS produced during the whole drought stress (D1 and D2). This increase in antioxidant enzymes had been seen in previous studies [8,24], but the candidates responsible for this response were not clearly identified. In the DIABLO network, the presence of relevant enzymes, such as APX2 (qilexprot_28343) and glutathione S-transferase (qilexprot_71801), was observed. Ascorbate peroxidase (APX) isoenzymes play a key role in the ascorbate–glutathione cycle, which is the major hydrogen peroxide detoxifying system [62]. Rossel et al. [63] showed a constitutively higher expression of cytosolic-APX2-enhanced tolerance to drought in an Arabidopsis mutant. The other enzyme, glutathione S-transferase, is a protein involved in the metabolism of oxidative stress caused by the ROS produced because of water deprivation and high irradiance [64]. This protein is related to several proteins according to the DIABLO network (Figure 4), where we can see proteins related to secondary metabolism with possible antioxidant capacity, such as terpenes ((+)-neomenthol dehydrogenase and carotenoid cleavage dioxygenase 4) and flavonoids (NAD(P)H-dependent 6′-deoxychalcone synthase).

3. Materials and Methods

3.1. Plant Material and Experiment Description

The drought experiment was performed with seedlings of the drought-tolerant Quercus ilex species [7]. Six-month-old seedlings obtained from healthy open-pollinated acorns from trees located in Almadén de la Plata (Seville, Andalusia, Spain; 37°52′ N, 6°28′ W) were used. These seedlings had a size of about 10–15 cm and had 8 to 10 adult leaves. The experiment was performed under the typical summer conditions in Andalusia, (mean temperature 37 °C, maximum temperature 44 °C, minimum temperature 19 °C, and relative humidity 40%) under severe drought conditions (withholding water for 28 days) [7]. Well-watered plants (C) were kept at 100% substrate moisture throughout the experiment [22]. Leaves were collected when the chlorophyll fluorescence in seedlings subjected to drought-like conditions decreased by 20% (17 days of drought, D1, early response) and 45% (24 days of drought, D2, late response) with respect to the control, well-watered, seedlings [7]. Three asymptomatic seedlings were analyzed at each sampling point (C1,D1 and C2,D2), each seedling being a biological replicate. Adult leaves without stress symptoms were washed, frozen in liquid nitrogen, and stored at −80 until extraction (metabolites, proteins, and RNA). Detailed information about plant material and the experiment can be found elsewhere [22].

3.2. Non-Targeted Transcriptomic Analysis

RNA extraction has been described in Ref. [24], and the high-throughput sequencing Illumina Hiseq 4000 platform (Illumina, San Diego, CA, USA) was used to obtain the libraries. Previously, a de novo transcriptome for Q. ilex was generated from raw data obtained from Illumina and Ion Torrent [13,24]. Such raw data, together with the datasets for well-watered seedlings and seedlings exposed to drought-like conditions, were used to assemble all clean Q. ilex reads into contigs by using RAY v. 2.3.1 [65]. The assembly structure was evaluated with QUAST v. 5.0.0 [66]. This new version of the Q. ilex transcriptome was annotated using Sma3s v. 2 [67] and the UniRef90 database (www.uniprot.org, accessed on 29 June 2019). To quantify the expression of transcripts, RSEM [68] was used to map the filtered reads to the reference transcriptome, obtaining the abundance of each transcript. The complete transcript dataset used in the analysis is available in Supplementary Table S3a. The transcriptome raw data are available in the NCBI’s Gene Expression Omnibus repository (GEO, accession number GSE145009, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE145009, accessed on 15 October 2020).

3.3. Non-Targeted Proteomic Analysis

Protein extraction, separation, and mass spectrometry analysis have been described in Ref. [22]. Briefly, proteins were extracted by following the trichloroacetic acid (TCA)/acetone–phenol protocol according to Wang et al. [69]. Protein samples were run 1 cm in 12% sodium dodecyl sulfate–polyacrylamide gel electrophoresis (SDS–PAGE), and the resulting single bands were trypsin-digested and subjected to shotgun proteomics analysis (nano-LC-MS-UHPLC Ultimate 3000-Orbitrap Fusion, Thermo Scientific, San Jose, CA, USA) according to Gómez-Gálvez et al. [70]. MS/MS data were processed employing Proteome Discoverer 2.3 (Thermo, San Jose, CA, USA), the proteins were identified using the SEQUEST algorithm with the species-specific Q. ilex transcriptome database [13,24], and the data were enriched with the RNA-seq data generated in this experiment, as described above. Proteins were quantified as described in Ref. [71]. The complete proteomic dataset used in the analysis is available in Supplementary Table S3b. The protein raw mass spectrometry was deposited in the ProteomeXchange Consortium via the PRIDE partner repository [72] with the dataset identifier PXD017493 (http://proteomecentral.proteomexchange.org/cgi/GetDataset?ID=PXD017493, accessed on 15 October 2020).

3.4. Non-Targeted Metabolomics Analysis

Metabolite extraction, separation, and mass spectrometry analysis have been described in Ref. [8]. Briefly, metabolites were extracted as described by Valledor et al. [73] using three replicates of each treatment and a quality control mix. Dried extracts were re-dissolved in 1 mL of 50% methanol and 0.1% formic acid. After filtration, 5 μL of the sample was subjected to chromatographic separation with a Dionex Ultimate 3000 RS UHPLC system (Thermo Fisher Scientific, Bremen, Germany) equipped with an Acquity UPLC BEH (bridged ethyl hybrid) C18 column (1.7 μm; 100 × 2.1 mm) (Waters Corporation, Manchester, UK) held at 40 °C. Chromatography parameters and analysis have been described in Ref. [8]. The software Compound discoverer® (v. 3.1) (Thermo Scientific, San Jose, CA, USA) was used to quantify and identify each feature. The area under the curve of the peaks was used for the relative quantification. Identification was performed by comparing with three different databases included in the software (mzCloud, ChemSpider, and KEGG). The complete metabolomic dataset used in the analysis is available in Supplementary Table S3c. Metabolite raw data were deposited in the NIH Common Fund’s National Metabolomics Data Repository (NMDR) website, the MetabolomicsWorkbench, https://www.metabolomicsworkbench.org (accessed on 13 September 2020), where it has been assigned the project ID PR001361. The data can be accessed directly via the project DOI 10.21228/M8SQ6J.

3.5. Data Preprocessing and Statistical Analysis

Previous preprocessed data of transcripts [14], proteins [14,22], and metabolites [8] were used (Supplementary Table S3). Scaled and centered values (z-scores) were subjected to multivariate analysis with the integration of omic levels. Principal component analysis (PCA) and DIABLO (Data Integration Analysis for Biomarker discovery using Latent variable approaches for Omics studies) [74] were conducted with pRocessomic v.1.8 (available on http://github.com/Valledor/pRocessomics, accessed on 29 January 2020). Both analyses are a dimensional reduction approach, but DIABLO selects the optimal number of variables to obtain the maximum discriminant power and provides the inter-relationship between the different omic layers. The autotune() function was used in DIABLO to select the optimal number of variables to include from the three integrated omic layers (with significant variable features p < 0.05). This function involves comparing multiple models by selecting different numbers of variables within each dataset, choosing the number of variables that best predict their own accuracy [75]. So, 350 transcripts (250 in component 1 (comp1) and 100 in component 2 (comp2)), 514 proteins (300 in comp1 and 300 in comp2, with 86 in both components), and 71 metabolites (62 in comp1 and 10 comp2, with 1 in both components) were selected and included in DIABLO analysis. PCA was performed first with all variables in the datasets (47,569 transcripts; 3312 proteins; 342 metabolites) and then with only the significant variables at p < 0.05 (4574 transcripts; 567 proteins; 81 metabolites). Heatmap and plots were made using pRocessomic v.1.8. The networks and visualization were generated by pRocessomic and Cytoscape program 3.9 [76], using DIABLO analysis (DIABLO network, Figure 4) and the StringApp plugin (STITCH network, Figure 5), using the model species A. thaliana [77]. The networks were simplified to interactions with a threshold of ±0.9 or higher. In the DIABLO network, −0.97 to −0.9 (negative) and 0.9 to 0.97 (positive) represent the strongest correlations between variables, and in the STITCH network, 0.9 to 1 represents the interactions with the highest confidence.

4. Conclusions

We have used an integrated multiomics analysis to generate new information beyond that obtained by using transcriptomic, proteomic, and metabolomic approaches independently of each other, contributing to the knowledge of the responses of Q. ilex to drought. In the study of the response to drought, DIABLO provided a better model than PCA and allowed us to interrelate the different omic layers. Within the interaction networks, we confirm the importance of previous candidates in the response to drought, such as the FTsH6 protease, involved in the degradation of the light-harvesting complex of photosystem II, and the ClpB proteases, with several members of this family within both networks. These processes appear to be key in the permanent changes to stress. On the other side, early responses are characterized by the appearance of transcription factors (DEBR2A, WRK65, CONSTANT, etc.) up-regulating stress-related genes. The appearance of these TFs could be related to the prioritization of the gene response in the early stress stage and to the increase in the osmoprotectant proline observed in later stages, among others. At the spatiotemporal level, we have seen that some responsive gene products or metabolites are transiently accumulated in early stages (DEBR2A, WRK65, and CONSTANT) and later stages (such as L-proline), whereas others (such as FTSH6, APX2, and glutathione S-transferase) are permanently accumulated in both sampling times. The latter ones can be proposed as markers of resilience; one of them has been proposed in previous studies.
We have gone one step ahead in the molecular study of the responses of an orphan tree species, Q. ilex, to drought. We have shown that it is possible to take the systems biology approach not only in model plants, such as A. thaliana, but also in non-model, orphan, and recalcitrant organisms. However, the systems biology approach with non-domesticated and poorly characterized species is challenging and with limitations since the results depend, largely, on the homology of species-specific genes to those of model species, such as Arabidopsis.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/plants11223067/s1: Figure S1: Heatmap of the variables selected for DIABLO: 350 transcripts (250 in comp1 and 100 in comp2), 514 proteins (300 in comp1 and 300 in comp2, with 86 in both components), and 71 metabolites (62 in comp1 and 10 in comp2, with 1 in both components), Table S1: principal component analysis (PCA); (1) all variables of omics datasets, (2) only omics variables with p-value < 0.05: (a) explained variance, (b) loadings are shown; Table S2: identification (ID), description and contribution to component 1 and 2 of DIABLO selected variables: (a) transcripts, (b) proteins, (c) metabolites; Table S3: datasets used for integrative multiomics analysis: (a) transcripts dataset, (b) proteins dataset, (c) metabolites dataset.

Author Contributions

Conceptualization, J.V.J.-N. and M.E.; methodology, M.E. and V.M.G.-S.; dataset resources, C.L.-H., V.M.G.-S., M.-D.R., M.Á.C. and J.V.J.-N.; data curation, V.M.G.-S.; writing—original draft preparation, M.E. and V.M.G.-S.; writing—review and editing, M.E., M.-D.R., M.Á.C. and J.V.J.-N.; supervision, M.E. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the project ENCINOMICS-2 PID2019-109038RB-I00 of the Spanish Ministry of Economy and Competitiveness. V.M.G.-S. was supported by the Youth Employment Operational Program (EJI-17-AGR-164), co-financed by the Regional Government of Andalusia and the European Social Fund (ESF). M.-D.R., M.Á.C. and M.E. are grateful for being awarded the Juan de la Cierva-Incorporación (IJC2018-035272-I), Ramon y Cajal (RYC-2017-23706), and Juan de la Cierva-Formación (FJCI-2017-31613) contracts, respectively, by the Spanish Ministry of Science, Innovation and Universities. C.L.-H. is grateful to the Regional Government of Asturias, Spain, for being awarded the Severo Ochoa contract BP17-112.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All datasets are available in the Quercus ilex web of our group (https://www.uco.es/probiveag/holm-oak-database.html) and in the supplementary material of this article. The transcriptome raw data are available in the NCBI’s Gene Expression Omnibus repository (GEO, accession number GSE145009). The protein raw mass spectrometry was deposited in the ProteomeXchange Consortium via the PRIDE partner repository [72] with the dataset identifier PXD017493. Metabolite raw data were deposited in the NIH Common Fund’s National Metabolomics Data Repository (NMDR) website, the MetabolomicsWorkbench, https://www.metabolomicsworkbench.org, where it has been assigned the project ID PR001361. The data can be accessed directly via the project DOI 10.21228/M8SQ6J.

Acknowledgments

We would like to thank Laura Lamelas (University of Oviedo) for all the support provided for the analyses with the statistical package pRocessomic.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Patón, D.; García-Herrera, R.; Cuenca, J.; Galavis, M.; Roig, F. Influence of Climate on Radial Growth of Holm Oaks (Quercus ilex Subsp. Ballota Desf) from SW Spain. Geochronometria 2010, 34, 49–56. [Google Scholar]
  2. Haavik, L.J.; Billings, S.A.; Guldin, J.M.; Stephen, F.M. Emergent insects, pathogens and drought shape changing patterns in oak decline in North America and Europe. For. Ecol. Manag. 2015, 354, 190–205. [Google Scholar]
  3. Natalini, F.; Alejano, R.; Vázquez-Piqué, J.; Cañellas, I.; Gea-Izquierdo, G. The role of climate change in the widespread mortality of holm oak in open woodlands of Southwestern Spain. Dendrochronologia 2016, 38, 51–60. [Google Scholar] [CrossRef] [Green Version]
  4. Villar-Salvador, P.; Planelles, R.; Oliet, J.; Peñuelas-Rubira, J.L.; Jacobs, D.F.; González, M. Drought tolerance and transplanting performance of holm oak (Quercus ilex) seedlings after drought hardening in the nursery. Tree Physiol. 2004, 24, 1147–1155. [Google Scholar]
  5. Keenan, T.F.; Serra, J.M.; Lloret, F.; Ninyerola, M.; Sabaté, S. Predicting the future of forests in the Mediterranean under climate change, with niche- and process-based models: CO2 matters! Glob. Chang. Biol. 2011, 17, 565–579. [Google Scholar]
  6. Ogaya, R.; Liu, D.; Barbeta, A.; Peñuelas, J. Stem Mortality and Forest Dieback in a 20-Years Experimental Drought in a Mediterranean Holm Oak Forest. Front. For. Glob. Chang. 2020, 2, 89. [Google Scholar] [CrossRef]
  7. San-Eufrasio, B.; Sánchez-Lucas, R.; López-Hidalgo, C.; Guerrero-Sánchez, V.M.; Castillejo, M.; Maldonado-Alconada, A.M.; Jorrín-Novo, J.V.; Rey, M.-D. Responses and differences in tolerance to water shortage under climatic dryness conditions in seedlings from Quercus spp. and Andalusian Q. ilex populations. Forests 2020, 11, 707. [Google Scholar]
  8. Tienda-Parrilla, M.; López-Hidalgo, C.; Guerrero-Sanchez, V.M.; Infantes-González, Á.; Valderrama-Fernández, R.; Castillejo, M.-Á.; Jorrín-Novo, J.V.; Rey, M.-D. Untargeted MS-Based Metabolomics Analysis of the Responses to Drought Stress in Quercus ilex L. Leaf Seedlings and the Identification of Putative Compounds Related to Tolerance. Forests 2022, 13, 551. [Google Scholar] [CrossRef]
  9. Rey, M.-D.; Castillejo, M.Á.; Sánchez-Lucas, R.; Guerrero-Sanchez, V.M.; López-Hidalgo, C.; Romero-Rodríguez, C.; Valero-Galván, J.; Sghaier-Hammami, B.; Simova-Stoilova, L.; Echevarría-Zomeño, S.; et al. Proteomics, holm oak (Quercus ilex L.) and other recalcitrant and orphan forest tree species: How do they see each other? Int. J. Mol. Sci. 2019, 20, 692. [Google Scholar] [CrossRef] [Green Version]
  10. Maldonado-Alconada, A.M.; Castillejo, M.Á.; Rey, M.-D.; Labella-Ortega, M.; Tienda-Parrilla, M.; Hernández-Lao, T.; Honrubia-Gómez, I.; Ramírez-García, J.; Guerrero-Sanchez, V.M.; López-Hidalgo, C.; et al. Multiomics Molecular Research into the Recalcitrant and Orphan Quercus ilex Tree Species: Why, What for, and How. Int. J. Mol. Sci. 2022, 23, 9980. [Google Scholar] [CrossRef]
  11. Rey, M.; Labella-Ortega, M.; Guerrero-Sánchez, V.; Carleial, R.; Castillejo, M.; Rodríguez-Franco, A.; Buggs, R.; Ruggieri, V.; Jorrín-Novo, J. A first draft genome of Holm oak (Quercus ilex L.), the most representative species of the Mediterranean forest and the Spanish agrosilvopastoral ecosystem “dehesa”. bioRxiv 2022. bioRxiv: 511480. [Google Scholar]
  12. Escandón, M.; Castillejo, M.Á.; Jorrín-Novo, J.V.; Rey, M.-D. Molecular research on stress responses in Quercus spp.: From classical biochemistry to systems biology through Omics analysis. Forests 2021, 12, 364. [Google Scholar] [CrossRef]
  13. Guerrero-Sánchez, V.M.; Maldonado-Alconada, A.M.; Amil-Ruiz, F.; Verardi, A.; Jorrín-Novo, J.V.; Rey, M.-D. Ion Torrent and lllumina, two complementary RNA-seq platforms for constructing the Holm oak (Quercus ilex) transcriptome. PLoS ONE 2019, 14, e0210356. [Google Scholar] [CrossRef] [PubMed]
  14. Guerrero-Sánchez, V.M.; Castillejo, M.Á.; López-Hidalgo, C.; Alconada, A.M.M.; Jorrín-Novo, J.V.; Rey, M.-D. Changes in the transcript and protein profiles of Quercus ilex seedlings in response to drought stress. J. Proteomics 2021, 243, 104263. [Google Scholar] [CrossRef]
  15. López-Hidalgo, C.; Trigueros, M.; Menéndez, M.; Jorrin-Novo, J.V. Phytochemical composition and variability in Quercus ilex acorn morphotypes as determined by NIRS and MS-based approaches. Food Chem. 2021, 338, 127803. [Google Scholar] [CrossRef]
  16. Karahalil, B. Overview of Systems Biology and Omics Technologies. Curr. Med. Chem. 2016, 23, 4221–4230. [Google Scholar] [CrossRef]
  17. Tebani, A.; Afonso, C.; Marret, S.; Bekri, S. Omics-Based Strategies in Precision Medicine: Toward a Paradigm Shift in Inborn Errors of Metabolism Investigations. Int. J. Mol. Sci. 2016, 17, 1555. [Google Scholar] [CrossRef] [Green Version]
  18. Picard, M.; Scott-Boyer, M.-P.; Bodein, A.; Périn, O.; Droit, A. Integration strategies of multi-omics data for machine learning analysis. Comput. Struct. Biotechnol. J. 2021, 19, 3735–3746. [Google Scholar] [CrossRef]
  19. Fabres, P.J.; Collins, C.; Cavagnaro, T.R.; Rodríguez López, C.M. A Concise Review on Multi-Omics Data Integration for Terroir Analysis in Vitis vinifera. Front. Plant Sci. 2017, 8, 1065. [Google Scholar] [CrossRef] [Green Version]
  20. Yan, J.; Risacher, S.L.; Shen, L.; Saykin, A.J. Network approaches to systems biology analysis of complex disease: Integrative methods for multi-omics data. Brief. Bioinform. 2018, 19, 1370–1381. [Google Scholar] [CrossRef] [Green Version]
  21. Krassowski, M.; Das, V.; Sahu, S.K.; Misra, B.B. State of the Field in Multi-Omics Research: From Computational Needs to Data Mining and Sharing. Front. Genet. 2020, 11, 610798. [Google Scholar] [CrossRef] [PubMed]
  22. San-Eufrasio, B.; Bigatton, E.; Guerrero-Sanchez, V.; Chaturvedi, P.; Jorrin-Novo, J.; Rey, M.-D.; Castillejo, M. Proteomics data analysis for the identification of proteins and derived proteotypic peptides of potential use as putative drought tolerance markers for Quercus ilex. Int. J. Mol. Sci. 2021, 22, 3191. [Google Scholar] [PubMed]
  23. San-Eufrasio, B.; Castillejo, M.Á.; Labella-Ortega, M.; Ruiz-Gómez, F.J.; Navarro-Cerrillo, R.M.; Tienda-Parrilla, M.; Jorrín-Novo, J.V.; Rey, M.-D. Effect and Response of Quercus ilex subsp. ballota [Desf.] Samp. Seedlings From Three Contrasting Andalusian Populations to Individual and Combined Phytophthora cinnamomi and Drought Stresses. Front. Plant Sci. 2021, 12, 722802. [Google Scholar] [CrossRef] [PubMed]
  24. Guerrero-Sánchez, V.M.; Maldonado-Alconada, A.M.; Amil-Ruiz, F.; Jorrin-Novo, J. V Holm oak (Quercus ilex) transcriptome. De novo sequencing and assembly analysis. Front. Mol. Biosci. 2017, 4, 70. [Google Scholar] [CrossRef] [Green Version]
  25. López-Hidalgo, C.; Guerrero-Sánchez, V.M.; Gómez-Gálvez, I.; Sánchez-Lucas, R.; Castillejo-Sánchez, M.A.; Maldonado-Alconada, A.M.; Valledor, L.; Jorrín-Novo, J. V A multi-omics analysis pipeline for the metabolic pathway reconstruction in the orphan species Quercus ilex. Front. Plant Sci. 2018, 9, 935. [Google Scholar] [CrossRef] [Green Version]
  26. Echevarría-Zomeño, S.; Ariza, D.; Jorge, I.; Lenz, C.; del Campo, A.D.; Jorrín, J.; Navarro, R.M. Changes in the protein profile of Quercus ilex leaves in response to drought stress and recovery. J. Plant Physiol. 2009, 166, 233–245. [Google Scholar] [CrossRef]
  27. Valero-Galván, J.; González-Fernández, R.; Navarro-Cerrillo, R.; Gil-Pelegrin, E.; Jorrín-Novo, J. Physiological and proteomic analyses of drought stress response in Holm oak provenances. J. Proteome Res. 2013, 12, 5110–5123. [Google Scholar]
  28. Simova-Stoilova, L.P.; Romero-Rodríguez, M.C.; Sánchez-Lucas, R.; Navarro-Cerrillo, R.M.; Medina-Aunon, J.A.; Jorrín-Novo, J.V. 2-DE proteomics analysis of drought treated seedlings of Quercus ilex supports a root active strategy for metabolic adaptation in response to water shortage. Front. Plant Sci. 2015, 6, 627. [Google Scholar]
  29. Simova-Stoilova, L.P.; López-Hidalgo, C.; Sanchez-Lucas, R.; Valero-Galvan, J.; Romero-Rodríguez, C.; Jorrin-Novo, J.V. Holm oak proteomic response to water limitation at seedling establishment stage reveals specific changes in different plant parts as well as interaction between roots and cotyledons. Plant Sci. 2018, 276, 1–13. [Google Scholar] [CrossRef]
  30. Vaseva, I.; Sabotič, J.; Sustar-vozlic, J.; Meglic, V.; Kidrič, M.; Demirevska, K.; Simova, L. The response of plants to drought stress: The role of dehydrins, chaperones, proteases and protease inhibitors in maintaining cellular protein function. In Droughts: New Research; Neves, D.F., Sanz, J.D., Eds.; Nova Science Publishers, Inc.: Hauppauge, NY, USA, 2012; pp. 1–45. ISBN 978-1-62100-769-2. [Google Scholar]
  31. Heinemann, B.; Künzler, P.; Eubel, H.; Braun, H.-P.; Hildebrandt, T.M. Estimating the number of protein molecules in a plant cell: Protein and amino acid homeostasis during drought. Plant Physiol. 2021, 185, 385–404. [Google Scholar]
  32. Wang, W.; Vinocur, B.; Shoseyov, O.; Altman, A. Role of plant heat-shock proteins and molecular chaperones in the abiotic stress response Role of plant heat-shock proteins and molecular chaperones in the abiotic stress response. Trends Plant Sci. 2004, 9, 244–252. [Google Scholar] [CrossRef] [PubMed]
  33. Miller, G.; Suzuki, N.; Ciftci-Yilmaz, S.; Mittler, R. Reactive oxygen species homeostasis and signalling during drought and salinity stresses. Plant. Cell Environ. 2010, 33, 453–467. [Google Scholar] [CrossRef] [PubMed]
  34. Wahid, A.; Close, T.J. Expression of dehydrins under heat stress and their relationship with water relations of sugarcane leaves. Biol. Plant. 2007, 51, 104–109. [Google Scholar]
  35. Dvořák, P.; Krasylenko, Y.; Zeiner, A.; Šamaj, J.; Takáč, T. Signaling Toward Reactive Oxygen Species-Scavenging Enzymes in Plants. Front. Plant Sci. 2020, 11, 618835. [Google Scholar]
  36. Kavi Kishor, P.B.; Sreenivasulu, N. Is proline accumulation per se correlated with stress tolerance or is proline homeostasis a more critical issue? Plant. Cell Environ. 2014, 37, 300–311. [Google Scholar] [CrossRef]
  37. Almeida, T.; Pinto, G.; Correia, B.; Gonçalves, S.; Meijón, M.; Escandón, M. In-depth analysis of the Quercus suber metabolome under drought stress and recovery reveals potential key metabolic players. Plant Sci. 2020, 299, 110606. [Google Scholar] [CrossRef]
  38. Kuhn, M.; von Mering, C.; Campillos, M.; Jensen, L.J.; Bork, P. STITCH: Interaction networks of chemicals and proteins. Nucleic Acids Res. 2008, 36, D684–D688. [Google Scholar] [CrossRef] [Green Version]
  39. Izhaki, I. Emodin—A secondary metabolite with multiple ecological functions in higher plants. New Phytol. 2002, 155, 205–217. [Google Scholar] [CrossRef] [Green Version]
  40. Cui, Y.; Liu, B.; Xie, J.; Xu, P.; Zhang, Y.; Ming, J. Effect of enrofloxacin and emodin on heat-shock proteins’ expression in hepatic cells of grass carp (Ctenopharyngodon idellus). Aquac. Int. 2014, 22, 1067–1077. [Google Scholar] [CrossRef]
  41. Rajendrakumar, C.S.V.; Suryanarayana, T.; Reddy, A.R. DNA helix destabilization by proline and betaine: Possible role in the salinity tolerance process. FEBS Lett. 1997, 410, 201–205. [Google Scholar]
  42. Takano, K.; Higashi, R.; Okada, J.; Mukaiyama, A.; Tadokoro, T.; Koga, Y.; Kanaya, S. Proline effect on the thermostability and slow unfolding of a hyperthermophilic protein. J. Biochem. 2009, 145, 79–85. [Google Scholar] [PubMed]
  43. Zelisko, A.; García-Lorenzo, M.; Jackowski, G.; Jansson, S.; Funk, C. AtFtsH6 is involved in the degradation of the light-harvesting complex II during high-light acclimation and senescence. Proc. Natl. Acad. Sci. USA 2005, 102, 13699–13704. [Google Scholar] [CrossRef] [PubMed]
  44. Adam, Z.; Rudella, A.; van Wijk, K.J. Recent advances in the study of Clp, FtsH and other proteases located in chloroplasts. Curr. Opin. Plant Biol. 2006, 9, 234–240. [Google Scholar] [PubMed]
  45. Shiraku, M.L.; Magwanga, R.O.; Cai, X.; Kirungu, J.N.; Xu, Y.; Mehari, T.G.; Hou, Y.; Wang, Y.; Wang, K.; Peng, R.; et al. Knockdown of 60S ribosomal protein L14-2 reveals their potential regulatory roles to enhance drought and salt tolerance in cotton. J. Cott. Res. 2021, 4, 27. [Google Scholar] [CrossRef]
  46. Singh, D.; Laxmi, A. Transcriptional regulation of drought response: A tortuous network of transcriptional factors. Front. Plant Sci. 2015, 6, 895. [Google Scholar] [CrossRef] [Green Version]
  47. Manna, M.; Thakur, T.; Chirom, O.; Mandlik, R.; Deshmukh, R.; Salvi, P. Transcription factors as key molecular target to strengthen the drought stress tolerance in plants. Physiol. Plant. 2021, 172, 847–868. [Google Scholar] [CrossRef]
  48. Samach, A.; Onouchi, H.; Gold, S.E.; Ditta, G.S.; Schwarz-Sommer, Z.; Yanofsky, M.F.; Coupland, G. Distinct roles of CONSTANS target genes in reproductive development of Arabidopsis. Science 2000, 288, 1613–1616. [Google Scholar] [CrossRef] [Green Version]
  49. Kiełbowicz-Matuk, A.; Czarnecka, J.; Banachowicz, E.; Rey, P.; Rorat, T. Solanum tuberosum ZPR1 encodes a light-regulated nuclear DNA-binding protein adjusting the circadian expression of StBBX24 to light cycle. Plant. Cell Environ. 2017, 40, 424–440. [Google Scholar] [CrossRef]
  50. Bundock, P.; Hooykaas, P. An Arabidopsis hAT-like transposase is essential for plant development. Nature 2005, 436, 282–284. [Google Scholar] [CrossRef]
  51. Ikeda, M.; Fujiwara, S.; Mitsuda, N.; Ohme-Takagi, M. A triantagonistic basic helix-loop-helix system regulates cell elongation in Arabidopsis. Plant Cell 2012, 24, 4483–4497. [Google Scholar]
  52. Sakuma, Y.; Maruyama, K.; Qin, F.; Osakabe, Y.; Shinozaki, K.; Yamaguchi-Shinozaki, K. Dual function of an Arabidopsis transcription factor DREB2A in water-stress-responsive and heat-stress-responsive gene expression. Proc. Natl. Acad. Sci. USA 2006, 103, 18822–18827. [Google Scholar]
  53. Jia, J.; Zhou, J.; Shi, W.; Cao, X.; Luo, J.; Polle, A.; Luo, Z. Comparative transcriptomic analysis reveals the roles of responsive genes in poplars exposed to high temperature and drought. Nature 2017, 7, 43215. [Google Scholar]
  54. Yoshida, T.; Sakuma, Y.; Todaka, D.; Maruyama, K.; Qin, F.; Mizoi, J.; Kidokoro, S.; Fujita, Y.; Shinozaki, K.; Yamaguchi-Shinozaki, K. Functional analysis of an Arabidopsis heat-shock transcription factor HsfA3 in the transcriptional cascade downstream of the DREB2A stress-regulatory system. Biochem. Biophys. Res. Commun. 2008, 368, 515–521. [Google Scholar] [PubMed]
  55. Qiu, Y.; Yu, D. Over-expression of the stress-induced OsWRKY45 enhances disease resistance and drought tolerance in Arabidopsis. Environ. Exp. Bot. 2009, 65, 35–47. [Google Scholar]
  56. Raineri, J.; Ribichich, K.F.; Chan, R.L. The sunflower transcription factor HaWRKY76 confers drought and flood tolerance to Arabidopsis thaliana plants without yield penalty. Plant Cell Rep. 2015, 34, 2065–2080. [Google Scholar] [PubMed]
  57. Wang, C.-T.; Ru, J.-N.; Liu, Y.-W.; Yang, J.-F.; Li, M.; Xu, Z.-S.; Fu, J.-D. The Maize WRKY Transcription Factor ZmWRKY40 Confers Drought Resistance in Transgenic Arabidopsis. Int. J. Mol. Sci. 2018, 19, 2580. [Google Scholar]
  58. Wang, C.-T.; Ru, J.-N.; Liu, Y.-W.; Li, M.; Zhao, D.; Yang, J.-F.; Fu, J.-D.; Xu, Z.-S. Maize WRKY Transcription Factor ZmWRKY106 Confers Drought and Heat Tolerance in Transgenic Plants. Int. J. Mol. Sci. 2018, 19, 3046. [Google Scholar]
  59. Huo, T.; Wang, C.-T.; Yu, T.-F.; Wang, D.-M.; Li, M.; Zhao, D.; Li, X.-T.; Fu, J.-D.; Xu, Z.-S.; Song, X.-Y. Overexpression of ZmWRKY65 transcription factor from maize confers stress resistances in transgenic Arabidopsis. Sci. Rep. 2021, 11, 4024. [Google Scholar]
  60. Sawa, M.; Nusinow, D.A.; Kay, S.A.; Imaizumi, T. FKF1 and GIGANTEA complex formation is required for day-length measurement in Arabidopsis. Science 2007, 318, 261–265. [Google Scholar]
  61. Baek, D.; Kim, W.-Y.; Cha, J.-Y.; Park, H.J.; Shin, G.; Park, J.; Lim, C.J.; Chun, H.J.; Li, N.; Kim, D.H.; et al. The GIGANTEA-ENHANCED EM LEVEL Complex Enhances Drought Tolerance via Regulation of Abscisic Acid Synthesis. Plant Physiol. 2020, 184, 443–458. [Google Scholar]
  62. Sofo, A.; Scopa, A.; Nuzzaci, M.; Vitti, A. Ascorbate Peroxidase and Catalase Activities and Their Genetic Regulation in Plants Subjected to Drought and Salinity Stresses. Int. J. Mol. Sci. 2015, 16, 13561–13578. [Google Scholar] [CrossRef]
  63. Rossel, J.B.; Walter, P.B.; Hendrickson, L.; Chow, W.S.; Poole, A.; Mullineaux, P.M.; Pogson, B.J. A mutation affecting ASCORBATE PEROXIDASE 2 gene expression reveals a link between responses to high light and drought tolerance. Plant. Cell Environ. 2006, 29, 269–281. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  64. Liu, D.; Liu, Y.; Rao, J.; Wang, G.; Li, H.; Ge, F.; Chen, C. Overexpression of the glutathione S-transferase gene from Pyrus pyrifolia fruit improves tolerance to abiotic stress in transgenic tobacco plants. Mol. Biol. 2013, 47, 515–523. [Google Scholar] [CrossRef]
  65. Boisvert, S.; Laviolette, F.; Corbeil, J. Ray: Simultaneous assembly of reads from a mix of high-throughput sequencing technologies. J. Comput. Biol. 2010, 17, 1519–1533. [Google Scholar] [PubMed] [Green Version]
  66. Gurevich, A.; Saveliev, V.; Vyahhi, N.; Tesler, G. QUAST: Quality assessment tool for genome assemblies. Bioinformatics 2013, 29, 1072–1075. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  67. Casimiro-Soriguer, C.S.; Muñoz-Mérida, A.; Pérez-Pulido, A.J. Sma3s: A universal tool for easy functional annotation of proteomes and transcriptomes. Proteomics 2017, 17, 1700071. [Google Scholar]
  68. Li, B.; Dewey, C.N. RSEM: Accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics 2011, 12, 323. [Google Scholar]
  69. Wang, W.; Vignani, R.; Scali, M.; Cresti, M. A universal and rapid protocol for protein extraction from recalcitrant plant tissues for proteomic analysis. Electrophoresis 2006, 27, 2782–2786. [Google Scholar] [CrossRef]
  70. Gómez-Gálvez, I.; Sánchez-Lucas, R.; San-Eufrasio, B.; Rodríguez de Francisco, L.E.; Maldonado-Alconada, A.M.; Fuentes-Almagro, C.; Castillejo, M.A. Optimizing shotgun proteomics analysis for a confident protein identification and quantitation in orphan plant species: The case of Holm oak (Quercus ilex). In Plant Proteomics. Methods in Molecular Biology, Vol. 2139; Jorrin-Novo, J., Valledor, L., Castillejo, M.A., Rey, M., Eds.; Springer: New York, NY, USA, 2020; pp. 157–168. ISBN 978-1-0716-0527-1. [Google Scholar]
  71. Silva, J.C.; Gorenstein, M.V.; Li, G.-Z.; Vissers, J.P.C.; Geromanos, S.J. Absolute Quantification of Proteins by LCMSE: A Virtue of Parallel ms Acquisition *S. Mol. Cell. Proteomics 2006, 5, 144–156. [Google Scholar] [CrossRef] [Green Version]
  72. Perez-Riverol, Y.; Bai, J.; Bandla, C.; García-Seisdedos, D.; Hewapathirana, S.; Kamatchinathan, S.; Kundu, D.J.; Prakash, A.; Frericks-Zipper, A.; Eisenacher, M.; et al. The PRIDE database resources in 2022: A hub for mass spectrometry-based proteomics evidences. Nucleic Acids Res. 2022, 50, D543–D552. [Google Scholar]
  73. Valledor, L.; Escandón, M.; Meijón, M.; Nukarinen, E.; Cañal, M.J.; Weckwerth, W. A universal protocol for the combined isolation of metabolites, DNA, long RNAs, small RNAs, and proteins from plants and microorganisms. Plant J. 2014, 79, 173–180. [Google Scholar] [PubMed]
  74. Singh, A.; Shannon, C.P.; Gautier, B.; Rohart, F.; Vacher, M.; Tebbutt, S.J.; Lê Cao, K.-A. DIABLO: An integrative approach for identifying key molecular drivers from multi-omics assays. Bioinformatics 2019, 35, 3055–3062. [Google Scholar] [PubMed]
  75. Rohart, F.; Gautier, B.; Singh, A.; Lê Cao, K.-A. mixOmics: An R package for ‘omics feature selection and multiple data integration. PLoS Comput. Biol. 2017, 13, e1005752. [Google Scholar]
  76. Shannon, P.; Markiel, A.; Owen, O.; Baliga, N.S.; Wang, J.T.; Ramage, D.; Amin, N.; Schwikowski, B.; Ideker, T. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 2003, 13, 2498–2504. [Google Scholar] [PubMed]
  77. Szklarczyk, D.; Gable, A.L.; Lyon, D.; Junge, A.; Wyder, S.; Huerta-Cepas, J.; Simonovic, M.; Doncheva, N.T.; Morris, J.H.; Bork, P.; et al. STRING v11: Protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 2019, 47, D607–D613. [Google Scholar] [CrossRef]
Figure 1. Systems biology data interpretation approach in holm oak. Integration with three omic layers for novel insights and the discovery of biomarkers. The red arrows indicate positive correlations, the blue arrows indicate negative correlations, and the dashed lines indicate interrelations between omic layers.
Figure 1. Systems biology data interpretation approach in holm oak. Integration with three omic layers for novel insights and the discovery of biomarkers. The red arrows indicate positive correlations, the blue arrows indicate negative correlations, and the dashed lines indicate interrelations between omic layers.
Plants 11 03067 g001
Figure 2. Multivariate PCA and DIABLO analysis of the proteomic, transcriptomic, and metabolomic datasets. (A) PCA plot of all variables, (B) PCA plot of significant variables, and (C) DIABLO distance Y plot and the partial DIABLO plots of each omic dataset of the selected variables (transcripts, proteins, and metabolites). D1 corresponds to exposure to drought-like conditions for 17 days (early response, when chlorophyll fluorescence decreases by 20%), and D2 corresponds to exposure to drought-like conditions for 24 days (later response, when chlorophyll fluorescence decreases by 45%). C1 and C2 are the controls taken at both times.
Figure 2. Multivariate PCA and DIABLO analysis of the proteomic, transcriptomic, and metabolomic datasets. (A) PCA plot of all variables, (B) PCA plot of significant variables, and (C) DIABLO distance Y plot and the partial DIABLO plots of each omic dataset of the selected variables (transcripts, proteins, and metabolites). D1 corresponds to exposure to drought-like conditions for 17 days (early response, when chlorophyll fluorescence decreases by 20%), and D2 corresponds to exposure to drought-like conditions for 24 days (later response, when chlorophyll fluorescence decreases by 45%). C1 and C2 are the controls taken at both times.
Plants 11 03067 g002
Figure 3. Heatmap of the variables selected for DIABLO: 350 transcripts (250 in comp1 and 100 in comp2), 514 proteins (300 in comp1 and 300 in comp2, with 86 in both components), and 71 metabolites (62 in comp1 and 10 in comp2, with 1 in both components). The five clear clusters have been framed in black (iv). The variables selected in the three omic layers are represented in pink (transcripts), orange (proteins), and blue (metabolites) in rows. D1 corresponds to 17 days of drought (early response, when chlorophyll fluorescence decreases by 20%) and D2 corresponds to 24 days of drought (later response, when chlorophyll fluorescence decreases by 45%). C1 and C2 are the controls taken at both times. The figure has been compressed. Supplementary Figure S1 corresponds to the same figure uncompressed with all variable names available.
Figure 3. Heatmap of the variables selected for DIABLO: 350 transcripts (250 in comp1 and 100 in comp2), 514 proteins (300 in comp1 and 300 in comp2, with 86 in both components), and 71 metabolites (62 in comp1 and 10 in comp2, with 1 in both components). The five clear clusters have been framed in black (iv). The variables selected in the three omic layers are represented in pink (transcripts), orange (proteins), and blue (metabolites) in rows. D1 corresponds to 17 days of drought (early response, when chlorophyll fluorescence decreases by 20%) and D2 corresponds to 24 days of drought (later response, when chlorophyll fluorescence decreases by 45%). C1 and C2 are the controls taken at both times. The figure has been compressed. Supplementary Figure S1 corresponds to the same figure uncompressed with all variable names available.
Plants 11 03067 g003
Figure 4. DIABLO network of the variables with more weight in the statistical analysis (threshold ± 0.9). (A) DIABLO network indicating the variables included in each omic layer. The three omic layers are represented in pink (transcripts), orange (proteins), and blue (metabolites). (B) DIABLO network indicating the variables included in the predominant trends. Positive connections are shown in red and negative connections in blue. D1 corresponds to 17 days of drought (early response, when chlorophyll fluorescence decreases by 20%), and D2 corresponds to 24 days of drought (later response, when chlorophyll fluorescence decreases by 45%). Predominant processes are rounded off with a dashed line.
Figure 4. DIABLO network of the variables with more weight in the statistical analysis (threshold ± 0.9). (A) DIABLO network indicating the variables included in each omic layer. The three omic layers are represented in pink (transcripts), orange (proteins), and blue (metabolites). (B) DIABLO network indicating the variables included in the predominant trends. Positive connections are shown in red and negative connections in blue. D1 corresponds to 17 days of drought (early response, when chlorophyll fluorescence decreases by 20%), and D2 corresponds to 24 days of drought (later response, when chlorophyll fluorescence decreases by 45%). Predominant processes are rounded off with a dashed line.
Plants 11 03067 g004
Figure 5. STITCH network of the variables with more weight in the functional analysis (confidence score > 0.9) using the Arabidopsis thaliana database. (A) STITCH network indicating the variables included in each omic layer. The three omic layers are represented in pink (transcripts), orange (proteins), and blue (metabolites). (B) STITCH network indicating the variables included in the predominant trends. The links between variables presented are the highest confidence (score > 0.9). The score is computed by combining the probabilities from the different evidence channels and corrected for the probability of randomly observing an interaction. D1 corresponds to 17 days of drought (early response, when chlorophyll fluorescence decreases by 20%), and D2 corresponds to 24 days of drought (later response, when chlorophyll fluorescence decreases by 45%). Predominant processes are rounded off with a dashed line.
Figure 5. STITCH network of the variables with more weight in the functional analysis (confidence score > 0.9) using the Arabidopsis thaliana database. (A) STITCH network indicating the variables included in each omic layer. The three omic layers are represented in pink (transcripts), orange (proteins), and blue (metabolites). (B) STITCH network indicating the variables included in the predominant trends. The links between variables presented are the highest confidence (score > 0.9). The score is computed by combining the probabilities from the different evidence channels and corrected for the probability of randomly observing an interaction. D1 corresponds to 17 days of drought (early response, when chlorophyll fluorescence decreases by 20%), and D2 corresponds to 24 days of drought (later response, when chlorophyll fluorescence decreases by 45%). Predominant processes are rounded off with a dashed line.
Plants 11 03067 g005
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Guerrero-Sánchez, V.M.; López-Hidalgo, C.; Rey, M.-D.; Castillejo, M.Á.; Jorrín-Novo, J.V.; Escandón, M. Multiomic Data Integration in the Analysis of Drought-Responsive Mechanisms in Quercus ilex Seedlings. Plants 2022, 11, 3067. https://doi.org/10.3390/plants11223067

AMA Style

Guerrero-Sánchez VM, López-Hidalgo C, Rey M-D, Castillejo MÁ, Jorrín-Novo JV, Escandón M. Multiomic Data Integration in the Analysis of Drought-Responsive Mechanisms in Quercus ilex Seedlings. Plants. 2022; 11(22):3067. https://doi.org/10.3390/plants11223067

Chicago/Turabian Style

Guerrero-Sánchez, Víctor M., Cristina López-Hidalgo, María-Dolores Rey, María Ángeles Castillejo, Jesús V. Jorrín-Novo, and Mónica Escandón. 2022. "Multiomic Data Integration in the Analysis of Drought-Responsive Mechanisms in Quercus ilex Seedlings" Plants 11, no. 22: 3067. https://doi.org/10.3390/plants11223067

APA Style

Guerrero-Sánchez, V. M., López-Hidalgo, C., Rey, M. -D., Castillejo, M. Á., Jorrín-Novo, J. V., & Escandón, M. (2022). Multiomic Data Integration in the Analysis of Drought-Responsive Mechanisms in Quercus ilex Seedlings. Plants, 11(22), 3067. https://doi.org/10.3390/plants11223067

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop