Next Article in Journal
Identification of Key Pathways and Candidate Genes Controlling Organ Size Through Transcriptome and Weighted Gene Co-Expression Network Analyses in Navel Orange Plants (Citrus sinensis)
Previous Article in Journal
Etiologies of Early-Onset Hearing Impairment in Rwanda
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

ACT2.6: Global Gene Coexpression Network in Arabidopsis thaliana Using WGCNA

by
Vasileios L. Zogopoulos
1,2,†,
Konstantinos Papadopoulos
1,2,†,
Apostolos Malatras
3,
Vassiliki A. Iconomidou
2 and
Ioannis Michalopoulos
1,*
1
Center of Systems Biology, Biomedical Research Foundation, Academy of Athens, 11527 Athens, Greece
2
Section of Cell Biology and Biophysics, Department of Biology, National and Kapodistrian University of Athens, 15701 Athens, Greece
3
Molecular Medicine Research Center, biobank.cy, Center of Excellence in Biobanking and Biomedical Research, University of Cyprus, 2109 Nicosia, Cyprus
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Genes 2025, 16(3), 258; https://doi.org/10.3390/genes16030258
Submission received: 17 January 2025 / Revised: 6 February 2025 / Accepted: 21 February 2025 / Published: 23 February 2025
(This article belongs to the Section Plant Genetics and Genomics)

Abstract

:
Background/Objectives: Genes with similar expression patterns across multiple samples are considered coexpressed, and they may participate in similar biological processes or pathways. Gene coexpression networks depict the degree of similarity between the expression profiles of all genes in a set of samples. Gene coexpression tools allow for the prediction of functional gene partners or the assignment of roles to genes of unknown function. Weighted Gene Correlation Network Analysis (WGCNA) is an R package that provides a multitude of functions for constructing and analyzing a weighted or unweighted gene coexpression network. Methods: Previously preprocessed, high-quality gene expression data of 3500 samples of Affymetrix microarray technology from various tissues of the Arabidopsis thaliana plant model species were used to construct a weighted gene coexpression network, using WGCNA. Results: The gene dendrogram was used as the basis for the creation of a new Arabidopsis coexpression tool (ACT) version (ACT2.6). The dendrogram contains 21,273 leaves, each one corresponding to a single gene. Genes that are clustered in the same clade are coexpressed. WGCNA grouped the genes into 27 functional modules, all of which were positively or negatively correlated with specific tissues. Discussion: Genes known to be involved in common metabolic pathways were discovered in the same module. By comparing the current ACT version with the previous one, it was shown that the new version outperforms the old one in discovering the functional connections between gene partners. ACT2.6 is a major upgrade over the previous version and a significant addition to the collection of public gene coexpression tools.

1. Introduction

Genes that exhibit similar expression patterns across multiple transcriptomic samples are considered as coexpressed [1]. Coexpressed genes tend to participate in similar biological processes, and gene coexpression patterns can provide insights into underlying cellular processes and can be used for the discovery of functional gene partners [2]. One of the most effective ways to study gene coexpression is known as ‘‘condition-independent’’ coexpression analysis, where the most representative transcriptomic samples of each tissue or cell type for a species of interest are selected [3], allowing for the study of global gene coexpression.
Networks represent interactions between nodes. Network-specific concepts, such as connectivity and modules, have proven valuable for the analysis of complex interactions [4]. The development of high-throughput technologies has allowed for network-based methods to be applied in many domains of biology. Gene coexpression is visualized through gene coexpression networks (GCNs), which are undirected graphs, depicting genes as nodes (vertices) and gene correlations as lines connecting gene pairs (edges) [5]. The Pearson correlation coefficient (PCC) is usually the metric of choice for measuring coexpression, and its thresholding results in unweighted networks known as “relevance networks” [4]. On the other hand, weighted networks are based on the strength of coexpression, through a “soft” thresholding. This involves defining adjacency functions to convert coexpression similarities into connection strengths and estimating parameters based on biological considerations [4]. Signed and unsigned networks may be produced, depending on whether the adjacency function retains the sign of gene correlations. By distinguishing between positive and negative correlations, signed networks provide a more nuanced view which can be valuable for understanding regulatory relationships and functional interactions in biological systems [6].
A. thaliana is a small, annual plant of the Brassicaceae family [7], valued as a model organism for genetic research. Its short life cycle, self-pollination ability, and small genome established it as a key subject for molecular studies [8]. Arabidopsis transitions from vegetative to reproductive growth [9], developing flowers through a complex process involving stem and floral tissue differentiation [10]. Its complete genome was sequenced in 2000 [7], aiding research on gene expression, development, and stress responses, with mutant screening [11] and transcriptome analyses [12], providing insights into phenotypic impacts and genetic functions. The influx of publicly available transcriptomic data has resulted in the development of many online resources for studying gene coexpression in A. thaliana. These include online gene coexpression tools and databases, which are often based on GCNs, such as ATTED-II [13], EXPath2.0 [14], ACT [15], AraNet [16], CORNET [17], GeneMANIA [18], ExpressionAngler [19], CoNekT [20], CoCoCoNet [21], etc. Furthermore, there are standalone pieces of software that enable GCN construction through the input of a user’s transcriptomic data, such as WGCNA [22], CoExpNetViz [23], scLink [24], LSTrAP [25], etc. These tools are not limited to a predetermined list of species and can be utilized to create A. thaliana-based GCNs using data originating from A. thaliana transcriptomic samples.
Weighted Gene Correlation Network Analysis (WGCNA) is a popular R package that contains methods for both weighted and unweighted gene coexpression network construction [26]. Weighted networks are preferred over unweighted networks, due to their capacity to preserve the intricate details present in the data, ensuring a more comprehensive representation of the relationships between genes [22]. Empirical and simulated studies have demonstrated the effectiveness of weighted network approaches in capturing meaningful biological relationships and producing more reliable results compared to unweighted networks, with WGNCA having numerous biological applications [27,28]. Moreover, WGCNA includes additional functionalities, such as grouping genes into modules and identifying intramodular hub genes. Consequently, a biological role may be attributed to a gene of unknown function based on the module to which it belongs [29], as gene members of a coexpression module may be involved in similar biological functions [22]. Finally, WGCNA is able to correlate gene expression patterns and sample traits [30].
This manuscript presents the new version (2.6) of the web-based Arabidopsis Coexpression Tool (ACT) which employs WGCNA to produce a microarray-based A. thaliana global coexpression landscape and discover gene coexpression modules. In the 2.6 version of ACT, the same gene expression data of the previous version (2.0) were re-analyzed through WGCNA, and ACT2.6 gene coexpression levels were calculated using the TOM-based similarity, instead of the Pearson correlation coefficient that was used in ACT2.0. The ACT2.6 web interface was updated to incorporate additional features which include gene coexpression module discovery, depiction of the module each gene belongs to, module-wide biological term enrichment analysis, association of modules with tissue traits, and identification of intramodular hub genes.

2. Materials and Methods

2.1. Data Acquisition

To study the global coexpression landscape in A. thaliana, a dataset of 3500 Affymetrix Arabidopsis ATH1 Genome Array chip microarray samples was used, as before [15,31]. In brief, samples analyzed with Affymetrix Arabidopsis ATH1 Genome Array chip were retrieved from the Gene Expression Omnibus (NCBI-GEO) [32], ArrayExpress (EMBL-EBI) [33], and NASCArrays [34] public repositories. Those samples were checked for possible duplicate or corrupted files, resulting in 19,887 unique samples. Raw “.cel” data were normalized via SCAN [35], a single-channel normalization algorithm, along with an updated BrainArray CDF file [36], with each sample including 21,273 probe sets that target unique genes. Samples that were retrieved from whole plants, retrieved from mutant plants, or low-quality were removed. Finally, the remaining samples were clustered according to the expression value of their genes, and using an iterative algorithm, similar samples were programmatically removed, down to having 3500 representative samples. This approach ensures a high signal-to-noise ratio within individual samples, decreased variation across samples, adjustment for batch and platform effects, and minimization of tissue bias.
The following biological terms were downloaded: gene descriptions from Thalemine [37], gene ontologies from Gene Ontology [38], plant ontologies from Planteome [39], biological and metabolical pathways from KEGG [40], AraCyc [41], and Wikipathways [42], transcription factor gene targets from AtRegNet [43] and Plant Cistrome Database [44], and protein domains from Pfam [45].
All downloaded and processed data, including normalized gene expression levels and metadata for each sample, were stored in a MySQL relational database.

2.2. WGCNA Analysis

To construct a weighted gene coexpression network, the WGCNA R package (version 1.71) was executed in the R console, running on a 14-core, 120 GB RAM, Linux Ubuntu 22.04 system. Gene expression data from the aforementioned 3500 representative microarray samples, as well as tissue metadata, were used as input for WGCNA. In order to be imported, tissue trait data for each sample were converted to a binary matrix. The WGCNA function pickSoftThreshold with parameter networkType = “signed” was used to estimate the value of soft-threshold power β as 14, which was used to calculate adjacency values and, subsequently, Topological Overlap Matrix (TOM) similarity and distances between all genes. Average linkage hierarchical clustering was used to create a gene dendrogram that was exported in Newick format [46]. Genes were grouped into modules with dynamicTreeCut [47] based on the coexpression gene dendrogram, with parameters minClusterSize set as 15 and deepSplit as 2. Modules were grouped into a dendrogram using average linkage, and modules with a height <0.25 were merged. The merged modules, represented as eigengenes, were associated with tissue traits by calculating PCCs between the eigengenes and samples from the same tissue type (trait).

2.3. Website Construction

The web server is hosted on a Linux Ubuntu 22.04, 16-core, 64 GB memory system and served through Apache2. The web application was developed using HTML5, CSS, Bootstrap 5, PHP, and Javascript. All programming scripts were developed in PHP.
Initially, the user selects an A. thaliana gene, called the “driver” gene, through an auto-completed field, based on the genes included in the webtool database. ACT2.6 outputs a gene coexpression clade which includes the driver gene and its coexpressed gene partners. The default clade size is defined using an iterative algorithm, pruning internal nodes until the coexpression clade is the closest to containing 25 genes. Trial and error showed that this choice of the number of genes is optimal. Nevertheless, the clade size may be modified by adding internal nodes, up to a maximum of 25% of the total genes, or removing them, down to a single clade. At the top of the clade depiction, a scale bar, which corresponds to TOM-based distances between genes, is displayed. The driver gene is highlighted in yellow. The user may change the driver gene by clicking on a different AGI code, while clicking on the gene symbol redirects to the gene’s page in Thalemine.
The webtool allows users to perform relevant gene term overrepresentation analyses by selecting an enrichment analysis type from a drop-down menu. These analyses are executed using as input the genes which are included in the currently selected subtree. The results, including overrepresented biological terms such as gene or plant ontologies, pathways, gene-targeting transcription factors, and protein domains, are summarized in a term enrichment table. p-values are calculated based on the Hypergeometric Distribution [48], and the terms are ranked according to their False Discovery Rate (FDR)-adjusted p-values [49]. Only terms with an FDR-adjusted p-value ≤ 0.05 are displayed.
For each enriched term, the analysis presents the hit percentage (the frequency of the term’s occurrence in the subtree compared to its overall occurrence in the dataset) and the overrepresentation rate (observed versus expected frequency). The size of the subtree influences the results, as increasing its size may reveal additional enriched terms not detected in smaller subtrees. A larger subtree may encompass gene subclades with diverse functions, whereas smaller subtrees yield more specialized enrichment results. Monitoring the variation in enrichment p-values can help identify the optimal subtree size for analysis.
A second table provides a comprehensive list of the genes in the subtree along with all associated terms within the selected category, with hyperlinks to their source databases. The gene list for the subtree can be downloaded for use in external tools such as WebGestalt [50] or redirected for further analysis via links to platforms like STRING [51], Thalemine, g:Profiler [52], and Flame [53].

2.4. New Features in ACT2.6

In ACT2.6, the module each gene belongs to is now displayed as a colored circle next to the gene name in the gene coexpression clade. By clicking on the gene module circle, the user is redirected to the corresponding module page. On that page, the statistically significant positive or negative correlations of the module’s eigengene with each tissue trait are displayed. In addition, the enriched biological terms of that module’s genes for each biological term category are shown. Finally, a table at the bottom of the page shows all the genes of the module, ranked by their “average ranking”. The average ranking of gene i  a v g R a n k i in a module is calculated as follows:
a v g R a n k i = n + 1 n 1 n n 1 / 2 + 1 j = 1 n R i , j
where i and j are genes of the module, n is the number of genes in the module, and R i , j is the rank of the distance between genes i and j in the list of distances between all n n 1 / 2 pairs of the genes of the same module. In each module, the top-ranking gene and the genes having an avgRank difference from that of the top-ranking gene <1 are considered as that module’s intramodular hub genes.

2.5. API Access

ACT2.6 offers public access to coexpression clades and enrichment analyses via a JSON-based Application Programming Interface (API). The API is keyed on an A. thaliana AGI code, a tree node number, and optionally, a two-character keyword representing an enrichment analysis category. For instance, the URL https://www.michalopoulos.net/act2.6/api/AT3G16920/5/bp (accessed on 8 January 2025) retrieves the coexpression clade in Newick format for AT3G16920 as the driver gene with 5 internal nodes, details about the driver gene, a list of genes within the coexpression clade, and enriched “Gene Ontology: Biological Process” terms ranked by adjusted p-value. If an incorrect or missing keyword is provided, the enrichment analysis will not be executed. Detailed instructions for API usage are available in the Help section of the ACT2.6 website.

2.6. Comparison Between the Current and Previous ACT Versions

ACT2.6 and ACT2.0 were benchmarked with the same 10 gene use cases (AT4G13170, HSP101, COR15A, CEV1, CTL2, PSB28, LHY, PSBT, AMS, and emb1692) that were originally presented in ACT2.0 [15]. For comparison impartiality, the same biological term database versions were used, while the number of coexpressed genes was kept as close as possible, since an identical number of resulting coexpressed genes between the two versions cannot always be achieved, due to the coexpression tree representation being pruned based on internal node number.

3. Results

3.1. Gene Module Generation

The resulting gene coexpression dendrogram contained 21,273 A. thaliana genes, originally grouped into 42 gene modules. After module merging, 28 modules (27 functional modules, with gene numbers ranging from 27 to 3809, and 1 module that contained 2 ungrouped genes) were generated (Figure 1).
The PCCs between the eigengenes describing each of the 28 merged modules and the 55 tissue traits were calculated (Figure 2), allowing for the overall expression pattern of the genes of each module to be associated with specific healthy tissues/plant parts.

3.2. Functional Exploration of ACT2.6 WGCNA Modules

The enrichment analysis of a series of general or plant-specific biological terms, using ACT2.6’s internal enrichment functionality, was performed for all 27 functional gene modules, with GOBP enrichment being mainly used to indicate each module’s predominant biological function. In 14 out of 27 modules, there is an accordance between the top GOBP term and the GOBP terms that characterize the hub gene, and in 17 out of 27 modules, there is bibliographic evidence for the overexpression of the hub gene, in the module’s overexpressed tissue (Table 1). All modules are described [54] (pp. 13–60). Six indicative modules of variable sizes are also described in this manuscript.

3.2.1. Lightsteelblue1 Module

The lightsteelblue1 module includes 27 genes. They are overexpressed in “Flower”, “Inflorescence”, “Nectary”, “Flower bud”, “Stamen”, “Pistil”, and “Gynoecium” tissues and underexpressed in “Seedling” and “Root” tissues. All of these genes are associated with flower and/or anthesis. “Stamen filament development”, “jasmonic acid mediated signaling pathway”, “terpene biosynthetic process” and “nectar secretion” GO terms are significantly overrepresented (Table 2).
MYB21 (myb domain protein 21) and AT5G44630 (Terpenoid cyclases/Protein prenyltransferases superfamily protein) showed up as intramodular hub genes. MYB21 and MYB24 (myb domain protein 24), two transcription factors belonging to the lightsteelblue1 module, are the only known regulators of jasmonate, which is necessary for the development of stamen and pollen in Arabidopsis [55]. A MYB21 mutant plant showed reduced male fertility, delayed anther dehiscence, and shorter anther filaments. Although the MYB24 mutant plant appeared normal, the double MYB21/MYB24 mutant showed severe defects in all three aspects of stamen development. Exogenous jasmonate was ineffective at restoring male fertility in either the MYB21 or MYB21/MYB24 mutant plants. MYB21 and MYB24 are induced by jasmonate and play a crucial role in regulating various aspects of stamen development in Arabidopsis [55].
AT5G44630, the other hub gene of this module, is one of the main genes responsible for the production of sesquiterpenes that are emitted from Arabidopsis flowers [73]. TPS14 (terpene synthase 14) and AT3G25810 (Terpenoid cyclases/Protein prenyltransferases superfamily protein) are terpene synthases that are responsible for the synthesis of alcohol linalool and monoterpene volatile products, respectively, in flowers (Table 2) [73]. Another two genes of the module, CYP76C3 (cytochrome P450, family 76, subfamily C, polypeptide 3) and CYP71B31 (cytochrome P450, family 71, subfamily B, polypeptide 31), code for P450 cytochrome enzymes which metabolize the two linalool enantiomers to form hydroxylated or epoxidized compounds [74]. CYP76C3 and CYP71B31 are shown to be coexpressed with TPS10 (terpene synthase 10) and TPS14 [74], with the latter two being terpene synthases of this module.
MYB21 was used as input for ACT2.6, and the produced coexpression clade was set to 13 internal nodes, containing 24 genes (Figure 3). All genes of this clade belong to the lightsteelblue1 module, including the two hub genes of this module (MYB21 and AT5G44630), which are located close to each other. GO enrichment analysis revealed enriched terms related to terpene biosynthesis, in accordance with the enriched terms of lightsteelblue1 module (Table 2), as 24 out of the 27 genes of this module are part of this clade.

3.2.2. Yellowgreen Module

The yellowgreen module includes 45 genes, all of which are found in the same clade of the gene coexpression tree. These genes are exclusively located on the chloroplast chromosome. This module is overexpressed in “Rosette leaf”, “Leaf”, “Aerial tissue”, and “Seedling” tissues and underexpressed in “Root”, “Cell culture”, “Root tip”, “Seed”, “Starch sheath”, “Pollen”, and “Lateral root” tissues. The enrichment analysis of this module indicated the prevalence of specific biological process terms, such as “photosynthesis” and “electron transport chain”, which were significantly overrepresented (Table 3).
Hub genes YCF4 and ATPF are the only plastid-related genes that are overexpressed in multiple cases of transgenic plants that overexpress RAP2.2 [75], although the latter belongs to another module (purple).
Chloroplasts are essential for photosynthesis. As cell organelles, they feature their own chromosome, and their genes are organized as operons or transcriptional units [56]. ATPA (ATP synthase subunit α), ATPI (ATPase, F0 complex, subunit A protein), and hub genes ATPH (ATP synthase subunit C family protein) and ATPF (ATPase, F0 complex, subunit B/B′, bacterial/chloroplast) are organized into the ATP synthase (atp) operon. ATPH transcripts are more abundant compared to all other atp operon transcripts because of the protection of their structure, as they possess both a hairpin structure at the 3′ end and RNA-binding proteins at the 3′ end and at the 5′ end [56].
PSBA (photosystem II reaction center protein A), PSBB (photosystem II reaction center protein B), PSBC (photosystem II reaction center protein C), and PSBD (photosystem II reaction center protein D), which code proteins of photosystem II protein complex [76], belong to this module, resulting in the enrichment of “photosystem II protein” Pfam domain (Table 3). PSBC and PSBD are encoded by the same polycistronic transcript [76].

3.2.3. White Module

The white module contains 102 genes, 99 of which are located in the same clade. They are overexpressed in “Stem”, “Root”, “Stalk”, “Hypocotyl”, “Basal tissue”, “Silique”, and “Replum” tissues and underexpressed in “Leaf”, “Rosette leaf”, “Seed”, “Seedling”, “Aerial tissue”, and “Rosette” tissues. The “plant-type secondary cell wall biogenesis” and “xylan metabolic process” GO terms and “xylem” PO term are significantly overrepresented (Table 4). IRX3 (cellulose synthase family protein) and GAUT12 (galacturonosyltransferase 12) are the two hub genes of this module.
Many genes contained in the white module, such as IRX3, IRX12 (Laccase/Diphenol oxidase family protein), IRX9 (Nucleotide-diphospho-sugar transferases superfamily protein), CESA4 (cellulose synthase A4), IRX6 (COBRA-like extracellular glycosyl-phosphatidyl inositol-anchored protein family), GXM3 (glucuronoxylan 4-O-methyltransferase-like protein, DUF579), CTL2 (chitinase-like protein), KNAT7 (homeobox knotted-like protein), PGSIP3 (plant glycogenin-like starch initiation protein 3), GAUT12, AT4G27435 (fiber, DUF1218), GUT2 (Exostosin family protein), IRX1 (cellulose synthase family protein), AT5G60720 (electron transporter, putative (Protein of unknown function, DUF547), AT1G72220 (RING/U-box superfamily protein), RWA1 (O-acetyltransferase family protein), RIC2 (ROP-interactive CRIB motif-containing protein 2), AT1G08340 (Rho GTPase activating protein with PAK-box/P21-Rho-binding domain-containing protein), PGSIP1 (plant glycogenin-like starch initiation protein 1), FLA11 (FASCICLIN-like arabinogalactan-protein 11), and LAC2 (laccase 2) were known to exhibit similar expression patterns [63].
IRX3 is highly expressed in the hypocotyl and in the base of the stem, while in samples from bigger height points of the stem, it shows lower expression levels [63]. In addition, ERF38, which belongs to a different module (blue), has been shown to be coexpressed with a series of white module genes, i.e., AT1G07120 (CHUP1-like protein), AT1G09440 (Protein kinase superfamily protein), AT1G22480 (Cupredoxin superfamily protein), RIC2, GUT2, ATMYB103 (myb domain protein 103), AT1G80170 (Pectin lyase-like superfamily protein), LAC2, AT2G31930 (hypothetical protein), AT2G40120 (Protein kinase superfamily protein), AT2G41610 (transmembrane protein), IQD10 (IQ-domain 10), CTL2, PGSIP1, IRX15 (IRREGULAR XYLEM protein (DUF579)), AT3G59845 (Zinc-binding dehydrogenase family protein), NAC073 (NAC domain-containing protein 73), FLA11, AT5G06930 (nucleolar-like protein), IRX6, CESA4, and FLA12 (FASCICLIN-like arabinogalactan-protein 12), which are related to secondary cell wall processing. The expression of ERF38 at floral stems and mature siliques is intensive and may participate in an alternative, non-typical with cellulose and lignin, process of secondary cell wall synthesis [77], possibly explaining its grouping in a different module.

3.2.4. Midnightblue Module

The midnightblue module contains 385 genes, 374 of which are located in the same clade. They are overexpressed in “Seed”, “Endosperm”, “Silique”, and “Embryo” tissues and underexpressed in “Leaf”, “Seedling”, and “Rosette” tissues. “Seed development”, “seed maturation”, and “seed oilbody biogenesis” are overrepresented GOBP terms (Table 5).
AT1G27990 (transmembrane protein) and AT1G72100 (late embryogenesis abundant domain-containing protein/LEA domain-containing protein) are the hub genes of this module. Late embryogenesis abundant (LEA) genes activate during various stresses and accumulate at late stages of seed development [67]. Tandem repeat genes AT3G22490 and ATECP31 (AT3G22500, LATE EMBRYOGENESIS ABUNDANT PROTEIN ECP31), which encode seed maturation proteins, have a positive correlation of their expression levels [78].
Gene pairs AtLEA4-1 (Late Embryogenesis Abundant 4-1) and LEA18 (Late Embryogenesis Abundant 18), LEA7 (LATE EMBRYOGENESIS ABUNDANT 7) and AT3G15670 (late embryogenesis abundant protein), AT1G72100 and AT1G22600 (late embryogenesis abundant protein), AT2G18340 (late embryogenesis abundant domain-containing protein) and AT4G36600 (late embryogenesis abundant protein), LEA (dehydrin LEA) and AT4G39130 (Dehydrin family protein), ECP63 (embryonic cell protein 63) and AT3G53040 (late embryogenesis abundant protein), and AT4G21020 (late embryogenesis abundant protein) and AT5G44310 (late embryogenesis abundant protein) were found to have similar expression patterns, with all of them being expressed in seeds [78], serving as an additional line of evidence for the grouping of those genes in the same module.
Multiple genes of the midnightblue module reach their maximum expression in the mid to late seed developmental stages, albeit having different points of initial expression [79]. AT1G03890 (RmlC-like cupins superfamily protein), PER1 (1-cysteine peroxiredoxin 1), AT1G65090 (nucleolin), LBD40 (LOB domain-containing protein 40), TIP3;1 (Aquaporin-like superfamily protein), CBSX4 (Cystathionine β-synthase family protein), AT3G01570 (oleosin family protein), OLEO4 (oleosin 4), AT3G63040 (hypothetical protein), OLEO1 (oleosin 1), SESA2 (seed storage albumin 2), ATS3 (embryo-specific protein 3), OLEO2 (oleosin 2), SESA5 (seed storage albumin 5), ATPXG2 (peroxygenase 2), CYP71B10 (cytochrome P450, family 71, subfamily B, polypeptide 10), AT3G54940 (Papain family cysteine protease), AT5G59170 (Proline-rich extensin-like family protein), and AT5G62800 (protein with RING/U-box and TRAF-like domain) start being expressed during the early stages of seed development (embryos at early heart to late torpedo stage). SOM (Zinc finger C-x8-C-x5-C-x3-H type family protein), AT1G14950 (Polyketide cyclase/dehydrase and lipid transport superfamily protein), AT1G48660 (Auxin-responsive GH3 family protein), GLYI8 (Lactoylglutathione lyase/glyoxalase I family protein), AT2G33520 (cysteine-rich/transmembrane domain protein A), SMP1 (seed maturation protein 1), XTH11 (xyloglucan endotransglucosylase/hydrolase 11), CYP76C7 (cytochrome P450, family 76, subfamily C, polypeptide 7), AT5G01670 (NAD(P)-linked oxidoreductase superfamily protein), AT5G04010 (F-box family protein), AT5G22470 (poly ADP-ribose polymerase 3), AT5G44310 (LEA protein), AT5G45690 (histone acetyltransferase, DUF1264), DOG1 (delay of germination 1), and HVA22B (HVA22 homolog B) begin their expression in mid seed development stages (embryos at late torpedo to early walking-stick stage). Only AT4G36700 (RmlC-like cupins superfamily protein) has an initial expression in a very early developmental stage (heart embryo stage) [79].

3.2.5. Cyan Module

The cyan module contains 1079 genes which are overexpressed in “Leaf”, “Protoplast”, “Root”, and “Trichome” tissues and underexpressed in “Flower”, “Seedling”, “Seed”, “Aerial tissue”, “Shoot apex”, “Inflorescence”, “Flower bud”, “Pollen”, “Petiole”, and “Developing leaf insertions” tissues. The enriched biological terms of this module are related to the defense response (Table 6).
AT5G18490 (vacuolar sorting-associated protein DUF946) is the hub gene of this module. Its methylation is increased in multi-mutant drm1/drm2/cmt3 plants [80]. ERF4 (ethylene responsive element binding factor 4), ERF11 (ERF domain protein 11), ERF5 (ethylene responsive element binding factor 5), CEJ1 (cooperatively regulated by ethylene and jasmonate 1), ERF6 (ethylene responsive element binding factor 6), AT5G51190 (Integrase-type DNA-binding superfamily protein), ACS6 (1-aminocyclopropane-1-carboxylic acid synthase 6), MAPKKK14 (mitogen-activated protein kinase kinase kinase 14), and MKK9 (MAP kinase kinase 9) control ethylene accumulation [81]. SIP4 (SOS3-interacting protein 4), STZ (salt tolerance zinc finger), and ZF2 (zinc-finger protein 2) are involved in salinity tolerance [81]. VPS28-1 (vacuolar protein sorting-associated protein 28 homolog 1), SRC2 (soybean gene regulated by cold-2), ELC (Ubiquitin-conjugating enzyme/RWD-like protein), VPS2.1 (SNF7 family protein), and VPS46.2 (SNF7 family protein) orchestrate trafficking from endosomes to central vacuole [81]. AT1G02660 (α/β-Hydrolases superfamily protein), IP5PII (myo-inositol polyphosphate 5-phosphatase 2), BAP1 (BON association protein 1), and FAB1D (FORMS APLOID AND BINUCLEATE CELLS 1A) participate in phospholipid signaling [81]. SRO5 (similar to RCD one 5) controls reactive oxygen species (ROS) in plants, while RHL41 (RESPONSIVE TO HIGH LIGHT 41) participates in signal transduction of ROS [81]. NHL3 (NDR1/HIN1-like 3) and PUB17 (plant U-box 17) function against Pseudomonas syringae and along with AT2G34930 (disease resistance family protein/LRR family protein) are involved in biotic and abiotic stress conditions [81].
CML38 (calmodulin-like 38), AT3G10300 (Calcium-binding EF-hand family protein), AT5G62570 (Calmodulin-binding protein-like protein), CPK28 (calcium-dependent protein kinase 28), CPK32 (calcium-dependent protein kinase 32), AT4G34150 (Calcium-dependent lipid-binding domain family protein), and AT4G27280 (Calcium-binding EF-hand family protein) are calcium-dependent genes [81]. RPK1 (receptor-like protein kinase 1) and CYP707A3 (cytochrome P450, family 707, subfamily A, polypeptide 3) are ABA-related genes [81]. Although the aforementioned genes are involved in specific defense responses, they all exhibit upregulation at any kind of environmental or biotic stress [81].
In addition, CAF1b (CCR4-associated factor 1b), AT5G54940 (Translation initiation factor SUI1 family protein), TEM1 (TEMPRANILLO 1), SCL13 (SCARECROW-like 13), MYBR1 (myb domain protein r1), MYB73 (myb domain protein 73), NAC102 (NAC domain-containing protein 102), NAC062 (NAC domain-containing protein 62), TIP (TCV-interacting protein), WRKY40 (WRKY DNA-binding protein 40), WRKY33 (WRKY DNA-binding protein 33), WRKY25 (WRKY DNA-binding protein 25), WRKY11 (WRKY DNA-binding protein 11), WRKY18 (WRKY DNA-binding protein 18), HSFB2A (heat shock transcription factor B2A), and HSFA4A (heat shock transcription factor A4A) are transcription factors upregulated in any stress condition [81].
DIC2 (dicarboxylate carrier 2), AT5G11650 (α/β-Hydrolases superfamily protein), BCS1 (cytochrome BC1 synthesis), AT2G46620 (P-loop containing nucleoside triphosphate hydrolases superfamily protein), AT4G33920 (Protein phosphatase 2C family protein), FC1 (ferrochelatase 1), UCP5 (uncoupling protein 5), and PNC2 (peroxisomal adenine nucleotide carrier 2) are involved in mitochondrial functions and are upregulated in any stress condition [81]. RSH2 (RELA/SPOT homolog 2) encodes a homolog protein of RelA/SpoT, bacterial enzymes that adapt bacteria to various environmental stresses [81]. Finally, 51 transcription factors and 15 ubiquitin-ligase genes that are responsive to chitooctaose treatment [82] belong to this module.

3.2.6. Blue Module

The blue module contains 3249 genes (22 of them are chloroplast ones), most of which are located in the same clade. They are overexpressed in “Leaf”, “Aerial tissue”, “Rosette”, “Seedling”, “Rosette leaf”, “Cotyledon”, and “Shoot” tissues and underexpressed in “Root”, “Cell culture”, “Root tip”, “Seed”, “Pollen”, “Flower”, “Lateral root”, “Pollen tube”, and “Phloem” tissues. The “photosynthesis”, “plastid organization”, “thylakoid membrane organization”, and “photosystem II assembly” GO terms and “chlorophyll A-B binding protein” and “PsbP” Pfam families of proteins are overrepresented (Table 7). The “chloroplast” Gene Ontology Cellular Component (GOCC) term is significantly overrepresented, indicating the localization of the proteins coded by the genes of this module. The proteome of chloroplasts is encoded mainly by the nuclear genome, although they require their own genome [83], supporting the grouping of a small number of chloroplast genes with numerous nuclear ones which are nevertheless enriched for chloroplast-specific biological terms.
AT1G76450 (Photosystem II reaction center PsbP family protein) is the hub gene of this module. AT1G76450, PSBP-1 (photosystem II subunit P-1), PSBP-2 (photosystem II subunit P-2), PPL1 (PsbP-like protein 1), PnsL1 (Photosynthetic NDH subcomplex L 1), PPD1 (PsbP-Domain Protein1), AT2G28605 (Photosystem II reaction center PsbP family protein), AT1G77090 (Mog1/PsbP/DUF1795-like photosystem II reaction center PsbP family protein), PPD5 (PsbP domain protein 5), and PPD6 (PsbP-domain protein 6) constitute the whole PsbP protein family. PsbP is a subcomplex of photosystem II which catalyses water splitting [72]. CBL2 (calcineurin B-like protein 2) and CBL10 (calcineurin B-like protein 10) encode calcium-signal sensors, and CIPK1 (CBL-interacting protein kinase 1), CIPK3 (CBL-interacting protein kinase 3), CIPK7 (CBL-interacting protein kinase 7), CIPK9 (CBL-interacting protein kinase 9), and CIPK20 (CBL-interacting protein kinase 20), which encode proteins capable of interacting with CBLs, belong to this module [84]. TOC33 (translocon at the outer envelope membrane of chloroplasts 33) regulates the expression of many genes that are involved in photosynthesis [85]. Tandem repeat genes COR15A (cold-regulated 15a) and COR15B (cold-regulated 15b), which are located in neighboring clades on the coexpression tree, have positively correlated expression [78].

3.3. Benchmarking of ACT2.6 Versus ACT2.0

For the coexpression clade produced in each of the 10 gene use cases tested, the top enriched GO:BP term of ACT2.0 and its corresponding adjP were recorded and compared to the adjP of the same term in the coexpression clade of ACT2.6 (Table 8).
In the case of AT4G13170, HSP101, CTL2, AMS, and LHY, the same biological terms were enriched, having very close FDR-adjusted p-values (adjP). Any small differences in adjP could be due to the different number of coexpressed genes that resulted in each webtool, rather than the difference in performance.
The gene coexpression subclades of multiple gene cases in ACT2.6 exhibited a “ladderization” effect, which is a characteristic of an unbalanced hierarchical tree. As a result, in the case of COR15A, PSB28, and PSBT, even when the internal node was set to 1, the resulting coexpression clade contained a large number of genes (far greater than the initial default number of 25). Consequently, the internal nodes in ACT2.0 were increased, as an attempt to match the number of genes of the clades of ACT2.6, as much as possible. In the case of COR15A, there were a lot of overlapping genes between the outcomes of ACT2.6 and ACT2.0, with the ACT2.0 coexpression clade exceeding the size of the ACT2.6 one. Nevertheless, the number of genes characterized by the most prominent GOBP term (“photosynthesis”) was similar between the two clades, resulting in ACT2.6 exhibiting smaller enrichment p-values than ACT2.0 for that term. In the case of PSB28, the expanded ACT2.0 clade of 262 genes exhibited almost identical adjP in the common top enriched “photosynthesis” GOBP term, with the ACT2.6 coexpression clade. In the case of PSBT, a comparable number of coexpressed genes between the tool versions could not be achieved. Thus, the smallest coexpression clade of PSBT that could be generated in ACT2.6, containing 995 coexpressed genes, was compared to the ACT2.0 coexpression clade of 4 internal nodes and 72 coexpressed genes which was originally tested as a use case. The resulting top enriched GOBP term was the same in each case (“photosynthesis”) with ACT2.6 having much lower adjP due to the ~14x higher gene number. Nevertheless, the coexpression clade of ACT2.0 contained all 72 chloroplast genes, while in the ACT2.6 coexpression clade of 995 genes, only 12 genes were chloroplast ones.
Finally, in the cases of CEV1 and emb1692, ACT2.0 exhibited lower p-values for the same overrepresented biological terms. Specifically, the CEV1 coexpression clade in ACT2.0 contained genes that were more relevant to the gene’s main function of primary cell wall formation [86]. On the other hand, the enriched terms of the emb1692 clade were similar for both versions, with the adjP difference also being explained by the larger number of ACT2.0 coexpressed genes.

4. Discussion

4.1. Life Cycle of A. thaliana Through Gene Coexpression Patterns

4.1.1. Seedling

The darkmagenta, blue, darkgreen, yellowgreen, darkolivegreen, and purple modules are overexpressed in “seedling” tissue in descending order of correlation. Out of these modules, the blue, yellowgreen, and darkmagenta modules are overexpressed in “aerial tissue”, while darkgreen, darkolivegreen, and purple exhibit their highest expression in “root” tissue. All aforementioned modules serve distinctly different biological purposes.

4.1.2. Underground Development

Darkgreen and saddlebrown are the most overexpressed modules in the “root” tissue, followed by royalblue, steelblue, darkolivegreen, and purple, in decreasing order. Darkgreen is the only module that shows significant overexpression in all three “root”, “lateral root”, and “root tip” tissues. Consequently, there is a significant overrepresentation of “root morphogenesis” and “root development” GOBP terms within this module. Saddlebrown is overexpressed in “root” and “lateral root”, with ~86% of its genes being characterized by the “root system” POPA term. The darkolivegreen, royalblue, and purple modules showcase their highest overexpression in “root” and “cell culture” tissues. Those three modules contain mitochondrial complex 1 subunit genes [87], and therefore, the “mitochondrial protein complex” GOCC term is overrepresented in all three modules. The royalblue and darkolivegreen modules are unique in sharing LSM-family genes [68], with LSM3B participating as a hub gene in the royalblue module. The darkolivegreen module contains AT3G60770 (Ribosomal protein S13/S15) as a hub gene which, as a ribosomal protein, is associated with the module’s overrepresented “translation” GOBP term. In addition, the royalblue and darkolivegreen modules share “Ribosome” as an overrepresented KEGG term, while purple shows overrepresentation of “Proteasome” KEGG term, supporting the identification as a hub gene of AT4G24820 which encodes a 26S proteasome regulatory subunit. This observation is additionally supported by the non-cell-autonomous nature of lateral root development, wherein various cell types assume distinct functions and trigger a multitude of genetic networks throughout this progression [88].
The genes of black and magenta modules are overexpressed in “root” tissue along with “meristem”, “apex”, and “shoot apex”. Genes involved in the cell cycle are essential to root development [88], and these three modules share “cell cycle” as an overrepresented term, coupled with other similar terms. HDA3 (histone deacetylase 3), HD2B (histone deacetylase 2B), and HDT4 (histone deacetylase-related/HD-like protein), which belong to the black module, rearrange the structure of chromatin, affecting root development [88].
ARF19 (auxin response factor 19) and NPH4 (NON-PHOTOTROPHIC HYPOCOTYL) regulate LBD16 (lateral organ boundaries-domain 16) and LBD29 (lateral organ boundaries-domain 29) transcription factors which are responsible for lateral root initiation by promoting cell division. ATARCA (Transducin/WD40 repeat-like superfamily protein), RACK1B_AT (receptor for activated C kinase 1B), and RACK1C_AT (receptor for activated C kinase 1C), which are involved in cell cycle functions and are involved in lateral root development, are also regulated by ARF19 and NPH4 [88]. NPH4, RACK1B_AT, and RACK1C_AT belong to the black module, ATARCA belongs to the darkolivegreen module, and ARF19 and LBD16 belong to the darkgreen module.
In the early stages of lateral root development, where asymmetric cell division takes place, magenta module genes CYCB;1 (CYCLIN B1;1), BRXL4 (BREVIS RADIX-like 4), and ATBRXL2 (Disease resistance/zinc finger/chromosome condensation-like region domain-containing protein) are induced. Also, magenta module genes MP (Transcriptional factor B3 family protein/auxin-responsive factor AUX/IAA-like protein) and TMO6 (TARGET OF MONOPTEROS 6), which are related to each other as gene regulator and target gene, respectively, are expressed in the lateral root, affecting its development, although they are also involved in embryonic development [88]. In the ACT2.6 coexpression tree, TMO6 and BRXL4 are located in adjacent leaves, owing to their regulatory connection. AT2G17500 (Auxin efflux carrier family protein) and WRKY75 (WRKY DNA-binding protein 75), which belong to the darkgreen and cyan modules, respectively, both independently influence the structure of the root. Cyan module exhibits overexpression in “root” and “protoplast” tissues and is associated with the GOBP term “stress response”, justifying the place of WRKY75 in this module, as it is mainly induced during phosphate starvation in roots [88].

4.1.3. Aerial Development of the Plant

The blue, yellowgreen, and darkmagenta modules are overexpressed in “Aerial tissue”. The shoot apex is the tissue from which the total of the above-ground plant originates, except for hypocotyl and cotyledons. The shoot apex includes distinct layers of cells serving different biological functions [9]. Cells responsible for establishment and maintenance of the shoot apex show overexpression of KNAT1 (homeobox knotted-like protein) and KNAT6 (homeobox protein knotted-1-like 6), which belong in magenta module and are also adjacent genes in the ACT2.6 coexpression tree, KNAT2 (homeobox knotted-like protein), which belongs in the turquoise module, and STM (SHOOT MERISTEMLESS) of the lightgreen module [9]. High-resolution single-cell RNA sequencing of shoot apex tissue exhibited a cluster of epidermal cell samples having a high accumulation of HIS4 (histone 4) and TSO2 (Ferritin/ribonucleotide reductase-like family protein) transcripts [9], which belong to the darkolivegreen module. Other epidermal cells of the shoot apex overexpress genes such as CDKB2;1 (cyclin-dependent kinase B2;1), CYCA1;1 (Cyclin A1;1), ENODL15 (early nodulin-like protein 15), MAD2 (MITOTIC ARREST-DEFICIENT 2), and PCNA2 (proliferating cell nuclear antigen 2) of the magenta module, CYCD3;2 (CYCLIN D3;2) and PDF1 (protodermal factor 1) of the darkmagenta module, RPL24 (plastid ribosomal protein L24), ATML1 (MERISTEM LAYER 1), and FDH (formate dehydrogenase) of the blue module, and RPS6 (RESISTANT TO P. SYRINGAE 6) of the turquoise module [9]. Finally, transcripts of HIS4, CDKB2;1, and CYCA1;1 have also been found in proliferating cells [9].

4.1.4. Photosynthetic Tissues

Blue and darkred are the only overexpressed modules in “rosette” tissue, where turquoise and paleturquoise are underexpressed, although their overexpression peaks in “rosette leaf” tissue. The blue, darkred, and yellowgreen modules are overexpressed in both “leaf” and “rosette leaf” tissues. The yellowgreen module contains chloroplast genes, exclusively, while the majority of blue module genes originate from the nuclear genome. They both share the “photosynthesis” GOBP term as the most significantly overrepresented one. Only the blue module is enriched with “chloroplast organization” GOBP term, confirming that although chloroplasts contain their own genome, most genes that are expressed in chloroplasts are encoded by the nuclear genome [83]. The darkred module is enriched with “defense response”, “response to other organism” and “response to stress” GOBP terms; as its hub genes, SIB1 and MEK1, participate in the aforementioned processes. The darkred module also includes genes responsible for induced systemic resistance [89]. In addition, the cyan module includes many genes that are upregulated during various stresses [81], and it is enriched for all aforementioned darkred module enriched GOBP terms, as well as “response to chitin”. The Cyan module attains the top of its expression in “leaf” tissue, the same as the darkred module, but it is also overexpressed in “root” tissue, where the darkred module is significantly underexpressed.
Stomata, the structures that permit carbon dioxide uptake through leaves, indicate high expression of SPCH (SPEECHLESS) and TMM (TOO MANY MOUTHS) of the blue module, AT4G31805 (WRKY family transcription factor) of the magenta module, and BASL of the darkmagenta module, in their early development [9].

4.1.5. Phloem and Stem Tissue Development

Only the white module is significantly overexpressed in “basal tissue”, and its expression level peaks in “stem” tissue, justifying the enrichment of “plant-type secondary cell wall biogenesis” and “xylan metabolic process” GOBP terms, where IRX3 and GAUT12 hub genes participate, respectively. The steelblue module is also overexpressed in “stem” tissue, as it includes NAC045 and NAC086 which are linked to the module’s overrepresented “sieve element enucleation” and “sieve element differentiation” GOBP terms. Within vascular bundles in the stem, xylem and phloem tissues consist of specialized cell types including xylem fibers, xylem vessel elements, phloem sieve elements, and phloem companion cells, facilitating the transport of water and nutrients. In contrast to these highly specialized cells, cambium stem cells retain the ability to produce secondary xylem and phloem cells, augmenting the transport capacity and structural support of the growing shoot system [90]. SEOR1 and SEOa of the steelblue module and XCP1 (xylem cysteine peptidase 1) and XCP2 (xylem cysteine peptidase 2) of the white module are expressed specifically in vascular tissues [9]. For xylem identification, the expression of PXY (Leucine-rich repeat protein kinase family protein) and BHLH32 (basic helix–loop–helix 32), which belong to the magenta and darkgreen modules, respectively, is preceded by XCP2 expression [9]. In addition, genes of the magenta and black modules, related to the auxin pathway, such as MP (MONOPTEROS), LAX2 (like AUXIN RESISTANT 2), PIN6 (Auxin efflux carrier family protein), and IAA12 (AUX/IAA transcriptional regulator family protein), are overexpressed in xylem [9]. On the other hand, AT5G57130 (Clp amino terminal domain-containing protein), APL (Homeodomain-like superfamily protein), HAC2 (histone acetyltransferase of the CBP family 2), and SEOR1, which belong to the magenta, blue, turquoise, and steelblue modules, respectively, are mainly expressed in phloem [9].
The primary inflorescence stem comprises a wide range of tissues, spanning from the undifferentiated cambium stem cells to the terminally differentiated cells within the vasculature [90]. ANT (AINTEGUMENTA) of the magenta module exhibits expression in the cambium of the inflorescence stem, although it is also found to be expressed in root cambium cells. Especially in the cambium of the stem, ANT is coexpressed with AT3G13980 (SKI/DACH domain protein) and AT1G56210 (Heavy metal transport/detoxification superfamily protein), which belong to blue and magenta modules, respectively [90].

4.1.6. Initiation and Development of Flower Tissue

The initial stages of flower growth are marked by significant alterations in shape and involve numerous transcriptional regulators overseeing crucial functions, such as establishing floral patterns and specifying floral organs [91]. In the early stages of flower primordium emergence, increased expression of AP1, which belongs to the lightgreen module, represses SVP (SHORT VEGETATIVE PHASE) of the blue module [91]. Also, AG and AP3 of the lightgreen module showcase a high level of expression, temporally from the early stages of flower development, maintaining this situation until the late stages [91]. Lightgreen module genes, such as AMS which affects pollen wall formation, SPL, and EMS1 (Leucine-rich repeat transmembrane protein kinase), are upregulated in the mid-stage of flower development [91]. The lightgreen and lightsteelblue1 modules show the highest expression in the “flower bud” and “flower” tissues, respectively. The Lightgreen module contains the EXL6 hub gene which participates in pollen coat synthesis [66], in accordance with the most overrepresented GOBP term of the lightgreen module being “pollen wall assembly”. Lighsteelblue1 hub gene MYB21 plays a crucial role in stamen development as a jasmonate regulator [55], explaining the most enriched GOBP term in this module being “stamen filament development”. The pink module is notably overexpressed in “pollen” and “pollen tube” tissues, exhibiting an overrepresentation in “pollen tube growth” and “pollen tube development” GOBP terms. In addition, the turquoise and paleturquoise modules also exhibit overexpression in “pollen” and “pollen tube” tissues, although their overexpression peaks in “rosette leaf” tissue. The lightgreen and sienna3 modules contain MADs-box transcription factors SEP3 and STK, respectively, which combine their action to enable the transcriptional regulation of ovule target genes [92]. Also, AT4G15750 (Plant invertase/pectin methylesterase inhibitor superfamily protein), a sienna3 hub gene, and many genes of the turquoise module are involved in the late stages of embryo sac development [57].

4.1.7. Flower to Seed

The sienna3 module is overexpressed in the “flower” tissue and specifically female organs, like “pistil”, “silique”, “suspensor”, “embryo”, and “seed” tissues. The orange module combines overexpression in tissues associated with male reproduction, like “microsporocyte”, “pollen”, “anther”, and “seed”, although it is underexpressed in “flower” tissue. The darkorange module is most overexpressed in “silique”, like the sienna3 module, and is, generally, upregulated in all other specific tissues which are contained in silique, like “seed”, “endosperm”, “replum”, “embryo”, and “suspensor”. Its hub gene, KCS18, is associated with the “lipid metabolic process” GOBP term, which is also a prevalent module term.
The suspensor is an essential structure for further seed development, while also linking the embryo and further seed with the whole plant. FUS3 of darkorange module affects the development of suspensors [93]. In “flower” tissue, the overexpression of the pink, turquoise, and lightgreen modules suggests a regulatory relationship where WRKY2 (WRKY DNA-binding protein 2) governs the expression of WOX8 (WUSCHEL related homeobox 8) and AT2G33880 (homeobox-3), affecting suspensor development [93]. The midnightblue and darkorange modules are overexpressed in “endosperm”, “silique”, “embryo”, and “seed” tissues, with the latter being the tissue in which the genes of the the midnightblue module are mostly overexpressed, justifying the significantly overrepresented “seed development” GOBP term.

4.2. Comparison of ACT2.6 and ACT2.0

The main rationale behind the usage of WGCNA in the new ACT version (ACT2.6) was to discover whether the signed weighted TOM-based coexpression tree produced by WGCNA would outperform the coexpression results of the previous PCC-based ACT2.0 version (thus, the use of the same datasets and gene-describing biological terms was compulsory). In general, TOM provides a robust and accurate centrality measure that outperforms standard metrics in predicting gene importance, emphasizing the stronger correlation of intramodular connectivity with gene significance. Its consistent performance across various network contexts, especially when combined with soft thresholding techniques, ensures reliable preservation of biological signals and meaningful insights into gene interactions [4].
A main addition of ACT2.6 to ACT2.0 is the grouping of genes into modules, as well as the modules’ association with tissue traits. In general, the tissues in which each module is overexpressed align with the top enriched POPA terms for that module and are biologically consistent with the top enriched GO terms associated with the module’s genes (i.e., the yellowgreen module, being overexpressed in leaves, exhibits leaf-related POPA and photosynthesis-related GOBP enriched terms). Additionally, the top enriched GOBP term and the top overexpressed tissue of each module coincided with the biological terms of the module’s hub gene, as well as its tissue-specific overexpression, in over half of the modules (Table 1). Since the best way to evaluate a coexpression tool is to study whether it is able to replicate known biology, these concordances served as positive controls for the ability of ACT2.6 to produce functionally relevant coexpression partners.
Furthermore, the identification of modules of coexpressed genes allowed for a better evaluation of the coexpression clades produced by ACT2.6, compared to those of the previous ACT version. For example, the lightsteelblue1 module links MYB21 and MYB24, which encode jasmonate regulators responsible for stamen and pollen development [55], with AT5G44630 and other terpene synthases which may be involved in pollinator attraction or the protection of reproductive organs against bacteria and fungi [73]. In contrast, these two genes are distant in the ACT2.0 tree. In ACT2.6, SUS2 and LEC1, which affects the former’s expression [94], belong to the same subtree of 81 genes that participate exclusively in the darkorange module, while in ACT2.0, those two genes belong in distinctly different clades. AT1G76640 (Calcium-binding EF-hand family protein) and AGD11 (ARF-GAP domain 11) are calmodulin-like proteins that are upregulated during pollen germination and pollen tube growth [95]. In ACT2.0, AT1G76640 and AGD11 belong to separate subtrees, while in ACT2.6, they both belong to a clade of 631 exclusively pink module genes. Finally, COW1 and RHD2 participate in hair root development [96] and are located in the same subtree of 159 genes of the darkgreen module in ACT2.6, while in ACT2.0, COW1 and RHD2 belong to separate clades.
In summary, ACT2.6 WGCNA-derived coexpression clades were more unbalanced compared to ACT2.0 ones. Nevertheless, the benchmarking for the creation of coexpression clades of the 10 use-case genes showed that ACT2.6 and ACT2.0 produced, in general, comparable results, regarding the pathway participation of genes and the discovery of potential functional partners. However, there were exceptions where one tool outperformed the other. Therefore, it is recommended that users consult both ACT2.6 and ACT2.0 when they focus on the topology of the coexpression clades.
ACT2.6 constitutes a major upgrade over ACT2.0 by producing coexpressed gene modules, identifying intramodular hub genes, and introducing module–tissue trait associations, all of which are features unique to this version. Additionally, ACT2.6 identified multiple gene partners that were not discovered by ACT2.0, located within the same module and coexpression clade. Therefore, ACT2.6 is a significant addition in the field of gene coexpression tools when discovering gene partners for a gene of interest or attributing biological roles to genes of unknown function.

4.3. Limitations

The limitation of the depiction of the coexpression tree produced by hierarchical clustering, as in ACT2.6, is the inability to portray negative gene correlations. In addition, through this approach, genes may only participate in a single coexpression clade, a limitation that is also present in the WGCNA module discovery analysis, as genes are grouped into non-overlapping modules. This is contrary to the fact that a gene may interact with different sets of genes, playing diverse biological roles, while also impeding the identification of inter-modular genes, i.e., genes that act as links between distinct functional modules.
Microarrays are constrained by the presence of probes for studying genes and potential distortions due to cross-hybridization, particularly when using the default CDF. These limitations are addressed by RNA-Seq, which is progressively replacing microarrays, as the amount of publicly available RNA-Seq data for A. thaliana now surpasses that of microarray data. While ACT2.6 does not include RNA-Seq data in its gene expression analysis, limiting the number of genes that can be studied, it uses the constantly updated Brainarray CDF, which incorporates the latest knowledge of the human genome and transcriptome and ensures that each probe set corresponds to one gene, and vice versa. In addition, even though RNA-Seq offers greater sensitivity, gene expression values between microarrays and RNA-Seq are largely comparable, especially in genes of average expression levels [97], while GCNs produced by microarrays and RNA-Seq produce similar coexpression values and enrichments [98,99]. Finally, RNA-Seq has not yet fully replaced microarrays, as the optimal normalization method for RNA-Seq-based gene coexpression analysis remains under debate, whereas microarray normalization algorithms have been extensively refined over time.

Author Contributions

Conceptualization, I.M.; methodology, V.L.Z., K.P., A.M. and I.M.; software, V.L.Z., K.P., A.M. and I.M.; validation, V.L.Z. and K.P.; formal analysis, V.L.Z., K.P. and I.M.; investigation, V.L.Z., K.P. and I.M.; resources, V.L.Z., A.M. and I.M.; data curation, V.L.Z. and K.P.; writing—original draft preparation, V.L.Z., K.P. and I.M.; writing—review and editing, V.L.Z. and I.M.; visualization, V.L.Z. and K.P.; supervision, V.A.I. and I.M.; project administration, I.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The list of microarray samples used in this study is available in https://www.michalopoulos.net/act2.6/sample_table.php (accessed on 8 January 2025).

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Zogopoulos, V.L.; Saxami, G.; Malatras, A.; Papadopoulos, K.; Tsotra, I.; Iconomidou, V.A.; Michalopoulos, I. Approaches in Gene Coexpression Analysis in Eukaryotes. Biology 2022, 11, 1019. [Google Scholar] [CrossRef] [PubMed]
  2. Petereit, J.; Smith, S.; Harris, F.C., Jr.; Schlauch, K.A. petal: Co-expression network modelling in R. BMC Syst. Biol. 2016, 10 (Suppl. S2), 51. [Google Scholar] [CrossRef] [PubMed]
  3. Usadel, B.; Obayashi, T.; Mutwil, M.; Giorgi, F.M.; Bassel, G.W.; Tanimoto, M.; Chow, A.; Steinhauser, D.; Persson, S.; Provart, N.J. Co-expression tools for plant biology: Opportunities for hypothesis generation and caveats. Plant Cell Environ. 2009, 32, 1633–1651. [Google Scholar] [CrossRef] [PubMed]
  4. Zhang, B.; Horvath, S. A general framework for weighted gene co-expression network analysis. Stat. Appl. Genet. Mol. Biol. 2005, 4, 17. [Google Scholar] [CrossRef]
  5. Koutrouli, M.; Karatzas, E.; Paez-Espino, D.; Pavlopoulos, G.A. A Guide to Conquer the Biological Network Era Using Graph Theory. Front. Bioeng. Biotechnol. 2020, 8, 34. [Google Scholar] [CrossRef]
  6. Song, L.; Langfelder, P.; Horvath, S. Comparison of co-expression measures: Mutual information, correlation, and model based indices. BMC Bioinform. 2012, 13, 328. [Google Scholar] [CrossRef] [PubMed]
  7. Kramer, U. Planting molecular functions in an ecological context with Arabidopsis thaliana. Elife 2015, 4, e06100. [Google Scholar] [CrossRef] [PubMed]
  8. Koornneef, M.; Meinke, D. The development of Arabidopsis as a model plant. Plant J. 2010, 61, 909–921. [Google Scholar] [CrossRef]
  9. Zhang, T.Q.; Chen, Y.; Wang, J.W. A single-cell analysis of the Arabidopsis vegetative shoot apex. Dev. Cell 2021, 56, 1056–1074. [Google Scholar] [CrossRef]
  10. Smyth, D.R.; Bowman, J.L.; Meyerowitz, E.M. Early flower development in Arabidopsis. Plant Cell 1990, 2, 755–767. [Google Scholar] [CrossRef] [PubMed]
  11. Yamaoka, S.; Nakajima, M.; Fujimoto, M.; Tsutsumi, N. MIRO1 influences the morphology and intracellular distribution of mitochondria during embryonic cell division in Arabidopsis. Plant Cell Rep. 2011, 30, 239–244. [Google Scholar] [CrossRef] [PubMed]
  12. Becker, J.D.; Boavida, L.C.; Carneiro, J.; Haury, M.; Feijo, J.A. Transcriptional profiling of Arabidopsis tissues reveals the unique characteristics of the pollen transcriptome. Plant Physiol. 2003, 133, 713–725. [Google Scholar] [CrossRef]
  13. Obayashi, T.; Hibara, H.; Kagaya, Y.; Aoki, Y.; Kinoshita, K. ATTED-II v11: A Plant Gene Coexpression Database Using a Sample Balancing Technique by Subagging of Principal Components. Plant Cell Physiol. 2022, 63, 869–881. [Google Scholar] [CrossRef]
  14. Tseng, K.C.; Li, G.Z.; Hung, Y.C.; Chow, C.N.; Wu, N.Y.; Chien, Y.Y.; Zheng, H.Q.; Lee, T.Y.; Kuo, P.L.; Chang, S.B.; et al. EXPath 2.0: An Updated Database for Integrating High-Throughput Gene Expression Data with Biological Pathways. Plant Cell Physiol. 2020, 61, 1818–1827. [Google Scholar] [CrossRef] [PubMed]
  15. Zogopoulos, V.L.; Saxami, G.; Malatras, A.; Angelopoulou, A.; Jen, C.H.; Duddy, W.J.; Daras, G.; Hatzopoulos, P.; Westhead, D.R.; Michalopoulos, I. Arabidopsis Coexpression Tool: A tool for gene coexpression analysis in Arabidopsis thaliana. iScience 2021, 24, 102848. [Google Scholar] [CrossRef] [PubMed]
  16. Lee, T.; Yang, S.; Kim, E.; Ko, Y.; Hwang, S.; Shin, J.; Shim, J.E.; Shim, H.; Kim, H.; Kim, C.; et al. AraNet v2: An improved database of co-functional gene networks for the study of Arabidopsis thaliana and 27 other nonmodel plant species. Nucleic Acids Res. 2015, 43, D996–D1002. [Google Scholar] [CrossRef]
  17. De Bodt, S.; Hollunder, J.; Nelissen, H.; Meulemeester, N.; Inze, D. CORNET 2.0: Integrating plant coexpression, protein-protein interactions, regulatory interactions, gene associations and functional annotations. New Phytol. 2012, 195, 707–720. [Google Scholar] [CrossRef] [PubMed]
  18. Franz, M.; Rodriguez, H.; Lopes, C.; Zuberi, K.; Montojo, J.; Bader, G.D.; Morris, Q. GeneMANIA update 2018. Nucleic Acids Res. 2018, 46, W60–W64. [Google Scholar] [CrossRef]
  19. Toufighi, K.; Brady, S.M.; Austin, R.; Ly, E.; Provart, N.J. The Botany Array Resource: E-Northerns, Expression Angling, and promoter analyses. Plant J. 2005, 43, 153–163. [Google Scholar] [CrossRef] [PubMed]
  20. Proost, S.; Mutwil, M. CoNekT: An open-source framework for comparative genomic and transcriptomic network analyses. Nucleic Acids Res. 2018, 46, W133–W140. [Google Scholar] [CrossRef] [PubMed]
  21. Lee, J.; Shah, M.; Ballouz, S.; Crow, M.; Gillis, J. CoCoCoNet: Conserved and comparative co-expression across a diverse set of species. Nucleic Acids Res. 2020, 48, W566–W571. [Google Scholar] [CrossRef] [PubMed]
  22. Langfelder, P.; Horvath, S. WGCNA: An R package for weighted correlation network analysis. BMC Bioinform. 2008, 9, 559. [Google Scholar] [CrossRef] [PubMed]
  23. Tzfadia, O.; Diels, T.; De Meyer, S.; Vandepoele, K.; Aharoni, A.; Van de Peer, Y. CoExpNetViz: Comparative Co-Expression Networks Construction and Visualization Tool. Front. Plant Sci. 2015, 6, 1194. [Google Scholar] [CrossRef] [PubMed]
  24. Vivian Li, W.; Li, Y. scLink: Inferring Sparse Gene Co-expression Networks from Single-cell Expression Data. Genom. Proteom. Bioinform. 2021, 19, 475–492. [Google Scholar] [CrossRef]
  25. Proost, S.; Krawczyk, A.; Mutwil, M. LSTrAP: Efficiently combining RNA sequencing data into co-expression networks. BMC Bioinform. 2017, 18, 444. [Google Scholar] [CrossRef]
  26. Langfelder, P.; Mischel, P.S.; Horvath, S. When is hub gene selection better than standard meta-analysis? PLoS ONE 2013, 8, e61505. [Google Scholar] [CrossRef]
  27. Mason, M.J.; Fan, G.; Plath, K.; Zhou, Q.; Horvath, S. Signed weighted gene co-expression network analysis of transcriptional regulation in murine embryonic stem cells. BMC Genom. 2009, 10, 327. [Google Scholar] [CrossRef] [PubMed]
  28. Levine, A.J.; Panos, S.E.; Horvath, S. Genetic, transcriptomic, and epigenetic studies of HIV-associated neurocognitive disorder. J. Acquir. Immune Defic. Syndr. 2014, 65, 481–503. [Google Scholar] [CrossRef] [PubMed]
  29. Farber, C.R.; Mesner, L.D. A Systems-Level Understanding of Cardiovascular Disease Through Networks; Elsevier Inc.: Amsterdam, The Netherlands, 2016; pp. 59–81. [Google Scholar]
  30. Xu, X.; Lu, X.; Tang, Z.; Zhang, X.; Lei, F.; Hou, L.; Li, M. Combined analysis of carotenoid metabolites and the transcriptome to reveal the molecular mechanism underlying fruit colouration in zucchini (Cucurbita pepo L.). Food Chem. Mol. Sci. 2021, 2, 100021. [Google Scholar] [CrossRef]
  31. Zogopoulos, V.L.; Malatras, A.; Michalopoulos, I. Gene coexpression analysis in Arabidopsis thaliana based on public microarray data. STAR Protoc. 2022, 3, 101208. [Google Scholar] [CrossRef] [PubMed]
  32. Barrett, T.; Wilhite, S.E.; Ledoux, P.; Evangelista, C.; Kim, I.F.; Tomashevsky, M.; Marshall, K.A.; Phillippy, K.H.; Sherman, P.M.; Holko, M.; et al. NCBI GEO: Archive for functional genomics data sets--update. Nucleic Acids Res. 2013, 41, D991–D995. [Google Scholar] [CrossRef] [PubMed]
  33. Kolesnikov, N.; Hastings, E.; Keays, M.; Melnichuk, O.; Tang, Y.A.; Williams, E.; Dylag, M.; Kurbatova, N.; Brandizi, M.; Burdett, T.; et al. ArrayExpress update--simplifying data submissions. Nucleic Acids Res. 2015, 43, D1113–D1116. [Google Scholar] [CrossRef]
  34. Craigon, D.J.; James, N.; Okyere, J.; Higgins, J.; Jotham, J.; May, S. NASCArrays: A repository for microarray data generated by NASC’s transcriptomics service. Nucleic Acids Res. 2004, 32, D575–D577. [Google Scholar] [CrossRef] [PubMed]
  35. Piccolo, S.R.; Sun, Y.; Campbell, J.D.; Lenburg, M.E.; Bild, A.H.; Johnson, W.E. A single-sample microarray normalization method to facilitate personalized-medicine workflows. Genomics 2012, 100, 337–344. [Google Scholar] [CrossRef]
  36. Dai, M.; Wang, P.; Boyd, A.D.; Kostov, G.; Athey, B.; Jones, E.G.; Bunney, W.E.; Myers, R.M.; Speed, T.P.; Akil, H.; et al. Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data. Nucleic Acids Res. 2005, 33, e175. [Google Scholar] [CrossRef] [PubMed]
  37. Pasha, A.; Subramaniam, S.; Cleary, A.; Chen, X.; Berardini, T.; Farmer, A.; Town, C.; Provart, N. Araport Lives: An Updated Framework for Arabidopsis Bioinformatics. Plant Cell 2020, 32, 2683–2686. [Google Scholar] [CrossRef]
  38. Gene Ontology, C.; Aleksander, S.A.; Balhoff, J.; Carbon, S.; Cherry, J.M.; Drabkin, H.J.; Ebert, D.; Feuermann, M.; Gaudet, P.; Harris, N.L.; et al. The Gene Ontology knowledgebase in 2023. Genetics 2023, 224, iyad031. [Google Scholar] [CrossRef]
  39. Cooper, L.; Elser, J.; Laporte, M.A.; Arnaud, E.; Jaiswal, P. Planteome 2024 Update: Reference Ontologies and Knowledgebase for Plant Biology. Nucleic Acids Res. 2024, 52, D1548–D1555. [Google Scholar] [CrossRef] [PubMed]
  40. Kanehisa, M.; Furumichi, M.; Sato, Y.; Matsuura, Y.; Ishiguro-Watanabe, M. KEGG: Biological systems database as a model of the real world. Nucleic Acids Res. 2024, 53, D672–D677. [Google Scholar] [CrossRef]
  41. Schlapfer, P.; Zhang, P.; Wang, C.; Kim, T.; Banf, M.; Chae, L.; Dreher, K.; Chavali, A.K.; Nilo-Poyanco, R.; Bernard, T.; et al. Genome-Wide Prediction of Metabolic Enzymes, Pathways, and Gene Clusters in Plants. Plant Physiol. 2017, 173, 2041–2059. [Google Scholar] [CrossRef] [PubMed]
  42. Agrawal, A.; Balci, H.; Hanspers, K.; Coort, S.L.; Martens, M.; Slenter, D.N.; Ehrhart, F.; Digles, D.; Waagmeester, A.; Wassink, I.; et al. WikiPathways 2024: Next generation pathway database. Nucleic Acids Res. 2024, 52, D679–D689. [Google Scholar] [CrossRef] [PubMed]
  43. Yilmaz, A.; Mejia-Guerra, M.K.; Kurz, K.; Liang, X.; Welch, L.; Grotewold, E. AGRIS: The Arabidopsis Gene Regulatory Information Server, an update. Nucleic Acids Res. 2011, 39, D1118–D1122. [Google Scholar] [CrossRef]
  44. O’Malley, R.C.; Huang, S.C.; Song, L.; Lewsey, M.G.; Bartlett, A.; Nery, J.R.; Galli, M.; Gallavotti, A.; Ecker, J.R. Cistrome and Epicistrome Features Shape the Regulatory DNA Landscape. Cell 2016, 165, 1280–1292. [Google Scholar] [CrossRef] [PubMed]
  45. Paysan-Lafosse, T.; Andreeva, A.; Blum, M.; Chuguransky, S.R.; Grego, T.; Pinto, B.L.; Salazar, G.A.; Bileschi, M.L.; Llinares-Lopez, F.; Meng-Papaxanthos, L.; et al. The Pfam protein families database: Embracing AI/ML. Nucleic Acids Res. 2024, 53, D523–D534. [Google Scholar] [CrossRef]
  46. Archie, J.; Day, H.E.W.; Felsenstein, J.; Maddison, W.; Meacham, C.; Rohlf, F.J.; Swofford, D. The Newick Tree Format. Available online: http://evolution.genetics.washington.edu/phylip/newicktree.html (accessed on 14 November 2022).
  47. Langfelder, P.; Zhang, B.; Horvath, S. Defining clusters from a hierarchical cluster tree: The Dynamic Tree Cut package for R. Bioinformatics 2008, 24, 719–720. [Google Scholar] [CrossRef] [PubMed]
  48. Forbes, C.; Evans, M.; Hastings, N.; Peacock, B. Statistical Distributions, 4th ed.; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2011. [Google Scholar]
  49. Benjamini, Y.; Hochberg, Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. Royal Stat. Soc. Ser. B 1995, 57, 289–300. [Google Scholar] [CrossRef]
  50. Liao, Y.; Wang, J.; Jaehnig, E.J.; Shi, Z.; Zhang, B. WebGestalt 2019: Gene set analysis toolkit with revamped UIs and APIs. Nucleic Acids Res. 2019, 47, W199–W205. [Google Scholar] [CrossRef]
  51. Szklarczyk, D.; Nastou, K.; Koutrouli, M.; Kirsch, R.; Mehryary, F.; Hachilif, R.; Hu, D.; Peluso, M.E.; Huang, Q.; Fang, T.; et al. The STRING database in 2025: Protein networks with directionality of regulation. Nucleic Acids Res. 2024, 53, D730–D737. [Google Scholar] [CrossRef] [PubMed]
  52. Raudvere, U.; Kolberg, L.; Kuzmin, I.; Arak, T.; Adler, P.; Peterson, H.; Vilo, J. g:Profiler: A web server for functional enrichment analysis and conversions of gene lists (2019 update). Nucleic Acids Res. 2019, 47, W191–W198. [Google Scholar] [CrossRef] [PubMed]
  53. Karatzas, E.; Baltoumas, F.A.; Aplakidou, E.; Kontou, P.I.; Stathopoulos, P.; Stefanis, L.; Bagos, P.G.; Pavlopoulos, G.A. Flame (v2.0): Advanced integration and interpretation of functional enrichment results from multiple sources. Bioinformatics 2023, 39, btad490. [Google Scholar] [CrossRef] [PubMed]
  54. Papadopoulos, K. Construction of a Weighted Gene Co-Expression Network from Transcriptomic Data. M.Sc. Dissertation, National and Kapodistrian University of Athens, Athens, Greece, 2024. [Google Scholar]
  55. Mandaokar, A.; Thines, B.; Shin, B.; Lange, B.M.; Choi, G.; Koo, Y.J.; Yoo, Y.J.; Choi, Y.D.; Choi, G.; Browse, J. Transcriptional regulators of stamen development in Arabidopsis identified by transcriptional profiling. Plant J. 2006, 46, 984–1008. [Google Scholar] [CrossRef]
  56. Malik Ghulam, M.; Courtois, F.; Lerbs-Mache, S.; Merendino, L. Complex processing patterns of mRNAs of the large ATP synthase operon in Arabidopsis chloroplasts. PLoS ONE 2013, 8, e78265. [Google Scholar] [CrossRef]
  57. Yu, H.J.; Hogan, P.; Sundaresan, V. Analysis of the female gametophyte transcriptome of Arabidopsis by comparative expression profiling. Plant Physiol. 2005, 139, 1853–1869. [Google Scholar] [CrossRef] [PubMed]
  58. Hu, W.; Wang, Y.; Bowers, C.; Ma, H. Isolation, sequence analysis, and expression studies of florally expressed cDNAs in Arabidopsis. Plant Mol. Biol. 2003, 53, 545–563. [Google Scholar] [CrossRef] [PubMed]
  59. Krolikowski, K.A.; Victor, J.L.; Wagler, T.N.; Lolle, S.J.; Pruitt, R.E. Isolation and characterization of the Arabidopsis organ fusion gene HOTHEAD. Plant J. 2003, 35, 501–511. [Google Scholar] [CrossRef] [PubMed]
  60. Queitsch, C.; Hong, S.W.; Vierling, E.; Lindquist, S. Heat shock protein 101 plays a crucial role in thermotolerance in Arabidopsis. Plant Cell 2000, 12, 479–492. [Google Scholar] [CrossRef] [PubMed]
  61. Chen, Z.; Hartmann, H.A.; Wu, M.J.; Friedman, E.J.; Chen, J.G.; Pulley, M.; Schulze-Lefert, P.; Panstruga, R.; Jones, A.M. Expression analysis of the AtMLO gene family encoding plant-specific seven-transmembrane domain proteins. Plant Mol. Biol. 2006, 60, 583–597. [Google Scholar] [CrossRef]
  62. Lee, J.; He, K.; Stolc, V.; Lee, H.; Figueroa, P.; Gao, Y.; Tongprasit, W.; Zhao, H.; Lee, I.; Deng, X.W. Analysis of transcription factor HY5 genomic binding sites revealed its hierarchical role in light regulation of development. Plant Cell 2007, 19, 731–749. [Google Scholar] [CrossRef] [PubMed]
  63. Brown, D.M.; Zeef, L.A.; Ellis, J.; Goodacre, R.; Turner, S.R. Identification of novel genes in Arabidopsis involved in secondary cell wall formation using expression profiling and reverse genetics. Plant Cell 2005, 17, 2281–2295. [Google Scholar] [CrossRef]
  64. Jasinski, S.; Lecureuil, A.; Miquel, M.; Loudet, O.; Raffaele, S.; Froissard, M.; Guerche, P. Natural variation in seed very long chain fatty acid content is controlled by a new isoform of KCS18 in Arabidopsis thaliana. PLoS ONE 2012, 7, e49261. [Google Scholar] [CrossRef]
  65. Li, M.; Lee, K.P.; Liu, T.; Dogra, V.; Duan, J.; Li, M.; Xing, W.; Kim, C. Antagonistic modules regulate photosynthesis-associated nuclear genes via GOLDEN2-LIKE transcription factors. Plant Physiol. 2022, 188, 2308–2324. [Google Scholar] [CrossRef]
  66. Wellmer, F.; Riechmann, J.L.; Alves-Ferreira, M.; Meyerowitz, E.M. Genome-wide analysis of spatial gene expression in Arabidopsis flowers. Plant Cell 2004, 16, 1314–1326. [Google Scholar] [CrossRef] [PubMed]
  67. Candat, A.; Paszkiewicz, G.; Neveu, M.; Gautier, R.; Logan, D.C.; Avelange-Macherel, M.H.; Macherel, D. The ubiquitous distribution of late embryogenesis abundant proteins across cell compartments in Arabidopsis offers tailored protection against abiotic stress. Plant Cell 2014, 26, 3148–3166. [Google Scholar] [CrossRef]
  68. Perea-Resa, C.; Hernandez-Verdeja, T.; Lopez-Cobollo, R.; del Mar Castellano, M.; Salinas, J. LSM proteins provide accurate splicing and decay of selected transcripts to ensure normal Arabidopsis development. Plant Cell 2012, 24, 4930–4947. [Google Scholar] [CrossRef]
  69. Takac, T.; Samajova, O.; Vadovic, P.; Pechan, T.; Kosutova, P.; Ovecka, M.; Husickova, A.; Komis, G.; Samaj, J. Proteomic and biochemical analyses show a functional network of proteins involved in antioxidant defense of the Arabidopsis anp2anp3 double mutant. J. Proteome Res. 2014, 13, 5347–5361. [Google Scholar] [CrossRef]
  70. Zhao, L.N.; Shen, L.K.; Zhang, W.Z.; Zhang, W.; Wang, Y.; Wu, W.H. Ca2+-dependent protein kinase11 and 24 modulate the activity of the inward rectifying K+ channels in Arabidopsis pollen tubes. Plant Cell 2013, 25, 649–661. [Google Scholar] [CrossRef] [PubMed]
  71. Ascencio-Ibanez, J.T.; Sozzani, R.; Lee, T.J.; Chu, T.M.; Wolfinger, R.D.; Cella, R.; Hanley-Bowdoin, L. Global analysis of Arabidopsis gene expression uncovers a complex array of changes impacting pathogen response and cell cycle during geminivirus infection. Plant Physiol. 2008, 148, 436–454. [Google Scholar] [CrossRef] [PubMed]
  72. Ishihara, S.; Takabayashi, A.; Ido, K.; Endo, T.; Ifuku, K.; Sato, F. Distinct functions for the two PsbP-like proteins PPL1 and PPL2 in the chloroplast thylakoid lumen of Arabidopsis. Plant Physiol. 2007, 145, 668–679. [Google Scholar] [CrossRef] [PubMed]
  73. Tholl, D.; Chen, F.; Petri, J.; Gershenzon, J.; Pichersky, E. Two sesquiterpene synthases are responsible for the complex mixture of sesquiterpenes emitted from Arabidopsis flowers. Plant J. 2005, 42, 757–771. [Google Scholar] [CrossRef]
  74. Ginglinger, J.F.; Boachon, B.; Hofer, R.; Paetz, C.; Kollner, T.G.; Miesch, L.; Lugan, R.; Baltenweck, R.; Mutterer, J.; Ullmann, P.; et al. Gene coexpression analysis reveals complex metabolism of the monoterpene alcohol linalool in Arabidopsis flowers. Plant Cell 2013, 25, 4640–4657. [Google Scholar] [CrossRef] [PubMed]
  75. Welsch, R.; Maass, D.; Voegel, T.; Dellapenna, D.; Beyer, P. Transcription factor RAP2.2 and its interacting partner SINAT2: Stable elements in the carotenogenesis of Arabidopsis leaves. Plant Physiol. 2007, 145, 1073–1085. [Google Scholar] [CrossRef] [PubMed]
  76. Armbruster, U.; Zuhlke, J.; Rengstl, B.; Kreller, R.; Makarenko, E.; Ruhle, T.; Schunemann, D.; Jahns, P.; Weisshaar, B.; Nickelsen, J.; et al. The Arabidopsis thylakoid protein PAM68 is required for efficient D1 biogenesis and photosystem II assembly. Plant Cell 2010, 22, 3439–3460. [Google Scholar] [CrossRef]
  77. Lasserre, E.; Jobet, E.; Llauro, C.; Delseny, M. AtERF38 (At2g35700), an AP2/ERF family transcription factor gene from Arabidopsis thaliana, is expressed in specific cell types of roots, stems and seeds that undergo suberization. Plant Physiol. Biochem. 2008, 46, 1051–1061. [Google Scholar] [CrossRef]
  78. Hundertmark, M.; Hincha, D.K. LEA (late embryogenesis abundant) proteins and their encoding genes in Arabidopsis thaliana. BMC Genom. 2008, 9, 118. [Google Scholar] [CrossRef]
  79. Becerra, C.; Puigdomenech, P.; Vicient, C.M. Computational and experimental analysis identifies Arabidopsis genes specifically expressed during early seed development. BMC Genom. 2006, 7, 38. [Google Scholar] [CrossRef] [PubMed]
  80. Tran, R.K.; Henikoff, J.G.; Zilberman, D.; Ditt, R.F.; Jacobsen, S.E.; Henikoff, S. DNA methylation profiling identifies CG methylation clusters in Arabidopsis genes. Curr. Biol. 2005, 15, 154–159. [Google Scholar] [CrossRef]
  81. Ma, S.; Bohnert, H.J. Integration of Arabidopsis thaliana stress-related transcript profiles, promoter structures, and cell-specific expression. Genome Biol. 2007, 8, R49. [Google Scholar] [CrossRef] [PubMed]
  82. Libault, M.; Wan, J.; Czechowski, T.; Udvardi, M.; Stacey, G. Identification of 118 Arabidopsis transcription factor and 30 ubiquitin-ligase genes responding to chitin, a plant-defense elicitor. Mol. Plant Microbe Interact. 2007, 20, 900–911. [Google Scholar] [CrossRef] [PubMed]
  83. Zybailov, B.; Rutschow, H.; Friso, G.; Rudella, A.; Emanuelsson, O.; Sun, Q.; van Wijk, K.J. Sorting signals, N-terminal modifications and abundance of the chloroplast proteome. PLoS ONE 2008, 3, e1994. [Google Scholar] [CrossRef] [PubMed]
  84. Albrecht, V.; Ritz, O.; Linder, S.; Harter, K.; Kudla, J. The NAF domain defines a novel protein-protein interaction module conserved in Ca2+-regulated kinases. EMBO J. 2001, 20, 1051–1063. [Google Scholar] [CrossRef] [PubMed]
  85. Kubis, S.; Baldwin, A.; Patel, R.; Razzaq, A.; Dupree, P.; Lilley, K.; Kurth, J.; Leister, D.; Jarvis, P. The Arabidopsis ppi1 mutant is specifically defective in the expression, chloroplast import, and accumulation of photosynthetic proteins. Plant Cell 2003, 15, 1859–1871. [Google Scholar] [CrossRef] [PubMed]
  86. Daras, G.; Rigas, S.; Penning, B.; Milioni, D.; McCann, M.C.; Carpita, N.C.; Fasseas, C.; Hatzopoulos, P. The thanatos mutation in Arabidopsis thaliana cellulose synthase 3 (AtCesA3) has a dominant-negative effect on cellulose synthesis and plant growth. New Phytol. 2009, 184, 114–126. [Google Scholar] [CrossRef] [PubMed]
  87. Klodmann, J.; Braun, H.P. Proteomic approach to characterize mitochondrial complex I from plants. Phytochemistry 2011, 72, 1071–1080. [Google Scholar] [CrossRef] [PubMed]
  88. Gala, H.P.; Lanctot, A.; Jean-Baptiste, K.; Guiziou, S.; Chu, J.C.; Zemke, J.E.; George, W.; Queitsch, C.; Cuperus, J.T.; Nemhauser, J.L. A single-cell view of the transcriptome during lateral root initiation in Arabidopsis thaliana. Plant Cell 2021, 33, 2197–2220. [Google Scholar] [CrossRef] [PubMed]
  89. Meier, S.; Bastian, R.; Donaldson, L.; Murray, S.; Bajic, V.; Gehring, C. Co-expression and promoter content analyses assign a role in biotic and abiotic stress responses to plant natriuretic peptides. BMC Plant Biol. 2008, 8, 24. [Google Scholar] [CrossRef] [PubMed]
  90. Shi, D.; Jouannet, V.; Agusti, J.; Kaul, V.; Levitsky, V.; Sanchez, P.; Mironova, V.V.; Greb, T. Tissue-specific transcriptome profiling of the Arabidopsis inflorescence stem reveals local cellular signatures. Plant Cell 2021, 33, 200–223. [Google Scholar] [CrossRef]
  91. Ryan, P.T.; O’Maoileidigh, D.S.; Drost, H.G.; Kwasniewska, K.; Gabel, A.; Grosse, I.; Graciet, E.; Quint, M.; Wellmer, F. Patterns of gene expression during Arabidopsis flower development from the time of initiation to maturation. BMC Genom. 2015, 16, 488. [Google Scholar] [CrossRef]
  92. Battaglia, R.; Brambilla, V.; Colombo, L.; Stuitje, A.R.; Kater, M.M. Functional analysis of MADS-box genes controlling ovule development in Arabidopsis using the ethanol-inducible alc gene-expression system. Mech. Dev. 2006, 123, 267–276. [Google Scholar] [CrossRef] [PubMed]
  93. Kao, P.; Schon, M.A.; Mosiolek, M.; Enugutti, B.; Nodine, M.D. Gene expression variation in Arabidopsis embryos at single-nucleus resolution. Development 2021, 148, dev199589. [Google Scholar] [CrossRef] [PubMed]
  94. Yamamoto, A.; Kagaya, Y.; Toyoshima, R.; Kagaya, M.; Takeda, S.; Hattori, T. Arabidopsis NF-YB subunits LEC1 and LEC1-LIKE activate transcription by interacting with seed-specific ABRE-binding factors. Plant J. 2009, 58, 843–856. [Google Scholar] [CrossRef]
  95. Wang, Y.; Zhang, W.Z.; Song, L.F.; Zou, J.J.; Su, Z.; Wu, W.H. Transcriptome analyses show changes in gene expression to accompany pollen germination and tube growth in Arabidopsis. Plant Physiol. 2008, 148, 1201–1211. [Google Scholar] [CrossRef] [PubMed]
  96. Bruex, A.; Kainkaryam, R.M.; Wieckowski, Y.; Kang, Y.H.; Bernhardt, C.; Xia, Y.; Zheng, X.; Wang, J.Y.; Lee, M.M.; Benfey, P.; et al. A gene regulatory network for root epidermis cell differentiation in Arabidopsis. PLoS Genet. 2012, 8, e1002446. [Google Scholar] [CrossRef] [PubMed]
  97. Chen, L.; Sun, F.; Yang, X.; Jin, Y.; Shi, M.; Wang, L.; Shi, Y.; Zhan, C.; Wang, Q. Correlation between RNA-Seq and microarrays results using TCGA data. Gene 2017, 628, 200–204. [Google Scholar] [CrossRef] [PubMed]
  98. Malatras, A.; Michalopoulos, I.; Duguez, S.; Butler-Browne, G.; Spuler, S.; Duddy, W.J. MyoMiner: Explore gene co-expression in normal and pathological muscle. BMC Med. Genom. 2020, 13, 67. [Google Scholar] [CrossRef]
  99. Obayashi, T.; Aoki, Y.; Tadaka, S.; Kagaya, Y.; Kinoshita, K. ATTED-II in 2018: A Plant Coexpression Database Based on Investigation of the Statistical Property of the Mutual Rank Index. Plant Cell Physiol. 2018, 59, e3. [Google Scholar] [CrossRef] [PubMed]
Figure 1. WGCNA-constructed gene coexpression tree and generated modules: (a) Gene coexpression tree created using the TOM-based distances through average linkage. Each leaf corresponds to a different gene. (b) Original 42 modules produced by dynamicTreeCut (top row) and 28 modules produced after the merging of the original modules (bottom row). Each module is depicted with a different color.
Figure 1. WGCNA-constructed gene coexpression tree and generated modules: (a) Gene coexpression tree created using the TOM-based distances through average linkage. Each leaf corresponds to a different gene. (b) Original 42 modules produced by dynamicTreeCut (top row) and 28 modules produced after the merging of the original modules (bottom row). Each module is depicted with a different color.
Genes 16 00258 g001
Figure 2. Heatmap depicting the Pearson correlation coefficient-based associations between merged gene modules and tissue traits.
Figure 2. Heatmap depicting the Pearson correlation coefficient-based associations between merged gene modules and tissue traits.
Genes 16 00258 g002
Figure 3. ACT2.6 MYB21 11 internal node coexpression clade. The driver gene (MYB21) is highlighted in yellow.
Figure 3. ACT2.6 MYB21 11 internal node coexpression clade. The driver gene (MYB21) is highlighted in yellow.
Genes 16 00258 g003
Table 1. The 27 merged functional modules determined by WGCNA. The top enriched GOBP term of each module and its corresponding FDR-adjusted p-value using ACT2.6 built-in enrichment analysis are displayed in each case. Underlined modules are presented in the Results section. The top overexpressed tissues and hub genes of each module are displayed. The last two columns signify whether the hub gene is described by its module’s top enriched term and whether the overexpression of the hub gene is bibliographically supported in the module’s top overexpressed tissue.
Table 1. The 27 merged functional modules determined by WGCNA. The top enriched GOBP term of each module and its corresponding FDR-adjusted p-value using ACT2.6 built-in enrichment analysis are displayed in each case. Underlined modules are presented in the Results section. The top overexpressed tissues and hub genes of each module are displayed. The last two columns signify whether the hub gene is described by its module’s top enriched term and whether the overexpression of the hub gene is bibliographically supported in the module’s top overexpressed tissue.
ModuleNumber of GenesTop Enriched GO Biological Process TermAdj. p-ValueTop Overexpressed TissueTop Hub GeneConcordance of Hub Gene and Top Enriched BP TermEvidence of Hub Gene Overexpression in Top Tissue
lightsteelblue127stamen filament development4.0 × 10−5FlowerMYB21Yes[55]
plum139N/AN/ARosette LeafAT5G41250N/AN/A
yellowgreen45photosynthesis6.6 × 10−21Rosette LeafATPHNo[56]
sienna347protein depalmitoylation/negative regulation of molecular function4.4 × 10−3SiliqueAT4G15750Yes[57]
darkmagenta48external encapsulating structure organization/cell wall organization2.2 × 10−2Flower Bud/Shoot ApexSBT1.3/HTHNo[58,59]
violet57response to heat2.5 × 10−42SeedHSP101Yes[60]
paleturquoise67N/AN/ARosette LeafCID13N/ANo
steelblue87callose deposition in phloem sieve plate4.4 × 10−6RootMLO13No[61]
saddlebrown92cell surface receptor signaling pathway6.1 × 10−3RootAT5G59930Yes[62]
white102plant-type secondary cell wall biogenesis8.4 × 10−38StemIRX3Yes[63]
darkorange107secondary metabolic process1.5 × 10−2SiliqueKCS18Yes[64]
orange116N/AN/AMicrosporocyteAT5G20370N/AN/A
darkred167defense response7.2 × 10−33LeafSIB1Yes[65]
lightgreen329pollen wall assembly/cellular component assembly involved in morphogenesis/extracellular matrix assembly4.6 × 10−15Flower BudEXL6Yes[66]
lightcyan356DNA metabolic process1.1 × 10−12FlowerAT5G02520N/ANo
midnightblue385seed development2.3 × 10−17SeedAT1G72100Yes[67]
royalblue401autophagy/protein localization9.3 × 10−6RootLSM3BNo[68]
greenyellow702signaling/signal transduction3.5 × 10−2LeafAT1G32040N/AN/A
magenta832cell cycle8.1 × 10−59RootNP3No[69]
darkolivegreen836translational elongation1.0 × 10−112RootAT3G60770YesNo
pink836pollen tube growth1.1 × 10−29PollenCPK24Yes[12,70]
cyan1079response to chitin5.7 × 10−54LeafAT5G18490N/A[71]
darkgreen1930secondary metabolic process4.8 × 10−14RootAT4G28890YesNo
purple1933vesicle-mediated transport5.3 × 10−29RootAT4G24820NoNo
blue3249photosynthesis6.2 × 10−86LeafAT1G76450Yes[72]
turquoise3593regulation of transcription, DNA-templated/regulation of RNA biosynthetic process4.9 × 10−15Rosette LeafAT3G58390YesNo
black3809RNA processing1.7 × 10−70SeedMIRO1NoNo
N/A: not applicable. Underlines denote the specific modules that are presented in the following sections in results.
Table 2. Lightsteelblue1 module enriched biological terms.
Table 2. Lightsteelblue1 module enriched biological terms.
Categoryp-ValueTerm IDDescription
Gene Ontology: Biological Process4.0 × 10−5GO:0080086stamen filament development
1.3 × 10−4GO:0046246terpene biosynthetic process
2.7 × 10−4GO:0042214terpene metabolic process
3.8 × 10−4GO:0071836nectar secretion
Gene Ontology: Molecular Function1.6 × 10−6GO:0010333terpene synthase activity
1.6 × 10−6GO:0016838carbon–oxygen lyase activity, acting on phosphates
1.2 × 10−4GO:0050551myrcene synthase activity
Plant Ontology: Plant Anatomy3.3 × 10−8PO:0009056flower nectary
3.3 × 10−8PO:0009035Nectary
1.0 × 10−7PO:0005656portion of secretory tissue
KEGG9.8 × 10−8ath00902monoterpenoid biosynthesis
AraCyc2.3 × 10−5PWY-3041monoterpene biosynthesis
Pfam1.3 × 10−7Terpene_synth_Cterpene synthase family, metal binding domain
1.3 × 10−7Terpene_synthterpene synthase, N-terminal domain
Table 3. Yellowgreen module enriched biological terms.
Table 3. Yellowgreen module enriched biological terms.
Categoryp-ValueTerm IDDescription
Gene Ontology: Biological Process6.6 × 10−21GO:0015979photosynthesis
4.3 × 10−17GO:0019684photosynthesis, light reaction
4.0 × 10−14GO:0006091generation of precursor metabolites and energy
9.0 × 10−12GO:0022900electron transport chain
Gene Ontology: Molecular Function2.3 × 10−8GO:0048038quinone binding
4.5 × 10−8GO:0008137NADH dehydrogenase (ubiquinone) activity
4.5 × 10−8GO:0003735structural constituent of ribosome
6.4 × 10−8GO:0045156electron transporter, transferring electrons within the cyclic electron transport pathway of photosynthesis activity
Gene Ontology: Cellular Component1.0 × 10−37GO:0044435plastid part
8.1 × 10−33GO:0009507chloroplast
5.3 × 10−31GO:0009534chloroplast thylakoid
7.0 × 10−29GO:0009535chloroplast thylakoid membrane
Plant Ontology: Plant Anatomy3.7 × 10−8PO:0020030cotyledon
3.7 × 10−8PO:0025099embryo plant structure
3.7 × 10−8PO:0025233portion of embryo plant tissue
Plant Ontology: Plant Structure Development Stage1.6 × 10−7PO:0007095LP.08 eight leaves visible stage
3.1 × 10−7PO:0001050leaf development stage
AraCyc1.3 × 10−8PWY-101photosynthesis light reactions
Pfam1.3 × 10−5Photo_RCphotosynthetic reaction center protein
1.3 × 10−5PSIIphotosystem II protein
Table 4. White module enriched biological terms.
Table 4. White module enriched biological terms.
Categoryp-ValueTerm IDDescription
Gene Ontology: Biological Process8.4 × 10−38GO:0009834plant-type secondary cell wall biogenesis
4.2 × 10−34GO:0042546cell wall biogenesis
1.9 × 10−32GO:0009832plant-type cell wall biogenesis
2.6 × 10−19GO:0045491xylan metabolic process
Gene Ontology: Molecular Function5.1 × 10−9GO:0052716hydroquinone:oxygen oxidoreductase activity
8.3 × 10−8GO:0016682oxidoreductase activity, acting on diphenols and related substances as donors, oxygen as acceptor
Gene Ontology: Cellular Component8.9 × 10−5GO:0000139Golgi membrane
Plant Ontology: Plant Anatomy7.0 × 10−25PO:0005352xylem
2.0 × 10−14PO:0005849primary xylem
2.3 × 10−14PO:0005598vascular cambium
4.9 × 10−13PO:0005848secondary xylem
Plant Ontology: Plant Structure Development Stage1.1 × 10−8PO:0001083inflorescence development stage
KEGG2.6 × 10−2ath00520amino sugar and nucleotide sugar metabolism
AraCyc2.9 × 10−4PWY-1001cellulose biosynthesis
AtRegNet1.1 × 10−17AT2G44730alcohol dehydrogenase transcription factor Myb/SANT-like family protein
5.3 × 10−16AT1G61730DNA-binding storekeeper protein-related transcriptional regulator
9.2 × 10−11AT2G21230basic-leucine zipper (bZIP) transcription factor family protein
Plant Cistrome Database2.0 × 10−2AT5G47660homeodomain-like superfamily protein
Pfam4.6 × 10−7Cu-oxidase_3multicopper oxidase
9.5 × 10−6Glyco_hydro_10glycosyl hydrolase family 10
Table 5. Midnightblue module enriched biological terms.
Table 5. Midnightblue module enriched biological terms.
Categoryp-ValueTerm IDDescription
Gene Ontology: Biological Process2.3 × 10−17GO:0048316seed development
7.4 × 10−12GO:0010431seed maturation
5.6 × 10−7GO:0010344seed oilbody biogenesis
Gene Ontology: Molecular Function5.7 × 10−10GO:0045735nutrient reservoir activity
Gene Ontology: Cellular Component3.2 × 10−11GO:0005811lipid droplet
Plant Ontology: Plant Anatomy2.0 × 10−2PO:0009089endosperm
Plant Ontology: Plant Structure Development Stage1.0 × 10−23PO:0007632seed maturation stage
KEGG3.1 × 10−4ath04075plant hormone signal transduction
AraCyc2.3 × 10−2PWY-5060luteolin biosynthesis
AtRegNet3.9 × 10−4AT5G23930mitochondrial transcription termination factor family protein
Plant Cistrome Database1.3 × 10−4AT5G07310integrase-type DNA-binding superfamily protein
Pfam3.7 × 10−8Oleosinoleosin
Table 6. Cyan module enriched biological terms.
Table 6. Cyan module enriched biological terms.
Categoryp-ValueTerm IDDescription
Gene Ontology: Biological Process5.7 × 10−54GO:0010200response to chitin
2.0 × 10−53GO:0010243response to organonitrogen compound
2.0 × 10−53GO:0006952defense response
Gene Ontology: Molecular Function4.0 × 10−16GO:0004672protein kinase activity
Gene Ontology: Cellular Component9.1 × 10−16GO:0005886plasma membrane
5.7 × 10−14GO:0071944cell periphery
4.0 × 10−11GO:0016020membrane
1.2 × 10−4GO:0036452ESCRT complex
Plant Ontology: Plant Anatomy3.7 × 10−62PO:0002000stomatal complex
3.7 × 10−62PO:0000293guard cell
7.0 × 10−62PO:0025165shoot epidermal cell
Plant Ontology: Plant Structure Development Stage1.8 × 10−53PO:0007123LP.06 six leaves visible stage
KEGG1.5 × 10−16ath04626plant–pathogen interaction
AtRegNet1.8 × 10−47AT3G42860zinc knuckle (CCHC-type) family protein
1.5 × 10−23AT4G16150calmodulin-binding transcription activator 5
Plant Cistrome Database1.3 × 10−47AT3G42860zinc knuckle (CCHC-type) family protein
Pfam9.0 × 10−15AP2AP2 domain
2.2 × 10−13Pkinase_Tyrprotein tyrosine kinase
Table 7. Blue module enriched biological terms.
Table 7. Blue module enriched biological terms.
Categoryp-ValueTerm IDDescription
Gene Ontology: Biological Process6.2 × 10−86GO:0015979photosynthesis
Gene Ontology: Molecular Function4.8 × 10−11GO:0016491oxidoreductase activity
Gene Ontology: Cellular Component0.0 × 10+0GO:0009507chloroplast
Plant Ontology: Plant Anatomy0.0 × 10+0PO:0000013cauline leaf
Plant Ontology: Plant Structure Development Stage0.0 × 10+0PO:0001050leaf development stage
WikiPathways3.8 × 10−4WP2622_r85067starch metabolism
KEGG1.5 × 10−24ath00195photosynthesis
AraCyc1.5 × 10−12PWY-101photosynthesis light reactions
AtRegNet1.1 × 10−18AT1G71450integrase-type DNA-binding superfamily protein
Plant Cistrome Database2.2 × 10−18AT1G71450integrase-type DNA-binding superfamily protein
Pfam1.2 × 10−8Chloroa_b-bindchlorophyll A-B binding protein
Table 8. Comparison of the coexpression results of the gene use cases of ACT2.0 versus the new 2.6 version of the webtool. The adjP of the top enriched Gene Ontology Biological Process term of ACT2.0 is compared with the corresponding adjP of this term in ACT2.6.
Table 8. Comparison of the coexpression results of the gene use cases of ACT2.0 versus the new 2.6 version of the webtool. The adjP of the top enriched Gene Ontology Biological Process term of ACT2.0 is compared with the corresponding adjP of this term in ACT2.6.
Input GeneACT2.0 Coexpressed Gene NumberACT2.0 Tree Internal NodesACT2.6 Coexpressed Gene NumberACT2.6 Tree Internal NodesTop ACT2.0 Enriched GOBP TermACT2.0 Adj. p-ValueACT2.6 Adj. p-ValueCommon Coexpressed Genes
AT4G1317013561601translational elongation1.3 × 10−1751.0 × 10−174130
HSP1012682613response to heat1.5 × 10−372.3 × 10−3823
COR15A18341214261photosynthesis6.4 × 10−941.1 × 10−1211321
CEV1318664polysaccharide biosynthetic process1.2 × 10−144.7 × 10−310
CTL225182519plant-type secondary cell wall biogenesis5.5 × 10−276.4 × 10−2420
PSB28265142252photosynthesis1.0 × 10−987.9 × 10−99207
LHY249148rhythmic process3.2 × 10−119.8 × 10−812
PSBT7249951photosynthesis5.8 × 10−443.1 × 10−13412
AMS928911pollen exine formation5.4 × 10−182.9 × 10−1783
emb16926884217embryo development1.1 × 10−42.6 × 10−215
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zogopoulos, V.L.; Papadopoulos, K.; Malatras, A.; Iconomidou, V.A.; Michalopoulos, I. ACT2.6: Global Gene Coexpression Network in Arabidopsis thaliana Using WGCNA. Genes 2025, 16, 258. https://doi.org/10.3390/genes16030258

AMA Style

Zogopoulos VL, Papadopoulos K, Malatras A, Iconomidou VA, Michalopoulos I. ACT2.6: Global Gene Coexpression Network in Arabidopsis thaliana Using WGCNA. Genes. 2025; 16(3):258. https://doi.org/10.3390/genes16030258

Chicago/Turabian Style

Zogopoulos, Vasileios L., Konstantinos Papadopoulos, Apostolos Malatras, Vassiliki A. Iconomidou, and Ioannis Michalopoulos. 2025. "ACT2.6: Global Gene Coexpression Network in Arabidopsis thaliana Using WGCNA" Genes 16, no. 3: 258. https://doi.org/10.3390/genes16030258

APA Style

Zogopoulos, V. L., Papadopoulos, K., Malatras, A., Iconomidou, V. A., & Michalopoulos, I. (2025). ACT2.6: Global Gene Coexpression Network in Arabidopsis thaliana Using WGCNA. Genes, 16(3), 258. https://doi.org/10.3390/genes16030258

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop