Next Article in Journal
Ascochlorin Attenuates the Early Stage of Adipogenesis via the Wnt/β-Catenin Pathway and Inhibits High-Fat-Diet-Induced Obesity in Mice
Previous Article in Journal
Novel Insights into Diabetic Kidney Disease
Previous Article in Special Issue
Metabolomics-Assisted Breeding in Oil Palm: Potential and Current Perspectives
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Bioinformatics Identification and Expression Analysis of Acetyl-CoA Carboxylase Reveal Its Role in Isoflavone Accumulation during Soybean Seed Development

Key Laboratory of Soybean Biology in Chinese Education Ministry, Northeast Agricultural University, Harbin 150030, China
*
Authors to whom correspondence should be addressed.
Int. J. Mol. Sci. 2024, 25(18), 10221; https://doi.org/10.3390/ijms251810221
Submission received: 7 August 2024 / Revised: 5 September 2024 / Accepted: 20 September 2024 / Published: 23 September 2024

Abstract

:
Isoflavones belong to the class of flavonoid compounds, which are important secondary metabolites that play a crucial role in plant development and defense. Acetyl-CoA carboxylase (ACCase) is a biotin-dependent enzyme that catalyzes the conversion of Acetyl-CoA into Malonyl-CoA in plants. It is a key enzyme in fatty acid synthesis and also catalyzes the production of various secondary metabolites. However, information on the ACC gene family in the soybean (Glycine max L. Merr.) genome and the specific members involved in isoflavone biosynthesis is still lacking. In this study, we identified 20 ACC family genes (GmACCs) from the soybean genome and further characterized their evolutionary relationships and expression patterns. Phylogenetic analysis showed that the GmACCs could be divided into five groups, and the gene structures within the same groups were highly conserved, indicating that they had similar functions. The GmACCs were randomly distributed across 12 chromosomes, and collinearity analysis suggested that many GmACCs originated from tandem and segmental duplications, with these genes being under purifying selection. In addition, gene expression pattern analysis indicated that there was functional divergence among GmACCs in different tissues. The GmACCs reached their peak expression levels during the early or middle stages of seed development. Based on the transcriptome and isoflavone content data, a weighted gene co-expression network was constructed, and three candidate genes (Glyma.06G105900, Glyma.13G363500, and Glyma.13G057400) that may positively regulate isoflavone content were identified. These results provide valuable information for the further functional characterization and application of GmACCs in isoflavone biosynthesis in soybean.

1. Introduction

Soybean (Glycine max L. Merr.), anciently known as “shu”, is an annual leguminous crop native to Asia with a cultivation history of thousands of years in China. As an important crop for both food and feed, soybeans play a significant role in agricultural production and the diet of residents [1]. Soybeans are a good source of bioactive components such as vegetable oil, protein, and isoflavones [2]. Isoflavones are significant secondary metabolites in the phenylpropanoid biosynthesis pathway [3], predominantly found in legumes, and are richly present in soybean seeds, constituting from 0.1% to 0.6% of the seed mass [4]. Soybean isoflavones include daidzein (DZ), genistein (GT), glycitein (GC), daidzin, genistin, glycitin, 6-o-acetyldaidzin, 6-o-acetylgenistin, 6-o-acetylglycitin, 6-o-malonyldaidzin, 6-o-malonylgenistin, and 6-o-malonylglycitin [5]. Among them, DZ, GT, and GC are the major bioactive components in human nutrition [6]. Soybeans are the only effective source of isoflavones for humans [7]. Isoflavones are precursors in the biosynthesis of phytoalexins [8] and play a crucial role in the defensive responses of plants to various biological and abiotic stresses [9]. Additionally, isoflavones possess biological activity and serve as natural plant estrogens, playing a significant role in healthcare, wellness, and beauty applications [10]. The pathway of isoflavone biosynthesis is relatively complex and is regulated by a variety of genes and metabolic networks [11]. This pathway primarily involves enzymes such as phenylalanine ammonia lyase (PAL), chalcone synthase (CHS), chalcone isomerase (CHI), and isoflavone synthase (IFS). Among them, chalcone synthase (CHS) and isoflavone synthase (IFS) are key enzymes in the production of isoflavones. To date, a large number of genes involved in the isoflavone biosynthetic pathway have been elucidated. According to reports, a total of 12 chalcone isomerase (CHI) family members [12,13], 21 chalcone synthase (CHS) family members [14], and 14 chalcone reductase (CHR) family members have been identified in soybeans [15]. Therefore, studying the different gene families in soybeans will contribute to our understanding of their evolution and functions.
Acetyl-CoA carboxylase (ACCase) is a biotin-dependent enzyme that catalyzes the conversion of Acetyl-CoA into Malonyl-CoA, playing a crucial role in the fatty acid synthesis pathway [16,17]. Acetyl-CoA carboxylase consists of the three following domains: biotin carboxylase (BC), carboxyl transferase (CT), and biotin carboxyl carrier protein (BCCP) [18]. In plants, ACCase is primarily classified into the two following types: homomeric and heteromeric. The heteromeric form of ACCase, also known as the prokaryotic type, is mainly found in the plastids of plants [19]. The heteromeric form of Acetyl-CoA carboxylase is a multi-subunit complex consisting of the four following subunits: BC, BCCP, α-CT, and β-CT [20]. In its active state, the BC and BCCP subunits form homodimers, while the α-CT and β-CT subunits exist as a heterodimer. The α-CT and β-CT subunits are connected by a covalent bond and, together, they constitute the CT catalytic domain [21]. The BC subunit, BCCP subunit, and α-CT subunit are encoded by the nuclear genes accC, accB, and accA, respectively, while the β-CT subunit is encoded by the chloroplast accD gene [22]. The BC, BCCP, and α-CT subunits possess N-terminal signal peptide sequences. Their protein precursors are synthesized in the cytoplasm and then transported into the chloroplasts. After the signal peptide is removed, the protein matures. This mature protein then associates with the β-CT subunit to form a functional, complete protein [23]. The heteromeric form of ACCase is structurally unstable and prone to dissociation. In contrast, the homomeric form of ACCase is a single polypeptide chain that forms a stable homodimer within cells, which is difficult to dissociate [24]. The homomeric form of ACCase, also known as the eukaryotic type, is primarily found in the cytoplasm of algae and higher plants [25]. In ACCase, the three functional domains of the homomeric form are homologous in sequence to the four subunits of the heteromeric form [26]. So far, ACC genes have been identified in various species, and the number of ACC genes varies greatly among different species. Generally, in most plants, the genes encoding the BCCP subunit of ACC are the most numerous [27]. In five Brassica species, 43 BCCP subunits were identified, and some of these genes may play important roles in their responses to biotic and abiotic stresses [28]. In four cotton varieties, 24 BCCP genes were identified. In each variety, the homologs exhibited relative conservation in their gene structure and motifs, and they were induced by cold or salt stress signals, showing specific expressions in different tissue locations [29].
In addition to being involved in de novo fatty acid synthesis, some studies have reported that ACCase is also important for secondary metabolism [30]. From a metabolic pathway perspective, ACCase catalyzes the conversion of Acetyl-CoA into Malonyl-CoA. Malonyl-CoA, serving as a necessary substrate in the flavonoid metabolic pathway, and is combined with p-coumaroyl CoA and catalyzed by chalcone synthase (CHS) and chalcone reductase (CHR), ultimately resulting in the production of isoflavone synthesis precursors such as naringenin and liquiritigenin [31]. Research has reported that ACCase can influence the synthesis of flavonoid compounds. In Arabidopsis thaliana, overexpression of the BCCP gene led to the suppression of plastidial ACCase and increased the production of cytoplasmic ACCase to promote the up-regulation of flavonoid biosynthesis genes (At1g53520, At3g51240, and At1g06000) [32]. In rapeseed leaves, the activity of homomeric ACCase was reduced by antisense RNA technology, which could decrease the accumulation of flavonoids under UV-B treatment [33]. In Escherichia coli, the co-overexpression of the four ACCase subunits led to an increase in the production of flavonoid compounds [34].
GmACCs in the soybean genome and their regulatory roles, especially in the flavonoid and isoflavone biosynthesis pathway, have not been widely studied. In the present study, we identified 20 GmACCs and constructed a comprehensive phylogenetic tree. The gene structures, conserved motifs, promoter cis-acting elements, collinearity, and expression patterns of the GmACCs were analyzed. To further elucidate the function of the GmACCs, we constructed a weighted gene co-expression network using transcriptomic and isoflavone content data from soybean seeds. These results provide a basic understanding of the roles and complexity of GmACCs in soybean and lay the foundation for understanding the breeding utilization of GmACCs in improving soybean isoflavone content and stress resistance.

2. Results

2.1. Identification of ACC Genes from Soybean Genome

A total of 17 ACC genes and 3 putative ACC genes were identified from the soybean genome, named GmACC1 to GmACC20 based on their chromosomal locations and subunit compositions (Table 1). The 20 GmACCs were randomly distributed across 12 chromosomes of the soybean genome. Six genes (30%) were distributed on chromosome 18, which had the largest number of GmACCs, while three genes (15%) were located on chromosome 15. The length of the soybean ACCase proteins ranged from 34 to 2260 amino acids and the molecular weight ranged from 3847.76 kDa to 252,370.45 kDa, with the predicted isoelectric point ranging from 4.51 to 10.30. Subcellular localization predictions suggested that GmACC1, GmACC2, and GmACC12 were primarily located in the cytoplasm, GmACC4 and GmACC15 were mainly localized in the nucleus, and GmACC3 was primarily located in the plasma membrane. Interestingly, the subcellular localization of the majority of the remaining ACCase proteins was primarily in the semi-autonomous organelles’ mitochondria and chloroplasts (Table 1 and Table S1).

2.2. Phylogenetic Analysis of GmACCs

To evaluate the evolutionary relationships among the ACCase proteins, we constructed a phylogenetic tree based on the amino acid sequences of twenty ACC genes from soybean, eight from Arabidopsis thaliana, five from rice, seven from maize, and twenty-nine from peanuts (File S1 and Table S2). In plants, ACCase is primarily composed of BC, CT, and BCCP. CT further comprises α-CT and β-CT subunits. Phylogenetic analysis revealed that the 20 GmACCs identified in soybean belonged five different groups, respectively (Group I–Group V). Group VI and VII did not cover any GmACCs. The GmACCs encoding the BC subunit proteins were predominantly found in Group I, genes encoding the BCCP subunit proteins mainly resided in Group III, genes encoding the β-CT subunit proteins were predominantly found in both Group II and Group IV, and genes encoding the α-CT subunit proteins were mainly found in Group V (Figure 1). Overall, ACCase genes from soybean, Arabidopsis thaliana, maize, rice, and peanuts were simultaneously identified in Group I. The GmACCs were highly similar to those in peanuts. Additionally, seven ACCase members from maize were only found in Group I, and five ACCase members from rice were only found in Group I and Group V, suggesting that the differentiation of ACC genes may happen between monocotyledon and dicotyledon in the process of evolution (Figure 1).

2.3. Conserved Motifs and Gene Structure Analysis of GmACCs

To explore the sequence structures of the soybean ACCase family, we predicted the conserved motifs using the MEME tool and also analyzed the intron/exon structure of each member of the GmACCs. The results showed that 15 highly conserved motifs were found in the GmACCs. Only motif 2 was present in all 20 GmACCs, while other motifs differed in type and number among the ACCase family members. Interestingly, the chloroplast genes GmACC13 and GmACC14, which encode the β-CT subunit, contained only one motif 2. There were differences in the conserved motifs among the members of the GmACCs, which may have also led to functional differences among the members of the gene family. Gene structure analysis revealed that almost all members of the GmACCs family contained a certain number of introns, except for GmACC13, GmACC14, and GmACC15, which lacked introns. Additionally, GmACC8 had no untranslated regions (UTRs) at the 5′ end and 3′ end (Figure 2A–C).

2.4. Cis-Acting Element Analysis of GmACC Gene Promoters

Due to the important role of cis-elements in gene expression regulation, we analyzed the cis-elements in the promoters of GmACCs using the Plant CARE online tool. The results showed that the cis-regulatory elements in the promoter region of the GmACC genes were involved in light response, various hormone responses, growth regulation, seed development, responses to abiotic stress, and secondary metabolism processes in addition to the fundamental TATA-box and CAAT-box (Figure 2D and Table S3). In the study, MYB binding sites were present in the promoter regions of 10 GmACC genes (GmACC3, GmACC7, GmACC8, GmACC11, GmACC13, GmACC15, GmACC17, GmACC18, GmACC19, and GmACC20). Some research reports that certain MYB binding sites can serve as gene regulatory elements involved in flavonoid biosynthesis [35,36]. It has been hypothesized that these genes might synthesize isoflavones by regulating the upstream substances of the isoflavone biosynthesis pathway. Methyl jasmonate (MeJA) and salicylic acid (SA) also affect isoflavone biosynthesis [37,38]. The study found that 11 GmACC genes possessed cis-acting regulatory elements involved in MeJA responsiveness, while 12 genes had cis-acting regulatory elements involved in salicylic acid responsiveness. Apart from GmACC5, GmACC9, and GmACC20, these cis-acting elements could be identified in the GmACCs.

2.5. Collinearity and Amino Acid Substitution Selection Pressure Analyses of GmACCs

We utilized soybean genome annotation information and TBtools to conduct a chromosomal distribution analysis of the GmACCs [39,40]. The results showed that the GmACCs were randomly distributed across 12 chromosomes (Figure 3A), with the majority of genes being located in regions with a high gene density (Figure 3B). Notably, chromosome 18 contained six GmACCs and chromosome 15 contained three GmACCs, with most genes being located at both ends of the soybean chromosomes (Figure 3A).
Tandem duplication and segmental duplication facilitate the expansion of new gene family members and the emergence of new functions in plant genome evolution [41,42]. In order to identify duplication events between the GmACCs, we conducted a collinearity analysis on the GmACCs using the Multiple Collinearity Scan toolkit (MCScanX) within TBtools [43]. We found eight pairs of segmental duplications, which comprised 13 genes (Figure 3B). In addition to these segmental duplications, we also identified three pairs of tandem duplications, which consisted of three genes located on chromosome 18, belonging to Group V (Figure 3A and Table 2). These genes were preserved through tandem duplication events during evolution, which might be related to specific biological functions in soybean. In total, there were 13 GmACCs resulting from segmental duplications, constituting 65% (13/20) of all GmACCs, and 3 GmACCs resulting from tandem duplications, accounting for 15% (3/20) of the total. The overall duplication rate of the GmACCs was 80%, higher than the average duplication rate of the soybean genome (75%) [44], indicating that both segmental and tandem duplication events may play important roles in the expansion of the GmACCs gene family.
To determine the selective evolutionary pressures on the GmACCs after duplication, the KaKs Calculator was used to compute the synonymous substitution rate (Ks) and nonsynonymous substitution rate (Ka) values for the duplicated GmACC pairs [45]. The Ka/Ks ratio for 11 pairs of GmACC duplication events was less than one (Table 2). Since the Ka/Ks ratio indicates selection on the gene, the duplicated GmACCs were under purifying selection.
Additionally, we investigated the collinearity relationships of the GmACCs with ACC genes from four representative species, including two dicots (Arabidopsis thaliana and peanut) and two monocots (rice and maize), to identify homologous genes (Figure 3C and Table S4). A total of 12 soybean genes exhibited collinearity with 7 Arabidopsis thaliana genes, and 14 soybean genes showed collinearity with 21 peanut genes. The number of homologous gene pairs between soybean and Arabidopsis thaliana was 14 pairs, and the number of homologous gene pairs between soybean and peanut was 44 pairs. However, no such gene pairs were identified between soybean and rice or between soybean and maize. This discrepancy may be attributed to the closer phylogenetic relationship among dicots compared to monocots. Notably, the collinearity between soybean genes and peanut genes was stronger than that observed with the other three species. This may be related to the fact that both soybean and peanut belong to legumes. Additionally, we found that a large number of GmACCs exhibited collinear relationships with three to four peanut genes, suggesting that these genes may play important roles in the evolution of the ACCase gene family (Table S4).

2.6. Expression Pattern Analysis of GmACCs

To investigate the expression patterns of the GmACCs in different tissues and explore the relationships between isoflavones and the GmACCs, we utilized the multi-omics database SoyMD online tool to predict the expression levels of the 20 GmACCs in various tissues during soybean seed development [46]. The results revealed that GmACC8 and GmACC15 exhibited relatively low overall expression levels in seeds (Figure 4A). Moreover, most other GmACC genes in seeds showed an overall decreasing trend or a pattern of initially increasing and then decreasing. Members of Group II were found at higher levels in seeds and pods, while most genes in Group III showed higher expression levels in seeds, suggesting a potential association with the accumulation of fatty acids and isoflavones. Genes in Group IV exhibited significantly higher expression levels in pods than in seeds, implying that CT proteins might primarily function in pods.
Transcriptome sequencing is an effective method for analyzing gene expression patterns. Utilizing the existing soybean germplasm resources, we selected three high-isoflavone varieties and three low-isoflavone varieties. Subsequently, their seeds were subjected to transcriptome sequencing analysis at the beginning seed stage (R5), the full seed stage (R6), and the beginning maturity stage (R7). Following this, the expression levels of the 20 GmACCs were identified (Figure S1). The results indicated that, during the three stages of seed development, four genes (GmACC8, GmACC11, GmACC12, and GmACC14) showed no significant differences in expression between the high-isoflavone and low-isoflavone soybean varieties, suggesting that these genes are not key regulators of isoflavone synthesis. Out of the 20 GmACCs analyzed, 12 genes (GmACC1, GmACC2, GmACC3, GmACC4, GmACC6, GmACC9, GmACC16, GmACC17, GmACC18, GmACC19, and GmACC20) exhibited higher overall expression levels in the high-isoflavone varieties than in the low-isoflavone varieties. When considering the consecutive stages of seed development, the expression trends of eight genes (GmACC2, GmACC3, GmACC4, GmACC6, GmACC7, GmACC9, GmACC17, and GmACC20) showed a pattern of first increasing and then decreasing, while the expression trends of six genes exhibited a continuous decline.
To further validate the accuracy of the transcriptome data, we selected soybean varieties with significant differences in their isoflavone content through preliminary screening. We then used RT-qPCR technology to verify the expression of 16 GmACC genes during seed development at the R5, R6, and R7 stages (Figure 4B). The results showed that the expression levels of four genes exhibited a decreasing trend as seed development progressed. The expression levels of eight genes showed a trend of first increasing and then decreasing with seed development. Notably, the expression patterns of GmACC2, GmACC4, and GmACC17 were essentially consistent with the transcriptomic results. Most genes showed expression trends consistent with those observed in the transcriptomic data, and the expression differences in GmACCs between high-isoflavone and low-isoflavone varieties were particularly evident at the R5 and R6 stages.
To explore the correlation between isoflavone content and the expression of GmACCs, we randomly selected seeds from high-isoflavone varieties and collected samples at four stages over a period of 0–35 days after the R5 stage. We conducted a correlation analysis between the changes in isoflavone content and transcriptomic data at different stages (Figure 5 and Table S6). The results showed that the expression levels of most GmACC genes increased with the number of development days. Notably, the expression levels of five genes (GmACC2, GmACC3, GmACC15, GmACC17, and GmACC20) showed a significant positive correlation with the number of development days, indicating that these genes may be involved in the accumulation of isoflavones.

2.7. Construction and Analysis of Weighted Gene Co-Expression Networks

To gain a more comprehensive understanding of the mechanism by which GmACCs regulate isoflavone biosynthesis, we selected 10 varieties with a high isoflavone content (3549.27–4688.14 µg g−1) and 10 varieties with a low isoflavone content (1396.33–2575.78 µg g−1) (Table S5). Subsequently, we conducted a transcriptome sequencing analysis on their seeds at the R6 stage. Within the established KEGG pathways, we screened 423 key genes involved in the isoflavone biosynthesis pathway and its upstream pathways, and constructed a weighted gene co-expression network based on the content data of various isoflavone components. The results showed that a total of four color modules were clustered (Figure 6), with the genes within the turquoise module showing a positive correlation with isoflavone content, especially significantly positive correlations between the genes within the module and both GT and TI. Notably, GmACC1, GmACC2, GmACC3, GmACC10, GmACC16, and GmACC17 were identified in the turquoise module (Table S7). However, when integrating the transcriptomic data from different stages with the RT-qPCR results, it was found that GmACC1, GmACC10, and GmACC16 did not show significant correlations with isoflavone accumulation. Therefore, the experiment identified the genes GmACC2, GmACC3, and GmACC17, which showed increased expressions from the R5 to R6 stage and were found to be synergistic with isoflavone accumulation, as candidate genes affecting isoflavone content.

3. Discussion

In recent years, the characterization of gene families has been instrumental in unraveling their functions. Acetyl-CoA carboxylase was first discovered in the 1960s through the study of fatty acid biosynthesis in plants [47], where the identification and characterization of this enzyme were pivotal in understanding the process of fatty acid synthesis within plants [48]. Most research on Acetyl-CoA carboxylase has primarily focused on its regulation of the de novo synthesis of fatty acids [49], while studies on its regulation of isoflavone synthesis have not yet been reported. In this study, we identified 20 GmACC genes that were randomly distributed across 12 chromosomes. A subcellular localization prediction analysis indicated that most of these proteins were predicted to be located in the cytoplasm, chloroplasts, and mitochondria, suggesting that this gene family could play roles in various biological processes, participating in the syntheses of certain substances and playing important roles in energy metabolism.
Phylogenetic analysis indicated that the ACCase genes in soybean, Arabidopsis thaliana, maize, rice, and peanut could be divided into seven groups, with the 20 GmACC genes in soybean being revealed within five of these groups. The ACC genes from soybean, Arabidopsis thaliana, maize, rice, and peanut could all be found in Group I. In this group, the identified gene family members primarily encoded homomeric ACCase and biotin carboxylase, implying that biotin carboxylase was a shared component among most plant ACCases. Furthermore, the ACC genes from maize and rice were exclusively identified in Group I and Group V, suggesting a possible divergence in the evolution of ACC genes between monocotyledonous and dicotyledonous plants. In Group IV, only two genes encoding the β-CT subunit protein were identified specifically in soybean, suggesting that this protein may have a distinct structure in soybean.
In soybean, ACCase exists in the two following forms: homomeric and heteromeric. The homomeric ACCase has the three following functional domains: the BC domain, the BCCP domain, and the CT domain. The heteromeric ACCase’s BC and BCCP domains are composed of separate BC and BCCP subunits, while the CT domain is composed of two subunits, α-CT and β-CT [50]. Studies have reported that Arabidopsis thaliana has three ACCase isozymes, one of which is a hetero-oligomer composed of four different subunits located in the plastids (AT5G16390, AT5G35360, AT2G38040, and ATCG00500), while the other two are homodimers, ACC1 (AT1G36160) and ACC2 (AT1G36180), located in the cytoplasm and chloroplasts [22]. In this study, homomeric ACCase was identified in Group I in both soybean and Arabidopsis thaliana, and all four subunits of the heteromeric ACCase could be identified within the same group in both species. Therefore, members of the same group had similar gene structures. In Group I, the identified gene family members primarily encoded homomeric ACCase and biotin carboxylase, in Group III, they primarily encoded biotin carboxyl carrier proteins, and in Groups II, IV, and V, they primarily encoded biotin carboxyltransferase. An important aspect of further structural exploration was to reveal the functions of the genes within each group and the functional differences among genes within the same group.
Previous studies have shown that, during the evolutionary process, gene families typically undergo tandem duplication or large-scale segmental duplication to maintain their gene integrity [51]. Segmental duplication events refer to the process in which a segment of a DNA sequence is repeated in the genome, usually distributed at distant locations, while homologous genes that are continuously repeated in the same genomic region are considered as tandem duplication events [52,53]. Soybean (Glycine max) is a typical paleopolyploid, whose ancestor diverged from the ancestor of green bean (Phaseolus vulgaris) and then experienced a whole-genome polyploidization event accompanied by a slow diploidization process, resulting in a whole-genome duplication rate of approximately 75% [44]. The analysis results indicated that the rates of segmental and tandem duplications in soybean (80%) were higher than the whole-genome duplication rate, suggesting that segmental duplication events and tandem duplication events play significant roles in the amplification of these GmACCs. We also found that the calculated Ka/Ks ratios for all gene pairs were less than one, indicating that these genes may experience strong purifying selection pressure during the evolutionary process. It has been reported that soybean is closely related to green bean and peanut during the evolutionary process of legumes [54]. Meanwhile, the experimental results showed that there were significantly more collinear gene pairs between soybean and peanut than between soybean and Arabidopsis thaliana. However, soybean did not exhibit collinear relationships with monocots like maize and rice, suggesting that this linear relationship may only exist among dicots.
ACCase catalyzes the conversion of Acetyl-CoA into Malonyl-CoA. There have been research reports where, in some soybean varieties, the total isoflavone content was negatively correlated with the oil content [55]. This study suggested that homomeric and heteromeric ACCases might play different roles in maintaining metabolic balance and responding to environmental changes. It has been reported that overexpressing the heteromeric ACC in E. coli and then introducing it into rapeseed results in an increased oil content in seeds [56]. The overexpression of α-CT from pea in Arabidopsis thaliana was found to enhance fatty acid synthesis [57,58]. The overexpression of the four subunits of the heteromeric ACCase can enhance lipid accumulation in cotton seeds [59]. In Arabidopsis thaliana, “BCCP-like” proteins act as negative regulators of htACCase. The silencing of this gene by RNA interference led to an increase in seed oil content [60]. In tobacco, it was discovered that the level of the accD subunit was a determinant of ACCase levels, and that enzyme levels were, in part, controlled post-transcriptionally at the level of subunit assembly [61]. However, once Acetyl-CoA is produced in the cytoplasm, the Malonyl-CoA produced by the homomeric ACCase enters various secondary metabolic pathways, participating in multiple plant responses [31]. It is inferred that the Malonyl-CoA produced under the catalysis of the heteromeric ACCase can promote fatty acid synthesis, while the homomeric ACCase may have a certain influence on the accumulation of isoflavones. Studies have found that the homomeric ACCase is primarily responsible for producing cytoplasmic Malonyl-CoA, which is used for flavonoid biosynthesis in Arabidopsis thaliana [33]. Since Arabidopsis thaliana lacks the key enzyme in the isoflavone biosynthesis process, isoflavone synthase, it cannot synthesize isoflavones [62]. However, isoflavone synthase is widely present in legumes and is capable of catalyzing the isoflavone biosynthesis reaction. Therefore, further verification of the functions of GmACCs will greatly enhance our understanding of the soybean isoflavone synthesis mechanism.
Gene expression patterns serve as vital clues for elucidating gene functions. In this study, we analyzed the expression profiles of 20 GmACC genes during seed development. The research indicated that most genes in Group I were highly expressed in both seeds and pods, genes in Group II, Group IV, and Group V were more highly expressed in pods, and most genes in Group III were more highly expressed in seeds. It is inferred that BC genes and homomeric ACC may play significant roles in the accumulation of specific substances during seed development, carboxyl transferase primarily functions in pods, and BCCP genes perform key functions during seed development. The accumulation levels of isoflavone-related compounds vary during seed development. Reports indicate that, after flowering for 25 days, the expression levels of genes related to daidzein and glycitein contents significantly change. Additionally, in the early stages of seed development, the expression levels of genes associated with genistein content also undergo significant changes [63,64]. To further validate the GmACCs associated with isoflavone accumulation, we conducted transcriptome sequencing of the seeds at the R5, R6, and R7 stages, followed by RT-qPCR verification. The results revealed that the expression patterns of most genes were consistent with the expression profiles, and their expression levels were lower during the R7 stage of soybean development. The R7 stage marks the beginning of maturity, when the seeds are about to enter a dormant state and the expression levels of most genes within the plant decrease. Studies have reported that isoflavones continuously accumulate during seed development, and genes related to isoflavone synthesis exhibit higher expression levels in the later stages of seed development [63]. It has been reported that genes involved in isoflavone synthesis, such as CHS7, CHS8, and IFS2, show a gradual increase in expression quantity throughout the seed development process, reaching their peak expression levels 60 days after flowering [65]. Therefore, this study selected four stages following the R5 period to analyze the correlation between the expression levels of the GmACC genes and isoflavone accumulation. Ultimately, five genes were found to have a significant positive correlation with isoflavone accumulation. Based on the literature and our experimental results, we preliminarily inferred that the expression of isoflavone-synthesis-related genes was most active during the R6 stage.
WGCNA analysis is an effective technique for classifying transcriptome data into co-expression modules, in order to reduce the number of potential candidate genes. Azam et al. [66] used a combination of GWAS and WGCNA to identify three modules highly correlated with total isoflavone content, ultimately determining that Glyma.11G108100 affected the accumulation of isoflavone in soybean seeds. Azam et al. [67], based on BSA-seq and differentially expressed genes from WGCNA, identified four hub genes, of which the allele variation in Glyma.06G290400 (GmIE3-1) significantly affected isoflavone accumulation. The isoflavone biosynthesis pathway involves multiple structural genes encoding enzymes, which interact through coordination or competition to promote isoflavone accumulation [68]. In this study, we constructed a weighted gene co-expression network using genes related to the isoflavone synthesis pathway and phenotypic data. Within the network, we found that the turquoise module was significantly positively correlated with the various components of isoflavones, showing an extremely significant positive correlation with GT. It has been reported that DZ and GT are key components affecting the total isoflavone content [2]. Therefore, identifying genes within the turquoise module could more accurately uncover the genes involved in regulating isoflavone accumulation.
By integrating the RT-qPCR and transcriptome sequencing results, we preliminarily identified GmACC2, GmACC3, and GmACC17 as hub genes regulating isoflavone synthesis. GmACC2 encodes homomeric ACCase. The homogeneous ACCase carboxylase plays a significant role in plant secondary metabolism. Studies have suggested that the homomeric ACCase in plants can catalyze the production of flavonoid metabolites [21]. Research has found that ACC missense mutants lack flavonoid accumulation during cold acclimation in leaves [69]. In Arabidopsis thaliana, under UV-B stress, the expression levels of cytosolic ACC and flavonoid-biosynthesis-related genes can significantly increase [70]. GmACC3 was found to encode the biotin carboxyl carrier protein (BCCP), and GmACC17 was found to encode the biotin carboxyl carrier protein subtype 2. Previous studies have shown that overexpressing BCCP increases fatty acid elongation and the production of flavonoids, terpenes, and phytosterols [32]. Both the antisense and sense expression of the biotin carboxyl carrier protein subtype 2 in Arabidopsis thaliana inactivated plastidial Acetyl-CoA carboxylase, reducing the seed oil content [71]. However, the role of this protein subtype in secondary metabolism remains unexplored. Therefore, it was speculated that GmACC2 can directly participate in isoflavone biosynthesis, and that GmACC3 and GmACC17 can indirectly regulate the accumulation of isoflavones. In summary, this study employed a bioinformatics analysis and integrated transcriptomics with WGCNA to associate phenotypes, delving into the regulatory mechanism of isoflavone biosynthesis during soybean seed development. We found that the GmACC2, GmACC3, and GmACC17 genes play significant roles in isoflavone biosynthesis. These results provide important insights into the molecular mechanisms underlying isoflavone biosynthesis in soybeans and offer theoretical support for the improvement of soybean varieties and the enhancement of isoflavone content.

4. Materials and Methods

4.1. Genome-Wide Identification of ACC Genes in Soybean

To identify the ACC gene family, we obtained the Arabidopsis thaliana ACCase protein sequences from TAIR (https://www.arabidopsis.org/) (accessed on 21 May 2024) and downloaded the complete set of soybean protein sequences from the Phytozome database [72]. Using these sequences as a reference, we searched the soybean protein database using the BLASTP program with an e-value threshold of 1 × 10−5. A local search based on a hidden Markov model (HMM) was conducted (http://hmmer.org/) (accessed on 22 May 2024) [73], using all known ACCase protein sequences from Arabidopsis thaliana to identify the GmACC genes in soybean. To more accurately identify the predicted GmACC genes from the genomic sequences, we analyzed the GmACC-like gene sequences based on the typical structural features of plant ACCase proteins reported previously in the NCBI online database (https://www.ncbi.nlm.nih.gov/) (accessed on 22 May 2024) [74]. As a result, we obtained 20 soybean ACC genes, which were renamed as GmACC1GmACC20. (Table 1).

4.2. Prediction of Physicochemical Properties of GmACC Protein

We utilized Expasy-ProtParam (https://web.expasy.org/protparam/) (accessed on 23 May 2024) for the prediction of the protein physicochemical properties through pI/MW calculation [75]. Subcellular localization information on the soybean GmACC genes was obtained from the CELLO (http://cello.life.nctu.edu.tw/) (accessed on 24 May 2024) online web resource [76].

4.3. Construction of a Phylogenetic Tree

Based on the identified ACCase functional domains, we downloaded the ACCase protein sequences for peanut, rice, and maize from the Phytozome database [39]. Subsequently, we aligned the ACCase protein sequences from soybean, Arabidopsis thaliana, peanut, rice, and maize using the ClustalW algorithm in MEGA XI v11.0.13. All sequences were renamed according to their genetic characteristics as AtACC1-AtACC8, AhACC1-AhACC29, OsACC1-OsACC5, and ZmACC1-ZmACC7. A phylogenetic tree was constructed using the Maximum Parsimony method in MEGA XI, and bootstrap values were obtained based on 1000 replicates [77]. The tree was visualized using the online software Evolview v3 (https://www.evolgenius.info/evolview/#/treeview) (accessed on 24 May 2024) [78].

4.4. Gene Structure, Conserved Motif Analysis, and Promoter Cis-Regulatory Element Characterization of the GmACC Genes

We downloaded soybean genomic data, protein files, CDS files, and gff3 files from the Phytozome online website. We then used the online tool MEME motif analysis (https://meme-suite.org/meme/tools/meme) (accessed on 27 May 2024) to identify the conserved motifs in the soybean ACCase proteins [79]. In the MEME version 5.5.7 program, we set the number of conserved motifs to 15. Finally, we imported the generated files into TBtools (version 2.057) for visualization [40].
We extracted the 2000 bp sequence upstream of the start codon of the GmACC genes and submitted it to the PlantCare version 1 (https://bioinformatics.psb.ugent.be/webtools/plantcare/html/) (accessed on 27 May 2024) website for cis-acting element prediction [80]. After removing ambiguous elements and some basic elements, we identified 19 cis-elements. The results were visualized using the TBtools software.

4.5. Collinearity Analysis and Selective Pressure for Duplicated Genes

Tandem duplication events and segmental duplication events are considered to be the main drivers of gene family expansion within genomes [81]. Tandem duplications are defined as clusters of genes with sequence similarity and functional similarity that form within a 200 kb segment in the same chromosomal region [41]. The Simple Ka/Ks Calculator tool in TBtools software was used to estimate the synonymous substitution rate (Ks) and the non-synonymous substitution rate (Ka) to assess the impact sof selection pressure and divergence time on the ACC genes (parameter settings: CPU = 4) [45]. Subsequently, we analyzed tandem duplication events in the GmACC gene family using TBtools and MCScanX version git-97e74f40. Similarly, we investigated segmental duplication events and the collinearity relationships between gene pairs in different species using TBtools in conjunction with MCScanX and the BLASTP method.

4.6. Plant Materials and Gene Expression Analysis

The plant materials used in this study were grown in the potting field of Northeast Agricultural University in 2022 and included the low-isoflavone varieties L-79, Hong Feng No. 11, and Feng Shou No. 6, as well as the high-isoflavone varieties Sui 03-3952, Small-seeded Fodder Bean, and Zhong Dou 32 (Table S5). The plants were in good health with no disease or pest infestations. Using the SoyMD online platform (https://yanglab.hzau.edu.cn/SoyMD/#/) (accessed on 27 May 2024), we predicted the expression levels of the GmACC genes in various tissues during seed development [46]. Seeds at the beginning seed stage (R5), the full seed stage (R6), and the beginning maturity stage (R7) were selected for transcriptomic and RT-qPCR detection.

4.7. Total RNA Isolation and RT-qPCR Expression Analysis

To validate the expression profiles of the GmACCs in the multi-omics database SoyMD, we followed the methods described in the literature to extract the total RNA from the samples [82], and synthesized cDNA from mRNA using the ReverTra Ace® qPCR RT Master Mix (TOYOBO, Life Science Department, Tokyo, Japan). The cDNA samples were diluted to 150 ng/μL with sterile double-distilled water and stored at −20 °C for later use. Three biological replicates for each sample were conducted for transcriptomic and RT-qPCR detection. We used actin as the internal reference gene [83], with primers designed using Primer Premier 5 (Table S8).

4.8. Transcriptome Sequencing Analysis and Correlation Analysis

To clarify the correlation between the expressions of the GmACCs and isoflavone accumulation, we selected seeds from the high-isoflavone variety Zhongdou 27 (4220.61 µg g−1) at the four following stages of grain development: 0 days, 7 days, 21 days, and 35 days after the R5 stage. Three biological replicates were conducted for each sample for transcriptomic analysis. Subsequently, the isoflavone content in the seeds was measured using high-performance liquid chromatography (HPLC), with the measurement method referenced from that of Wu et al. [84]. A network heatmap was generated using https://www.omicshare.com/ (accessed on 21 May 2024) [85].
Using Tbtools (version 2.057), we analyzed the mapping of clean reads and utilized them for HISAT2 version 2.2.1 [86]. Transcripts per million (TPM) values were used for gene/transcript-level quantification, and the genes obtained were selected based on the KEGG pathway [87]. We focused on genes involved in the isoflavone biosynthesis pathway, the flavonoid biosynthesis pathway, and the phenylpropanoid biosynthesis pathway as the key genes affecting isoflavone accumulation for further study.

4.9. Analysis of Weighted Gene Co-Expression Networks

Based on the definitions of high and low isoflavone content by Cai and Kim et al. [88,89], we chose varieties with a total isoflavone content in their seeds of less than 3000 µg g−1 as the low isoflavone varieties and those with a total isoflavone content greater than 3500 µg g−1 as the high isoflavone varieties (Table S5). Utilizing the contents of various isoflavone components in conjunction with transcriptomic data, a weighted gene co-expression network analysis was conducted using the WGCNA package in R software version R-4.2.2 [90]. We determined the soft thresholding power based on the scale-free network principle, selecting a soft threshold with a scale-free network fit index greater than 0.8 [91]. We identified co-expression patterns using dynamic tree cutting, constructed a gene clustering tree based on the correlation of gene expression levels, set the minimum number of genes per module to two, and then merged modules with similar expression patterns based on a module eigengene similarity threshold of 0.8 (Table S7).

5. Conclusions

In conclusion, this study systematically explored soybean GmACC genes across the entire genome by integrating bioinformatics analysis and WGCNA. It was determined that the GmACC2, GmACC3, and GmACC17 genes play significant roles in the biosynthesis of isoflavones. These findings provide important evidence for further elucidating the molecular mechanisms of isoflavone biosynthesis in soybeans and offer theoretical support for the improvement of soybean varieties and the enhancement of isoflavone content.

Supplementary Materials

The supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms251810221/s1.

Author Contributions

Conceptualization, X.W. and Z.Y.; methodology, Z.Y.; software, X.W.; validation, Y.Z. (Yina Zhu), Y.Z. (Yuhang Zhan), and Y.L.; formal analysis, Y.Z. (Yuhang Zhan); investigation, Y.Z. (Yina Zhu); resources, W.T.; data curation, X.W.; writing—original draft preparation, X.W.; writing—review and editing, X.Z.; visualization, X.W.; supervision, Y.H.; project administration, Y.H.; funding acquisition, Y.H. All authors have read and agreed to the published version of the manuscript.

Funding

This study was conducted in the Key Laboratory of Soybean Biology of the Chinese Education Ministry, Soybean Research & Development Center (CARS), and the Key Laboratory of Northeastern Soybean Biology and Breeding/Genetics of the Chinese Agriculture Ministry and was financially supported by the Natural Science Foundation of Heilongjiang Province (ZD2022C002, JD22A015), Key Laboratory of Soybean Mechanized Production, Ministry of Agriculture and Rural Affairs, China (Grant No. SMP202206), National Key Research & Development Project (2021YFD1201604, 2021YFF1001204), the Chinese National Natural Science Foundation (31971967, U22A20473), the Youth Leading Talent Project of the Ministry of Science and Technology in China (2015RA228), The National Ten-thousand Talents Program, Postdoctoral Fund in Heilongjiang Province (LBH-Q20004), The national project (CARS-04-PS06).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The RNA_Seq data (ID: PRJNA1139955) is accessible: https://www.ncbi.nlm.nih.gov/bioproject/PRJNA1139955 (accessed on 2 June 2024). All figures and data are included in the manuscript and Supplementary Materials.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Mateos-Aparicio, I.; Redondo Cuenca, A.; Villanueva-Suárez, M.J.; Zapata-Revilla, M.A. Soybean, a Promising Health Source. Nutr. Hosp. 2008, 23, 305–312. [Google Scholar] [PubMed]
  2. Azam, M.; Zhang, S.; Abdelghany, A.M.; Shaibu, A.S.; Feng, Y.; Li, Y.; Tian, Y.; Hong, H.; Li, B.; Sun, J. Seed Isoflavone Profiling of 1168 Soybean Accessions from Major Growing Ecoregions in China. Food Res. Int. 2020, 130, 108957. [Google Scholar] [CrossRef] [PubMed]
  3. Wang, X.; Zhao, W.; Wei, X.; Sun, Y.; Dong, S. Molecular Mechanism of Drought Resistance in Soybean Roots Revealed Using Physiological and Multi-Omics Analyses. Plant Physiol. Biochem. 2024, 208, 108451. [Google Scholar] [CrossRef] [PubMed]
  4. Horitani, M.; Yamada, R.; Taroura, K.; Maeda, A.; Anai, T.; Watanabe, S. Identification of Genes Responsible for the Synthesis of Glycitein Isoflavones in Soybean Seeds. Plants 2024, 13, 156. [Google Scholar] [CrossRef]
  5. Zeng, G.; Li, D.; Han, Y.; Teng, W.; Wang, J.; Qiu, L.; Li, W. Identification of QTL Underlying Isoflavone Contents in Soybean Seeds among Multiple Environments. Theor. Appl. Genet. 2009, 118, 1455–1463. [Google Scholar] [CrossRef]
  6. Wang, Y.; Han, Y.; Zhao, X.; Li, Y.; Teng, W.; Li, D.; Zhan, Y.; Li, W. Mapping Isoflavone QTL with Main, Epistatic and QTL × Environment Effects in Recombinant Inbred Lines of Soybean. PLoS ONE 2015, 10, e0118447. [Google Scholar] [CrossRef] [PubMed]
  7. Yoshiara, L.Y.; Madeira, T.B.; Delaroza, F.; da Silva, J.B.; Ida, E.I. Optimization of Soy Isoflavone Extraction with Different Solvents Using the Simplex-Centroid Mixture Design. Int. J. Food Sci. Nutr. 2012, 63, 978–986. [Google Scholar] [CrossRef] [PubMed]
  8. Anguraj Vadivel, A.K.; McDowell, T.; Renaud, J.B.; Dhaubhadel, S. A Combinatorial Action of GmMYB176 and GmbZIP5 Controls Isoflavonoid Biosynthesis in Soybean (Glycine Max). Commun. Biol. 2021, 4, 356. [Google Scholar] [CrossRef]
  9. Graham, T.L. Flavonoid and Isoflavonoid Distribution in Developing Soybean Seedling Tissues and in Seed and Root Exudates. Plant Physiol. 1991, 95, 594–603. [Google Scholar] [CrossRef]
  10. Dwivedi, S.; Singh, V.; Sharma, K.; Sliti, A.; Baunthiyal, M.; Shin, J.-H. Significance of Soy-Based Fermented Food and Their Bioactive Compounds Against Obesity, Diabetes, and Cardiovascular Diseases. Plant Foods Hum. Nutr. 2024, 79, 1–11. [Google Scholar] [CrossRef]
  11. Gutierrez-Gonzalez, J.J.; Wu, X.; Gillman, J.D.; Lee, J.-D.; Zhong, R.; Yu, O.; Shannon, G.; Ellersieck, M.; Nguyen, H.T.; Sleper, D.A. Intricate Environment-Modulated Genetic Networks Control Isoflavone Accumulation in Soybean Seeds. BMC Plant Biol. 2010, 10, 105. [Google Scholar] [CrossRef] [PubMed]
  12. Chao, N.; Wang, R.-F.; Hou, C.; Yu, T.; Miao, K.; Cao, F.-Y.; Fang, R.-J.; Liu, L. Functional Characterization of Two Chalcone Isomerase (CHI) Revealing Their Responsibility for Anthocyanins Accumulation in Mulberry. Plant Physiol. Biochem. 2021, 161, 65–73. [Google Scholar] [CrossRef] [PubMed]
  13. Wang, J.; Jiang, Y.; Sun, T.; Zhang, C.; Liu, X.; Li, Y. Genome-Wide Classification and Evolutionary Analysis Reveal Diverged Patterns of Chalcone Isomerase in Plants. Biomolecules 2022, 12, 961. [Google Scholar] [CrossRef]
  14. Anguraj Vadivel, A.K.; Krysiak, K.; Tian, G.; Dhaubhadel, S. Genome-Wide Identification and Localization of Chalcone Synthase Family in Soybean (Glycine Max [L.] Merr) . BMC Plant Biol. 2018, 18, 325. [Google Scholar] [CrossRef]
  15. Sepiol, C.J.; Yu, J.; Dhaubhadel, S. Genome-Wide Identification of Chalcone Reductase Gene Family in Soybean: Insight into Root-Specific GmCHRs and Phytophthora Sojae Resistance. Front. Plant Sci. 2017, 8, 2073. [Google Scholar] [CrossRef]
  16. Post-Beittenmiller, D.; Jaworski, J.G.; Ohlrogge, J.B. In Vivo Pools of Free and Acylated Acyl Carrier Proteins in Spinach. Evidence for Sites of Regulation of Fatty Acid Biosynthesis. J. Biol. Chem. 1991, 266, 1858–1865. [Google Scholar] [CrossRef] [PubMed]
  17. Konishi, T.; Shinohara, K.; Yamada, K.; Sasaki, Y. Acetyl-CoA Carboxylase in Higher Plants: Most Plants Other than Gramineae Have Both the Prokaryotic and the Eukaryotic Forms of This Enzyme. Plant Cell Physiol. 1996, 37, 117–122. [Google Scholar] [CrossRef] [PubMed]
  18. Sasaki, Y.; Nagano, Y. Plant Acetyl-CoA Carboxylase: Structure, Biosynthesis, Regulation, and Gene Manipulation for Plant Breeding. Biosci. Biotechnol. Biochem. 2004, 68, 1175–1184. [Google Scholar] [CrossRef]
  19. Salie, M.J.; Thelen, J.J. Regulation and Structure of the Heteromeric Acetyl-CoA Carboxylase. Biochim. Biophys. Acta 2016, 1861, 1207–1213. [Google Scholar] [CrossRef]
  20. Kozaki, A.; Mayumi, K.; Sasaki, Y. Thiol-Disulfide Exchange between Nuclear-Encoded and Chloroplast-Encoded Subunits of Pea Acetyl-CoA Carboxylase. J. Biol. Chem. 2001, 276, 39919–39925. [Google Scholar] [CrossRef]
  21. Nikolau, B.J.; Ohlrogge, J.B.; Wurtele, E.S. Plant Biotin-Containing Carboxylases. Arch. Biochem. Biophys. 2003, 414, 211–222. [Google Scholar] [CrossRef] [PubMed]
  22. Li, X.; Ilarslan, H.; Brachova, L.; Qian, H.-R.; Li, L.; Che, P.; Wurtele, E.S.; Nikolau, B.J. Reverse-Genetic Analysis of the Two Biotin-Containing Subunit Genes of the Heteromeric Acetyl-Coenzyme A Carboxylase in Arabidopsis Indicates a Unidirectional Functional Redundancy. Plant Physiol. 2011, 155, 293–314. [Google Scholar] [CrossRef] [PubMed]
  23. Zhang, Z.; Dunwell, J.M.; Zhang, Y.-M. An Integrated Omics Analysis Reveals Molecular Mechanisms That Are Associated with Differences in Seed Oil Content between Glycine Max and Brassica Napus. BMC Plant Biol. 2018, 18, 328. [Google Scholar] [CrossRef] [PubMed]
  24. Elborough, K.M.; Winz, R.; Deka, R.K.; Markham, J.E.; White, A.J.; Rawsthorne, S.; Slabas, A.R. Biotin Carboxyl Carrier Protein and Carboxyltransferase Subunits of the Multi-Subunit Form of Acetyl-CoA Carboxylase from Brassica Napus: Cloning and Analysis of Expression during Oilseed Rape Embryogenesis. Biochem. J. 1996, 315, 103–112. [Google Scholar] [CrossRef]
  25. Yanai, Y.; Kawasaki, T.; Shimada, H.; Wurtele, E.S.; Nikolau, B.J.; Ichikawa, N. Genomic Organization of 251 kDa Acetyl-CoA Carboxylase Genes in Arabidopsis: Tandem Gene Duplication Has Made Two Differentially Expressed Isozymes. Plant Cell Physiol. 1995, 36, 779–787. [Google Scholar] [CrossRef]
  26. Incledon, B.J.; Hall, J.C. Acetyl-Coenzyme A Carboxylase: Quaternary Structure and Inhibition by Graminicidal Herbicides. Pestic. Biochem. Physiol. 1997, 57, 255–271. [Google Scholar] [CrossRef]
  27. Chen, Y.; Elizondo-Noriega, A.; Cantu, D.C.; Reilly, P.J. Structural Classification of Biotin Carboxyl Carrier Proteins. Biotechnol. Lett. 2012, 34, 1869–1875. [Google Scholar] [CrossRef]
  28. Megha, S.; Wang, Z.; Kav, N.N.V.; Rahman, H. Genome-Wide Identification of Biotin Carboxyl Carrier Subunits of Acetyl-CoA Carboxylase in Brassica and Their Role in Stress Tolerance in Oilseed Brassica Napus. BMC Genom. 2022, 23, 707. [Google Scholar] [CrossRef]
  29. Cui, Y.; Zhao, Y.; Wang, Y.; Liu, Z.; Ijaz, B.; Huang, Y.; Hua, J. Genome-Wide Identification and Expression Analysis of the Biotin Carboxyl Carrier Subunits of Heteromeric Acetyl-CoA Carboxylase in Gossypium. Front. Plant Sci. 2017, 8, 624. [Google Scholar] [CrossRef]
  30. Alban, C.; Job, D.; Douce, R. Biotin Metabolism in Plants. Annu. Rev. Plant Physiol. Plant Mol. Biol. 2000, 51, 17–47. [Google Scholar] [CrossRef]
  31. Shui, L.; Huo, K.; Chen, Y.; Zhang, Z.; Li, Y.; Niu, J. Integrated Metabolome and Transcriptome Revealed the Flavonoid Biosynthetic Pathway in Developing Vernonia Amygdalina Leaves. PeerJ 2021, 9, e11239. [Google Scholar] [CrossRef] [PubMed]
  32. Chen, M.; Mooney, B.P.; Hajduch, M.; Joshi, T.; Zhou, M.; Xu, D.; Thelen, J.J. System Analysis of an Arabidopsis Mutant Altered in de Novo Fatty Acid Synthesis Reveals Diverse Changes in Seed Composition and Metabolism. Plant Physiol. 2009, 150, 27–41. [Google Scholar] [CrossRef] [PubMed]
  33. Saito, K.; Yonekura-Sakakibara, K.; Nakabayashi, R.; Higashi, Y.; Yamazaki, M.; Tohge, T.; Fernie, A.R. The Flavonoid Biosynthetic Pathway in Arabidopsis: Structural and Genetic Diversity. Plant Physiol. Biochem. 2013, 72, 21–34. [Google Scholar] [CrossRef] [PubMed]
  34. Leonard, E.; Lim, K.-H.; Saw, P.-N.; Koffas, M.A.G. Engineering Central Metabolic Pathways for High-Level Flavonoid Production in Escherichia Coli. Appl. Environ. Microbiol. 2007, 73, 3877–3886. [Google Scholar] [CrossRef]
  35. Zong, Y.; Li, S.; Xi, X.; Cao, D.; Wang, Z.; Wang, R.; Liu, B. Comprehensive Influences of Overexpression of a MYB Transcriptor Regulating Anthocyanin Biosynthesis on Transcriptome and Metabolome of Tobacco Leaves. Int. J. Mol. Sci. 2019, 20, 5123. [Google Scholar] [CrossRef]
  36. Yan, F.; Githiri, S.M.; Liu, Y.; Sang, Y.; Wang, Q.; Takahashi, R. Loss-of-Function Mutation of Soybean R2R3 MYB Transcription Factor Dilutes Tawny Pubescence Color. Front. Plant Sci. 2019, 10, 1809. [Google Scholar] [CrossRef]
  37. Amato, A.; Cardone, M.F.; Ocarez, N.; Alagna, F.; Ruperti, B.; Fattorini, C.; Velasco, R.; Mejía, N.; Zenoni, S.; Bergamini, C. VviAGL11 Self-Regulates and Targets Hormone- and Secondary Metabolism-Related Genes during Seed Development. Hortic. Res. 2022, 9, uhac133. [Google Scholar] [CrossRef]
  38. Cooper, B.; Campbell, K.B.; Garrett, W.M. Salicylic Acid and Phytoalexin Induction by a Bacterium That Causes Halo Blight in Beans. Phytopathology® 2022, 112, 1766–1775. [Google Scholar] [CrossRef]
  39. Sreedasyam, A.; Plott, C.; Hossain, M.S.; Lovell, J.T.; Grimwood, J.; Jenkins, J.W.; Daum, C.; Barry, K.; Carlson, J.; Shu, S.; et al. JGI Plant Gene Atlas: An Updateable Transcriptome Resource to Improve Functional Gene Descriptions across the Plant Kingdom. Nucleic Acids Res. 2023, 51, 8383–8401. [Google Scholar] [CrossRef]
  40. Chen, C.; Wu, Y.; Li, J.; Wang, X.; Zeng, Z.; Xu, J.; Liu, Y.; Feng, J.; Chen, H.; He, Y.; et al. TBtools-II: A “One for All, All for One” Bioinformatics Platform for Biological Big-Data Mining. Mol. Plant 2023, 16, 1733–1742. [Google Scholar] [CrossRef]
  41. Holub, E.B. The Arms Race Is Ancient History in Arabidopsis, the Wildflower. Nat. Rev. Genet. 2001, 2, 516–527. [Google Scholar] [CrossRef] [PubMed]
  42. Li, X.; Clarke, J.D.; Zhang, Y.; Dong, X. Activation of an EDS1-Mediated R-Gene Pathway in the Snc1 Mutant Leads to Constitutive, NPR1-Independent Pathogen Resistance. Mol. Plant Microbe Interact. 2001, 14, 1131–1139. [Google Scholar] [CrossRef]
  43. Kumar, S.; Stecher, G.; Li, M.; Knyaz, C.; Tamura, K. MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. Mol. Biol. Evol. 2018, 35, 1547–1549. [Google Scholar] [CrossRef] [PubMed]
  44. Wang, L.; Jia, G.; Jiang, X.; Cao, S.; Chen, Z.J.; Song, Q. Altered Chromatin Architecture and Gene Expression during Polyploidization and Domestication of Soybean. Plant Cell 2021, 33, 1430–1446. [Google Scholar] [CrossRef]
  45. Sun, S.; Wang, B.; Jiang, Q.; Li, Z.; Jia, S.; Wang, Y.; Guo, H. Genome-Wide Analysis of BpDof Genes and the Tolerance to Drought Stress in Birch (Betula Platyphylla). PeerJ 2021, 9, e11938. [Google Scholar] [CrossRef] [PubMed]
  46. Yang, Z.; Luo, C.; Pei, X.; Wang, S.; Huang, Y.; Li, J.; Liu, B.; Kong, F.; Yang, Q.-Y.; Fang, C. SoyMD: A Platform Combining Multi-Omics Data with Various Tools for Soybean Research and Breeding. Nucleic Acids Res. 2024, 52, D1639–D1650. [Google Scholar] [CrossRef]
  47. Wakil, S.J. Enzymatic Synthesis of Fatty Acids. Comp. Biochem. Physiol. 1962, 4, 123–158. [Google Scholar] [CrossRef]
  48. Katagiri, T.; Amao, Y. Recent Advances in Light-Driven C–H Bond Activation and Building C–C Bonds with CO2 as a Feedstock for Carbon Capture and Utilization Technology. Green Chem. 2020, 22, 6682–6713. [Google Scholar] [CrossRef]
  49. Li, S.J.; Cronan, J.E. The Genes Encoding the Two Carboxyltransferase Subunits of Escherichia Coli Acetyl-CoA Carboxylase. J. Biol. Chem. 1992, 267, 16841–16847. [Google Scholar] [CrossRef]
  50. Belkebir, A.; Paepe, R.D.; Tremolieres, A.; Aid, F.; Benhassaine-Kesri, G. Sethoxydim Affects Lipid Synthesis and Acetyl-CoA Carboxylase Activity in Soybean. J. Exp. Bot. 2006, 57, 3553–3562. [Google Scholar] [CrossRef]
  51. Xu, Z.; Pu, X.; Gao, R.; Demurtas, O.C.; Fleck, S.J.; Richter, M.; He, C.; Ji, A.; Sun, W.; Kong, J.; et al. Tandem Gene Duplications Drive Divergent Evolution of Caffeine and Crocin Biosynthetic Pathways in Plants. BMC Biol. 2020, 18, 63. [Google Scholar] [CrossRef] [PubMed]
  52. Blanc, G.; Barakat, A.; Guyot, R.; Cooke, R.; Delseny, M. Extensive Duplication and Reshuffling in the Arabidopsis Genome. Plant Cell 2000, 12, 1093–1102. [Google Scholar] [CrossRef]
  53. Lu, L.; Hou, Q.; Wang, L.; Zhang, T.; Zhao, W.; Yan, T.; Zhao, L.; Li, J.; Wan, X. Genome-Wide Identification and Characterization of Polygalacturonase Gene Family in Maize (Zea Mays L.). Int. J. Mol. Sci. 2021, 22, 10722. [Google Scholar] [CrossRef] [PubMed]
  54. Zhao, Y.; Zhang, R.; Jiang, K.-W.; Qi, J.; Hu, Y.; Guo, J.; Zhu, R.; Zhang, T.; Egan, A.N.; Yi, T.-S.; et al. Nuclear Phylotranscriptomics and Phylogenomics Support Numerous Polyploidization Events and Hypotheses for the Evolution of Rhizobial Nitrogen-Fixing Symbiosis in Fabaceae. Mol. Plant 2021, 14, 748–773. [Google Scholar] [CrossRef] [PubMed]
  55. Charron, C.S.; Allen, F.L.; Johnson, R.D.; Pantalone, V.R.; Sams, C.E. Correlations of Oil and Protein with Isoflavone Concentration in Soybean [Glycine Max (L.) Merr.]. J. Agric. Food Chem. 2005, 53, 7128–7135. [Google Scholar] [CrossRef]
  56. Wang, M.; Liu, M.; Li, D.; Wu, J.; Li, X.; Yang, Y. Overexpression of FAD2 Promotes Seed Germination and Hypocotyl Elongation in Brassica Napus. Plant Cell Tiss. Organ Cult. 2010, 102, 205–211. [Google Scholar] [CrossRef]
  57. Shivaiah, K.-K.; Ding, G.; Upton, B.; Nikolau, B.J. Non-Catalytic Subunits Facilitate Quaternary Organization of Plastidic Acetyl-CoA Carboxylase1[OPEN]. Plant Physiol. 2020, 182, 756–775. [Google Scholar] [CrossRef]
  58. Wang, M.; Garneau, M.G.; Poudel, A.N.; Lamm, D.; Koo, A.J.; Bates, P.D.; Thelen, J.J. Overexpression of Pea α-Carboxyltransferase in Arabidopsis and Camelina Increases Fatty Acid Synthesis Leading to Improved Seed Oil Content. Plant J. 2022, 110, 1035–1046. [Google Scholar] [CrossRef] [PubMed]
  59. Cui, Y.; Liu, Z.; Zhao, Y. Overexpression of Heteromeric GhACCase Subunits Enhanced Oil Accumulation in Upland Cotton | Plant Molecular Biology Reporter. Plant Mol. Biol. Rep. 2017, 35, 287–297. [Google Scholar] [CrossRef]
  60. Salie, M.J.; Zhang, N.; Lancikova, V.; Xu, D.; Thelen, J.J. A Family of Negative Regulators Targets the Committed Step of de Novo Fatty Acid Biosynthesis. Plant Cell 2016, 28, 2312–2325. [Google Scholar] [CrossRef]
  61. Madoka, Y.; Tomizawa, K.-I.; Mizoi, J.; Nishida, I.; Nagano, Y.; Sasaki, Y. Chloroplast Transformation with Modified accD Operon Increases Acetyl-CoA Carboxylase and Causes Extension of Leaf Longevity and Increase in Seed Yield in Tobacco. Plant Cell Physiol. 2002, 43, 1518–1525. [Google Scholar] [CrossRef] [PubMed]
  62. Pandey, A.; Misra, P.; Khan, M.P.; Swarnkar, G.; Tewari, M.C.; Bhambhani, S.; Trivedi, R.; Chattopadhyay, N.; Trivedi, P.K. Co-Expression of Arabidopsis Transcription Factor, AtMYB12, and Soybean Isoflavone Synthase, GmIFS1, Genes in Tobacco Leads to Enhanced Biosynthesis of Isoflavones and Flavonols Resulting in Osteoprotective Activity. Plant Biotechnol. J. 2014, 12, 69–80. [Google Scholar] [CrossRef]
  63. Chen, H.; Liu, C.; Li, Y.; Wang, X.; Pan, X.; Wang, F.; Zhang, Q. Developmental Dynamic Transcriptome and Systematic Analysis Reveal the Major Genes Underlying Isoflavone Accumulation in Soybean. Front. Plant Sci. 2023, 14, 1014349. [Google Scholar] [CrossRef] [PubMed]
  64. D’Agostina, A.; Boschin, G.; Resta, D.; Annicchiarico, P.; Arnoldi, A. Changes of Isoflavones during the Growth Cycle of Lupinus Albus. J. Agric. Food Chem. 2008, 56, 4450–4456. [Google Scholar] [CrossRef] [PubMed]
  65. Dhaubhadel, S.; Gijzen, M.; Moy, P.; Farhangkhoee, M. Transcriptome Analysis Reveals a Critical Role of CHS7 and CHS8 Genes for Isoflavonoid Synthesis in Soybean Seeds. Plant Physiol. 2007, 143, 326–338. [Google Scholar] [CrossRef]
  66. Azam, M.; Zhang, S.; Li, J.; Ahsan, M.; Agyenim-Boateng, K.G.; Qi, J.; Feng, Y.; Liu, Y.; Li, B.; Qiu, L.; et al. Identification of Hub Genes Regulating Isoflavone Accumulation in Soybean Seeds via GWAS and WGCNA Approaches. Front. Plant Sci. 2023, 14, 1120498. [Google Scholar] [CrossRef]
  67. Azam, M.; Zhang, S.; Huai, Y.; Abdelghany, A.M.; Shaibu, A.S.; Qi, J.; Feng, Y.; Liu, Y.; Li, J.; Qiu, L.; et al. Identification of Genes for Seed Isoflavones Based on Bulk Segregant Analysis Sequencing in Soybean Natural Population. Theor. Appl. Genet. 2023, 136, 13. [Google Scholar] [CrossRef]
  68. Feng, Y.; Zhang, S.; Li, J.; Pei, R.; Tian, L.; Qi, J.; Azam, M.; Agyenim-Boateng, K.G.; Shaibu, A.S.; Liu, Y.; et al. Dual-Function C2H2-Type Zinc-Finger Transcription Factor GmZFP7 Contributes to Isoflavone Accumulation in Soybean. New Phytol. 2023, 237, 1794–1809. [Google Scholar] [CrossRef]
  69. Greco, M.; Chiappetta, A.; Bruno, L.; Bitonti, M.B. In Posidonia Oceanica Cadmium Induces Changes in DNA Methylation and Chromatin Patterning. J. Exp. Bot. 2012, 63, 695–709. [Google Scholar] [CrossRef]
  70. Kusano, M.; Tohge, T.; Fukushima, A.; Kobayashi, M.; Hayashi, N.; Otsuki, H.; Kondou, Y.; Goto, H.; Kawashima, M.; Matsuda, F.; et al. Metabolomics Reveals Comprehensive Reprogramming Involving Two Independent Metabolic Responses of Arabidopsis to UV-B Light. Plant J. 2011, 67, 354–369. [Google Scholar] [CrossRef]
  71. Thelen, J.J.; Ohlrogge, J.B. Both Antisense and Sense Expression of Biotin Carboxyl Carrier Protein Isoform 2 Inactivates the Plastid Acetyl-Coenzyme A Carboxylase in Arabidopsis Thaliana. Plant J. 2002, 32, 419–431. [Google Scholar] [CrossRef]
  72. Zhao, K.; Chen, S.; Yao, W.; Cheng, Z.; Zhou, B.; Jiang, T. Genome-Wide Analysis and Expression Profile of the bZIP Gene Family in Poplar. BMC Plant Biol. 2021, 21, 122. [Google Scholar] [CrossRef] [PubMed]
  73. Mor, B.; Garhwal, S.; Kumar, A. A Systematic Review of Hidden Markov Models and Their Applications. Arch. Computat. Methods Eng. 2021, 28, 1429–1448. [Google Scholar] [CrossRef]
  74. Ye, Y.; Nikovics, K.; To, A.; Lepiniec, L.; Fedosejevs, E.T.; Van Doren, S.R.; Baud, S.; Thelen, J.J. Docking of Acetyl-CoA Carboxylase to the Plastid Envelope Membrane Attenuates Fatty Acid Production in Plants. Nat. Commun. 2020, 11, 6191. [Google Scholar] [CrossRef] [PubMed]
  75. Zhao, D.; Gao, F.; Guan, P.; Gao, J.; Guo, Z.; Guo, J.; Cui, H.; Li, Y.; Zhang, G.; Li, Z.; et al. Identification and Analysis of Differentially Expressed Trihelix Genes in Maize (Zea Mays) under Abiotic Stresses. PeerJ 2023, 11, e15312. [Google Scholar] [CrossRef] [PubMed]
  76. Chou, K.-C.; Shen, H.-B. Cell-PLoc: A Package of Web Servers for Predicting Subcellular Localization of Proteins in Various Organisms. Nat. Protoc. 2008, 3, 153–162. [Google Scholar] [CrossRef]
  77. Wang, G.; Wang, T.; Jia, Z.-H.; Xuan, J.-P.; Pan, D.-L.; Guo, Z.-R.; Zhang, J.-Y. Genome-Wide Bioinformatics Analysis of MAPK Gene Family in Kiwifruit (Actinidia Chinensis). Int. J. Mol. Sci. 2018, 19, 2510. [Google Scholar] [CrossRef]
  78. Subramanian, B.; Gao, S.; Lercher, M.J.; Hu, S.; Chen, W.-H. Evolview v3: A Webserver for Visualization, Annotation, and Management of Phylogenetic Trees. Nucleic Acids Res. 2019, 47, W270–W275. [Google Scholar] [CrossRef]
  79. Bailey, T.L.; Boden, M.; Buske, F.A.; Frith, M.; Grant, C.E.; Clementi, L.; Ren, J.; Li, W.W.; Noble, W.S. MEME SUITE: Tools for Motif Discovery and Searching. Nucleic Acids Res. 2009, 37, W202–W208. [Google Scholar] [CrossRef]
  80. Rombauts, S.; Dehais, P.; Van Montagu, M.; Rouze, P. PlantCARE, a Plant Cis-Acting Regulatory Element Database. Nucleic Acids Res. 1999, 27, 295–296. [Google Scholar] [CrossRef]
  81. Xu, F.; He, L.; Gao, S.; Su, Y.; Li, F.; Xu, L. Comparative Analysis of Two Sugarcane Ancestors Saccharum Officinarum and S. Spontaneum Based on Complete Chloroplast Genome Sequences and Photosynthetic Ability in Cold Stress. IJMS 2019, 20, 3828. [Google Scholar] [CrossRef] [PubMed]
  82. Breitler, J.-C.; Campa, C.; Georget, F.; Bertrand, B.; Etienne, H. A Single-Step Method for RNA Isolation from Tropical Crops in the Field. Sci. Rep. 2016, 6, 38368. [Google Scholar] [CrossRef] [PubMed]
  83. He, P.; Wang, H.; Yan, Y.; Zhu, G.; Chen, Z. Development and Application of a Multiplex Fluorescent PCR for Shigella Detection and Species Identification. J. Fluoresc. 2022, 32, 707–713. [Google Scholar] [CrossRef]
  84. Wu, D.; Li, D.; Zhao, X.; Zhan, Y.; Teng, W.; Qiu, L.; Zheng, H.; Li, W.; Han, Y. Identification of a Candidate Gene Associated with Isoflavone Content in Soybean Seeds Using Genome-Wide Association and Linkage Mapping. Plant J. 2020, 104, 950–963. [Google Scholar] [CrossRef] [PubMed]
  85. Li, H.; Wang, X.; Peng, S.; Lai, Z.; Mai, Y. Seasonal Variation of Temperature Affects HMW-PAH Accumulation in Fishery Species by Bacterially Mediated LMW-PAH Degradation. Sci. Total Environ. 2022, 853, 158617. [Google Scholar] [CrossRef]
  86. Kim, D.; Paggi, J.M.; Park, C.; Bennett, C.; Salzberg, S.L. Graph-Based Genome Alignment and Genotyping with HISAT2 and HISAT-Genotype. Nat. Biotechnol. 2019, 37, 907–915. [Google Scholar] [CrossRef] [PubMed]
  87. Kanehisa, M.; Furumichi, M.; Sato, Y.; Kawashima, M.; Ishiguro-Watanabe, M. KEGG for Taxonomy-Based Analysis of Pathways and Genomes. Nucleic Acids Res. 2023, 51, D587–D592. [Google Scholar] [CrossRef]
  88. Cai, Z.; Cheng, Y.; Ma, Z.; Liu, X.; Ma, Q.; Xia, Q.; Zhang, G.; Mu, Y.; Nian, H. Fine-Mapping of QTLs for Individual and Total Isoflavone Content in Soybean (Glycine Max L.) Using a High-Density Genetic Map. Theor. Appl. Genet. 2018, 131, 555–568. [Google Scholar] [CrossRef]
  89. Kim, J.M.; Seo, J.S.; Lee, J.W.; Lyu, J.I.; Ryu, J.; Eom, S.H.; Ha, B.-K.; Kwon, S.-J. QTL Mapping Reveals Key Factors Related to the Isoflavone Contents and Agronomic Traits of Soybean (Glycine Max). BMC Plant Biol. 2023, 23, 517. [Google Scholar] [CrossRef]
  90. Hou, Y.; Zeng, W.; Ao, C.; Huang, J. Integrative Analysis of the Transcriptome and Metabolome Reveals Bacillus Atrophaeus WZYH01-Mediated Salt Stress Mechanism in Maize (Zea Mays L.). J. Biotechnol. 2024, 383, 39–54. [Google Scholar] [CrossRef]
  91. Zhang, W.; Bai, B.; Du, H.; Hao, Q.; Zhang, L.; Chen, Z.; Mao, J.; Zhu, C.; Yan, M.; Qin, H.; et al. Co-Expression of Metabolites and Sensory Attributes through Weighted Correlation Network Analysis to Explore Flavor-Contributing Factors in Various Pyrus Spp. Cultivars. Food Chem. X 2024, 21, 101189. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Phylogenetic analysis of ACC genes from soybean, Arabidopsis thaliana, rice, maize, and peanut. A phylogenetic tree constructed using full-length protein sequences. Different shades of color were used to distinguish different branches of Group I–VII indicating the classification of the ACCase gene family.
Figure 1. Phylogenetic analysis of ACC genes from soybean, Arabidopsis thaliana, rice, maize, and peanut. A phylogenetic tree constructed using full-length protein sequences. Different shades of color were used to distinguish different branches of Group I–VII indicating the classification of the ACCase gene family.
Ijms 25 10221 g001
Figure 2. Conserved motifs, gene structure maps of 20 GmACCs, and cis-elements in the promoter sequences of GmACCs. (A) Phylogenetic tree of 20 GmACCs. (B) Motif composition of 20 GmACCs; the black line indicates the protein length. (C) Gene structures of 20 GmACCs. Black lines indicate introns, green boxes indicate UTR, and blue boxes indicate CDS. (D) Schematic model of 19 cis-elements in the promoter sequences of GmACCs.
Figure 2. Conserved motifs, gene structure maps of 20 GmACCs, and cis-elements in the promoter sequences of GmACCs. (A) Phylogenetic tree of 20 GmACCs. (B) Motif composition of 20 GmACCs; the black line indicates the protein length. (C) Gene structures of 20 GmACCs. Black lines indicate introns, green boxes indicate UTR, and blue boxes indicate CDS. (D) Schematic model of 19 cis-elements in the promoter sequences of GmACCs.
Ijms 25 10221 g002
Figure 3. Chromosomal locations and collinearity analysis of the ACC genes. (A) Chromosomal locations the ACC genes. The scale is in megabases (Mbs). (B) The intrachromosomal segmental duplication map of the GmACCs. Chromosomes 01–20 are represented by yellow rectangles. Along these rectangles, blue lines and a heatmap indicate the gene density on the chromosomes. The red lines represent the segmental duplication pairs between the GmACCs and the gray lines represent the segmental duplication pairs in the whole soybean genome. Different colors on the chromosomes represent gene density, with red indicating high-density regions and blue indicating low-density regions. (C) Collinearity analysis of the ACC genes between soybean and four other plant species. Red lines represent the syntenic ACC gene pairs. Gray lines indicate collinear blocks.
Figure 3. Chromosomal locations and collinearity analysis of the ACC genes. (A) Chromosomal locations the ACC genes. The scale is in megabases (Mbs). (B) The intrachromosomal segmental duplication map of the GmACCs. Chromosomes 01–20 are represented by yellow rectangles. Along these rectangles, blue lines and a heatmap indicate the gene density on the chromosomes. The red lines represent the segmental duplication pairs between the GmACCs and the gray lines represent the segmental duplication pairs in the whole soybean genome. Different colors on the chromosomes represent gene density, with red indicating high-density regions and blue indicating low-density regions. (C) Collinearity analysis of the ACC genes between soybean and four other plant species. Red lines represent the syntenic ACC gene pairs. Gray lines indicate collinear blocks.
Ijms 25 10221 g003
Figure 4. The expression profiles of GmACCs in soybean seeds. (A) The expression profilez of GmACCs during seed development. The figure illustrates the hierarchical clustering and heatmap of the dynamic expression levels of 20 GmACCs in soybean tissues. The vertical bars on the right side of the figure represent the five groups. (B) Relative expression of 16 GmACCs from seed development stages (R5–R7) detected by RT-qPCR. The blue line represents the low isoflavone variety, and the red line represents the high isoflavone variety. Each bar indicates the mean of three repeats. Similar results were obtained from three independent biological experiments.
Figure 4. The expression profiles of GmACCs in soybean seeds. (A) The expression profilez of GmACCs during seed development. The figure illustrates the hierarchical clustering and heatmap of the dynamic expression levels of 20 GmACCs in soybean tissues. The vertical bars on the right side of the figure represent the five groups. (B) Relative expression of 16 GmACCs from seed development stages (R5–R7) detected by RT-qPCR. The blue line represents the low isoflavone variety, and the red line represents the high isoflavone variety. Each bar indicates the mean of three repeats. Similar results were obtained from three independent biological experiments.
Ijms 25 10221 g004
Figure 5. Correlation between GmACCs and isoflavone accumulation. The color of lines refers to Mantel’s r for statistics of corresponding distance correlations, and Edge width of lines represents the statistical significances. The size of the circles indicates the statistical significance, while the color of the circles represents the Spearman’s correlation coefficients for pairwise comparisons of GmACC expression levels.
Figure 5. Correlation between GmACCs and isoflavone accumulation. The color of lines refers to Mantel’s r for statistics of corresponding distance correlations, and Edge width of lines represents the statistical significances. The size of the circles indicates the statistical significance, while the color of the circles represents the Spearman’s correlation coefficients for pairwise comparisons of GmACC expression levels.
Ijms 25 10221 g005
Figure 6. WGCNA reveals modules associated with isoflavone content. (A) Dendrogram of average network adjacency for identifying gene co-expression modules. (B) Analysis of the correlation between gene modules and traits.
Figure 6. WGCNA reveals modules associated with isoflavone content. (A) Dendrogram of average network adjacency for identifying gene co-expression modules. (B) Analysis of the correlation between gene modules and traits.
Ijms 25 10221 g006
Table 1. The Acetyl-CoA carboxylase (ACCase) gene family in soybean.
Table 1. The Acetyl-CoA carboxylase (ACCase) gene family in soybean.
Gene NameGene IDChromosomeLength (aa)MWpISubcellular Localization PredictionGroup
GmACC1Glyma.04G104900.42260252,370.456.00CytoplasmicI
GmACC2Glyma.06G10590062260252,169.995.92CytoplasmicI
GmACC3Glyma.13G3635001329832,437.378.49Plasma Membrane, ChloroplastIII
GmACC4Glyma.15G0103001526929,161.578.81Nuclear, Chloroplast, MitochondrialIII
GmACC5Glyma.18G2435001826227,657.079.47ChloroplastIII
GmACC6Glyma.18G2653001828429,764.278.45ChloroplastIII
GmACC7Glyma.05G221100553958,888.727.22MitochondrialI
GmACC8Glyma.07G137400710711,530.569.03MitochondrialI
GmACC9Glyma.08G027600853958,807.607.22MitochondrialI
GmACC10Glyma.18G1957001870978,666.607.64Mitochondrial, CytoplasmicV
GmACC11Glyma.18G1959001869076,918.928.63Mitochondrial, CytoplasmicV
GmACC12Glyma.18G1960001868375,989.918.88Cytoplasmic, MitochondrialV
GmACC13Glyma.10G20890010343865.8310.30Mitochondrial, Chloroplast, NuclearIV
GmACC14Glyma.15G00380015343847.7610.00Mitochondrial, ChloroplastIV
GmACC15Glyma.15G2485001512614,897.904.51NuclearII
GmACC16Glyma.09G248900926127,536.949.37ChloroplastIII
GmACC17Glyma.13G0574001327628,870.488.69ChloroplastIII
GmACC18Glyma.11G2337001129732,064.759.01MitochondrialIII
GmACC19Glyma.18G0233001829131,540.099.10MitochondrialIII
GmACC20Glyma.19G0288001928029,326.888.16ChloroplastIII
Table 2. The duplication events of GmACCs identified in soybean.
Table 2. The duplication events of GmACCs identified in soybean.
No.SequenceDuplication
Type
KaKsKa/KsDivergence Time (MYA)
1GmACC1 and GmACC2Segmental0.0130.0930.1393.116
2GmACC3 and GmACC4Segmental0.0340.0640.5282.134
3GmACC6 and GmACC20Segmental0.2000.5430.36818.085
4GmACC7 and GmACC9Segmental0.0080.1240.0664.141
5GmACC16 and GmACC5Segmental0.0170.0900.1933.002
6GmACC17 and GmACC6Segmental0.1830.5520.33118.416
7GmACC18 and GmACC19Segmental0.0490.1040.4663.476
8GmACC17 and GmACC20Segmental0.0460.0770.6012.569
9GmACC10 and GmACC11Tandem0.0370.0720.5172.393
10GmACC11 and GmACC12Tandem0.0140.0540.2591.798
11GmACC10 and GmACC12Tandem0.0350.0750.4732.501
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wu, X.; Yang, Z.; Zhu, Y.; Zhan, Y.; Li, Y.; Teng, W.; Han, Y.; Zhao, X. Bioinformatics Identification and Expression Analysis of Acetyl-CoA Carboxylase Reveal Its Role in Isoflavone Accumulation during Soybean Seed Development. Int. J. Mol. Sci. 2024, 25, 10221. https://doi.org/10.3390/ijms251810221

AMA Style

Wu X, Yang Z, Zhu Y, Zhan Y, Li Y, Teng W, Han Y, Zhao X. Bioinformatics Identification and Expression Analysis of Acetyl-CoA Carboxylase Reveal Its Role in Isoflavone Accumulation during Soybean Seed Development. International Journal of Molecular Sciences. 2024; 25(18):10221. https://doi.org/10.3390/ijms251810221

Chicago/Turabian Style

Wu, Xu, Zhenhong Yang, Yina Zhu, Yuhang Zhan, Yongguang Li, Weili Teng, Yingpeng Han, and Xue Zhao. 2024. "Bioinformatics Identification and Expression Analysis of Acetyl-CoA Carboxylase Reveal Its Role in Isoflavone Accumulation during Soybean Seed Development" International Journal of Molecular Sciences 25, no. 18: 10221. https://doi.org/10.3390/ijms251810221

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop