Next Article in Journal
Thermodynamic Factors Controlling Electron Transfer among the Terminal Electron Acceptors of Photosystem I: Insights from Kinetic Modelling
Previous Article in Journal
The Serine Acetyltransferase (SAT) Gene Family in Tea Plant (Camellia sinensis): Identification, Classification and Expression Analysis under Salt Stress
Previous Article in Special Issue
TXNDC5 Plays a Crucial Role in Regulating Endoplasmic Reticulum Activity through Different ER Stress Signaling Pathways in Hepatic Cells
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

NF-κB Transcription Factors: Their Distribution, Family Expansion, Structural Conservation, and Evolution in Animals

by
Siphesihle Msweli
1,
Suresh B. Pakala
1,2 and
Khajamohiddin Syed
1,*
1
Department of Biochemistry and Microbiology, Faculty of Science, Agriculture and Engineering, University of Zululand, KwaDlangezwa 3886, South Africa
2
Department of Biochemistry, School of Life Sciences, University of Hyderabad, Hyderabad 500-046, India
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2024, 25(18), 9793; https://doi.org/10.3390/ijms25189793
Submission received: 29 July 2024 / Revised: 5 September 2024 / Accepted: 8 September 2024 / Published: 10 September 2024
(This article belongs to the Special Issue Advances in Endoplasmic Reticulum Stress and Apoptosis)

Abstract

:
The Nuclear Factor Kappa B (NF-κB) transcription factor family consists of five members: RelA (p65), RelB, c-Rel, p50 (p105/NF-κB1), and p52 (p100/NF-κB2). This family is considered a master regulator of classical biochemical pathways such as inflammation, immunity, cell proliferation, and cell death. The proteins in this family have a conserved Rel homology domain (RHD) with the following subdomains: DNA binding domain (RHD-DBD) and dimerization domain (RHD-DD). Despite the importance of the NF-κB family in biology, there is a lack of information with respect to their distribution patterns, evolution, and structural conservation concerning domains and subdomains in animals. This study aims to address this critical gap regarding NF-κB proteins. A comprehensive analysis of NF-κB family proteins revealed their distinct distribution in animals, with differences in protein sizes, conserved domains, and subdomains (RHD-DBD and RHD-DD). For the first time, NF-κB proteins with multiple RHD-DBDs and RHD-DDs have been identified, and in some cases, this is due to subdomain duplication. The presence of RelA/p65 exclusively in vertebrates shows that innate immunity originated in fishes, followed by amphibians, reptiles, aves, and mammals. Phylogenetic analysis showed that NF-κB family proteins grouped according to animal groups, signifying structural conservation after speciation. The evolutionary analysis of RHDs suggests that NF-κB family members p50/p105 and c-Rel may have been the first to emerge in arthropod ancestors, followed by RelB, RelA, and p52/p100.

1. Introduction

NF-κB is a family of transcription factors first discovered four decades ago as a regulator of immunoglobin kappa light-chain expression in mature B cells and plasma cells [1]. It is expressed in almost all cell types and tissues and is crucial in maintaining cellular homeostasis. NF-κB is inducible in most cells and is also found as a constitutively active protein in mature B cells, macrophages, neurons, vascular smooth muscle cells, and tumor cells [2]. In resting cells, NF-κB dimer is sequestered in the cytoplasm by inhibitors called IκBs, which mask its nuclear localization signal. However, when stimulated with various signals such as inflammatory cytokines, growth factors, and infectious microbes, IκBs are phosphorylated and ubiquitinated, allowing NF-κB dimers to localize to the nucleus to regulate the transcription of target genes [3]. NF-κB regulates many biological processes, including immune and inflammatory responses, developmental processes, cell proliferation, and apoptosis [4] (Table 1). Dysregulation of NF-κB can lead to cancer, autoimmune diseases, and chronic inflammatory disorders [5,6,7,8]. NF-κB activation affects cancer and inflammatory diseases by regulating the transcription of target genes involved in cell proliferation, angiogenesis, inflammation, tumor promotion, and metastasis [9] (Table 1). Additionally, NF-κB plays a role in cancer progression by interacting with other transcription factors such as STAT3 (signal transducer and activator of transcription 3), p53 (tumor protein p53), ATF3 (activating transcription factor 3), and SMAD3/4 (Suppressor of Mothers against Decapentaplegic 3/4), which then bind to the promoter regions of their target genes, regulating their expression patterns [10,11]. Furthermore, GSK-3β has been found to modulate NF-κB activity in glioma, prostate, and colon cancer to trigger cell proliferation [12,13,14,15,16].
There are five NF-κB family proteins: RelA (p65), RelB, c-Rel, p50 (p105/NF-κB1), and p52 (p100/NF-κB2) [25] (Figure 1). The p105 and p100 undergo post-translational modification to generate p50 and p52, respectively [2] (Figure 1). These family members regulate the transcription of target genes involved in different physiological and pathological processes by recruiting onto their promoter sequences, i.e., κB consensus DNA sequence, as various homo or hetero dimers [26] (Table 1).
The NF-κB signaling pathway can be activated through two major pathways: canonical and noncanonical. The canonical pathway is a key regulator of the inflammatory response, triggered by Toll-like receptors (TLRs), tumor necrosis factor α (TNF-α), interleukin 1 (IL-1), and various pathogens. This pathway results in the activation of RelA, which governs the expression of proinflammatory and cell survival genes. Canonical NF-κB signaling provides a rapid response to acute inflammatory cues. It mediates the innate immune response under conditions such as inflammatory bowel disease (IBD), rheumatoid arthritis (RA), and chronic obstructive pulmonary disease (COPD) [2,3,27]. On the other hand, the noncanonical NF-κB pathway is activated by lymphotoxin β (LTβ), CD40 ligand (CD40L), B-cell activating factor (BAFF), and receptor activator of nuclear factor kappa-B ligand (RANKL), resulting in the activation of RelB/p52 complexes. Noncanonical signaling responses are typically slow and sustained, which is characteristic of immune dysregulation. Activation of the noncanonical pathway is associated with the adaptive immune response, which regulates genes essential for lymph-organogenesis and B-cell activation. Dysregulation of this pathway results in lymphoid malignancies and other immune disorders [28,29]. These pathways are characterized by their requirement for IκB kinase (IKK) subunits. IKKβ regulates the activation of the canonical pathway through the phosphorylation of IkBs, requiring an IKKγ subunit, while IKKα is necessary for the activation of the noncanonical pathway [30]. Despite their different signaling mechanisms, both pathways are crucial for controlling immune and inflammatory responses [3], contributing to innate and acquired immunity [31] (Table 1).
The NF-κB proteins can be divided into two subclasses: NF-κB proteins (p52/p100 and p50/p105) and Rel proteins (p65/RelA, c-Rel, and RelB) [25] (Figure 1). They all have a conserved N-terminal region, known as the Rel homology domain (RHD), which contains the subdomains DNA binding domain (RHD-DBD) and dimerization domain (RHD-DD). This RHD is responsible for the homodimerization and heterodimerization of NF-κB proteins [25] (Figure 1). Additionally, members of the NF-κB subclass (p50 and p52) have a C-terminal inhibitory domain made up mainly of Ankyrin (ANK) repeats [32,33], while the C-terminus of RelA, RelB, and c-Rel contains a transcription activation domain (TAD) [34,35] (Figure 1). The TAD of NF-κB plays a crucial role in recruiting co-activators such as p300 and CBP, involved in histone deacetylation and chromatin remodeling, to facilitate gene transcription [36,37]. Phosphorylation of TAD at Ser276 and Ser536 enhances the recruitment of elongation factors like Brd4 and P-TEFb, which are involved in mRNA processing and transcriptional elongation [37,38]. These modifications induce conformational changes in the TAD, allowing it to interact more effectively with other proteins involved in the transcription of target genes. By facilitating the interaction of NF-κB with specific cofactors, the TAD influences the transcription of genes involved in inflammation, immune responses, and cell survival [37,38]. Further, its interaction with SMAD3/4 proteins also regulates signaling pathways affecting bone development and differentiation [39]. The proteins p50 and p52 lack TAD and hence act as transcriptional repressors as homodimers but activate transcription when heterodimerized with transactivating Rel subunit [25,34,35]. Finally, the N terminus of RelB contains a leucine zipper (LZ) motif, which is critical for transcriptional regulation [40]. Whether the LZ gives RelB any extra functional specificity, such as the ability to heterodimerize with NF-κB2 or other potential partners, remains unknown.
It has been suggested that NF-κB proteins were formed through the fusion of ANK repeat protein and RHD protein, followed by gene splitting and duplication events, which eventually led to the present-day NF-κB/Rel proteins [41]. The evolution of NF-κB proteins appears to have occurred after the split between mammals and fungi [42]. In a study aiming to identify the origin of NF-κB/Rel transcription factors in animals, researchers analyzed the presence of NF-κB/Rel protein domains in a handful of metazoan species and non-metazoans [42]. Other than these two studies, a comprehensive analysis of NF-κB proteins, their distribution patterns, and how these families have evolved, particularly in terms of the RHD and its subdomains RHD-DBD and RHD-DD, has not been previously reported. Therefore, our aim was to address this research gap, which also provides insights into the origins of innate immunity in different taxa.

2. Results and Discussion

2.1. p50/p105 Proteins Are Present in Most of the Animals

The analysis of genome data and annotation of NF-κB family proteins revealed that there are a total of 5860 NF-κB proteins present in 2819 animals (refer to Table S1). Among the NF-κB family, c-Rel has the highest number of members with 2463, followed by p50/p105 with 1401, p52/p100 with 1088, RelA/p65 with 522, and RelB with the least number of members at 386 (refer to Table S1). Overall, the Rel subfamily has more proteins compared to the NF-κB subfamily. On average, there are approximately two proteins for each NF-κB family member in every species (refer to Table S1). A total of 1716 duplicates were identified, with the majority being c-Rel proteins at 618 duplicates, followed by p50/p105 proteins at 462 duplicates, p52/p100 at 344 duplicates, RelA/p65 at 150 duplicates, and RelB at 142 duplicates. Detailed information regarding the number of species and NF-κB proteins is presented in Table S1. The NF-κB protein sequences, identification numbers, and respective domains are presented in Supplementary Dataset S1.
It is well-known that the ancestral vertebrate genome underwent two rounds of whole genome duplication, and consequently, vertebrates have more genes than invertebrates [43]. In concurring with this fact, we observed multiple identical copies of NF-κB family genes/proteins within the same species in many animals (Supplementary Dataset S1). For further analysis, we selected one of the duplicates as a representative, as the duplicates did not affect the outcome. After removing duplicates, the total number of NF-κB family proteins was 4144 in 2819 animal species (Figure 2A and Table S1). Comparative analysis of NF-κB proteins showed that c-Rel was the dominant family with 1845 members, followed by p50/p105 with 939 members, p52/p100 with 744 members, RelA/p65 with 372 members, and RelB with 244 members (Figure 2A). The average number of proteins in each NF-κB family confirmed that the c-Rel family is more prevalent in animals, with an average of two c-Rel proteins present in each species compared to a single protein for the other four NF-κB families (Figure 2A and Table S1). Even after removing the duplicates, there was still a high number of NF-κB family proteins, indicating diversification of these protein family members.
Genome-wide analysis revealed the presence of c-Rel proteins in five different classes of the phylum Chordata: Fishes, Amphibia, Reptilia, Mammalia, and Aves, as well as in three other phyla: Mollusca, Annelida, and Arthropoda (Figure 2B). Arthropoda has the highest number of c-Rel proteins, with 1056, while Annelida has the least, with just six. Among the different animal groups, Arthropoda has the highest average c-Rel proteins per species (three proteins), followed by Fishes and Amphibia with two proteins, and Reptilia, Aves, Mammalia, Mollusca, and Annelida each having a single c-Rel protein (Figure 2C). This suggests that chordates such as Fishes and Amphibia have more c-Rel proteins in their genomes than Aves, Reptilia, and Mammalia.
The function of c-Rel includes regulating immune responses through innate and acquired immunity [44]. It promotes innate immune response by producing cytokines like IL-12 and IL-23 and activating macrophages and dendritic cells against microbial infections [44]. Additionally, c-Rel promotes T cell activation and proliferation, allowing T cells to differentiate into Th1 (T helper 1) cells, which combat intracellular pathogens. c-Rel also plays a role in coordinating the adaptive immune response by controlling the expression of different cytokines and chemokines [44]. Studies have shown that c-Rel and RelA/p65 differ functionally, particularly in their affinity for binding DNA. This difference may have enabled c-Rel’s neofunctionalization and allowed it to assume specialized roles in immune regulation [45]. Furthermore, because c-Rel can bind to non-consensus DNA motifs, it can more effectively activate the transcription of immune-related genes, which is crucial for controlling adaptive immunity in vertebrates [45].
Although most research on c-Rel has focused on vertebrates, insights from model organisms such as Caenorhabditis elegans indicate that immune signaling pathways have ancient origins. Despite lacking TLRs, C. elegans has evolved other immune pathways, demonstrating a distinct evolutionary strategy for pathogen defense. This suggests that the evolutionary mechanisms of the immune response are diverse throughout the tree of life despite the critical role that c-Rel and its counterparts play in vertebrates [45]. Research comparing various species, such as bony fish and cartilaginous fish, suggests that c-Rel has remained essential throughout evolution, adapting to meet the unique immunological demands of various organisms [44,46,47]. The identification of c-Rel in both vertebrate and invertebrate species (Figure 2C) strongly supports its critical role in these organisms, as discussed above.
The p50/p105 proteins were present in all five classes of Chordata and six other phyla of animals (Figure 2B). However, the phyla Platyhelminthes and Nematoda did not have p50/p105 proteins (Figure 2B). Among the different groups, Aves had the highest number of p50/p105 proteins, with 411, followed by Mammalia, with 334 (Figure 2D). Arthropoda had the lowest number of p50/p105 proteins, with only four. On average, fishes had three p50/p105 proteins in each species, while Porifera, Cnidaria, Mammalia, and Echinodermata had an average of two proteins (Figure 2D). The remaining animal groups had an average of one p50/p105 protein in each species (Figure 2D).
Genome-wide analysis revealed the presence of RelA/p65 proteins in five classes of Chordata: Fishes, Amphibia, Reptilia, Aves, and Mammalia (Figure 2B). On average, each of the five Chordata classes had one RelA/p65 protein (Figure 2E). RelA/p65 proteins were more abundant in Mammalia, with 287 proteins, while Fishes had the lowest number of proteins (one protein) (Figure 2E). Notably, RelA/p65 proteins are only present in Chordata classes and are not found in other animal species (Figure 2B). RelA/p65 is an important protein that is activated when pathogens are recognized by cellular pattern recognition receptors (PRRs) like TLRs and nucleotide-binding oligomerization domain-like receptors (NOD-like receptors). This activation leads to the expression of several downstream cytokines and effector molecules that are involved in innate immune response [48,49,50]. It is interesting to note that NF-κB signaling pathways integrate information from various PRRs to tailor inflammatory responses based on the identity and virulence of the pathogen [48,49,50]. The regulation of RelA/p65s is dynamic, and it controls the transcription of pro-inflammatory cytokines and other genes involved in cellular processes, including cell survival, apoptosis, and pathogen clearance in response to infections to offer innate immunity [3,11,48]. Furthermore, the expression levels of RelA/p65 maintain the balance between pro- and anti-inflammatory responses, thereby preventing chronic inflammation and autoimmune diseases [3]. These findings suggest that innate immunity might have originated in vertebrates, initially in Fishes, followed by Amphibians, Reptiles, Aves, and Mammals.
The p52/p100 proteins were found in only four classes of Chordata: Amphibia, Reptilia, Mammalia, and Aves, and were not present in other animal species (see Figure 2B). Among these classes, p52/p100 proteins were not found in Fishes (see Figure 2B). On average, only one p52/p100 protein was found per species in all animal groups, except for Amphibia, which had an average of two proteins (see Figure 2F). Aves had the highest number of p52/p100 proteins, with 417 proteins, while Amphibia had the least, with 25 members (see Figure 2F).
In the study, it was found that RelB proteins were only present in mammals among all the animal groups analyzed (Figure 2B). A total of 244 RelB proteins were identified in 183 species, indicating an average of one protein in each species (Figure 2A and Table S1). RelB is an important transcription factor in the NF-κB family, playing a crucial role in regulating adaptive immunity by modulating the development and activation of dendritic cells (DCs) and T-cell responses. It is essential for the differentiation and function of dendritic cells, which are antigen-presenting cells that link innate and adaptive immunity [3,51,52]. Without RelB, the ability of DCs’ to present antigens to T cells is compromised, affecting the adaptive immune response. Additionally, RelB is necessary for the development of the thymus, where T cells mature. The absence of RelB leads to defects in T-cell activation, cytokine production, and overall immune regulation [3,51,52]. This analysis determined that RelB is exclusively present in mammals, suggesting that RelB-modulated adaptive immunity is limited to mammals.

2.2. Some NF-κB Proteins Have Multiple RHD Subdomains

All NF-κB protein families have members with multiple RHD-DBDs and RHD-DDs (see Figure 1 and Table 2). There are 31 c-Rel proteins, five p50/p105 proteins, four RelA/p65 proteins, three p52/p100 proteins, and four RelB proteins with more than one RHD-DBD and/or RHD-DD (Table 2). The highest number of c-Rel proteins with multiple domains were found in Arthropoda (24 proteins), followed by Mammalia (three proteins) and Aves (two proteins), and a single protein was found in Reptilia and Mollusca. Two p50/p105 proteins with multiple RHD subdomains were found in Mammalia, and a single protein was found in Cnidaria, Reptilia, and Mollusca. Two RelA/p65 proteins with multiple RHD subdomains were identified in each Mammalia and Aves. Two and one p52/p100 proteins with multiple RHD subdomains were identified in Mammalia and Aves, respectively, and four RelB proteins with multiple RHD subdomains were identified in Mammalia (Table 2).
The percentage identity among these multiple RHD subdomains reveals that most of these domains share less percentage identity, except for some that are duplicated (two RHD-DBD and one RHD-DD have 100% identity, and four RHD-DBD and three RHD-DD have more than 85% identity) (Table 2). This result indicates that these NF-κB proteins can recognize/bind more than one DNA sequence (in case of multiple RHD-DBD) or may have binding affinity to more than one NF-κB protein (in case of multiple RHD-DD). However, experimental evidence is needed to unravel the role of these multiple RHD subdomains in these proteins.

2.3. NF-κB Proteins Conserved Domains and Subdomains Differ in Their Size

The NF-κB family members share many conserved domains in their structures [53]. This raises the question of whether there are any differences in the proteins and domain sizes. To address this, a comprehensive analysis of the five different NF-κB family members was conducted to identify variations in complete protein lengths and the RHD and its subdomains, RHD-DBD and RHD-DD (refer to Figure 3 and Tables S2 and S3). The analysis revealed significant differences in protein sizes, RHD domain sizes, and sizes of its subdomains (RHD-DBD and RHD-DD) among all NF-κB family members. Notably, there was no significant difference in the RHD-DD size between the c-Rel and p52/p100 members (refer to Figure 3 and Table S2). Among the five NF-κB families, p50/p105 proteins displayed the largest protein lengths and RHD sizes (RHDs, RHD-DBDs, and RHD-DDs). In contrast, the RelA/p65 proteins exhibited smaller sizes in proteins, RHDs, RHD-DBDs, and RHD-DDs (refer to Figure 3 and Tables S2 and S3). The amino acid sequences of NF-κB family proteins, RHDs, RHD-DBDs, and RHD-DDs are presented in Supplementary Dataset S1.

2.4. RelB and c-Rel Have the Most and Least Conserved Amino Acids in the RHDs

An interesting pattern of amino acid conservation was observed for NF-κB family members in relation to RHDs (Table 3). The conservation of amino acids was not dependent on sample size (number of proteins) or the presence of the NF-κB family proteins in the different animal groups. This indicates that amino acid conservation is characteristic of each NF-κB family behavior.
Upon analyzing the amino acid conservation in the NF-κB RHDs, it was found that the RelB family had the most conserved amino acids, followed by p52/p100, RelA/p65, p50/p105, and c-Rel (Table 3). These findings suggest that the RelB family proteins underwent fewer amino acid changes, while c-Rel family proteins underwent more changes. Interestingly, despite p52/p100 and RelA/p65 being identified in vertebrates, p52/p100 ranked second regarding amino acid conservation. This suggests that the observed changes are real and not dependent on population size, as 741 p52/p100 proteins were analyzed compared to only 368 RelA/p65 proteins (Table 3).
However, a different pattern was observed for RHD-DBD and RHD-DD (Table 3). For RHD-DBD, the RelB family had the highest conserved amino acids, followed by p52/p100, RelA/p65, c-Rel, and p50/p105 (Table 3). On the other hand, for RHD-DD, the RelB family had the highest conserved amino acids, followed by RelA/p65, p52/p100, p50/p105, and c-Rel (Table 3). These differences in RHD-DBD and RHD-DD suggest that these proteins have specific regions for binding to DNA, which leads to the regulation of different genes.
In conclusion, these results indicate that c-Rel proteins mutate or evolve faster than other NF-κB family members.

2.5. p50/p105 and c-Rel Might Have Been the Earliest NF-κB Family Members to Emerge

The evolutionary analysis of Rel subfamily proteins showed that proteins from the same taxonomic groups clustered together on the phylogenetic tree except for c-Rel (Figure 4). This indicates the conservation of primary amino acid sequences after speciation. The phylogenetic tree analysis revealed that c-Rel proteins were grouped in three places (c-Rel 1–3) (Figure 4), and it is evident that the c-Rel proteins evolved and then diverged into two branches (Figure 4). Branch 1 contains various c-Rel members, mainly from arthropods, two from mollusks, one from annelids, and a single vertebrate member from a mammal, while branch 2 further evolved into RelB and RelA/p65 proteins and two more groups of c-Rel (c-Rel 2 and 3). The second c-Rel group (c-Rel 2) includes members of mollusks, annelids, mammals, and fishes, and the third group (c-Rel 3) includes only vertebrate members of fishes, amphibians, reptiles, mammals, and aves. One interesting observation was one RelA/p65 protein from the Aves at the base of branch 1, indicating its evolutionary linkage with c-Rel proteins (Figure 4).
As observed for Rel proteins, the p50/p105 and p52/p100 proteins from the same taxonomic groups clustered on the phylogenetic tree with a few exceptions, indicating the conservation of primary amino acid sequences after speciation (Figure 5). The tree has two major branches (Figure 5). The first branch mainly consists of p50/p105 invertebrate members (echinoderms, poriferans, cnidarians, mollusks, annelids, and arthropods) and fishes (p50/p105 1), while the second branch includes p52/p100 and p50/p105 members from vertebrates (p50/p105 2) (Figure 5). This indicates that p50/p105s appeared first and then evolved into p52/p100. The first branch consists of various p50/p105 invertebrate members and fishes, while the second branch evolved into different vertebrate taxonomic groups from p52/p100 and p50/p105 (Figure 5).
All NF-κB family members have a common RHD, indicating that they originated from a single RHD [41]. Then, the RHD can combine with different domains, leading to the diversification of NF-κB family members [41]. Thus, the RHD may retain evolutionary traces of NF-κB family members. To understand this better, we analyzed the phylogenetic relationship of RHDs from different NF-κB members (see Figure 6). Analysis revealed that RHDs from the same NF-κB family members tend to gather together on the tree, suggesting their evolutionary conservation after diversification (Figure 6). Notably, the RHDs of p50/p105 are clustered into five groups on the tree, named p50/p105 1–5 (Figure 6). Members of the first group (p50/p105 1) are from echinoderms, poriferans, cnidarians, and arthropods. The second group (p50/p105 2) consisted of members of annelids, and the third group (p50/p105 3) included members from annelids and mollusks. The fourth group (p50/p105 4) included echinoderms, cnidarians and fishes members. The fifth group (p50/p105 5) included only vertebrate members (fishes, amphibians, reptiles, mammals, and aves) (Figure 6). RHDs of c-Rel were clustered into two large groups (c-Rel 1–2). Members of the first c-Rel group (c-Rel 1) were from arthropods, mollusks, and annelids (invertebrates), while members of the second group (c-Rel 2) were mostly from vertebrates (mammals, amphibians, reptiles, aves and fishes) (Figure 6). Based on the evolutionary analysis of their RHDs, we propose a possible evolutionary scenario for NF-κB family members, suggesting that p50/p105 and c-Rel may have been the earliest NF-κB family members to emerge in an arthropod ancestor, followed by the emergence of RelB, RelA, and p52/p100 at a later stage.

3. Materials and Methods

3.1. Reference NF-κB Family Members Used in This Study

The NF-κB proteins, which belong to five NF-κB family members (see Table 4), were used as reference proteins for genome data mining of NF-κB proteins in animals. Initially, many protein sequences were collected from the Dr. Thomas Gilmore website (https://www.bu.edu/nf-kb/the-gilmore-lab/, last accessed on 26 May 2024) for each of the NF-κB family members. Multiple sequence alignments were then carried out using Clustal Omega [54] analysis to determine the percentage identity among the collected sequences. A representative for each of the NF-κB family members was selected for each of the five families, whereas for NF-κB family members, c-Rel and p50/p105, two reference sequences were selected as representatives for vertebrates and invertebrates (Table 4). NF-κB family members from Homo sapiens showed more than 50% identity to the analyzed sequences and were therefore chosen as reference proteins for data mining for homolog proteins. However, for c-Rel and p50/p105, H. sapiens members have less than 50% identity to the invertebrates. As a result, one of the invertebrate NF-κB members was chosen as an additional reference for data mining. The rationale behind this decision is that these invertebrate NF-κB family members, which are closely related to their group, can potentially aid in the discovery of homolog NF-κB members.

3.2. Selection of Suitable Method for Analyzing NF-κB Family Characteristic Domains

Although the characteristic domains of the NF-κB family were well-defined [53], for easy identification and screening of homologous proteins on a large scale, we conducted conserved domain analysis of NF-κB reference protein sequences (Table 4) using the National Center for Biotechnology Information (NCBI) Batch Web CD-Search Tool [55] and the Hidden Markov Model Scan (HMMSCAN) [56,57] at the HMMER website [58]. This analysis helped us identify the type and pattern of domains or domain protein families (Figure 7). The Batch Web CD-Search analysis clearly distinguished between NF-κB families as the characteristic domains for each of the NF-κB families are unique, as shown in Figure 7. However, the HMMSCAN was unable to do this. As a result, we used the NCBI Batch Web CD-Search analysis to identify NF-κB family members in this study.

3.3. Genome Data Mining and Annotation of NF-κB Proteins in Animals

A protein BLAST (Basic Local Alignment Search Tool) [59] was performed at NCBI using the reference protein sequences from Table 4 against each of the taxonomic identification numbers (taxon ID) from Table S4 for each animal group. The reason for using NCBI taxon ID is to ensure that the resulting hit proteins are only from the specific animal group, making it easier to sort and analyze NF-κB family statistics for that group. Information on species used for genome data mining of homolog NF-κB proteins in this study, including their respective NCBI taxonomy identification numbers (Taxa ID), number of species, and the protein hits, are shown in Table S4. Protein BLAST was performed with a maximum option of 5000 hits. The hit sequences were then checked for the presence of NF-κB family characteristics domains (Figure 7) using the NCBI Batch Web CD-Search Tool [55] as mentioned in Section 3.2. Hit sequences with NF-κB family characteristics domains (Figure 7) were selected for further analysis and removed those without these domains. The proteins that had NF-κB family characteristics domains were subjected to duplicate analysis using Clustal Omega [54]. Sequences that shared 100% identity were considered duplicates, and only one of each duplicate sequence was included in the study for further analysis.
In our study, we also analyzed the NF-κB/Rel proteins found in non-metazoans from the literature [42] in order to classify them into their respective NF-κB protein families using the method detailed in Section 3.2. However, we were unable to identify the characteristic domains of the NF-κB family, as discussed in Section 3.2. Therefore, we could not assign these proteins to their respective NF-κB families. As a result, these proteins were not included in the study.

3.4. Generation of NF-κB Profile Heat-Map

The heat map profile illustrating the presence and absence of NF-κB protein families across different animal groups was created using a method described previously [60]. A tab-delimited file was imported into Mev (Multi-experiment viewer) [61], and the data was clustered using a hierarchical clustering algorithm with an Euclidean distance metric. NF-κB family members were represented on the vertical axis, while animal groups were displayed on the horizontal axis.

3.5. Phylogenetic Analysis of NF-κB Proteins

Phylogenetic analysis of the NF-κB family members and their RHDs was conducted using a method described in a published article [60]. First, the protein sequences were aligned with the MAFFT v6.864 program [62], which is available on the T-REX web server [63]. The alignments were then automatically subjected to interpret the best tree using the maximum likelihood method available on the T-rex web server [63]. Finally, the best-inferred tree was visualized, colored, and generated by the Interactive Tree of Life (iTOL) [64].

3.6. Identification of RHD, RHD-DBD and RHD-DD in NF-κB Proteins

The NF-κB RHDs, RHD-DBDs, and RHD-DDs were identified using the MOTIF search tool (https://www.genome.jp/tools/motif/, last accessed on 26 August 2024). Detailed information on NF-κB protein sizes and the sizes of RHDs, RHD-DBDs, and RHD-DDs can be found in Table S2, and the respective sequences are provided in Supplementary Dataset S1.

3.7. Analysis of Amino Acid Conservation

The number of amino acids conserved in the RHDs, RHD-DBDs, and RHD-DDs was analyzed following the method previously described [65]. The number of amino acids conserved was analyzed using PROMALS3D (Profile Multiple Alignment with predicted Local Structures and 3D constraints) [66]. The NF-κB protein sequences that had more than one RHD-DBD and RHD-DD (Table S2) were not included for analysis of amino acid conservation. PROMALS3D is an alignment tool that aligns different protein sequences using homologous crystal structures based on secondary structure prediction. The output alignment assigns numbers as conservation indices ranging from 4 to 9, where 9 is the most consistently conserved amino acid among the protein sequences analyzed [66]. The ranking was based on the number of conserved amino acids, with the NF-κB family member with the highest number of conserved amino acids given first rank, indicating that this family member is subjected to fewer mutations or high conservation during evolution.

3.8. Statistical Analysis of the NF-κB Proteins

Statistical analysis was performed on the sizes (number of amino acids) of NF-κB family proteins and sizes of RHDs, RHD-DBDs, and RHD-DDs using Welch’s t-test (https://www.statskingdom.com/150MeanT2uneq.html, last accessed on 15 July 2024) following a method described elsewhere [67]. This analysis aimed to determine if the differences between NF-κB family proteins are statistically significant. The analysis used the average number of amino acids, standard deviation, and sample size (number of NF-κB proteins) as input data (refer to Table S3). The results of the statistical analysis are presented in Table S3.

4. Conclusions

This study analyzes the five NF-κB family members involved in different biological processes. It shows that these proteins have unique distribution patterns across different animal groups, with significant differences in sizes, domains, subdomains, and structural conservation. The presence of RelA/p65 only in vertebrates suggests that innate immunity might have evolved primarily in fishes, followed by amphibians, reptiles, birds, and mammals. NF-κB family proteins group according to animal groups, indicating structural conservation after speciation. The study suggests that p50/p105 and c-Rel might be the earliest NF-κB family members in arthropod ancestors, with RelB, RelA/p65, and p52/p100 emerging later. Additionally, the study identified NF-κB family proteins with multiple RHD-DBDs and RHD-DDs in various species, but further research is needed to understand their functional significance.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/ijms25189793/s1; refs in Table S4 are cited as [68,69,70,71].

Author Contributions

Conceptualization, K.S.; methodology, S.M., S.B.P. and K.S.; software, S.M., S.B.P. and K.S.; validation, S.M., S.B.P. and K.S.; formal analysis, S.M., S.B.P. and K.S.; investigation, S.M., S.B.P. and K.S.; resources, S.M., S.B.P. and K.S.; data curation, S.M., S.B.P. and K.S.; writing—original draft preparation, S.M., S.B.P. and K.S.; writing—review and editing, S.M., S.B.P. and K.S.; visualization, S.M., S.B.P. and K.S.; supervision, K.S.; project administration, K.S.; funding acquisition, K.S. All authors have read and agreed to the published version of the manuscript.

Funding

Khajamohiddin Syed sincerely thanks the University of Zululand, South Africa, for funding his research (Grant number P419). Professor Suresh Babu Pakala thanks the University of Zululand, South Africa, for appointing him as a visiting fellow. Siphesihle Msweli thanks the National Research Foundation (NRF), South Africa, for a postgraduate bursary (Grant number: PMDS22051812140).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data generated in the study is provided in the manuscript and Supplementary Materials.

Acknowledgments

We thank Wanping Chen from the Department of Molecular Microbiology and Genetics, University of Göttingen, 37077 Göttingen, Germany, for assisting with phylogenetic tree analysis.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the study’s design, data collection, analysis, interpretation, manuscript writing, or decision to publish the results.

References

  1. Sen, R.; Baltimore, D. Multiple nuclear factors interact with the immunoglobulin enhancer sequences. Cell 1986, 46, 705–716. [Google Scholar] [CrossRef] [PubMed]
  2. Oeckinghaus, A.; Ghosh, S. The NF-kappaB family of transcription factors and its regulation. Cold Spring Harb. Perspect. Biol. 2009, 1, a000034. [Google Scholar] [CrossRef] [PubMed]
  3. Liu, T.; Zhang, L.; Joo, D.; Sun, S.C. NF-κB signaling in inflammation. Signal Transduct. Target. Ther. 2017, 2, 17023. [Google Scholar] [CrossRef] [PubMed]
  4. Gilmore, T.D.; Wolenski, F.S. NF-κB: Where did it come from and why? Immunol. Rev. 2012, 246, 14–35. [Google Scholar] [CrossRef] [PubMed]
  5. Wong, E.T.; Tergaonkar, V. Roles of NF-κB in health and disease: Mechanisms and therapeutic potential. Clin. Sci. 2009, 116, 451–465. [Google Scholar] [CrossRef]
  6. Ben-Neriah, Y.; Karin, M. Inflammation meets cancer, with NF-κB as the matchmaker. Nat. Immunol. 2011, 12, 715–723. [Google Scholar] [CrossRef]
  7. Tieri, P.; Termanini, A.; Bellavista, E.; Salvioli, S.; Capri, M.; Franceschi, C. Charting the NF-κB pathway interactome map. PLoS ONE 2012, 7, e32678. [Google Scholar] [CrossRef]
  8. Yu, M.; Chen, J.; Bao, Y.; Li, J. Genomic analysis of NF-κB signaling pathway reveals its complexity in Crassostrea gigas. Fish Shellfish. Immunol. 2018, 72, 510–518. [Google Scholar] [CrossRef]
  9. Park, M.H.; Hong, J.T. Roles of NF-κB in Cancer and Inflammatory Diseases and Their Therapeutic Approaches. Cells 2016, 5, 15. [Google Scholar] [CrossRef]
  10. Hoesel, B.; Schmid, J.A. The complexity of NF-κB signaling in inflammation and cancer. Mol. Cancer 2013, 12, 86. [Google Scholar] [CrossRef]
  11. Dorrington, M.G.; Fraser, I.D.C. NF-κB Signaling in Macrophages: Dynamics, Crosstalk, and Signal Integration. Front. Immunol. 2019, 10, 705. [Google Scholar] [CrossRef] [PubMed]
  12. Mazor, M.; Kawano, Y.; Zhu, H.; Waxman, J.; Kypta, R.M. Inhibition of glycogen synthase kinase-3 represses androgen receptor activity and prostate cancer cell growth. Oncogene 2004, 23, 7882–7892. [Google Scholar] [CrossRef] [PubMed]
  13. Shakoori, A.; Ougolkov, A.; Yu, Z.W.; Zhang, B.; Modarressi, M.H.; Billadeau, D.D.; Mai, M.; Takahashi, Y.; Minamoto, T. Deregulated GSK3beta activity in colorectal cancer: Its association with tumor cell survival and proliferation. Biochem. Biophys. Res. Commun. 2005, 334, 1365–1373. [Google Scholar] [CrossRef] [PubMed]
  14. Ougolkov, A.V.; Bone, N.D.; Fernandez-Zapico, M.E.; Kay, N.E.; Billadeau, D.D. Inhibition of glycogen synthase kinase-3 activity leads to epigenetic silencing of nuclear factor kappaB target genes and induction of apoptosis in chronic lymphocytic leukemia B cells. Blood 2007, 110, 735–742. [Google Scholar] [CrossRef]
  15. Domoto, T.; Pyko, I.V.; Furuta, T.; Miyashita, K.; Uehara, M.; Shimasaki, T.; Nakada, M.; Minamoto, T. Glycogen synthase kinase-3β is a pivotal mediator of cancer invasion and resistance to therapy. Cancer Sci. 2016, 107, 1363–1372. [Google Scholar] [CrossRef]
  16. Das, A.; Bhattacharya, B.; Roy, S. Decrypting a path based approach for identifying the interplay between PI3K and GSK3 signaling cascade from the perspective of cancer. Genes Dis. 2022, 9, 868–888. [Google Scholar] [CrossRef]
  17. Vallabhapurapu, S.; Karin, M. Regulation and function of NF-κB transcription factors in the immune system. Annu. Rev. Immunol. 2009, 27, 693–733. [Google Scholar] [CrossRef]
  18. Karin, M. Nuclear factor-κB in cancer development and progression. Nature 2006, 441, 431–436. [Google Scholar] [CrossRef]
  19. Yu, H.; Lin, L.; Zhang, Z.; Zhang, H.; Hu, H. Targeting NF-κB pathway for the therapy of diseases: Mechanism and clinical study. Signal Transduct. Target. Ther. 2020, 5, 209. [Google Scholar] [CrossRef]
  20. Ngo, K.A.; Kishimoto, K.; Davis-Turak, J.; Pimplaskar, A.; Cheng, Z.; Spreafico, R.; Chen, E.Y.; Tam, A.; Ghosh, G.; Mitchell, S. Dissecting the regulatory strategies of NF-κB RelA target genes in the inflammatory response reveals differential transactivation logics. Cell Rep. 2020, 30, 2758–2775.e6. [Google Scholar] [CrossRef]
  21. Xia, Y.; Shen, S.; Verma, I.M. NF-κB, an active player in human cancers. Cancer Immunol. Res. 2014, 2, 823–830. [Google Scholar] [CrossRef] [PubMed]
  22. Verzella, D.; Pescatore, A.; Capece, D.; Vecchiotti, D.; Ursini, M.V.; Franzoso, G.; Alesse, E.; Zazzeroni, F. Life, death, and autophagy in cancer: NF-κB turns up everywhere. Cell Death Dis. 2020, 11, 210. [Google Scholar] [CrossRef] [PubMed]
  23. Gilmore, T.D.; Kalaitzidis, D.; Liang, M.-C.; Starczynowski, D.T. The c-Rel transcription factor and B-cell proliferation: A deal with the devil. Oncogene 2004, 23, 2275–2286. [Google Scholar] [CrossRef] [PubMed]
  24. Schreiber, J.; Jenner, R.G.; Murray, H.L.; Gerber, G.K.; Gifford, D.K.; Young, R.A. Coordinated binding of NF-κB family members in the response of human cells to lipopolysaccharide. Proc. Natl. Acad. Sci. USA 2006, 103, 5899–5904. [Google Scholar] [CrossRef] [PubMed]
  25. Gilmore, T.D. Introduction to NF-κB: Players, pathways, perspectives. Oncogene 2006, 25, 6680–6684. [Google Scholar] [CrossRef]
  26. Sun, S.C.; Chang, J.H.; Jin, J. Regulation of nuclear factor-κB in autoimmunity. Trends Immunol. 2013, 34, 282–289. [Google Scholar] [CrossRef]
  27. Pahl, H.L. Activators and target genes of Rel/NF-kappaB transcription factors. Oncogene 1999, 18, 6853–6866. [Google Scholar] [CrossRef]
  28. Mogensen, T.H. Pathogen recognition and inflammatory signaling in innate immune defenses. Clin. Microbiol. Rev. 2009, 22, 240–273. [Google Scholar] [CrossRef]
  29. Mattson, M.P.; Goodman, Y.; Luo, H.; Fu, W.; Furukawa, K. Activation of NF-kappaB protects hippocampal neurons against oxidative stress-induced apoptosis: Evidence for induction of manganese superoxide dismutase and suppression of peroxynitrite production and protein tyrosine nitration. J. Neurosci. Res. 1997, 49, 681–697. [Google Scholar] [CrossRef]
  30. Lawrence, T. The nuclear factor NF-kappaB pathway in inflammation. Cold Spring Harb. Perspect. Biol. 2009, 1, a001651. [Google Scholar] [CrossRef]
  31. Caamaño, J.; Hunter, C.A. NF-kappaB family of transcription factors: Central regulators of innate and adaptive immune functions. Clin. Microbiol. Rev. 2002, 15, 414–429. [Google Scholar] [CrossRef] [PubMed]
  32. Hayden, M.S.; Ghosh, S. Signaling to NF-κB. Genes Dev. 2004, 18, 2195–2224. [Google Scholar] [CrossRef] [PubMed]
  33. Sun, S.-C. Non-canonical NF-κB signaling pathway. Cell Res. 2011, 21, 71–85. [Google Scholar] [CrossRef] [PubMed]
  34. Bonizzi, G.; Karin, M. The two NF-κB activation pathways and their role in innate and adaptive immunity. Trends Immunol. 2004, 25, 280–288. [Google Scholar] [CrossRef] [PubMed]
  35. Hoffmann, A.; Baltimore, D. Circuitry of nuclear factor κB signaling. Immunol. Rev. 2006, 210, 171–186. [Google Scholar] [CrossRef]
  36. Sheppard, K.A.; Rose, D.W.; Haque, Z.K.; Kurokawa, R.; McInerney, E.; Westin, S.; Thanos, D.; Rosenfeld, M.G.; Glass, C.K.; Collins, T. Transcriptional activation by NF-kappaB requires multiple coactivators. Mol. Cell. Biol. 1999, 19, 6367–6378. [Google Scholar] [CrossRef]
  37. Guo, Q.; Jin, Y.; Chen, X.; Ye, X.; Shen, X.; Lin, M.; Zeng, C.; Zhou, T.; Zhang, J. NF-κB in biology and targeted therapy: New insights and translational implications. Signal Transduct. Target. Ther. 2024, 9, 53. [Google Scholar] [CrossRef]
  38. Hayden, M.S.; Ghosh, S. NF-κB, the first quarter-century: Remarkable progress and outstanding questions. Genes Dev. 2012, 26, 203–234. [Google Scholar] [CrossRef]
  39. Jimi, E.; Takakura, N.; Hiura, F.; Nakamura, I.; Hirata-Tsuchiya, S. The Role of NF-κB in Physiological Bone Development and Inflammatory Bone Diseases: Is NF-κB Inhibition “Killing Two Birds with One Stone”? Cells 2019, 8, 1636. [Google Scholar] [CrossRef]
  40. Dobrzanski, P.; Ryseck, R.P.; Bravo, R. Both N- and C-terminal domains of RelB are required for full transactivation: Role of the N-terminal leucine zipper-like motif. Mol. Cell. Biol. 1993, 13, 1572–1582. [Google Scholar] [CrossRef]
  41. Williams, L.M.; Gilmore, T.D. Looking down on NF-κB. Mol. Cell. Biol. 2020, 40, e00104-20. [Google Scholar] [CrossRef] [PubMed]
  42. Leger, M.M.; Ros-Rocher, N.; Najle, S.R.; Ruiz-Trillo, I. Rel/NF-κB transcription factors emerged at the onset of opisthokonts. Genome Biol. Evol. 2022, 14, evab289. [Google Scholar] [CrossRef] [PubMed]
  43. Dehal, P.; Boore, J.L. Two Rounds of Whole Genome Duplication in the Ancestral Vertebrate. PLoS Biol. 2005, 3, e314. [Google Scholar] [CrossRef] [PubMed]
  44. Gilmore, T.D.; Gerondakis, S. The c-Rel Transcription Factor in Development and Disease. Genes Cancer 2011, 2, 695–711. [Google Scholar] [CrossRef] [PubMed]
  45. Daly, A.E.; Yeh, G.; Soltero, S.; Smale, S.T. Selective regulation of a defined subset of inflammatory and immunoregulatory genes by an NF-κB p50-IκBζ pathway. Genes Dev. 2024, 38, 536–553. [Google Scholar] [CrossRef]
  46. Mitchell, C.D.; Criscitiello, M.F. Comparative study of cartilaginous fish divulges insights into the early evolution of primary, secondary and mucosal lymphoid tissue architecture. Fish Shellfish Immunol. 2020, 107, 435–443. [Google Scholar] [CrossRef]
  47. Smith, N.C.; Rise, M.L.; Christian, S.L. A Comparison of the Innate and Adaptive Immune Systems in Cartilaginous Fish, Ray-Finned Fish, and Lobe-Finned Fish. Front. Immunol. 2019, 10, 2292. [Google Scholar] [CrossRef]
  48. Hayden, M.S.; Ghosh, S. NF-κB in immunobiology. Cell Res. 2011, 21, 223–244. [Google Scholar] [CrossRef]
  49. Dev, A.; Iyer, S.; Razani, B.; Cheng, G. NF-κB and innate immunity. Curr. Top. Microbiol. Immunol. 2011, 349, 115–143. [Google Scholar] [CrossRef]
  50. Buchmann, K. Evolution of Innate Immunity: Clues from Invertebrates via Fish to Mammals. Front. Immunol. 2014, 5, 459. [Google Scholar] [CrossRef]
  51. Zanetti, M.; Castiglioni, P.; Schoenberger, S.; Gerloni, M. The role of relB in regulating the adaptive immune response. Ann. N. Y. Acad. Sci. 2003, 987, 249–257. [Google Scholar] [CrossRef] [PubMed]
  52. Millet, P.; McCall, C.; Yoza, B. RelB: An outlier in leukocyte biology. J. Leukoc. Biol. 2013, 94, 941–951. [Google Scholar] [CrossRef] [PubMed]
  53. Finnerty, J.R.; Gilmore, T.D. Methods for analyzing the evolutionary relationship of NF-κB proteins using free, web-driven bioinformatics and phylogenetic tools. In NF-kappa B: Methods and Protocols; Humana Press: New York, NY, USA, 2015; pp. 631–646. [Google Scholar]
  54. Sievers, F.; Wilm, A.; Dineen, D.; Gibson, T.J.; Karplus, K.; Li, W.; Lopez, R.; McWilliam, H.; Remmert, M.; Söding, J. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol. Syst. Biol. 2011, 7, 539. [Google Scholar] [CrossRef] [PubMed]
  55. Wang, J.; Chitsaz, F.; Derbyshire, M.K.; Gonzales, N.R.; Gwadz, M.; Lu, S.; Marchler, G.H.; Song, J.S.; Thanki, N.; Yamashita, R.A. The conserved domain database in 2023. Nucleic Acids Res. 2023, 51, D384–D388. [Google Scholar] [CrossRef] [PubMed]
  56. Söding, J. Protein homology detection by HMM–HMM comparison. Bioinformatics 2005, 21, 951–960. [Google Scholar] [CrossRef]
  57. Eddy, S.R. Profile hidden Markov models. Bioinformatics 1998, 14, 755–763. [Google Scholar] [CrossRef]
  58. Potter, S.C.; Luciani, A.; Eddy, S.R.; Park, Y.; Lopez, R.; Finn, R.D. HMMER web server: 2018 update. Nucleic Acids Res. 2018, 46, W200–W204. [Google Scholar] [CrossRef]
  59. Altschul, S.F.; Madden, T.L.; Schaffer, A.A.; Zhang, J.; Zhang, Z.; Miller, W.; Lipman, D.J. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 1997, 25, 3389–3402. [Google Scholar] [CrossRef]
  60. Chen, W.; Lee, M.-K.; Jefcoate, C.; Kim, S.-C.; Chen, F.; Yu, J.-H. Fungal cytochrome p450 monooxygenases: Their distribution, structure, functions, family expansion, and evolutionary origin. Genome Biol. Evol. 2014, 6, 1620–1634. [Google Scholar] [CrossRef]
  61. Howe, E.; Holton, K.; Nair, S.; Schlauch, D.; Sinha, R.; Quackenbush, J. Mev: Multiexperiment viewer. In Biomedical Informatics for Cancer Research; Springer: Boston, MA, USA, 2010; pp. 267–277. [Google Scholar]
  62. Katoh, K.; Kuma, K.-i.; Toh, H.; Miyata, T. MAFFT version 5: Improvement in accuracy of multiple sequence alignment. Nucleic Acids Res. 2005, 33, 511–518. [Google Scholar] [CrossRef]
  63. Boc, A.; Diallo, A.B.; Makarenkov, V. T-REX: A web server for inferring, validating and visualizing phylogenetic trees and networks. Nucleic Acids Res. 2012, 40, W573–W579. [Google Scholar] [CrossRef] [PubMed]
  64. Letunic, I.; Bork, P. Interactive Tree of Life (iTOL) v6: Recent updates to the phylogenetic tree display and annotation tool. Nucleic Acids Res. 2024, 52, W78–W82. [Google Scholar] [CrossRef] [PubMed]
  65. Parvez, M.; Qhanya, L.B.; Mthakathi, N.T.; Kgosiemang, I.K.; Bamal, H.D.; Pagadala, N.S.; Xie, T.; Yang, H.; Chen, H.; Theron, C.W.; et al. Molecular evolutionary dynamics of cytochrome P450 monooxygenases across kingdoms: Special focus on mycobacterial P450s. Sci. Rep. 2016, 6, 33099. [Google Scholar] [CrossRef] [PubMed]
  66. Pei, J.; Kim, B.-H.; Grishin, N.V. PROMALS3D: A tool for multiple protein sequence and structure alignments. Nucleic Acids Res. 2008, 36, 2295–2300. [Google Scholar] [CrossRef] [PubMed]
  67. Zondo, N.M.; Padayachee, T.; Nelson, D.R.; Syed, K. Saprophytic to pathogenic mycobacteria: Loss of cytochrome P450s vis a vis their prominent involvement in natural metabolite biosynthesis. Int. J. Mol. Sci. 2022, 24, 149. [Google Scholar] [CrossRef]
  68. Dohrmann, M.; Wörheide, G. Dating early animal evolution using phylogenomic data. Sci. Rep. 2017, 7, 3599. [Google Scholar] [CrossRef]
  69. Gilbert, P.U.; Porter, S.M.; Sun, C.-Y.; Xiao, S.; Gibson, B.M.; Shenkar, N.; Knoll, A.H. Biomineralization by particle attachment in early animals. Proc. Natl. Acad. Sci. USA 2019, 116, 17659–17665. [Google Scholar] [CrossRef]
  70. Hommel, B.; Chapman, C.S.; Cisek, P.; Neyedli, H.F.; Song, J.-H.; Welsh, T.N. No one knows what attention is. Atten. Percept. Psychophys. 2019, 81, 2288–2303. [Google Scholar] [CrossRef]
  71. Cole, D.B.; Mills, D.B.; Erwin, D.H.; Sperling, E.A.; Porter, S.M.; Reinhard, C.T.; Planavsky, N.J. On the co-evolution of surface oxygen levels and animals. Geobiology 2020, 18, 260–281. [Google Scholar] [CrossRef]
Figure 1. Schematic diagram illustrating the classification and characteristic domains of NF-κB proteins, along with additional DNA binding and dimerization domains identified in this study.
Figure 1. Schematic diagram illustrating the classification and characteristic domains of NF-κB proteins, along with additional DNA binding and dimerization domains identified in this study.
Ijms 25 09793 g001
Figure 2. Genome-wide data mining, identification, and classification of animal NF-κB family proteins. (A) Comparative analysis of five NF-κB family members in animals. (B) Heat-map representation of NF-κB family members conservation in animals. The animal groups are depicted vertically, while NF-κB families are represented horizontally. Comparative analysis of c-Rel (C), p50/p105 (D), RelA/p65 (E), and p52/p100 (F) NF-κB family members in different animal groups. Detailed information on the number of species and NF-κB proteins in animals is presented in Table S1. The complete NF-κB protein sequences without duplicates identified and annotated in this study are presented in Supplementary Dataset S1.
Figure 2. Genome-wide data mining, identification, and classification of animal NF-κB family proteins. (A) Comparative analysis of five NF-κB family members in animals. (B) Heat-map representation of NF-κB family members conservation in animals. The animal groups are depicted vertically, while NF-κB families are represented horizontally. Comparative analysis of c-Rel (C), p50/p105 (D), RelA/p65 (E), and p52/p100 (F) NF-κB family members in different animal groups. Detailed information on the number of species and NF-κB proteins in animals is presented in Table S1. The complete NF-κB protein sequences without duplicates identified and annotated in this study are presented in Supplementary Dataset S1.
Ijms 25 09793 g002
Figure 3. Statistical analysis of NF-κB family members protein sizes and the sizes of Rel homology domains (RHDs) and its subdomains, DNA binding domains (RHD-DBDs), and Dimerization domains (RHD-DDs). Statistical analysis was carried out as mentioned in the methods Section 3.8, and the corresponding p-values comparing the different NF-κB family members are presented in the figure. Detailed information on protein and domain sizes, as well as the results of Welch’s t-test, can be found in Tables S2 and S3. The statistical representation uses the following symbols: ns for p > 0.05; * for p ≤ 0.05; **** for p ≤ 0.0001.
Figure 3. Statistical analysis of NF-κB family members protein sizes and the sizes of Rel homology domains (RHDs) and its subdomains, DNA binding domains (RHD-DBDs), and Dimerization domains (RHD-DDs). Statistical analysis was carried out as mentioned in the methods Section 3.8, and the corresponding p-values comparing the different NF-κB family members are presented in the figure. Detailed information on protein and domain sizes, as well as the results of Welch’s t-test, can be found in Tables S2 and S3. The statistical representation uses the following symbols: ns for p > 0.05; * for p ≤ 0.05; **** for p ≤ 0.0001.
Ijms 25 09793 g003
Figure 4. Evolutionary analysis of Rel proteins (c-Rel, RelA/p65, and RelB). The phylogenetic tree was constructed using 2461 Rel protein sequences (Supplementary Dataset S1). A high-resolution phylogenetic tree with individual node information is provided in Supplementary Figure S1.
Figure 4. Evolutionary analysis of Rel proteins (c-Rel, RelA/p65, and RelB). The phylogenetic tree was constructed using 2461 Rel protein sequences (Supplementary Dataset S1). A high-resolution phylogenetic tree with individual node information is provided in Supplementary Figure S1.
Ijms 25 09793 g004
Figure 5. Evolutionary analysis of p50/p105 and p52/p100 proteins. The phylogenetic tree was constructed with 1683 protein sequences (Supplementary Dataset S1). A high-resolution phylogenetic tree is provided in Supplementary Figure S2, where one can see individual node information.
Figure 5. Evolutionary analysis of p50/p105 and p52/p100 proteins. The phylogenetic tree was constructed with 1683 protein sequences (Supplementary Dataset S1). A high-resolution phylogenetic tree is provided in Supplementary Figure S2, where one can see individual node information.
Ijms 25 09793 g005
Figure 6. Phylogenetic analysis of Rel Homology Domain (RHD) of different NF-κB family members. 4097 NF-κB family members RHDs (Supplementary Dataset S1) were used to construct the tree. A high-resolution phylogenetic tree is provided in Supplementary Figure S3, where one can see information for each tree node.
Figure 6. Phylogenetic analysis of Rel Homology Domain (RHD) of different NF-κB family members. 4097 NF-κB family members RHDs (Supplementary Dataset S1) were used to construct the tree. A high-resolution phylogenetic tree is provided in Supplementary Figure S3, where one can see information for each tree node.
Ijms 25 09793 g006
Figure 7. Analysis of the type and pattern of domains in protein families of NF-κB family members using NCBI Batch Web CD-search [55] and HMMSCAN [58]. An asterisk denotes the domain in invertebrate NF-κB family members for families c-Rel and p50/p105. For a detailed analysis of domains, including their names, please refer to Table 4.
Figure 7. Analysis of the type and pattern of domains in protein families of NF-κB family members using NCBI Batch Web CD-search [55] and HMMSCAN [58]. An asterisk denotes the domain in invertebrate NF-κB family members for families c-Rel and p50/p105. For a detailed analysis of domains, including their names, please refer to Table 4.
Ijms 25 09793 g007
Table 1. Human NF-κB family members and their associated regulatory functions.
Table 1. Human NF-κB family members and their associated regulatory functions.
NF-κB Family MemberGene TargetsCellular FunctionsCancer-Related FunctionsReference
RelA/p65IL-6, TNF-α,
Bcl-2, IL-1β, COX-2
An important player in the canonical NF-κB pathway, it regulates gene expression related to inflammation, immune response, and cell survival.Promotes the proliferation and survival of cancer cells by inducing anti-apoptotic genes; its overexpression is linked to aggressive cancer types.[17,18,19,20]
RelBIL-10, Bcl-2, CD40, MMP9, VEGFA crucial player in the non-canonical NF-κB pathway, it regulates the development of lymphoid organs and the movement of lymphocytes.Promotes immune evasion. [17,19,21]
c-RelIL-2, IL-4, CDOL, Bcl-XLIt regulates T-cell activation and its differentiation, and influences B cell development and its function.Associated with lymphoid malignancies, it promotes cancer cell survival and contributes to tumor formation.[17,22,23]
p105/p50
(NF-κB1)
IL-1, TNF-α, GM-CSF, MMP9Acts as a transcriptional repressor or activator depending on its dimerization partners. It is involved in regulating genes related to inflammation and immune responses.It dimerizes with RelA or c-Rel to promote oncogenesis. It also promotes tumor growth and metastasis in a signal-dependent manner.[17,21,24]
p100/p52
(NF-κB2)
IL-6, IL-10, lymphotoxin-α, Involved in the regulation of genes associated with immune responses and lymphocyte development.Associated with the development of certain lymphomas.[17,21,24]
Table 2. NF-κB proteins with multiple Rel homology domain (RHD) subdomains, including the percentage identity between the subdomains.
Table 2. NF-κB proteins with multiple Rel homology domain (RHD) subdomains, including the percentage identity between the subdomains.
NF-κB Family MemberOrganism GroupSpecies NamesProtein IDProtein SizeRHD-DBD Position (% Identity)RHD-DD Position (% Identity)
c-RelReptiliaNaja najaKAG8123041.1 *4473–4655–83 and
79–130
(17.24%)
MammaliaMicrotus ochrogasterKAH0516101.156683–135 and
138–192 (21.62%)
192–261
MammaliaOrycteropus afer aferXP_042636495.155821–62 and
76–154 (19.51%)
163–258
MammaliaTupaia chinensisELW63349.14581–77 and
86–133 (26.92%)
133–202
AvesColumbina picuiNWQ80171.12897–50 and
77–159 (22.50%)
168–263
AvesLamprotornis superbusKAI1240354.11426115–259 and
453–500 (25.00%)
651–746
MolluscaMya arenariaWAR19259.141289–193 and
196–233 (23.68%)
241–341
ArthropodaDermatophagoides pteronyssinusXP_027194418.179427–150 and
152–244 (100.00%)
252–355
ArthropodaDrosophila kikkawaiKAH8308414.1102451–221229–308 and
328–352 (24.00%)
ArthropodaDrosophila sulfurigasterKAH8391813.1105663–233241–321 and
331–362 (17.86%)
ArthropodaLeptinotarsa decemlineataXP_023017353.1534122–2923–68 and
299–397 (45.45%)
ArthropodaDrosophila pandoraKAH8323486.199453–223231–317 and
320–350 (23.33%)
ArthropodaDrosophila bipectinataKAH8278094.1100055–225233–313 and
325–350 (20.00%)
ArthropodaDrosophila pseudoananassaeKAH8324519.199853–223231–313 and
323–348 (22.22%)
ArthropodaPseudolycoriella hygidaKAJ6649769.1101174–244252–348 and
447–469 (88.89%)
ArthropodaGonioctena quinquepunctataKAG5873706.181876–201 and
207–288 (43.24%)
296–394
ArthropodaAphidius gifuensisKAF7994522.169475–245253–352 and
431–533 (41.00%)
ArthropodaPhthorimaea operculellaKAI5633865.180467–193 and
198–274 (94.29%)
281–389
ArthropodaSpodoptera exiguaCAH0702166.1137317–140 and
467–625 (35.40%)
633–731
ArthropodaTemnothorax longispinosusTGZ37724.1 *1121133–255 and
254–280 (29.17%)
288–388
ArthropodaTimema shepardiCAD7261384.17222–89 and
122–165 (25.00%)
173–273
ArthropodaApis floreaXP_031771860.146012–91 and
95–134 (17.50%)
142–242
ArthropodaApis dorsataXP_006618954.1 *46610–91 and
88–134 (22.86%)
142–242
ArthropodaDiaphorina citriKAI5744658.148048–146 and
151–188 (15.79%)
197–293
ArthropodaDiaphorina citriKAI5710357.151248–146 and
151–188 (15.79%)
197–293
ArthropodaDrosophila birchiiKAH8245463.1101580–250257–336 and
354–380 (25.93%)
ArthropodaRhagoletis zephyriaXP_017461252.130721–144 and
147–218 (85.71%)
256–307
ArthropodaChoristoneura fumiferanaKAI8422849.1639327–382 and
432–593 (22.64%)
600–624
ArthropodaTimema bartmaniCAD7444423.1102268–123 and
126–170 (30.56%)
178–278
ArthropodaAphidius gifuensisXP_044005943.134020–93 and
104–164 (34.38%)
170–268
ArthropodaCopidosoma floridanumXP_023248498.1128925–199 and
348–522 (29.76%)
209–308 and
533–629 (32.61%)
p50/p105CnidariaOrbicella faveolataXP_020612557.1112145–260269–295 and
307–410 (100.00%)
ReptiliaThamnophis sirtalisXP_013917902.158932–230 and
265–424 (98.11%)
239–268 and
433–534 (93.33%)
MammaliaMicrotus ochrogasterKAH0500796.1100011–49 and
64–226 (20.51%)
235–336
MammaliaHeterocephalus glaberEHB13915.11164125–166 and
198–340 (26.19%)
349–450
MolluscaPlakobranchus ocellatusGFN89059.1105667–254263–290 and
324–377 (14.29%)
RelA/p65MammaliaGalemys pyrenaicusKAG8522598.1756204–369378–404 and
424–494 (20.00%)
MammaliaSousa chinensisTEA11684.159117–62 and
102–226 (18.60%)
235–331
AvesMelopsittacus undulatusXP_033927754.140513–54 and
72–221 (96.15%)
230–326
AvesOnychostruthus taczanowskiiXP_041269398.14431–32 and
40–115 (12.90%)
124–220
p52/p100MammaliaGalemys pyrenaicusKAG8523592.195570–114 and
116–274 (100.00%)
283–381
AvesPygoscelis adeliaeKFW65979.165730–122 and
172–220 (17.39%)
229–279
AvesPygoscelis adeliaeXP_009320748.15689–101 and
116–163 (17.78%)
172–224
RelBMammaliaPteropus alectoXP_024902475.1637154–322331–357 and
401–456 (13.04%)
MammaliaTupaia chinensisELW71096.1 *54772–240249–322 and
318–369 (82.76%)
MammaliaPropithecus coquereliXP_012508693.1559112–202 and
232–275 (26.32%)
284–380
MammaliaNeotoma lepidaOBS57525.145544–132 and
138–166 (21.43%)
175–276
Note: proteins with overlapping subdomains are indicated with an asterisk. The percentage identity between the domains is indicated in brackets.
Table 3. Comparative analysis of amino acid conservation in NF-κB RHDs and subdomains. Each NF-κB family member was ranked based on the number of conserved amino acids (indicated by the number 9). The multiple sequence alignment of domains is provided in Supplementary Dataset S2.
Table 3. Comparative analysis of amino acid conservation in NF-κB RHDs and subdomains. Each NF-κB family member was ranked based on the number of conserved amino acids (indicated by the number 9). The multiple sequence alignment of domains is provided in Supplementary Dataset S2.
NF-κB FamilyNumber of NF-κB Protein SequencesPROMALS3D Conservation IndexRanking
56789
Rel homology domain (RHD)
c-Rel18141810912V
p50/p105934281225117IV
RelA/p6536830082092III
p52/p1007411533692093II
RelB240239200126I
Rel homology domain-DNA binding domain (RHD-DBD)
c-Rel1814810502IV
p50/p1059341681262IV
RelA/p6536819047051III
p52/p100741102037859II
RelB2400590072I
Rel homology domain-dimerization domain (RHD-DD)
c-Rel181433512V
p50/p1059341411767IV
RelA/p6536810522040II
p52/p100741262021032III
RelB2400160072I
Table 4. Information on the reference proteins utilized for genome data mining of homologous NF-κB family members. The table also presents protein characteristics, such as protein size and domains, as per the NCBI Batch Web CD-Search database [55].
Table 4. Information on the reference proteins utilized for genome data mining of homologous NF-κB family members. The table also presents protein characteristics, such as protein size and domains, as per the NCBI Batch Web CD-Search database [55].
NF-κB FamilyNCBI Protein IDOrganismProtein SizeNCBI Batch Web CD-Search Domain
Accession Short name
c-RelCAA52954.1Homo sapiens619c07933RHD-n_c-Rel
pfam00554RHD_DNA_bind
pfam16179RHD_dimer
c-RelAAK72690.1Crassostrea gigas615cd07887RHD-n_Dorsal_Dif
pfam00554RHD_DNA_bind
pfam16179RHD_dimer
RelA/p65AAA36408.1Homo sapiens551cd07885RHD-n_RelA
pfam00554RHD_DNA_bind
pfam16179RHD_dimer
RelBNP_006500.2Homo sapiens579cd07886RHD-n_RelB
pfam16181RelB_transactiv
pfam00554RHD_DNA_bind
pfam16179RHD_dimer
pfam16180RelB_leu_zip
p50/p105AAA36361.1Homo sapiens969cd07935RHD-n_NFkB1
pfam00554RHD_DNA_bind
pfam16179RHD_dimer
p50/p105NP_999819.1Strongylocentrotus purpuratus1125cd07883RHD-n_NFkB
pfam00554RHD_DNA_bind
pfam16179RHD_dimer
p52/p100NP_002493.3Homo sapiens899cd07934RHD-n_NFkB2
pfam00554RHD_DNA_bind
pfam16179RHD_dimer
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Msweli, S.; Pakala, S.B.; Syed, K. NF-κB Transcription Factors: Their Distribution, Family Expansion, Structural Conservation, and Evolution in Animals. Int. J. Mol. Sci. 2024, 25, 9793. https://doi.org/10.3390/ijms25189793

AMA Style

Msweli S, Pakala SB, Syed K. NF-κB Transcription Factors: Their Distribution, Family Expansion, Structural Conservation, and Evolution in Animals. International Journal of Molecular Sciences. 2024; 25(18):9793. https://doi.org/10.3390/ijms25189793

Chicago/Turabian Style

Msweli, Siphesihle, Suresh B. Pakala, and Khajamohiddin Syed. 2024. "NF-κB Transcription Factors: Their Distribution, Family Expansion, Structural Conservation, and Evolution in Animals" International Journal of Molecular Sciences 25, no. 18: 9793. https://doi.org/10.3390/ijms25189793

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop