Next Article in Journal
Sports-Related Genomic Predictors Are Associated with Athlete Status in Chinese Sprint/Power Athletes
Previous Article in Journal
Analysis of Modular Hub Genes and Therapeutic Targets across Stages of Non-Small Cell Lung Cancer Transcriptome
Previous Article in Special Issue
Impact of Vanadium–Titanium–Magnetite Mining Activities on Endophytic Bacterial Communities and Functions in the Root Systems of Local Plants
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Diversity, Distribution and Structural Prediction of the Pathogenic Bacterial Effectors EspN and EspS

1
State Key Laboratory of Pathogens and Biosecurity, Academy of Military Medical Sciences, Beijing 100071, China
2
School of Basic Medical Science, Anhui Medical University, Hefei 230032, China
*
Authors to whom correspondence should be addressed.
Genes 2024, 15(10), 1250; https://doi.org/10.3390/genes15101250
Submission received: 9 August 2024 / Revised: 20 September 2024 / Accepted: 23 September 2024 / Published: 26 September 2024
(This article belongs to the Special Issue Genomics of Microbial Diversity, Evolution and Function)

Abstract

:
Background: Many Gram-negative enterobacteria translocate virulence proteins (effectors) into intestinal epithelial cells using a type III secretion system (T3SS) to subvert the activity of various cell functions possess. Many T3SS effectors have been extensively characterized, but there are still some effector proteins whose functional information is completely unknown. Methods: In this study, two predicted effectors of unknown function, EspN and EspS (Escherichia coli secreted protein N and S), were selected for analysis of translocation, distribution and structure prediction. Results: The TEM1 (β-lactamase) translocation assay was performed, which showed that EspN and EspS are translocated into host cells in a T3SS-dependent manner during bacterial infection. A phylogenetic tree analysis revealed that homologs of EspN and EspS are widely distributed in pathogenic bacteria. Multiple sequence alignment revealed that EspN and its homologs share a conserved C-terminal region (673–1133 a.a.). Furthermore, the structure of EspN (673–1133 a.a.) was also predicted and well-defined, which showed that it has three subdomains connected by a loop region. EspS and its homologs share a sequence-conserved C-terminal (146–291 a.a.). The predicted structure of EspS (146–291 a.a.) is composed of a β-sheet consisting of four β-strands and several short helices, which has a TM score of 0.5014 with the structure of the Vibrio cholerae RTX cysteine protease domain (PDBID: 3eeb). Conclusions: These results suggest that EspN and EspS may represent two important classes of T3SS effectors associated with pathogen virulence, and our findings provide important clues to understanding the potential functions of EspN and EspS.

1. Introduction

Many Gram-negative enterobacteria, such as Escherichia, Shigella, Salmonella, and Citrobacter, can cause many serious infections. Two of the enteric E. coli pathotypes, enteropathogenic E. coli (EPEC) and enterohaemorrhagic E. coli (EHEC) are important foodborne pathogens that can cause serious complications, including hemorrhagic colitis and hemolytic uremic syndrome and contribute significantly to morbidity and mortality worldwide [1,2,3]. Their murine counterpart, Citrobacter rodentium, is an extracellular enteric mouse-specific pathogen used to model infections with human pathogenic E. coli and inflammatory bowel disease [4]. Salmonella and Shigella are known to cause a well-characterized spectrum of diseases in humans, ranging from asymptomatic carriage to hemorrhagic colitis and fatal typhoidal fever [5,6]. These pathogens have acquired virulence effectors that enable them to colonize, multiply, and evade host defenses. The best-characterized system for delivering effector proteins into target eukaryotic cells is the type III secretion system (T3SS), which is expressed by Gram-negative bacteria [7]. The structural component of the T3SS is the needle apparatus, which is assembled from over 20 different proteins and consists of a base, an extracellular needle, a tip, and a translocon [8]. The T3SS is essential for colonization and pathogenesis, and mutants lacking this apparatus fail to cause disease [9].
The T3SS is used to translocate dozens to hundreds of bacterial effector proteins directly into the host cytosol, where they manipulate a variety of host cell processes to establish a successful infection [10]. Many of these translocated proteins have been described in pathogenic E. coli strains (EHEC, EPEC, etc.). A number of T3SS effectors, such as non-LEE effectors (Nle) [11], are important mediators of virulence. NleH and NleD are inhibitors of cell apoptosis [12]. While, NleB, NleC, and NleE are involved in the interference with host inflammatory signaling pathways [3]. NleB, a bacterial effector with glycosyltransferase activity, targets GAPDH function to inhibit NF-κB activation [13]. NleC, a type III secretion protease, compromises NF-κB activation by targeting p65/RelA [14], and NleE inhibits NF-κB activation [15]. The E. coli secreted proteins (Esp) [16] also play an important role in infection. For example, EspH inactivates multiple host Rho GEFs through binding to their Dbl-homology and pleckstrin homology (DH-PH) domain, preventing Rho GTPase activation [7]. EspF coordinates the spatiotemporal activation of two eukaryotic signaling pathways, thereby regulating membrane trafficking [17].
In addition to the effectors described above, numerous other T3SS effectors have been identified by genome sequencing for which no functional information is currently available, such as EspN and EspS. The espN gene was initially identified in E. coli O157:H7 strain EDL933, which is a potential T3SS effector with an unconfirmed function [18]. The espN gene is located on the top strand from 1582242 to 1585643. The EspN protein is composed of 1133 amino acids. Additionally, the espS gene was identified in E. coli O145:H28 RM13514 as a T3SS effector with an unknown function [19]. The espS gene is located on the top strand from 3360604 to 3361479. The espS gene encodes a protein of 291 amino acids.
In this study, to confirm whether EspN or EspS is a T3SS effector, we performed a TEM1 (β-lactamase) transport assay, which indicated that EspN or EspS is transported into host cells in a T3SS-dependent manner. To further clarify the diversity and distribution of these two T3SS effectors, we performed an evolutionary tree analysis, which showed that homologous proteins of EspN and EspS are widely distributed in multiple pathogens. Then, to reveal the functional information of EspN and EspS, we predicted the 3D structures of EspN and EspS and compared them with those of known functional proteins, providing important clues to the functions of the two effectors.

2. Materials and Methods

2.1. T3SS Translocation Assay

The genes map-tem1, espN-tem1, espS-tem1, or tem1 were synthesized and inserted into expression vector pET-28a to construct TEM1 fusion expression plasmid, pMap-TEM1, pEspN-TEM1, pEspS-TEM1, or pTEM1 [20]. The expression plasmids pMap-TEM1, pEspN-TEM1, pEspS-TEM1, or pTEM1 were electrotransformed into EHEC (E. coli O157:H7 or E. coli O145:H28) wild-type or ΔescN strains whose expression was induced by IPTG, and prepared for a fluorescence-based reporter (TEM-1 β-lactamase) [21]. Human Colorectal Adenocarcinoma Cell Line HT-29 cells (ATCC HTB-38) were maintained in Dulbecco’s modified Eagle’s medium without antibiotics containing 10% fetal bovine serum at 37 °C in an atmosphere containing 5% CO2 [22]. HT-29 cells were infected with EHEC wild-type or ΔescN strains expressing different TEM-1 fusion proteins (Map-TEM1, TEM1, EspN-TEM1, or EspS-TEM1) at a MOI of 10 bacteria per cell. After 1 h of infection, IPTG was added to a final concentration of 1 mM, and the infection was allowed to proceed for an additional 3 h. Cell monolayers were then washed twice with Hanks Balanced Salt Solution (HBSS) and loaded with the fluorescent β-lactamase substrate CCF2/AM [23] according to the manufacturer’s instructions using a LiveBLAzer FRET-B/G loading kit (Thermo). The absence of translocated TEM1 resulted in the development of green fluorescence at 520 nm, which was attributed to CCF2/AM that had been loaded into the cells. Conversely, the blue fluorescence (at 450 nm), which was generated by β-lactamase-mediated cleavage of CCF2/AM, served as an indicator of translocated TEM1 fusion proteins. EspN or EspS translocation was examined on a confocal fluorescence microscope to detect the blue fluorescence (~450 nm) and green fluorescence emission (~520 nm) at 410 nm excitation (Olympus FV1000).

2.2. Bioinformatics Analyses

We first used the blastp (BLAST+ 2.15.0) tool [24] to perform sequence similarity comparison on the UniProt library (https://www.uniprot.org/, Uniprot release 2024_04, accessed on 31 July 2024) [25] using EspN and EspS sequences as query sequences. We used the default parameters of the blastp program, except that the evalue threshold was adjusted to 0.0001 to ensure sequence homology, and max_target_seqs was set to 10,000 to obtain as many sequences as possible. We further filtered the search results and retained sequences with the percentage of identical matches greater than 30 and query coverage per subject greater than 90. We found 441 sequences for EspN and 248 sequences for EspS. We then manually filtered the results and deleted highly duplicated sequences and those that had been deleted from the uniprot database. Finally, we retained 114 sequences for EspN and 69 sequences for EspS, respectively, for phylogenetic tree analysis.

2.3. Phylogenetic Analyses

To construct the phylogenetic tree, we used a dynamic programming algorithm to find the best global alignment between the two sequences. The number of matches divided by the length is the similarity score between the two sequences. We performed this calculation on the filtered sequences mentioned above to obtain the similarity matrix for EspN and EspS, respectively. One-minus the similarity matrix is the distance matrix. Then, we used the Unweighted Pair Group Method with the Arithmetic Mean (UPGMA) method [26,27] to calculate the phylogenetic tree. UPGMA is a hierarchical clustering analysis method that finds the two nearest samples or sample groups based on the distance matrix and merges them into a new group, then updates the average distance between this new group and other samples or groups, and then repeatedly finds the two nearest samples or sample groups and merges them until all samples are merged into one group. We used the ETE3 [28] python package to draw the phylogenetic tree. Sample groups with a distance greater than 0.1 are marked with different colors.

2.4. Multiple Alignment of EspN and Its Homologs, as Well as EspS and Its Homologs

The amino acid sequences of EspN and its homologs, as well as EspS and its homologs, were obtained from the Uniprot database (https://www.uniprot.org/, Uniprot release 2024_04, accessed on 31 July 2024) in FASTA format. Multiple alignments of these effectors were compared and aligned using BioEdit 7.7 [29] and ClustalW 7.7 software [30].

2.5. Protein Structure Prediction and Analysis

The 3D structures of EspN and EspS were generated using Alphafold2 [31] based on the protein sequences. For AlphaFold2, the default process and parameters are utilized for protein structure prediction, which involves the initial execution of Multiple Sequence Alignment (MSA), followed by End-to-End (E2E) inference, and finally, the application of molecular force fields [32] to relax the predicted structure. The per-residue distance difference tests (pLDDT) metrics are extracted from the b-factor column of the predicted structure PDB file, and only CA atoms are used for the calculation of the average pLDDT. The structure of EspN was derived from AlphaFold2 modeling. Based on the per-residue confidence score (pLDDT) (ranked 0: 84.0051, ranked 1: 83.8495, ranked 2: 82.5245, ranked 3: 81.8049, ranked 4: 81.5267) [33], we chose the ranked 0 model. The structure of EspS was derived from AlphaFold2 modeling. Based on the per-residue confidence score (pLDDT) (ranked 0: 86.3553, ranked 1: 85.9500, ranked 2: 85.3318, ranked 3: 85.3035, ranked 4: 84.9392), we chose the ranked 0 model.
We used the foldseek [34] server to perform a structural similarity search on the conserved regions of EspN and EspS. For FoldSeek, the “AlphaFold/Swiss-Prot v4” and “PDB100 20240101” databases are selected as the target [35]. “TM-align” [36] is used as the alignment mode, which gives higher precision than 3Di/AA. For EspN, we selected the structure corresponding to the sequence 673–1133 a.a. as the query. As a result, only one hit was found in PDB100. For EspS, we used the 146–291 a.a. sequence structure as the query and found 23 hits in UniProt–Swiss and 42 hits in PDB100. TM-score [37] is calculated using the similarity between protein structures. Finally, we found that PDBID: 6yhn [38] has a TM-score of 0.5157 with the predicted structure of EspN (673–1133 a.a.), and PDBID: 3eeb [39] has a TM-score of 0.5014 with the predicted structure of EspS (146–291 a.a.). Analyzing the section on structural comparisons revealed that 6yhn (530–700 a.a.) has a TM-score of 0.8601 with the predicted structure of EspN (673–843 a.a.), and PDBID: 3eeb (37–184 a.a.) has a TM-score of 0.7052 with the predicted structure of EspS (179–291 a.a.).

3. Results

3.1. EspN or EspS Is Translocated into Host Cells by T3SS

The TEM-1-β-lactamase reporter system has been proven robust for assaying effector translocation into host cells [23]. In order to determine whether EspN or EspS was translocated into host cells by T3SS, we constructed fusion proteins of EspN or EspS and TEM-1 (β-lactamase). These constructs were introduced into both EHEC WT and ΔescN, a strain deficient in T3SS function. These strains were used to infect HT-29 cells loaded with the CCF2/AM substrate. HT-29 cells with no translocated TEM1, such as in the case of infection with EHEC-expressing TEM1, emitted green fluorescence at 520 nm (Figure 1A,B). In contrast, evident emission of a blue fluorescence at 450 nm was observed in cells infected for 4 h by WT EHEC expressing the EspN-TEM1 or EspS-TEM1 fusion protein. The blue fluorescence, generated by the β-lactamase-mediated cleavage of CCF2/AM and the consequent disruption of FRET, was indicative of translocated EspN-TEM1 (Figure 1A) or EspS-TEM1 (Figure 1B) fusion proteins. When the EspN-TEM1 or EspS-TEM1 fusion protein was expressed in the EscN mutant strain (ΔescN), infected HT-29 cells emitted only green fluorescence similar to that observed in control cells. Another known T3SS substrate, Map [40], was included in this assay as a positive control. All of these results indicate that EspN or EspS is translocated into host cells by T3SS.

3.2. Evolutionary Tree Analysis of EspN Revealed That Homologous Proteins of EspN Widely Distributed in Many Pathogens

To determine the diversity and distribution of the bacterial effector EspN (1133 amino acids), a phylogenetic tree was constructed using the blastp algorithm within the Uniprot database. A total of 114 homologous proteins of EspN have been discovered, with an identity range of 35.57% to 100% (Figure 2 and Table S1). All homologs of EspN have been identified or predicted as T3SS-secreted effectors. More importantly, all homologs of EspN were with a conserved domain of unknown function (DUF4765) in the NCBI database, which consists of approximately 1100 amino acids. On the basis of evolutionary relationships, the homologous proteins of EspN were classified into six clades. There are six homologs of EspN in clade 1. All six homologs were distributed in pathogenic E. coli strains, including E. coli O26, E. coli O157, E. coli O145:NM, and E. coli 97.0246, and the sequences were substantially identical (99.74% to 100%). There are five homologous proteins in clade 2. Four homologs were distributed in E. coli strains, and one homolog, ROD_47811 (D2TQZ7), was in the C. rodentium strain, with an identity range of 36.36% to 39.03%. There are 15 homologous proteins in clade 3. All 15 homologs were distributed in Salmonella strains, including Salmonella Newport, Salmonella enterica subsp. enterica serovar Saintpaul, S. enterica subsp. enterica serovar Javiana, S. enterica I, Salmonella anatum, Salmonella infantis, and Salmonella muenchen, with an identity range of 35.89% to 36.70%. Clade 4 has the largest number of homologs, including 71 proteins. All 71 homologs were distributed in Salmonella strains, including Salmonella senftenberg, S. enterica subsp. enterica serovar, S. enterica I, Salmonella agona, Salmonella derby, Salmonella Rubislaw, and Salmonella Montevideo, with an identity range of 35.57% to 36.69%. There are eight homologous proteins in clade 5. All eight homologs were distributed in Salmonella strains, including S. enterica subsp. diarizonae serovar, Salmonella diarizonae, and S. enterica I, with an identity range of 37.75% to 37.84%. There are seven homologous proteins in clade 6. All seven homologs were distributed in Salmonella strains, including S. enterica subsp. salamae serovar, Salmonella bongori, with an identity range of 36.54% to 36.92%. In conclusion, the homologous proteins of EspN are widely distributed across multiple pathogenic bacteria, including Salmonella, E. coli, and C. rodentium, and the homologs from the same genus are generally in the same clade. These results suggest that EspN and its homologs represent an important class of T3SS effectors that serve as key virulence factors underlying the infection strategy of these pathogens.

3.3. EspN and Its Homologues Share Conserved C-Terminal Sequences

Further, one representative homologous protein from each clade was selected for multiple sequence comparisons in Figure 3. The selected homologous proteins included EspN in the EHEC O157:H7, R545_13515 in S. enterica subsp. diarizonae serovar (identity: 37.84%), SBG_1491 in S. bongori (identity: 37.59%), A2I20_05820 in Salmonella choleraesuis (identity: 36.20%), BEU62_20775 in Salmonella newport (identity: 36.59%), ROD_47811 in C. rodentium (identity: 39.03%). The C-terminal sequences of EspN and its homologous proteins (673–1133 a.a., 50.70 to 54.15%) have a significantly higher degree of identity than the N-terminal sequences (1–672 a.a., 28.03 to 30.27%). The results show that EspN and its homologous proteins share a sequence-conserved C-terminal region.

3.4. The Predicted 3D Structure of the Subsequences of EspN Shows SOME Similarity to the D4 Domain of CNF(Y)

Based on the protein sequences, we used Alphafold2 to predict the 3D structure of EspN. The predicted 3D structure showed high confidence scores (pLDDT value 83.6–85.3 on average) in three regions of EspN (19–206 a.a., 372–608 a.a., and 673–1133 a.a.) in Figure 4A. Interestingly, EspN (673–1133 a.a.) is the conserved C-terminal region of EspN. The well-defined 3D structure of EspN (673–1133 a.a., pLDDT value: 85.3) includes three subdomains connected by a loop region (Figure 4B). Consequently, we used foldseek to find similarly resolved experimental structures of EspN (673–1133 a.a.) from the PDB100 database [35]. The results showed that the structure of the bacterial cytotoxic necrotizing factor CNF(Y) (PDBID: 6yhn) has the highest TM score (0.5157) with the predicted structure of EspN (673–1133 a.a.) in Figure 4B. Structural comparisons show structural similarity between CNF(Y) and EspN for only a small portion of the sequence. The structure of CNF(Y) (530–700 a.a.) has a TM score of 0.8601, with the predicted structure of EspN (673–843 a.a.). CNFs are bacterial single-chain exotoxins that modulate cytokinetic/oncogenic and inflammatory processes through activation of host cell Rho GTPases [41,42]. To achieve this, they are secreted, bind surface receptors to induce endocytosis, and translocate a catalytic unit into the cytosol to intoxicate host cells. CNF(Y) consists of five domains (D1–D5). Analysis with PiSQRD [43] assigns domain boundaries to residues 1–22/135–424 (D1), 23–134 (D2), 425–529 (D3), 530–700 (D4), and 718–1014 (D5, deamidase domain). D1–3 act as export and translocation modules for the catalytic unit (D4–5). The role of the D5 domain is the targeting and modification of Rho GTPases. D4 interacts extensively with D5 in the crystal structure of the free D4–5 fragment [38]. EspN (673–843 a.a.) possesses structural similarity to the D4 domain (530–700 a.a.) of CNF(Y). Therefore, we hypothesize that the function of EspN (673–843 a.a.) has a similar function to the D4 domain of CNF(Y). The structure of EspN (844–1133 a.a.) is different from that of any other known structure, and the function of this region remains unknown. All results show that EspN has multiple structural domains and probably several unknown functional activities that need to be further investigated.

3.5. Evolutionary Tree Analysis of EspS Revealed That Homologous Proteins of EspS Widely Distributed in Many Pathogens

To determine the distribution of the bacterial effector EspS (291 amino acids), a phylogenetic tree was constructed using the blastp algorithm within the Uniprot database. Sixty-nine homologs of EspS have been discovered, with an identity range of 34.13% to 100% in Figure 5 and Table S2. On the basis of evolutionary relationships, homologous proteins were classified into five clades. There are 11 homologues of EspS in clade 1. Eight homologs were distributed in E. coli strains, including E. coli O145, E. coli 97.0246, E. coli O26, and E. coli O157:H7, with an identity range of 92.04% to 100%. Moreover, the other three homologs were found to be distributed in Escherichia albertii and C. rodentium, with an identity range of 63.92% to 71.48%. In comparison to E. coli, E. albertii and C. rodentium are closely related. There are 14 homologous proteins in clade 2. All 14 homologs were distributed in Salmonella strains, including S. enterica, S. enterica I, and S. montevideo, with an identity range of 36.05% to 38.08%. There are 29 homologous proteins in clade 3. All 29 homologs were distributed in Salmonella strains, including S. enterica subsp. enterica serovar, S. enterica I, and Salmonella oranienberg, with an identity range of 38.96% to 39.76%. There are four homologs in clade 4. Two homologs were distributed in Vibrio strains, with an identity range of 38.40%. The other two homologs were distributed in Grimontia hollisae strains, with an identity range of 37.18%. There are 12 homologous proteins in clade 5. Ten homologs were distributed in Shigella strains, including S. flexneri and Shigella boydii, with an identity range of 34.13%. Two homologs were distributed in E. coli strains, also with an identity range of 34.13%. In conclusion, the homologous proteins of EspS are widely distributed across multiple pathogenic bacteria, including Salmonella, E. coli, Shigella, Vibrio, Grimontia, and C. rodentium, and the homologs from the same genus are generally in the same clade. Additionally, all homologs of EspS have been identified or predicted as T3SS-secreted effectors. Given its wide distribution among pathogenic bacteria and its role as a T3SS effector, further studies on the potential contribution of EspS to bacterial virulence are warranted.

3.6. EspS and Its Homologues Share Conserved C-Terminal Sequences

Further, we selected one representative homolog from each clade to go for multiple sequence comparisons in Figure 6. The selected proteins included EspS in the E. coli O145:H28 strain, G8B04_001499 in S. enterica (identity: 38.077), S773_15875 in Salmonella rubislaw (identity: 39.357), OspB in Vibrio parahaemolyticus (identity: 38.4), NCTC8524_04980 in S. flexneri (identity: 34.127). The results show that EspS and its homologous proteins share a sequence-conserved C-terminal region from amino acids 146 to 291, with an identity range of 44.16 to 44.90%.

3.7. The Predicted 3D Structure of the Subsequences of EspS Shows Some Similarity to That of the Vibrio cholerae RTX Cysteine Protease Domain

Based on the protein sequences, we used Alphafold2 to predict the 3D structure of EspS. The predicted 3D structure showed low confidence scores (pLDDT value 56.2 on average) in the N-terminal of EspS (1–40 a.a.) that is composed of a loop, and high confidence scores (pLDDT value 81.5–84.0 on average) in EspS (41–291 a.a.) (Figure 7A). The conserved region of EspS (146–291 a.a.) is composed of a β-sheet consisting of four β-strands and several short helices. We also used foldseek to find similarly resolved experimental structures of the conserved C-terminal region of EspS from PDB100, which showed that the structure of the V. cholerae RTX cysteine protease domain (CPD) (PDBID: 3eeb) has the highest TM score (0.5014) in Figure 7B. Structural comparisons show that the structure of the RTX CPD (37–184 a.a.) has a TM score of 0.7052 with the predicted structure of EspS (179–291 a.a.). V. cholerae RTX (repeats in toxin) is an actin-disrupting toxin that is autoprocessed by the internal CPD [39]. These results provide important clues to the possible function of EspS and its potential role in bacterial infections.

4. Discussion

Type III secretion system (T3SS) effectors are key virulence factors that play a pivotal role in the infection strategy of numerous clinically important Gram-negative pathogens, including S. enterica, Shigella spp., enteropathogenic and enterohemorrhagic E. coli (EHEC, EPEC), and their murine counterpart, C. rodentium [44]. In recent years, numerous studies have elucidated the structural and functional characteristics of a multitude of effector molecules, including Map, Tir, the Nle family (NleH, NleD, NleB, NleC, and NleE), and the Esp family (EspT, EspM, EspG, EspH, EspW, and EspF) [10]. There are still some predicted effector proteins whose functional information is completely unknown, such as EspN and EspS. In this study, we initially performed the TEM1 (β-lactamase) translocation assay, which confirmed that the two predicted effectors, EspN and EspS, were translocated into host cells in a T3SS-dependent manner. To elucidate the diversity and function of EspN and EspS, we analyzed their sequence characteristics and distribution in pathogenic bacteria. The results of the phylogenetic analysis have shown that homologs of EspN and EspS are widely distributed among pathogenic bacteria, including E. coli, Salmonella, and other pathogens that possess type III secretion systems [45]. These findings suggest that EspN and EspS represent two important classes of effectors involved in the infection of numerous pathogens. Sequence analysis revealed that the homologous proteins of EspN and EspS from the same genus of bacteria had a high degree of similarity, and the majority of them belonged to the same clade. The results suggest that EspN and EspS were acquired during the early stages of bacterial evolution. Concurrently, we discovered that both EspN and EspS have highly conserved C-terminal sequences. These results suggest that the core domains encoded by their C-terminal sequences deserve further investigation.
Alphafold2 is widely used for protein structure prediction [31,46,47]. AlphaFold2 has been utilized to predict the 3D structure of almost the entire human proteome (98.5% of human proteins). The resulting dataset encompasses 58% of residues with a confident prediction, with a subset (36% of all residues) exhibiting exceptionally high confidence [48,49,50,51]. However, few studies have been conducted on structure prediction and functional analysis of pathogen effectors using Alphafold2 and other similar software. In this study, Alphafold2 was used to predict the structures of two effectors, EspN and EspS, which are widely distributed in E. coli, Salmonella, and other pathogenic enterobacteria. The results showed that the predicted structures of the C-terminal conserved regions of EspN (673–1133 a.a.) and EspS (146–291 a.a.) have some similarity to the structures of currently known functional proteins. EspN (673–843 a.a.) has a high degree of structural similarity to the D4 (530–700 a.a.) domain of CNF(Y) (PDBID: 6yhn) [38]. The D4 domain interacts extensively with the D5 domain (718–1014 a.a.), which targets and modifies Rho GTPases. However, the exact function of the D4 domain is not yet known. Further investigation into the significance of the EspN (673–843 a.a.) or D4 domain of CNF(Y) is warranted. More importantly, EspN (844–1133 a.a.) has unique structural features that are inconsistent with all currently known protein structures and warrant further investigation. The structure of the C-terminal conserved region of EspS (146–291 a.a.) shows some similarity to the structure of the V. cholerae RTX cysteine protease domain (PDBID: 3eeb). The V. cholerae RTX is an actin-disrupting toxin that is autoprocessed by the internal cysteine protease domain [39]. This structural information about EspN and EspS provides important clues for parsing their functions. The T3SS is critical for the virulence of Gram-negative pathogens, and the effectors delivered by the T3SS target many fundamental processes within the infected host cell [10]. The structural similarity to the important virulence factors CNF(Y) and RTX suggests that EspN and EspS may be two classes of undiscovered virulence factors that may play important roles in the pathogenesis of bacterial infections.

5. Conclusions

In conclusion, we have systematically analyzed the diversity, distribution, and structure of two T3SS effectors of unknown function, EspN and EspS. Phylogenetic tree analysis revealed that homologs of EspN and EspS are widely distributed in Gram-negative pathogens, including Escherichia, Salmonella, and Citrobacter. Multiple sequence alignment revealed that both EspN and EspS families share conserved C-terminal regions of unknown function. The results of the structure prediction indicated that partial sequences of the conserved C-terminal regions of EspN and EspS show some similarity to the currently known functional proteins CNF(Y) and RTX, respectively. All results suggest that EspN and EspS, delivered by the T3SS, target a multitude of fundamental processes within the infected host cell, and our findings provide important clues to the possible function of EspN and EspS. With increasing knowledge of the distribution and structure prediction of bacterial effectors, a future challenge will be to investigate how these effectors interact with host cells and contribute to the course of an infection, which will be important in elucidating bacterial pathogenesis.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes15101250/s1, Table S1: Bioinformatics analyses of EspN, Table S2: Bioinformatics analyses of EspS.

Author Contributions

Conceptualization, Z.L. and T.L.; Methodology, Z.L. and T.L.; Validation, Z.L.; Formal analysis, Z.L., Y.H., Y.S., D.L., X.Y., L.Z. and T.L.; Investigation, Z.L., T.L. and H.W.; Data curation, Z.L., Y.H., Y.S., T.L. and H.W.; Writing—original draft, Z.L. and T.L.; Writing—review and editing, Y.H., Y.S., D.L., X.Y., L.Z. and T.L.; Visualization, Z.L.; Project administration, T.L. and H.W.; Funding acquisition, T.L. and H.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (82072251) and the Beijing Nova Program (Z181100006218111).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data presented are available in the manuscript.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Serapio-Palacios, A.; Finlay, B.B. Dynamics of expression, secretion and translocation of type III effectors during enteropathogenic Escherichia coli infection. Curr. Opin. Microbiol. 2020, 54, 67–76. [Google Scholar] [CrossRef]
  2. Dean, P.; Kenny, B. The effector repertoire of enteropathogenic E. coli: Ganging up on the host cell. Curr. Opin. Microbiol. 2009, 12, 101–109. [Google Scholar] [CrossRef]
  3. Wong, A.R.; Pearson, J.S.; Bright, M.D.; Munera, D.; Robinson, K.S.; Lee, S.F.; Frankel, G.; Hartland, E.L. Enteropathogenic and enterohaemorrhagic Escherichia coli: Even more subversive elements. Mol. Microbiol. 2011, 80, 1420–1438. [Google Scholar] [CrossRef] [PubMed]
  4. Mullineaux-Sanders, C.; Sanchez-Garrido, J.; Hopkins, E.G.D.; Shenoy, A.R.; Barry, R.; Frankel, G. Citrobacter rodentium-host-microbiota interactions: Immunity, bioenergetics and metabolism. Nat. Rev. Microbiol. 2019, 17, 701–715. [Google Scholar] [CrossRef] [PubMed]
  5. Dekker, J.P.; Frank, K.M. Salmonella, Shigella, and yersinia. Clin. Lab. Med. 2015, 35, 225–246. [Google Scholar] [CrossRef] [PubMed]
  6. Leting, S.K.; Musyoki, S.K.; Maiyoh, G.K. Characterization and drug susceptibility pattern of Salmonella and Shigella in children below five years: A cross-sectional study conducted in Lodwar, Turkana County, in Northern Kenya. Pan Afr. Med. J. 2022, 42, 13. [Google Scholar] [CrossRef]
  7. Pinaud, L.; Sansonetti, P.J.; Phalipon, A. Host Cell Targeting by Enteropathogenic Bacteria T3SS Effectors. Trends Microbiol. 2018, 26, 266–283. [Google Scholar] [CrossRef] [PubMed]
  8. Dey, S.; Chakravarty, A.; Guha Biswas, P.; De Guzman, R.N. The type III secretion system needle, tip, and translocon. Protein Sci. A Publ. Protein Soc. 2019, 28, 1582–1593. [Google Scholar] [CrossRef]
  9. Harishankar, A.; Viswanathan, V.K. Attaching and effacing pathogens modulate host mitochondrial structure and function. Int. Rev. Cell Mol. Biol. 2023, 377, 65–86. [Google Scholar] [CrossRef]
  10. Shenoy, A.R.; Furniss, R.C.D.; Goddard, P.J.; Clements, A. Modulation of Host Cell Processes by T3SS Effectors. Curr. Top. Microbiol. Immunol. 2018, 416, 73–115. [Google Scholar] [CrossRef]
  11. Cepeda-Molero, M.; Berger, C.N.; Walsham, A.D.S.; Ellis, S.J.; Wemyss-Holden, S.; Schüller, S.; Frankel, G.; Fernández, L. Attaching and effacing (A/E) lesion formation by enteropathogenic E. coli on human intestinal mucosa is dependent on non-LEE effectors. PLoS Pathog. 2017, 13, e1006706. [Google Scholar] [CrossRef] [PubMed]
  12. Blasche, S.; Mörtl, M.; Steuber, H.; Siszler, G.; Nisa, S.; Schwarz, F.; Lavrik, I.; Gronewold, T.M.; Maskos, K.; Donnenberg, M.S.; et al. The E. coli effector protein NleF is a caspase inhibitor. PLoS ONE 2013, 8, e58937. [Google Scholar] [CrossRef]
  13. Gao, X.; Wang, X.; Pham, T.H.; Feuerbacher, L.A.; Lubos, M.L.; Huang, M.; Olsen, R.; Mushegian, A.; Slawson, C.; Hardwidge, P.R. NleB, a bacterial effector with glycosyltransferase activity, targets GAPDH function to inhibit NF-κB activation. Cell Host Microbe 2013, 13, 87–99. [Google Scholar] [CrossRef] [PubMed]
  14. Yen, H.; Ooka, T.; Iguchi, A.; Hayashi, T.; Sugimoto, N.; Tobe, T. NleC, a type III secretion protease, compromises NF-κB activation by targeting p65/RelA. PLoS Pathog. 2010, 6, e1001231. [Google Scholar] [CrossRef] [PubMed]
  15. Nadler, C.; Baruch, K.; Kobi, S.; Mills, E.; Haviv, G.; Farago, M.; Alkalay, I.; Bartfeld, S.; Meyer, T.F.; Ben-Neriah, Y.; et al. The type III secretion effector NleE inhibits NF-kappaB activation. PLoS Pathog. 2010, 6, e1000743. [Google Scholar] [CrossRef] [PubMed]
  16. Joshi, P.; Mishra, P.K.K. Functional Diversity of the Excretory/Secretory Proteins of Nematode Parasites. Acta Parasitol. 2022, 67, 619–627. [Google Scholar] [CrossRef]
  17. Kassa, E.G.; Zlotkin-Rivkin, E.; Friedman, G.; Ramachandran, R.P.; Melamed-Book, N.; Weiss, A.M.; Belenky, M.; Reichmann, D.; Breuer, W.; Pal, R.R.; et al. Enteropathogenic Escherichia coli remodels host endosomes to promote endocytic turnover and breakdown of surface polarity. PLoS Pathog. 2019, 15, e1007851. [Google Scholar] [CrossRef]
  18. Ogura, Y.; Ooka, T.; Iguchi, A.; Toh, H.; Asadulghani, M.; Oshima, K.; Kodama, T.; Abe, H.; Nakayama, K.; Kurokawa, K.; et al. Comparative genomics reveal the mechanism of the parallel evolution of O157 and non-O157 enterohemorrhagic Escherichia coli. Proc. Natl. Acad. Sci. USA 2009, 106, 17939–17944. [Google Scholar] [CrossRef]
  19. Petty, N.K.; Bulgin, R.; Crepin, V.F.; Cerdeño-Tárraga, A.M.; Schroeder, G.N.; Quail, M.A.; Lennard, N.; Corton, C.; Barron, A.; Clark, L.; et al. The Citrobacter rodentium genome sequence reveals convergent evolution with human pathogenic Escherichia coli. J. Bacteriol. 2010, 192, 525–538. [Google Scholar] [CrossRef]
  20. Li, T.; Li, Z.; Chen, F.; Liu, X.; Ning, N.; Huang, J.; Wang, H. Eukaryotic-like Kinase Expression in Enterohemorrhagic Escherichia coli: Potential for Enhancing Host Aggressive Inflammatory Response. J. Infect. Dis. 2017, 216, 1150–1158. [Google Scholar] [CrossRef]
  21. Allombert, J.; Vianney, A.; Charpentier, X. Monitoring Effector Translocation using the TEM-1 β-Lactamase Reporter System. Methods Mol. Biol. 2017, 1615, 489–499. [Google Scholar] [CrossRef] [PubMed]
  22. Sanchez-Villamil, J.; Tapia-Pastrana, G.; Navarro-Garcia, F. Pathogenic Lifestyles of E. coli Pathotypes in a Standardized Epithelial Cell Model Influence Inflammatory Signaling Pathways and Cytokines Secretion. Front. Cell. Infect. Microbiol. 2016, 6, 120. [Google Scholar] [CrossRef] [PubMed]
  23. Charpentier, X.; Oswald, E. Identification of the secretion and translocation domain of the enteropathogenic and enterohemorrhagic Escherichia coli effector Cif, using TEM-1 β-lactamase as a new fluorescence-based reporter. J. Bacteriol. 2004, 186, 5486–5495. [Google Scholar] [CrossRef] [PubMed]
  24. Mount, D.W. Using the Basic Local Alignment Search Tool (BLAST). CSH Protoc. 2007, 2007, pdb.top17. [Google Scholar] [CrossRef] [PubMed]
  25. UniProt Consortium, T. UniProt: The universal protein knowledgebase. Nucleic Acids Res. 2018, 46, 2699. [Google Scholar] [CrossRef]
  26. Baier, A.L.; Kline, A.C.; Feeny, N.C. Therapeutic alliance as a mediator of change: A systematic review and evaluation of research. Clin. Psychol. Rev. 2020, 82, 101921. [Google Scholar] [CrossRef]
  27. Smith, M.L.; Hahn, M.W. Phylogenetic inference using generative adversarial networks. Bioinformatics 2023, 39, btad543. [Google Scholar] [CrossRef]
  28. Huerta-Cepas, J.; Serra, F.; Bork, P. ETE 3: Reconstruction, Analysis, and Visualization of Phylogenomic Data. Mol. Biol. Evol. 2016, 33, 1635–1638. [Google Scholar] [CrossRef]
  29. Alzohairy, A.M. BioEdit: An important software for molecular biology. Gerf Bull. Biosci. 2011, 2, 60–61. [Google Scholar]
  30. Larkin, M.A.; Blackshields, G.; Brown, N.P.; Chenna, R.; McGettigan, P.A.; McWilliam, H.; Valentin, F.; Wallace, I.M.; Wilm, A.; Lopez, R.; et al. Clustal W and Clustal X version 2.0. Bioinformatics 2007, 23, 2947–2948. [Google Scholar] [CrossRef]
  31. Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A.; et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef] [PubMed]
  32. Eastman, P.; Swails, J.; Chodera, J.D.; McGibbon, R.T.; Zhao, Y.; Beauchamp, K.A.; Wang, L.P.; Simmonett, A.C.; Harrigan, M.P.; Stern, C.D.; et al. OpenMM 7: Rapid development of high performance algorithms for molecular dynamics. PLoS Comput. Biol. 2017, 13, e1005659. [Google Scholar] [CrossRef] [PubMed]
  33. Shao, C.; Bittrich, S.; Wang, S.; Burley, S.K. Assessing PDB macromolecular crystal structure confidence at the individual amino acid residue level. Structure 2022, 30, 1385–1394.e1383. [Google Scholar] [CrossRef] [PubMed]
  34. van Kempen, M.; Kim, S.S.; Tumescheit, C.; Mirdita, M.; Lee, J.; Gilchrist, C.L.M.; Söding, J.; Steinegger, M. Fast and accurate protein structure search with Foldseek. Nat. Biotechnol. 2024, 42, 243–246. [Google Scholar] [CrossRef]
  35. Porter, K.A.; Padhorny, D.; Desta, I.; Ignatov, M.; Beglov, D.; Kotelnikov, S.; Sun, Z.; Alekseenko, A.; Anishchenko, I.; Cong, Q.; et al. Template-based modeling by ClusPro in CASP13 and the potential for using co-evolutionary information in docking. Proteins 2019, 87, 1241–1248. [Google Scholar] [CrossRef]
  36. Zhang, Y.; Skolnick, J. TM-align: A protein structure alignment algorithm based on the TM-score. Nucleic Acids Res. 2005, 33, 2302–2309. [Google Scholar] [CrossRef]
  37. Zhang, Y.; Skolnick, J. Scoring function for automated assessment of protein structure template quality. Proteins 2004, 57, 702–710. [Google Scholar] [CrossRef]
  38. Chaoprasid, P.; Lukat, P.; Mühlen, S.; Heidler, T.; Gazdag, E.M.; Dong, S.; Bi, W.; Rüter, C.; Kirchenwitz, M.; Steffen, A.; et al. Crystal structure of bacterial cytotoxic necrotizing factor CNF(Y) reveals molecular building blocks for intoxication. EMBO J. 2021, 40, e105202. [Google Scholar] [CrossRef]
  39. Lupardus, P.J.; Shen, A.; Bogyo, M.; Garcia, K.C. Small molecule-induced allosteric activation of the Vibrio cholerae RTX cysteine protease domain. Science 2008, 322, 265–268. [Google Scholar] [CrossRef]
  40. Kenny, B.; Jepson, M. Targeting of an enteropathogenic Escherichia coli (EPEC) effector protein to host mitochondria. Cell. Microbiol. 2000, 2, 579–590. [Google Scholar] [CrossRef]
  41. Lemichez, E.; Aktories, K. Hijacking of Rho GTPases during bacterial infection. Exp. Cell Res. 2013, 319, 2329–2336. [Google Scholar] [CrossRef] [PubMed]
  42. Munro, P.; Lemichez, E. Bacterial toxins activating Rho GTPases. Curr. Top. Microbiol. Immunol. 2005, 291, 177–190. [Google Scholar] [CrossRef] [PubMed]
  43. Aleksiev, T.; Potestio, R.; Pontiggia, F.; Cozzini, S.; Micheletti, C. PiSQRD: A web server for decomposing proteins into quasi-rigid dynamical domains. Bioinformatics 2009, 25, 2743–2744. [Google Scholar] [CrossRef]
  44. Sanchez-Garrido, J.; Ruano-Gallego, D.; Choudhary, J.S.; Frankel, G. The type III secretion system effector network hypothesis. Trends Microbiol. 2022, 30, 524–533. [Google Scholar] [CrossRef]
  45. Deane, J.E.; Abrusci, P.; Johnson, S.; Lea, S.M. Timing is everything: The regulation of type III secretion. Cell. Mol. Life Sci. 2010, 67, 1065–1075. [Google Scholar] [CrossRef]
  46. Agard, D.A.; Bowman, G.R.; DeGrado, W.; Dokholyan, N.V.; Zhou, H.X. Solution of the protein structure prediction problem at last: Crucial innovations and next frontiers. Fac. Rev. 2022, 11, 38. [Google Scholar] [CrossRef] [PubMed]
  47. Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A.; et al. Applying and improving AlphaFold at CASP14. Proteins 2021, 89, 1711–1721. [Google Scholar] [CrossRef] [PubMed]
  48. Tunyasuvunakool, K.; Adler, J.; Wu, Z.; Green, T.; Zielinski, M.; Žídek, A.; Bridgland, A.; Cowie, A.; Meyer, C.; Laydon, A.; et al. Highly accurate protein structure prediction for the human proteome. Nature 2021, 596, 590–596. [Google Scholar] [CrossRef]
  49. David, A.; Islam, S.; Tankhilevich, E.; Sternberg, M.J.E. The AlphaFold Database of Protein Structures: A Biologist’s Guide. J. Mol. Biol. 2022, 434, 167336. [Google Scholar] [CrossRef]
  50. Ruff, K.M.; Pappu, R.V. AlphaFold and Implications for Intrinsically Disordered Proteins. J. Mol. Biol. 2021, 433, 167208. [Google Scholar] [CrossRef]
  51. Hetmann, M.; Parigger, L.; Sirelkhatim, H.; Stern, A.; Krassnigg, A.; Gruber, K.; Steinkellner, G.; Ruau, D.; Gruber, C.C. Folding the human proteome using BioNeMo: A fused dataset of structural models for machine learning purposes. Sci. Data 2024, 11, 591. [Google Scholar] [CrossRef] [PubMed]
Figure 1. TEM1 (β-lactamase) translocation assay of EspN or EspS in HT-29 cells. (A) EspN-TEM1, Map-TEM1 fusion proteins, or TEM1 were expressed in E. coli O157:H7 WT or ΔescN strains. (B) EspS-TEM1, Map-TEM1, or TEM1 were expressed in E. coli O145:H28 WT or ΔescN strains. After infection with WT or ΔescN strains, HT-29 cells were loaded with CCF2/AM1 dye, and translocation was determined by a comparison of cleaved to uncleaved CCF2/AM1, which emits blue and green fluorescence, respectively. White bars indicate 10 μm.
Figure 1. TEM1 (β-lactamase) translocation assay of EspN or EspS in HT-29 cells. (A) EspN-TEM1, Map-TEM1 fusion proteins, or TEM1 were expressed in E. coli O157:H7 WT or ΔescN strains. (B) EspS-TEM1, Map-TEM1, or TEM1 were expressed in E. coli O145:H28 WT or ΔescN strains. After infection with WT or ΔescN strains, HT-29 cells were loaded with CCF2/AM1 dye, and translocation was determined by a comparison of cleaved to uncleaved CCF2/AM1, which emits blue and green fluorescence, respectively. White bars indicate 10 μm.
Genes 15 01250 g001
Figure 2. Evolutionary tree diagram of EspN and its homologs. Phylogenetic relationships of EspN and its homologs in the Uniprot-kb database. The species origin of the proteins was identified and indicated with different colors.
Figure 2. Evolutionary tree diagram of EspN and its homologs. Phylogenetic relationships of EspN and its homologs in the Uniprot-kb database. The species origin of the proteins was identified and indicated with different colors.
Genes 15 01250 g002
Figure 3. Multiple alignments of EspN and its homologs. Type III secretion system effector protein—EspN (from E. coli O157:H7 strain) was found to be homologous to five other proteins, including R545_13515 (from S. enterica subsp. diarizonae serovar strain), SBG_1491 (from S. bongori NCTC 12419 strain), A2I20_05820 (from S. choleraesuis strain), BEU62_20775 (from S. newport strain), ROD_47811 (from C. rodentium strain). The EspN family sequences were obtained from Uniprot in FASTA format and compared with multiple sequences using BioEdit software. The invariant bases are indicated with the same color. ‘.’ indicates that one of the following ‘weaker’ groups is fully conserved. ‘:’ indicates that one of the following ‘strong’ groups is fully conserved. ‘*’ indicates positions which have a single, fully conserved residue. Highly conserved regions of the sequence are boxed.
Figure 3. Multiple alignments of EspN and its homologs. Type III secretion system effector protein—EspN (from E. coli O157:H7 strain) was found to be homologous to five other proteins, including R545_13515 (from S. enterica subsp. diarizonae serovar strain), SBG_1491 (from S. bongori NCTC 12419 strain), A2I20_05820 (from S. choleraesuis strain), BEU62_20775 (from S. newport strain), ROD_47811 (from C. rodentium strain). The EspN family sequences were obtained from Uniprot in FASTA format and compared with multiple sequences using BioEdit software. The invariant bases are indicated with the same color. ‘.’ indicates that one of the following ‘weaker’ groups is fully conserved. ‘:’ indicates that one of the following ‘strong’ groups is fully conserved. ‘*’ indicates positions which have a single, fully conserved residue. Highly conserved regions of the sequence are boxed.
Genes 15 01250 g003
Figure 4. Protein structure prediction and analysis of EspN. (A) The 3D structure of EspN was generated using Alphafold2. AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Some regions with low pLDDT may be unstructured in isolation. (B) The foldseek server was used to perform a structural similarity search on the conserved C-terminal (673–1133 a.a.) of EspN. The predicted structures were analyzed and compared using PyMOL v2.5 software to identify the structural differences. The TM-score is calculated from the similarity between the protein structures. The structure of 6yhn has the highest TM score (0.5157).
Figure 4. Protein structure prediction and analysis of EspN. (A) The 3D structure of EspN was generated using Alphafold2. AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Some regions with low pLDDT may be unstructured in isolation. (B) The foldseek server was used to perform a structural similarity search on the conserved C-terminal (673–1133 a.a.) of EspN. The predicted structures were analyzed and compared using PyMOL v2.5 software to identify the structural differences. The TM-score is calculated from the similarity between the protein structures. The structure of 6yhn has the highest TM score (0.5157).
Genes 15 01250 g004
Figure 5. Evolutionary tree diagram of EspS and its homologs. Phylogenetic relationships of EspS and its homologs in the Uniprot-kb database. The species origin of the proteins was identified and indicated with different colors.
Figure 5. Evolutionary tree diagram of EspS and its homologs. Phylogenetic relationships of EspS and its homologs in the Uniprot-kb database. The species origin of the proteins was identified and indicated with different colors.
Genes 15 01250 g005
Figure 6. Multiple alignments of EspS and its homologs. Type III secretion system effector protein—EspS (from E. coli O145:H28 strain) was found to be homologous to four other proteins, including protein G8B04_001499 (from S. enterica strain), S773_15875 (from S. rubislaw strain), OspB (from V. parahaemolyticus strain), NCTC8524_04980 (from S. flexneri strain). The EspN family sequences were obtained from Uniprot in FASTA format and compared with multiple sequences using BioEdit software. The invariant bases are indicated with the same color. ‘.’ indicates that one of the following ‘weaker’ groups is fully conserved. ‘:’ indicates that one of the following ‘strong’ groups is fully conserved. ‘*’ indicates positions which have a single, fully conserved residue. Highly conserved regions of the sequence are boxed.
Figure 6. Multiple alignments of EspS and its homologs. Type III secretion system effector protein—EspS (from E. coli O145:H28 strain) was found to be homologous to four other proteins, including protein G8B04_001499 (from S. enterica strain), S773_15875 (from S. rubislaw strain), OspB (from V. parahaemolyticus strain), NCTC8524_04980 (from S. flexneri strain). The EspN family sequences were obtained from Uniprot in FASTA format and compared with multiple sequences using BioEdit software. The invariant bases are indicated with the same color. ‘.’ indicates that one of the following ‘weaker’ groups is fully conserved. ‘:’ indicates that one of the following ‘strong’ groups is fully conserved. ‘*’ indicates positions which have a single, fully conserved residue. Highly conserved regions of the sequence are boxed.
Genes 15 01250 g006
Figure 7. Protein structure prediction and analysis of EspS. (A) The 3D structure of EspS was generated using Alphafold2. AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Some regions with low pLDDT may be unstructured in isolation. (B) The foldseek server was used to perform a structural similarity search on the conserved C-terminal (146–291 a.a.) of EspS. The predicted structures were analyzed and compared using PyMOL software to identify the structural differences. The TM-score is calculated from the similarity between the protein structures. The structure of 3eeb has the highest TM score (0.5014).
Figure 7. Protein structure prediction and analysis of EspS. (A) The 3D structure of EspS was generated using Alphafold2. AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Some regions with low pLDDT may be unstructured in isolation. (B) The foldseek server was used to perform a structural similarity search on the conserved C-terminal (146–291 a.a.) of EspS. The predicted structures were analyzed and compared using PyMOL software to identify the structural differences. The TM-score is calculated from the similarity between the protein structures. The structure of 3eeb has the highest TM score (0.5014).
Genes 15 01250 g007
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, Z.; Hu, Y.; Song, Y.; Li, D.; Yang, X.; Zhang, L.; Li, T.; Wang, H. Diversity, Distribution and Structural Prediction of the Pathogenic Bacterial Effectors EspN and EspS. Genes 2024, 15, 1250. https://doi.org/10.3390/genes15101250

AMA Style

Li Z, Hu Y, Song Y, Li D, Yang X, Zhang L, Li T, Wang H. Diversity, Distribution and Structural Prediction of the Pathogenic Bacterial Effectors EspN and EspS. Genes. 2024; 15(10):1250. https://doi.org/10.3390/genes15101250

Chicago/Turabian Style

Li, Zhan, Yuru Hu, Yuan Song, Deyu Li, Xiaolan Yang, Liangyan Zhang, Tao Li, and Hui Wang. 2024. "Diversity, Distribution and Structural Prediction of the Pathogenic Bacterial Effectors EspN and EspS" Genes 15, no. 10: 1250. https://doi.org/10.3390/genes15101250

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop