Functional Diversity and Engineering of the Adenylation Domains in Nonribosomal Peptide Synthetases

Zhang, Mengli; Peng, Zijing; Huang, Zhenkuai; Fang, Jiaqi; Li, Xinhai; Qiu, Xiaoting

doi:10.3390/md22080349

Open AccessReview

Functional Diversity and Engineering of the Adenylation Domains in Nonribosomal Peptide Synthetases

by

Mengli Zhang

,

Zijing Peng

,

Zhenkuai Huang

,

Jiaqi Fang

,

Xinhai Li

and

Xiaoting Qiu

^*

College of Food Science and Engineering, Ningbo University, Ningbo 315800, China

^*

Author to whom correspondence should be addressed.

Mar. Drugs 2024, 22(8), 349; https://doi.org/10.3390/md22080349

Submission received: 4 July 2024 / Revised: 23 July 2024 / Accepted: 27 July 2024 / Published: 29 July 2024

(This article belongs to the Section Synthesis and Medicinal Chemistry of Marine Natural Products)

Download

Browse Figures

Versions Notes

Abstract

:

Nonribosomal peptides (NRPs) are biosynthesized by nonribosomal peptide synthetases (NRPSs) and are widely distributed in both terrestrial and marine organisms. Many NRPs and their analogs are biologically active and serve as therapeutic agents. The adenylation (A) domain is a key catalytic domain that primarily controls the sequence of a product during the assembling of NRPs and thus plays a predominant role in the structural diversity of NRPs. Engineering of the A domain to alter substrate specificity is a potential strategy for obtaining novel NRPs for pharmaceutical studies. On the basis of introducing the catalytic mechanism and multiple functions of the A domains, this article systematically describes several representative NRPS engineering strategies targeting the A domain, including mutagenesis of substrate-specificity codes, substitution of condensation-adenylation bidomains, the entire A domain or its subdomains, domain insertion, and whole-module rearrangements.

Keywords:

adenylation domain in nonribosomal peptide synthetase; substrate-specificity codes of the adenylation domain; interrupted adenylation domain with methylase activity; nonribosomal peptide synthetase engineering targeting the adenylation domain

1. Introduction

In the natural environment, a peptide biosynthesis mechanism exists that does not rely on ribosomes. This process is frequently observed in bacteria and fungi and is capable of catalyzing the assembly of diverse natural peptides [1]. There are more than 500 types of monomers that have been discovered to be incorporated in such peptides [2], including amino acids (proteinogenic and nonproteinogenic amino acids), α-hydroxy acids, α-keto acids, and other types of acyl monomers [3,4,5]. This pathway is commonly referred to as the nonribosomal peptide synthetase (NRPS) pathway, which is a peptide assembly line composed of a sequence of monomer-specific mega-enzyme units, distinct from other types of ribosome-independent peptide synthesis pathways, such as those utilizing tRNA-independent acyl-AMP-ligases, ATP-grasp-ligases, tRNA-dependent cyclodipeptide synthases, and Fem-like ligases, as well as hybrid pathways for producing peptide bond containing secondary metabolites [6].

Marine microorganisms represent a significant reservoir of biological diversity on Earth, characterized by extensive genetic variability and the capacity to produce biologically active natural products. Research has demonstrated the presence of NRPS pathways in a variety of marine organisms [7,8], highlighting the potential of marine microorganisms as an important source of novel nonribosomal peptides (NRPs) metabolites [7,9].

Differences in substrates, the number of domains, and order in NRPS have led to the discovery of various peptide products synthesized through these pathways. Although certain NRPs can be toxic [10], the majority are bioactive secondary metabolites. Additionally, some NRPs offer unique advantages for human health. Many existing drugs are derived from NRPs or compounds biosynthesized by hybrid NRPS-polyketide synthase (PKS) pathways. In 2000, prepatellamide A, a cyclic peptide with cytotoxicity against P388 murine leukemia cell lines was isolated from the cytotoxic extracts of the ascidian Lissoclinum patella [11]. Additionally, two linear tetrapeptides, padanamides A and B, were identified in the crude extract of a Streptomyces species obtained from marine sediment, both exhibiting cytotoxic activity against Jurkat T lymphocyte cells (ATCC TIB-152), with padanamide B being more potent [12]. More recently, three hybrid NRPS-PKS metabolites, referred to as Flavipesides A−C, were discovered to be produced by a particular strain of filamentous fungus, Aspergillus flavipes 164013, which was isolated from the sponge Dysidea species found on Yongxing Island in the South China Sea. These metabolites can inhibit pancreatic lipase, potentially preventing hyperlipidemia and obesity without showing toxicity to normal cells [13]. A significant number of antiviral, antifungal, and antitumor drugs, as well as immunosuppressants, are NRPs, with 70% of these NRPs discovered in marine microorganisms [14].

NRPS is typically composed of multiple modules, the units that conduct the assembly of building blocks for generating NRP [15] (Figure 1). The first synthesis module in the assembly line is referred to as the initiation module, usually consisting of an adenylation (A) domain and a peptidyl carrier protein (PCP) domain, also known as the thiolation (T) domain. A domain specifically activates the acyl monomer substrate and subsequently loads the activated substrate to the 4′-phosphopantetheine (Ppant) arm in the T domain, which enables the monomer substrate to be loaded for connection with another monomer bound to the T domain in the downstream module, which is referred to as the elongation module. The elongation modules, located downstream of the initiation module, sequentially add monomers to the growing peptide chain and typically contain three core domains: A domain, T domain, and condensation (C) domain that catalyzes the formation of peptide bonds between T domain-bound monomers and peptides. In elongation modules, the core domains are typically arranged in a C-A-T order for the sequential incorporation of amino acids or other acyl monomers in order (Figure 1). As the peptide chain elongates, substrates move from the initial module to the termination module as the peptide chain grows. At the end of NRP synthesis, the release of the final product is typically catalyzed by a thioesterase (TE) domain located at the C-terminus of the termination module through hydrolysis or cyclization [16,17,18,19,20,21,22]. In addition to the core domains mentioned above, some NRPSs also possess additional optional domains such as the epimerization (E) domains in tyrocidine synthetases TycA and TycB from Bacillus brevi [23], the reduction (R) domain in NRPSs involved in soramycin synthesis, and the oxidation (Ox) domain involved in bleomycin synthesis [24]. These domains also play essential roles in NRP synthesis and release, contributing to the structural diversity observed in NRPs.

The organization of NRPS into multiple modules typically follows the principle of colinearity, where the specificity of substrates for a series of modules corresponds to the sequence of the final product. Within the NRPS assembly line, the A domain serves as the first domain required to load substrates for generating peptide chains, acting as the primary gatekeeper of substrate specificity in NRPS. Each A domain generally exhibits specificity towards a particular substrate [25], although some studies have indicated that certain types of A domains can specifically recognize multiple substrates [26]. The substrate specificity of various A domains has been biochemically characterized, demonstrating the dominant influence of the sequence of the A domain on the final peptide product sequence [27,28]. Advances in genomic sequencing and functional analysis of adenosylase have gradually improved the accuracy of bioinformatics tools for predicting substrate specificity of the A domain [29]. Various prediction methods, such as the AdenPredictor extra tree machine learning model, have been developed to improve the accuracy of substrate specificity prediction for A domains, facilitating research in this area. [30,31,32]. Although bioinformatics prediction tools are not always accurate, especially for rare substrates such as nonproteinogenic amino acids and several unique monomers derived from marine organisms [33], they can serve as auxiliary tools in guiding the discovery of novel NRPs [34].

The NRPS A domains have been identified in more than 90,000 protein sequences, predominantly found in actinobacteria, proteobacteria, bacilli, fungi, and cyanobacteria, as reported by the InterPro database [34]. This database serves as a valuable resource for researchers in screening target strains and expediting NRPS engineering studies to generate a wider array of novel NRPs with varied functionalities. Historically, the research on NRP engineering efforts has concentrated on the A domain, offering numerous avenues for NRP customization. In this article, based on a brief overview of the functions of A domains in terms of the mechanism of substrate adenylation and other auxiliary functions, several representative types of engineering strategies of A domains, including mutagenesis of substrate-specificity codes, the substitution of the C-A bidomain, the entire A domain or its subdomains, domain insertion, and rearrangements at the level of the whole module, are discussed in detail, providing insights for further comprehensive research in this field.

2. Major Function and Functional Diversity of A Domains

2.1. Substrate-Activation Function of A Domain

Studies on the A domain-catalyzed incorporation of monomer substrates into peptide chains can be categorized into two primary areas: one is the substrate recognition mechanism of the A domain, while the other is the mechanism of substrate adenylation catalyzed by the A domain.

2.1.1. Substrate Recognition Mechanism of A Domain

The characteristics of specific recognition of substrate in the A domain can be identified by the substrate-specificity code consisting of a series of key residues. Two decades ago, the relationship between the active site of the A domain and substrate specificity was revealed based on the crystal structure of the first A domain of Gramicidin S synthetase 1 (GrsA-A) and its complex with a substrate, leading to the introduction of the concept of substrate-specificity codes, also known as the substrate binding pocket [27,35]. Table 1 lists the 10 substrate-specificity codes of several A domains that correspond to L-α-amino acids. It demonstrates that A domains recognizing the same substrate exhibit highly similar substrate-specificity codes [35], which can exhibit variability within a certain range, indicating the diverse recognition modes of L-α-amino acids.

Amino Acid Recognition Mechanism of A Domain

Considerable research efforts have been dedicated to the production of non-natural NRPs via biosynthesis, involving the incorporation of substrates tailored to meet the requirements for the synthesis of final products. Noteworthy observations have emerged from these investigations. For example, within the α-amino acid substrate-selective A domain, the Asp235, and C-terminal Lys residue are highly conserved, as they interact specifically with the α-amino and α-carboxylate groups of the α-amino acid, respectively, to define its positioning and orientation during catalysis. The remaining eight specificity-coding residues are involved in the recognition of side chains of α-amino acid substrate. A comprehensive database has been established by gradually accumulating data on the relationship between enzyme sequences and substrate specificity, which aids in enhancing the accuracy of predicting substrate specificity for additional A domains [36,37,38].

Table 1. Several typical substrate-specificity codes of A domains correspond to L-α-amino acids.

	Substrate	235	236	239	278	299	301	322	330	331	517	Reference
GrsA	Phe	D	A	W	T	I	A	A	I	C	K	[39]
TycA	Phe	D	A	W	T	I	A	A	I	C	K	[39]
SrfA-B	Val	D	A	F	W	I	G	G	T	F	K	[35]
CcsA-M9	Val	D	A	W	M	F	A	A	V	L	K	[39]
GrsB	Val	D	A	F	W	I	G	G	T	F	K	[39]
CepA	Tyr	D	A	S	T	V	A	A	V	C	K	[39]
TycC	Tyr	D	A	L	T	T	G	E	V	V	K	[39]

Keto Acid Recognition Mechanism of A Domain

Previous studies have predominantly focused on amino acid-selective A domains, but various types of A domains have been identified to recognize other classes of acyl substrates. For instance, the A domain of StsA (StsA-A) specifically recognizes α-keto acid. StsA-A employs a unique mechanism to differentiate between α-keto acids, α-amino acids, and α-hydroxy acids, favoring the adenylation of α-keto acid and transferring the adenylated intermediate to T domain. The T domain then transports the α-keto acid to the ketoreduction (KR) domain to generate an α-hydroxy acid monomer through stereoselective reduction of the α-keto group. Subsequently, the α-hydroxy acid monomer-loaded T domain moves to the downstream C domain within the module for condensation, leading to the formation of ester bonds (or directly enters the C domain of the downstream module in the case of the initial module A-KR-T) [40,41].

2.1.2. Catalytic Mechanism of the A Domain

The A domain catalyzes the activation and loading of the acyl monomer in an ATP-dependent manner. Taking amino acid substrate as an example, A domain catalyzes two sequential reactions involving the activation of amino acid to generate an aminoacyl-adenosine monophosphate (AMP) intermediate, followed by the transfer of this intermediate to the 4’phosphopantetheine (Ppant) group in the downstream T domain [42,43] (Figure 2).

The structure of the A domain can be divided into two parts: the larger N-terminal core domain A_core (with a size of approximately 50 kDa) and the smaller C-terminal domain A_sub (with a size of approximately 10 kDa). During the catalytic reaction process, the A domain undergoes a conformational change. Initially, the A domain adopts an open conformation, where both the A_core domain and A_sub domain are in an open state to accommodate substrate and ATP. Subsequently, the A domain transitions to the catalytic conformation, leading to a shrinkage of the cavity between the A_core and A_sub domains to generate aminoacyl-AMP. Finally, the A_sub domain is displaced, resulting in the binding of aminoacyl-AMP to the T domain, and the A domain reverts to the open conformation [44].

Certain A domains require interaction with small-sized protein activators, such as MbtH-like proteins (MLPs), to fulfill their catalytic function effectively. MLPs, named after the MbtH protein in the mycobactin operon from Mycobacterium tuberculosis [45], are a class of proteins consisting of 70 amino acids. Previous research has investigated the substrate preference of the A domain of Hrm, an NRPS module of hormaomycin. When attempts to express heterologous proteins failed, it was discovered that some A domains exhibited activity only in the presence of co-expressed MLPs, highlighting the A domain’s reliance on MLPs [46,47]. Subsequently, Herbst et al. expressed and purified SlgN1, a 3-methylaspartate-adenylating enzyme with an N-terminal MLP. They designed an expression construct for SlgN1-ΔA_sub, a fusion protein containing an MLPs and an A_core domain, by removing the C-terminal A_sub domain to obtain the crystal structure of this adenylation enzyme. Crystal structure analysis demonstrates that MLP is far from the active center and does not directly interact with the substrate. Mutation of Ala433 to Glu in MLP abolished the activity, and mutation of Ser23 located on the MLP interaction surface to Tyr decreased the activity. However, the activity was restored upon the addition of a complete MLP, providing the first direct evidence and functional characterization for this binding mode and supporting the significant impact of MLP on A domain activity [48]. Additionally, other studies have also shown that MLPs can enhance the activity of the A domain [45], so MLPs are necessary for the activity of some particular NRPS. This makes them promising drug targets for developing antibacterial compounds that disrupt the iron carrier of pathogens [49]. Acyl CoA synthetase and luciferase, members of the ANL adenylase superfamily like A domain, exhibit similar catalytic functions. Despite differences in the overall catalysis process of these two types of reactions, they generally follow a two-step mechanism involving ATP to form an adenylate intermediate and release pyrophosphate (PPi) to activate the substrate in the first step, followed by the release of AMP in the second step [42].

2.2. Auxiliary Functions of A Domains

The substrate-specificity codes of the A domain are composed of 10 elements (a1–a10), which are split into N-terminal (a1–a7) and C-terminal (a8–a10) codes. Some A domains exhibit special structures, with additional catalytic regions such as the methylation (M) domain, Ox domain, and KR domain inserted into the domain. Typically, these insertions occur between codes a8 and a9 and sometimes between a2 and a3. A domain with such interruptions is referred to as an interrupted A domain [50,51,52]. The interrupted A domain exhibits two types of functions: its intrinsic adenylation function and the function corresponding to the insertion domain. In the case of an interrupted A domain with M domain insertion, methylation modification is performed using S-adenosyl-L-methionine (SAM) as the methyl donor, and the inserted M domains do not accept free substrate. Adenylation occurs first to activate the substrate, and methylation takes place only after the linkage of the activated substrate to the T domain [53]. M domains in the interrupted A domains are generally divided into two main categories: those responsible for main-chain N-methylation and the other corresponding to side-chain O- or S-methylation. The type of methylation is generally denoted by subscript notation, with “b” in M_b referring to main-chain methylation and “s” in M_s referring to side-chain methylation [54].

Several studies demonstrate that the most common form of interrupted domains is the insertion between a8 and a9 codes, as the case in the NRPS module of Kutznerides. Kutznerides are cyclic hexadepsipeptides produced by the soil actinomycete Kutzneria sp. 744 through the action of three NRPS modules, namely KtzE, KtzG, and KtzH. The A domain of KtzH contains an interruption between a8 and a9 codes by an M domain, which is responsible for catalyzing the activation and O-methylation of L-Ser [55] (Figure 3). Studies of TioS, synthetase of the anticancer peptide thiocoraline derived from marine actinomycetes, have shown that interruptions occur in the A domains in both the third and fourth modules of TioS. The N-methylation domains are also generally inserted between a8 and a9 codes. Unlike the typical fusion of engineered proteins through flexible connecting regions, the connection region between the M and A domains is structured, thereby maintaining the normal folding of the A domain [53,56]. An example of this interruption is seen in TioN, a component of the thiocoraline biosynthetase, where the A domain is interrupted between a2 and a3 codes by an M domain, specifically denoted as TioN (A-M-A) [50]. This interrupted A domain exhibits both adenylation and methylation activities for L-Cys (Figure 3). A similar pattern of domain insertion is observed in MarQ, the maremycin biosynthetic module from a marine bacterium Streptomyces sp. B9173, where the M domain is also inserted between a2 and a3 codes of the A domain [57] (Figure 3). Studies have revealed that some interrupted A domains play an active role in biosynthesis, leading to the production of methylated analogs with enhanced oral bioavailability compared to the original compounds [58]. To provide a comprehensive overview of interrupted domains, a partial list of known interrupted domains has been compiled in Table 2, providing details on methylation products, interruption sites as well as methylation types. This information can serve as a reference for future research on the modification of NRP-derived drugs generated by interrupted A domains.

3. Engineering of A Domains

Investigating the substrate specificity of the A domain is particularly crucial because of its primary role in controlling substrate specificity during the process of NRP biosynthesis. In addition to altering substrate-specificity codes, methods such as splicing or domain substituting of NRPS modules are utilized to change the diversity of NRPS product peptides [63]. Not only individual domains but recognition subdomains and even entire modules can also be the target for substitution. These methods are demonstrated below and summarized in Table 3. Engineering modifications are mostly performed using Escherichia coli as the expression host, which has proved to be efficient for genetic engineering manipulations of biosynthetic gene clusters and the recombinant expression of their encoded proteins.

3.1. Site-Specific Mutation of Substrate Specificity Codes

Studies on site-directed mutagenesis of residues at substrate recognition sites and insertion of partial peptide segments have been carried out for a long time, leading to significant advancements in the field. For example, researchers performed mutations in the substrate-specificity codes that recognize Glu in the A domain of the initiation module responsible for lipopeptide surfactin biosynthesis from Bacillus subtilis. The introduction of the K239E mutation resulted in enhanced selectivity towards L-Gln, resulting in the generation of surfactin variants [64]. In another study, researchers selected eight key residues associated with substrate specificity in the A domain that specifically recognizes phenylalanine (PheA) in gramicidin S synthetase as mutation targets. Among these mutants, the mutation of W239S resulted in approximately a threefold higher preference for L-Tyr over L-Phe [65]. These achievements have laid the foundation for further studies on the substrate-specificity code mutations within the A domain. Similarly, during the study on how to integrate synthetically produced non-natural amino acids into calcium-dependent antibiotics (CDA), Thirlway et al. employed site-directed mutations to alter the substrate specificity of the A domain in the first module of CDA PS3. The substitution of Lys at position 278 with Gln (K278Q) successfully yielded a CDA variant containing glutamine. This study serves as a foundational example for future research endeavors aiming to introduce non-natural amino acids by manipulating the substrate specificity of the A domain in NRPS [66].

For A domains with dual or multiple substrate specificities, site-directed mutagenesis of substrate recognition residues can transform them into single-specificity A domains to generate the desired products [26]. An illustration of this process is demonstrated in the production of ohmyungsamycins (OMSs), a group of macrocyclic peptides with anticancer activity produced by a marine bacterial strain belonging to the Streptomyces genus. Within the biosynthesis pathway of OMS, the A domain in the second module can specifically recognize two amino acids, L-Val and L-Ile, respectively, corresponding to the formation of OMS-A and OMS-B, with OMS-A exhibiting stronger activity against cancer cells. By introducing double mutations (G299W and A322G), the yield of OMS-A was effectively increased [67]. Similarly, lyngbyatoxin (LTX) produced by marine cyanobacteria is known for its capacity to activate protein kinase C (PKC), a trait with therapeutic implications for PKC-related pathological conditions. Studies using the heterologously expressed protein of the double module NRPS LtxA in E. coli illustrated that the first A domain in the binary module can activate a variety of substrates, such as Val, Leu, and Ile. After site-specific mutations of Y239 and W299, substrate-specificity changes in this A domain were observed: the W299L mutation notably increased activity towards Val and Leu, while the Y239M mutation resulted in a higher specific preference for Leu [68].

3.2. Substitution of the A Domain

Because the A domain plays a primary role in the substrate recognition process within NRPS, this domain is usually a priority target for NRPS domain substitution. This concept was proposed and experimentally studied in the 1990s, with subsequent experiments gradually confirming the significant role of substituting either the entire A domain or a portion of it in the diversification of NRPSs [69]. Marahiel et al. substituted the A domain that recognizes Leu in SrfA-C from Bacillus subtilis with an A domain recognizing Cys (Figure 4A). Through this substitution, they successfully constructed a hybrid peptide synthetase with altered amino acid substrate specificity. This innovative approach led to the recombination of SrfA, enabling the synthesis of surfactin analogs with hemolytic activity in Bacillus subtilis for the first time [70].

Substitution of the A domain may not always yield the desired outcomes, as it can result in compromised activity, reduced peptide product yield, and abnormal expression of recombinant proteins. Over the years, researchers have been actively exploring effective strategies for the substitution of A domains. PvdD, a dual-module NRPS in Pseudomonas aeruginosa PAO1 responsible for incorporating the last two Thr residues into pyoverdine, has been a subject of study in this regard. By substituting the A domain in the second module of PvdD with Thr-selective A domains from different bacterial strains, researchers were able to synthesize pyoverdine successfully. Conversely, A domains that were not selective for Thr only produced minimal amounts of pyoverdine compounds, indicating a lack of expected functionality under the new environment [71,72]. Subsequent experiments involved substituting the A domain in the second module of the PvdD module, along with the linker region (an approximately 36-residue sequence extending from the C-terminus of the last helix in the C domain to the first helix in the A domain), resulting in increased yield of pyoverdine products. Furthermore, nine A domains from Pseudomonas species that activate other substrates were randomly selected and replaced into the second module of PvdD (Figure 4B), which led to a significant enhancement in the yields of six of these A domain-exchanged hybrids [69]. More studies are continuously attempting to optimize domain exchange methods, with progress being made in addressing some of the associated challenges.

3.3. Substitution of the Recognition Subdomain of the A Domain

Previous studies suggest that substrate-specificity codes are located within a subdomain of the A domain, referred to as the recognition subdomain (RS) (Figure 5) [73]. To modify the substrate specificity of the A domain, one can employ a technique involving the substitution of the substrate-specificity code by replacing the RS segment of the A domain. Kries et al. pinpointed the RS within the A domains of gramicidin S synthetases, GrsA and GrsB, and exchanged these subdomains into GrsA to investigate their amino acid substrate recognition patterns. Among the nine different RS substitutions, the subdomain from GrsB that recognizes Val exhibited the same amino acid-specific recognition mode in GrsA. Furthermore, through the construction of a module containing RS-substituted GrsA and GrsB1, they successfully obtained the desired product, demonstrating the feasibility of altering the substrate specificity of the A domain through subdomain substitution [74].

The exchange of subdomains has a negligible impact on the overall structure of the A domain, allowing for the preservation of crucial interactions with other domains within the assembly line. Notably, the effects of subdomain exchange are more pronounced when dealing with homologous A domains that share high sequence similarities. For instance, by substituting the flavodoxin-like subdomain (FSD, containing key active site residues within the A domain) from the second A domain of EndA, which recognizes L-Thr in endracidin biosynthesis, with the FSD from EndC that recognizes L-Ser (with 88% identity and 89% similarity) (Figure 6), an endracidin variant with L-Ser replacing L-Thr was successfully obtained [75].

3.4. Domain Insertion

Interrupted A domains, as described in 2.2, are a distinct form of A domains that can be generated by removing the insertion domains, exchanging insertion domains, or artificially disrupting uninterrupted A domains to achieve diversity in the resultant peptide products. Shrestha et al. removed a portion of the M_S domain (The domain previously referred to as the M_H domain) between a8 and a9 codes of the interrupted A domain in KtzH, thereby designing a new uninterrupted A domain (Figure 7A). In a separate experiment, they replaced part of the interrupted M_S domain with the M domain from TioN (another interrupted A domain) (Figure 7B). The observations of these experiments showed that the complete uninterrupted A domain, with the M_S domain removed, did not exhibit methylase activity. Moreover, by exchanging the M_S with a segment of the methylase domain lacking significant sequence similarity in a naturally interrupted A domain, a new dual-functional A domain was created, retaining its adenylation and methylation activities [76]. Lundy et al. employed a different experimental strategy and observed that when a naturally uninterrupted A domain (Ecm6) was inserted with non-homologous M domains (from KtzH and TioS, respectively), the artificially interrupted A domain also exhibited dual functionality [77] (Figure 7B). Additionally, the linker region between the M domain and the A domain exhibited structured folding without impacting the normal folding of the A domain [56]. Subsequently, researchers achieved A domains with dual interruptions by inserting a main-chain N-methylase domain between a8 and a9 codes and inserting a side-chain S-methylase domain between a2 and a3 codes, enabling the A domain to acquire the functions of adenylation, N-methylation, and S-methylation concurrently [78]. However, an alternative approach involving the simultaneous insertion of two M domains between the interrupted a8 and a9 codes did not show detectable methyltransferase activity, although the adenylation activity and substrate specificity of the A domain remained unaffected [79]. These findings open up new possibilities for exploring the use of auxiliary domains such as MOx, Ox, and KR domains present in NRPS modules to generate interrupted A domains with novel functions. They provide valuable insights into the current research aimed at developing new bifunctional and trifunctional enzymes with adenylation and other activities derived from single-functional A domains in various NRPS systems.

3.5. Substitution of C-A Bidomain

Studies have shown that the C domain, which catalyzes the formation of peptide bonds between substrates, has a certain impact on the substrate selection specificity of the A domain. For example, in the A-T double domain of McyB from Microcystis aeruginosa PCC 7806, various amino acids such as Leu, Val, Ile, and Tyr can be activated. However, the presence of the C domain on the receptor side biases the substrate preference towards Leu. Similarly, with the C domain present, the McyC module also exhibits a strong preference for the substrate Arg. These observations indicate that the C domain can significantly influence the substrate specificity of the A domain [80].

Several studies have investigated the domain exchange at the level of the C-A bidomain. Researchers successfully enhanced the production of pyoverdine, a nonribosomal peptide siderophore from Pseudomonas aeruginosa, by substituting the C-A bidomain of the second module in PvdD [71]. However, some researchers were unable to create active hybrid enzymes by exchanging heterologous C-A bidomains, suggesting that the substituted C-A bidomains probably are unable to dock properly with the upstream module [81]. Further research on the domain arrangement of the SrfA-C module revealed a linker region containing thirty-two residues between the C and A domains that tightly connects these two domains; as demonstrated in the crystal structure of SrfA-C, the C domain is tightly bound to the A domain to form a stable platform for T domain and TE domain (Figure 8) [82]. Therefore, the influence of the C domain on substrate specificity should also be considered during domain substitution. For example, the substitution of the C4-A10 bidomain in the second module of PvdD involved in pyoverdine biosynthesis from P. aeruginosa led to the production of a desired pyoverdine analog, indicating that maintaining the integrity of C-A bidomain may be beneficial [83]. This is also shown by the work of Helge Bode’s group, who introduced a concept called the eXchange unit (XU). The primary forms of XU are A-T-C and A-T-C/E (epimerization domain), allowing exchanges of the C and A domains. The XU can only be applied for exchange when the downstream domains recognize the identical substrate due to the impact of the C domain on substrate specificity [84]. Subsequently, the eXchange unit condensation domain (XUC) was proposed, focusing on overcoming the substrate recognition restriction of the C domain. The form of XUC is C_Asub-A-T-C_Dsub, where C_Asub represents the acceptor site (approximately the latter half), and C_Dsub represents the donor site (approximately the former half) [85]. Recently, another concept, the eXchange unit between T domains (XUT), was established in the form T_1/2-C-A-T_1/2 [86]. The discovery of these new exchange sites has expanded the flexibility and diversity of recombining NRPSs.

3.6. NRPS Engineering by Whole-Module Rearrangements

In addition to the methods of engineering mentioned above, there is another form of engineering at the module level. Engineering of NRPS at the module level to produce novel peptides is mainly achieved through methods such as module swapping, module deletion, and module addition. For instance, within the daptomycin synthetase, module 8 and module 11 are responsible for recognizing D-Ala and D-Ser, respectively. Researchers interchanged these two modules, observing a decrease in the yield of daptomycin analogs compared to the original strain, yet demonstrating the feasibility of module swapping [72]. Additionally, Mootz et al. explored the deletion of an entire module in the NRPS assembly line to generate cyclic peptides with reduced sizes. By removing the SrfA-A2 module that recognizes Leu in the surfactin NRPS and directly connecting module 1 and module 3, the amino acid recognized by the deleted module was removed, and the product was transformed from the original heptapeptide to a hexapeptide variant referred to as ∆2-surfactin (the product of the second module deletion version of SrfA-A) [87] (Figure 9A).

These methods were also successfully applied in the engineering of NRPS-PKS hybrid modules. Awakawa et al. deleted the coding sequence of the NatD module (domains arranged as C-A-KR-T-TE) from the neoantimycin (tetra-lactone) biosynthetic gene cluster Nat, resulting in a contraction of cycle size and the production of a tri-lactone compound [88]. Additionally, they conducted module extensions to increase the macrocycle size of JBIR-06 (tri-lactone) by inserting the coding gene of the NatD module into the JBIR-06 macrolide biosynthetic gene cluster (Sml cluster). To enable proper expression of the inserted NatD module in SmlC, the C-terminal docking domain of NatC was substituted with the linker and TE domain in the original position of the SmlC module. This substitution maintained module-module interactions, resulting in the production of tetra-lactone products [88]. The NRPS responsible for synthesizing the vancomycin-type glycopeptide antibiotic balhimycin consists of seven modules. Among them, BspA and BspB each contain three modules, while BspC contains only one module. By inserting a hydroxyphenylglycine (Hpg)-selective composite module composed of the T-E bidomain from module 4 and the C-A bidomain from module 5 between modules 4 and 5 of BspB (Figure 9B), researchers detected an octapeptide and a heptapeptide both containing three Hpg residues through product analysis by using HPLC-ESI-MS/MS [89]. Taken together, the successful outcomes of these experiments described above provide valuable examples for further comprehensive investigations into whole-module rearrangements.

Table 3. Several typical engineering methods targeting the A domains.

Engineering Method	Details	Reference
Site-directed mutation	Site-specific mutation of substrate binding site	[26,64,65,66,67,68]
Substitution of A domain	The A domain is replaced by an A domain with alternative substrate-specificity	[69,70,71,72]
Substitution of the recognition subdomain of A domain	Only partial domain sequence associated with substrate recognition is substituted	[74,75]
Domain insertion	Substitution of the domain inserted in interrupted A domain	[76]
	Removal of the inserted domain from the interrupted A domain	[76]
	Inserting domains into non-interrupted A domain	[77]
	Inserting domains at different positions in A domain	[78]
Substitution of C-A bidomain	Substitution of the region, including both C and A domains	[71]
Whole-module rearrangements	Module Replacement	[72]
	Module deletion	[87,88]
	Module extension	[88,89]

4. Conclusions

Natural products derived from plants, fungi, and bacteria have long been utilized for treating human diseases [90], playing a crucial role in public health maintenance. A variety of emergency events, such as antibiotic resistance and increased cancer rates, pose a growing threat to human health, making the development of new drugs urgent. Consequently, research on the most diverse and widely distributed natural products, such as NRPs, becomes essential. While most NRPS research has focused on terrestrial organisms, corresponding studies relevant to marine organisms have only been established in recent decades. Marine organisms, owing to their unique ecological habitats and adaptive mechanisms, generate NRPs with unique structures and biological activities. Due to difficulties in marine sample collection, challenges in mimicking the growth environments of marine organisms, and highly variable genomes of these organisms, the exploration of marine organism-derived NRPs as well as the engineering of the corresponding NRPSs are currently insufficient, implying that the field of NRPS research focused on marine organisms holds promising potential for future advancements.

The structural diversity of NRPs is primarily determined by the substrate specificity of the A domain. Therefore, there is a growing emphasis on exploring the modification and engineering of the A domain for producing novel NRPs. This article introduces the substrate recognition and catalytic mechanisms of the A domain, revealing how the A domain adds monomers to the elongating peptide chain. Furthermore, the engineering of A domains using strategies such as mutagenesis of substrate-specificity codes, substitution of domain, domain insertion, and whole-module rearrangements are discussed in detail, presenting advances in these research areas.

After more than two decades of development, both the basic research and engineering of the NRPS system have achieved significant progress. These research observations fully demonstrate the potential applications of NRPSs in the combinatorial biosynthesis of NRPs, further guiding the rational design and engineering of NRPSs and even the de novo design of NRPS assembly lines. Based on the linear catalytic mechanism of NRPS, directed recombination of catalytic modules can theoretically yield peptides with arbitrary sequence combinations. Because A domain is dominant in controlling the sequence of NRP product as illustrated above, the key focus in NRPS assembly line development lies in the engineering of the A domain.

Due to the complex nature of NRPSs, which possess multiple modules and dynamic function modes, there remains an insufficient comprehension of their internal operational processes and working mechanisms. Enhancing our knowledge of NRPS mechanisms and developing suitable engineering approaches while amalgamating insights from studies on NRPSs from both terrestrial and marine organisms, particularly through the engineering of A domains, holds the potential to empower NRPSs to identify and synthesize novel compounds that have not been naturally discovered. This endeavor is crucial for expanding the diversity within the NRP family and advancing the development of novel pharmaceutical agents.

Author Contributions

Conceptualization, X.Q.; writing—original draft preparation, M.Z.; writing—review and editing, M.Z., X.Q., Z.P., Z.H., J.F. and X.L.; supervision, X.Q. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key Research and Development Program of China (grant number: 2018YFA0903004), Natural Science Foundation of Ningbo City (grant number: 2023J121), National Natural Science Foundation of China (grant number: 31400683), Natural Science Foundation of Zhejiang Province (grant number: LQ14C050001) and Subject Project of Ningbo University (grant No. XYL20019). This work was also sponsored by the K.C. Wong Magna Fund at Ningbo University.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Izore, T.; Cryle, M.J. The many faces and important roles of protein-protein interactions during non-ribosomal peptide synthesis. Nat. Prod. Rep. 2018, 35, 1120–1139. [Google Scholar] [CrossRef] [PubMed]
Payne, J.A.E.; Schoppet, M.; Hansen, M.H.; Cryle, M.J. Diversity of nature’s assembly lines recent discoveries in non-ribosomal peptide synthesis. Mol. Biosyst. 2017, 13, 9–22. [Google Scholar] [CrossRef] [PubMed]
Miller, B.R.; Gulick, A.M. Structural Biology of Nonribosomal Peptide Synthetases. Methods Mol. Biol. 2016, 1401, 3–29. [Google Scholar] [CrossRef]
Walsh, C.T.; Brien, R.V.O.; Khosla, C. Nonproteinogenic Amino Acid Building Blocks for Nonribosomal Peptide and Hybrid Polyketide Scaffolds. Angew. Chem.-Int. Ed. 2013, 52, 7098–7124. [Google Scholar] [CrossRef]
Duckworth, B.P.; Wilson, D.J.; Aldrich, C.C. Measurement of Nonribosomal Peptide Synthetase Adenylation Domain Activity Using a Continuous Hydroxylamine Release Assay. Methods Mol. Biol. 2016, 1401, 53–61. [Google Scholar] [CrossRef] [PubMed]
Giessen, T.W.; Marahiel, M.A. Ribosome-independent biosynthesis of biologically active peptides: Application of synthetic biology to generate structural diversity. FEBS Lett. 2012, 586, 2065–2075. [Google Scholar] [CrossRef] [PubMed]
Benny, A.M.; Gopakumar, S.T.; Janardhanan, R.K.; Nair, A.V.; Raj, N.B.; Vakkachan, A.P.; Raveendran, R.K.; Balakrishnan, S.K.; Karayi, S.N. Analysis of nonribosomal peptide synthetase genes in haemolymph microbes of marine crabs. Arch. Microbiol. 2021, 203, 1251–1258. [Google Scholar] [CrossRef]
Schorn, M.A.; Jordan, P.A.; Podell, S.; Blanton, J.M.; Agarwal, V.; Biggs, J.S.; Allen, E.E.; Moore, B.S. Comparative genomics of cyanobacterial symbionts reveals distinct, specialized metabolism in tropical Dysideidae sponges. mBio 2019, 10, 10–1128. [Google Scholar] [CrossRef]
Tambadou, F.; Lanneluc, I.; Sablé, S.; Klein, G.L.; Doghri, I.; Sopéna, V.; Didelot, S.; Barthélémy, C.; Thiéry, V.; Chevrot, R. Novel nonribosomal peptide synthetase (NRPS) genes sequenced from intertidal mudflat bacteria. FEMS Microbiol. Lett. 2014, 357, 123–130. [Google Scholar]
Kahlert, L.; Lichstrahl, M.S.; Townsend, C.A. Colorimetric Determination of Adenylation Domain Activity in Nonribosomal Peptide Synthetases by Using Chrome Azurol S. Chembiochem 2023, 24, e202200668. [Google Scholar] [CrossRef]
Fu, X.; Su, J.; Zeng, L. Prepatellamide A, a new cyclic peptide from the ascidian Lissoclinum patella. Sci. China Ser. B Chem. 2000, 43, 643–648. [Google Scholar] [CrossRef]
Williams, D.E.; Dalisay, D.S.; Patrick, B.O.; Matainaho, T.; Andrusiak, K.; Deshpande, R.; Myers, C.L.; Piotrowski, J.S.; Boone, C.; Yoshida, M. Padanamides A and B, highly modified linear tetrapeptides produced in culture by a Streptomyces sp. isolated from a marine sediment. Org. Lett. 2011, 13, 3936–3939. [Google Scholar] [CrossRef]
Jiao, W.-H.; Xu, Q.-H.; Ge, G.-B.; Shang, R.-Y.; Zhu, H.-R.; Liu, H.-Y.; Cui, J.; Sun, F.; Lin, H.-W. Flavipesides A–C, PKS-NRPS hybrids as pancreatic lipase inhibitors from a marine sponge symbiotic fungus Aspergillus flavipes 164013. Org. Lett. 2020, 22, 1825–1829. [Google Scholar] [CrossRef] [PubMed]
Agrawal, S.; Acharya, D.; Adholeya, A.; Barrow, C.J.; Deshmukh, S.K. Nonribosomal peptides from marine microbes and their antimicrobial and anticancer potential. Front. Pharmacol. 2017, 8, 828. [Google Scholar] [CrossRef] [PubMed]
Reimer, J.M.; Hague, A.S.; Tarry, M.J.; Schmeing, T.M. Piecing together nonribosomal peptide synthesis. Curr. Opin. Struc. Biol. 2018, 49, 104–113. [Google Scholar] [CrossRef] [PubMed]
Chen, M.; Xu, C.Y.; Wang, X.; Wu, Y.A.; Li, L. Nonribosomal peptide synthetases and nonribosomal cyanopeptides synthesis in Microcystis: A comparative genomics study. Algal Res. 2021, 59, 102432. [Google Scholar] [CrossRef]
Drake, E.J.; Miller, B.R.; Shi, C.; Tarrasch, J.T.; Sundlov, J.A.; Allen, C.L.; Skiniotis, G.; Aldrich, C.C.; Gulick, A.M. Structures of two distinct conformations of holo-non-ribosomal peptide synthetases. Nature 2016, 529, U235–U289. [Google Scholar] [CrossRef] [PubMed]
Abbood, N.; Prave, L.; Bozhueyuek, K.A.J.; Bode, H.B. A Practical Guideline to Engineering Nonribosomal Peptide Synthetases. Methods Mol. Biol. 2023, 2670, 219–234. [Google Scholar] [CrossRef] [PubMed]
Condurso, H.L.; Bruner, S.D. Structure and noncanonical chemistry of nonribosomal peptide biosynthetic machinery. Nat. Prod. Rep. 2012, 29, 1099–1110. [Google Scholar] [CrossRef]
Lu, Z.; Liu, X.H.; Yuan, X.; Liu, F.; Wang, T. Engineered Biosynthesis through the Adenylation Domains from Nonribosomal Peptide Synthetases. Curr. Top. Med. Chem. 2023, 23, 1973–1984. [Google Scholar] [CrossRef]
Marahiel, M.A. A structural model for multimodular NRPS assembly lines. Nat. Prod. Rep. 2016, 33, 136–140. [Google Scholar] [CrossRef] [PubMed]
Hur, G.H.; Vickery, C.R.; Burkart, M.D. Explorations of catalytic domains in non-ribosomal peptide synthetase enzymology. Nat. Prod. Rep. 2012, 29, 1074–1098. [Google Scholar] [CrossRef] [PubMed]
Samel, S.A.; Czodrowski, P.; Essen, L.O. Structure of the epimerization domain of tyrocidine synthetase A. Acta Crystallogr. D 2014, 70, 1442–1452. [Google Scholar] [CrossRef] [PubMed]
Sussmuth, R.D.; Mainz, A. Nonribosomal Peptide Synthesis-Principles and Prospects. Angew. Chem.-Int. Ed. 2017, 56, 3770–3821. [Google Scholar] [CrossRef] [PubMed]
Konno, S.; Ishikawa, F.; Suzuki, T.; Dohmae, N.; Burkart, M.D.; Kakeya, H. Active site-directed proteomic probes for adenylation domains in nonribosomal peptide synthetases. Chem. Commun. 2015, 51, 2262–2265. [Google Scholar] [CrossRef] [PubMed]
Kaljunen, H.; Schiefelbein, S.H.H.; Stummer, D.; Kozak, S.; Meijers, R.; Christiansen, G.; Rentmeister, A. Structural Elucidation of the Bispecificity of A Domains as a Basis for Activating Non-natural Amino Acids. Angew. Chem.-Int. Ed. 2015, 54, 8833–8836. [Google Scholar] [CrossRef] [PubMed]
Rottig, M.; Medema, M.H.; Blin, K.; Weber, T.; Rausch, C.; Kohlbacher, O. NRPSpredictor2-a web server for predicting NRPS adenylation domain specificity. Nucleic Acids Res. 2011, 39, W362–W367. [Google Scholar] [CrossRef] [PubMed]
Kittila, T.; Schoppet, M.; Cryle, M.J. Online Pyrophosphate Assay for Analyzing Adenylation Domains of Nonribosomal Peptide Synthetases. Chembiochem 2016, 17, 576–584. [Google Scholar] [CrossRef]
Kudo, F.; Miyanaga, A.; Eguchi, T. Structural basis of the nonribosomal codes for nonproteinogenic amino acid selective adenylation enzymes in the biosynthesis of natural products. J. Ind. Microbiol. Biot. 2019, 46, 515–536. [Google Scholar] [CrossRef]
Baranasic, D.; Zucko, J.; Diminic, J.; Gacesa, R.; Long, P.F.; Cullum, J.; Hranueli, D.; Starcevic, A. Predicting substrate specificity of adenylation domains of nonribosomal peptide synthetases and other protein properties by latent semantic indexing. J. Ind. Microbiol. Biot. 2014, 41, 461–467. [Google Scholar] [CrossRef]
Khayatt, B.I.; Overmars, L.; Siezen, R.J.; Francke, C. Classification of the Adenylation and Acyl-Transferase Activity of NRPS and PKS Systems Using Ensembles of Substrate Specific Hidden Markov Models. PLoS ONE 2013, 8, e62136. [Google Scholar] [CrossRef] [PubMed]
Mongia, M.; Baral, R.; Adduri, A.; Yan, D.H.; Liu, Y.D.; Bian, Y.Y.; Kim, P.; Behsaz, B.; Mohimani, H. AdenPredictor: Accurate prediction of the adenylation domain specificity of nonribosomal peptide biosynthetic gene clusters in microbial genomes. Bioinformatics 2023, 39, i40–i46. [Google Scholar] [CrossRef] [PubMed]
Lei, M.; Zhao, Z.; Liu, K.; Liu, Y.; Zhang, H.; Wu, X.; Gong, S.; Ma, Y.; Zhao, H.; Liu, J. Screening the specific substrates of adenylation domain from marine actinomycetes by fluorescence quenching and isothermal titration calorimetry. Acta Pol. Pharm.-Drug Res. 2018, 75, 1287–1292. [Google Scholar] [CrossRef] [PubMed]
Xu, D.; Zhang, Z.; Yao, L.; Wu, L.; Zhu, Y.; Zhao, M.; Xu, H. Advances in the adenylation domain: Discovery of diverse non-ribosomal peptides. Appl. Microbiol. Biot. 2023, 107, 4187–4197. [Google Scholar] [CrossRef]
Stachelhaus, T.; Mootz, H.D.; Marahiel, M.A. The specificity-conferring code of adenylation domains in nonribosomal peptide synthetases. Chem. Biol. 1999, 6, 493–505. [Google Scholar] [CrossRef] [PubMed]
Miyanaga, A.; Cieslak, J.; Shinohara, Y.; Kudo, F.; Eguchi, T. The crystal structure of the adenylation enzyme VinN reveals a unique beta-amino acid recognition mechanism. J. Biol. Chem. 2014, 289, 31448–31457. [Google Scholar] [CrossRef] [PubMed]
Challis, G.L.; Ravel, J.; Townsend, C.A. Predictive, structure-based model of amino acid recognition by nonribosomal peptide synthetase adenylation domains. Chem. Biol. 2000, 7, 211–224. [Google Scholar] [CrossRef] [PubMed]
Heard, S.C.; Winter, J.M. Structural, Biochemical and Bioinformatic Analyses of Nonribosomal Peptide Synthetase Adenylation Domains. Nat. Prod. Rep. 2024, 41, 1180–1205. [Google Scholar] [CrossRef]
Bian, X.; Plaza, A.; Yan, F.; Zhang, Y.; Müller, R. Rational and efficient site-directed mutagenesis of adenylation domain alters relative yields of luminmide derivatives in vivo. Biotechnol. Bioeng. 2015, 112, 1343–1353. [Google Scholar] [CrossRef]
Alonzo, D.A.; Chiche-Lapierre, C.; Tarry, M.J.; Wang, J.; Schmeing, T.M. Structural basis of keto acid utilization in nonribosomal depsipeptide synthesis. Nat. Chem. Biol. 2020, 16, 493–496. [Google Scholar] [CrossRef]
Miyanaga, A.; Kudo, F.; Eguchi, T. Recent advances in the structural analysis of adenylation domains in natural product biosynthesis. Curr. Opin. Chem. Biol. 2022, 71, 102212. [Google Scholar] [CrossRef] [PubMed]
Gulick, A.M. Conformational Dynamics in the Acyl-CoA Synthetases, Adenylation Domains of Non-ribosomal Peptide Synthetases, and Firefly Luciferase. ACS Chem. Biol. 2009, 4, 811–827. [Google Scholar] [CrossRef] [PubMed]
Kasai, S.; Konno, S.; Ishikawa, F.; Kakeya, H. Functional profiling of adenylation domains in nonribosomal peptide synthetases by competitive activity-based protein profiling. Chem. Commun. 2015, 51, 15764–15767. [Google Scholar] [CrossRef]
Stanisic, A.; Kries, H. Adenylation Domains in Nonribosomal Peptide Engineering. Chembiochem 2019, 20, 1347–1356. [Google Scholar] [CrossRef] [PubMed]
Miller, B.R.; Drake, E.J.; Shi, C.; Aldrich, C.C.; Gulick, A.M. Structures of a Nonribosomal Peptide Synthetase Module Bound to MbtH-like Proteins Support a Highly Dynamic Domain Architecture. J. Biol. Chem. 2016, 291, 22559–22571. [Google Scholar] [CrossRef] [PubMed]
Crusemann, M.; Kohlhaas, C.; Piel, J. Evolution-guided engineering of nonribosomal peptide synthetase adenylation domains. Chem. Sci. 2013, 4, 1041–1045. [Google Scholar] [CrossRef]
Boll, B.; Taubitz, T.; Heide, L. Role of MbtH-like Proteins in the Adenylation of Tyrosine during Aminocoumarin and Vancomycin Biosynthesis. J. Biol. Chem. 2011, 286, 36281–36290. [Google Scholar] [CrossRef] [PubMed]
Herbst, D.A.; Boll, B.; Zocher, G.; Stehle, T.; Heide, L. Structural Basis of the Interaction of MbtH-like Proteins, Putative Regulators of Nonribosomal Peptide Biosynthesis, with Adenylating Enzymes. J. Biol. Chem. 2013, 288, 1991–2003. [Google Scholar] [CrossRef] [PubMed]
Felnagle, E.A.; Barkei, J.J.; Park, H.; Podevels, A.M.; McMahon, M.D.; Drott, D.W.; Thomas, M.G. MbtH-Like Proteins as Integral Components of Bacterial Nonribosomal Peptide Synthetases. Biochemistry 2010, 49, 8815–8817. [Google Scholar] [CrossRef]
Al-Mestarihi, A.H.; Villamizar, G.; Fernandez, J.; Zolova, O.E.; Lombo, F.; Garneau-Tsodikova, S. Adenylation and S-Methylation of Cysteine by the Bifunctional Enzyme TioN in Thiocoraline Biosynthesis. J. Am. Chem. Soc. 2014, 136, 17350–17354. [Google Scholar] [CrossRef]
Labby, K.J.; Watsula, S.G.; Garneau-Tsodikova, S. Interrupted adenylation domains: Unique bifunctional enzymes involved in nonribosomal peptide biosynthesis. Nat. Prod. Rep. 2015, 32, 641–653. [Google Scholar] [CrossRef] [PubMed]
Baunach, M.; Chowdhury, S.; Stallforth, P.; Dittmann, E. The Landscape of Recombination Events That Create Nonribosomal Peptide Diversity. Mol. Biol. Evol. 2021, 38, 2116–2130. [Google Scholar] [CrossRef] [PubMed]
Mori, S.; Garzan, A.; Tsodikov, O.V.; Gameau-Tsodikovae, S. Deciphering Nature’s Intricate Way of N,S-Dimethylating L-Cysteine: Sequential Action of Two Bifunctional Adenylation Domains. Biochemistry 2017, 56, 6087–6097. [Google Scholar] [CrossRef]
Lundy, T.A.; Mori, S.; Garneau-Tsodikova, S. A thorough analysis and categorization of bacterial interrupted adenylation domains, including previously unidentified families. RSC Chem. Biol. 2020, 1, 233–250. [Google Scholar] [CrossRef] [PubMed]
Zolova, O.E.; Garneau-Tsodikova, S. KtzJ-dependent serine activation and O-methylation by KtzH for kutznerides biosynthesis. J. Antibiot. 2014, 67, 59–64. [Google Scholar] [CrossRef]
Mori, S.; Pang, A.H.; Lundy, T.A.; Garzan, A.; Tsodikov, O.V.; Garneau-Tsodikova, S. Structural basis for backbone N-methylation by an interrupted adenylation domain. Nat. Chem. Biol. 2018, 14, 428–430. [Google Scholar] [CrossRef] [PubMed]
Huang, T.; Duan, Y.; Zou, Y.; Deng, Z.; Lin, S. NRPS Protein MarQ Catalyzes Flexible Adenylation and Specific S-Methylation. ACS Chem. Biol. 2018, 13, 2387–2391. [Google Scholar] [CrossRef]
Biron, E.; Chatterjee, J.; Ovadia, O.; Langenegger, D.; Brueggen, J.; Hoyer, D.; Schmid, H.A.; Jelinek, R.; Gilon, C.; Hoffman, A.; et al. Improving oral bioavailability of peptides by multiple N-methylation: Somatostatin analogues. Angew. Chem. Int. Ed. Engl. 2008, 47, 2595–2599. [Google Scholar] [CrossRef]
Tillett, D.; Dittmann, E.; Erhard, M.; von Dohren, H.; Borner, T.; Neilan, B.A. Structural organization of microcystin biosynthesis in Microcystis aeruginosa PCC7806: An integrated peptide-polyketide synthetase system. Chem. Biol. 2000, 7, 753–764. [Google Scholar] [CrossRef]
Lundy, T.A.; Mori, S.; Thamban Chandrika, N.; Garneau-Tsodikova, S. Characterization of a Unique Interrupted Adenylation Domain That Can Catalyze Three Reactions. ACS Chem. Biol. 2020, 15, 282–289. [Google Scholar] [CrossRef]
Nishizawa, T.; Ueda, A.; Nakano, T.; Nishizawa, A.; Miura, T.; Asayama, M.; Fujii, K.; Harada, K.; Shirai, M. Characterization of the locus of genes encoding enzymes producing heptadepsipeptide micropeptin in the unicellular cyanobacterium Microcystis. J. Biochem. 2011, 149, 475–485. [Google Scholar] [CrossRef]
Zhang, J.J.; Tang, X.; Huan, T.; Ross, A.C.; Moore, B.S. Pass-back chain extension expands multimodular assembly line biosynthesis. Nat. Chem. Biol. 2020, 16, 42–49. [Google Scholar] [CrossRef] [PubMed]
Sun, H.H.; Liu, Z.H.; Zhao, H.M.; Ang, E.L. Recent advances in combinatorial biosynthesis for drug discovery. Drug Des. Dev. Ther. 2015, 9, 823–833. [Google Scholar] [CrossRef]
Eppelmann, K.; Stachelhaus, T.; Marahiel, M.A. Exploitation of the selectivity-conferring code of nonribosomal peptide synthetases for the rational design of novel peptide antibiotics. Biochemistry 2002, 41, 9718–9726. [Google Scholar] [CrossRef] [PubMed]
Kries, H.; Wachtel, R.; Pabst, A.; Wanner, B.; Niquille, D.; Hilvert, D. Reprogramming nonribosomal peptide synthetases for “clickable” amino acids. Angew. Chem. Int. Ed. Engl. 2014, 53, 10105–10108. [Google Scholar] [CrossRef]
Thirlway, J.; Lewis, R.; Nunns, L.; Al Nakeeb, M.; Styles, M.; Struck, A.W.; Smith, C.P.; Micklefield, J. Introduction of a Non-Natural Amino Acid into a Nonribosomal Peptide Antibiotic by Modification of Adenylation Domain Specificity. Angew. Chem.-Int. Ed. 2012, 51, 7181–7184. [Google Scholar] [CrossRef]
Kim, E.; Du, Y.E.; Ban, Y.H.; Shin, Y.-H.; Oh, D.-C.; Yoon, Y.J. Enhanced ohmyungsamycin a production via adenylation domain engineering and optimization of culture conditions. Front. Microbiol. 2021, 12, 626881. [Google Scholar] [CrossRef] [PubMed]
Soeriyadi, A.H.; Ongley, S.E.; Kehr, J.C.; Pickford, R.; Dittmann, E.; Neilan, B.A. Tailoring enzyme stringency masks the multispecificity of a lyngbyatoxin (indolactam alkaloid) nonribosomal peptide synthetase. Chembiochem 2022, 23, e202100574. [Google Scholar] [CrossRef]
Calcott, M.J.; Owen, J.G.; Ackerley, D.F. Efficient rational modification of non-ribosomal peptides by adenylation domain substitution. Nat. Commun. 2020, 11, 4554. [Google Scholar] [CrossRef]
Stachelhaus, T.; Schneider, A.; Marahiel, M.A. Rational Design of Peptide Antibiotics by Targeted Replacement of Bacterial and Fungal Domains. Science 1995, 269, 69–72. [Google Scholar] [CrossRef]
Calcott, M.J.; Owen, J.G.; Lamont, I.L.; Ackerley, D.F. Biosynthesis of Novel Pyoverdines by Domain Substitution in a Nonribosomal Peptide Synthetase of Pseudomonas aeruginosa. Appl. Env. Microb. 2014, 80, 5723–5731. [Google Scholar] [CrossRef] [PubMed]
Winn, M.; Fyans, J.K.; Zhuo, Y.; Micklefield, J. Recent advances in engineering nonribosomal peptide assembly lines. Nat. Prod. Rep. 2016, 33, 317–347. [Google Scholar] [CrossRef] [PubMed]
Throckmorton, K.; Vinnik, V.; Chowdhury, R.; Cook, T.; Cheyrette, M.G.; Maranas, C.; Pfleger, B.; Thomas, M.G. Directed Evolution Reveals the Functional Sequence Space of an Adenylation Domain Specificity Code. ACS Chem. Biol. 2019, 14, 2044–2054. [Google Scholar] [CrossRef] [PubMed]
Kries, H.; Niquille, D.L.; Hilvert, D. A Subdomain Swap Strategy for Reengineering Nonribosomal Peptides. Chem. Biol. 2015, 22, 640–648. [Google Scholar] [CrossRef] [PubMed]
Thong, W.L.; Zhang, Y.; Zhuo, Y.; Robins, K.J.; Fyans, J.K.; Herbert, A.J.; Law, B.J.C.; Micklefield, J. Gene editing enables rapid engineering of complex antibiotic assembly lines. Nat. Commun. 2021, 12, 6872. [Google Scholar] [CrossRef] [PubMed]
Shrestha, S.K.; Garneau-Tsodikova, S. Expanding Substrate Promiscuity by Engineering a Novel Adenylating-Methylating NRPS Bifunctional Enzyme. Chembiochem 2016, 17, 1328–1332. [Google Scholar] [CrossRef] [PubMed]
Lundy, T.A.; Mori, S.; Garneau-Tsodikova, S. Engineering Bifunctional Enzymes Capable of Adenylating and Selectively Methylating the Side Chain or Core of Amino Acids. ACS Synth. Biol. 2018, 7, 399–404. [Google Scholar] [CrossRef] [PubMed]
Lundy, T.A.; Mori, S.; Garneau-Tsodikova, S. Probing the limits of interrupted adenylation domains by engineering a trifunctional enzyme capable of adenylation, N-, and S-methylation. Org. Biomol. Chem. 2019, 17, 1169–1175. [Google Scholar] [CrossRef] [PubMed]
Lundy, T.A.; Mori, S.; Garneau-Tsodikova, S. Lessons learned in engineering interrupted adenylation domains when attempting to create trifunctional enzymes from three independent monofunctional ones. RSC Adv. 2020, 10, 34299–34307. [Google Scholar] [CrossRef]
Meyer, S.; Kehr, J.C.; Mainz, A.; Dehm, D.; Petras, D.; Sussmuth, R.D.; Dittmann, E. Biochemical Dissection of the Natural Diversification of Microcystin Provides Lessons for Synthetic Biology of NRPS. Cell Chem. Biol. 2016, 23, 462–471. [Google Scholar] [CrossRef]
Ackerley, D.F.; Lamont, I.L. Characterization and genetic manipulation of peptide synthetases in Pseudomonas aeruginosa PAO1 in order to generate novel pyoverdines. Chem. Biol. 2004, 11, 971–980. [Google Scholar] [CrossRef] [PubMed]
Tanovic, A.; Samel, S.A.; Essen, L.O.; Marahiel, M.A. Crystal structure of the termination module of a nonribosomal peptide synthetase. Science 2008, 321, 659–663. [Google Scholar] [CrossRef] [PubMed]
Messenger, S.R.; McGuinniety, E.M.; Stevenson, L.J.; Owen, J.G.; Challis, G.L.; Ackerley, D.F.; Calcott, M.J. Metagenomic domain substitution for the high-throughput modification of nonribosomal peptides. Nat. Chem. Biol. 2024, 20, 251–260. [Google Scholar] [CrossRef] [PubMed]
Bozhüyük, K.A.; Fleischhacker, F.; Linck, A.; Wesche, F.; Tietze, A.; Niesert, C.-P.; Bode, H.B. De novo design and engineering of non-ribosomal peptide synthetases. Nat. Chem. 2018, 10, 275–281. [Google Scholar] [CrossRef] [PubMed]
Bozhüyük, K.A.; Linck, A.; Tietze, A.; Kranz, J.; Wesche, F.; Nowak, S.; Fleischhacker, F.; Shi, Y.-N.; Grün, P.; Bode, H.B. Modification and de novo design of non-ribosomal peptide synthetases using specific assembly points within condensation domains. Nat. Chem. 2019, 11, 653–661. [Google Scholar] [CrossRef] [PubMed]
Bozhüyük, K.A.; Präve, L.; Kegler, C.; Schenk, L.; Kaiser, S.; Schelhas, C.; Shi, Y.-N.; Kuttenlochner, W.; Schreiber, M.; Kandler, J. Evolution-inspired engineering of nonribosomal peptide synthetases. Science 2024, 383, eadg4320. [Google Scholar] [CrossRef] [PubMed]
Mootz, H.D.; Kessler, N.; Linne, U.; Eppelmann, K.; Schwarzer, D.; Marahiel, M.A. Decreasing the ring size of a cyclic nonribosomal peptide antibiotic by in-frame module deletion in the biosynthetic genes. J. Am. Chem. Soc. 2002, 124, 10980–10981. [Google Scholar] [CrossRef]
Awakawa, T.; Fujioka, T.; Zhang, L.; Hoshino, S.; Hu, Z.; Hashimoto, J.; Kozone, I.; Ikeda, H.; Shin-Ya, K.; Liu, W.; et al. Reprogramming of the antimycin NRPS-PKS assembly lines inspired by gene evolution. Nat. Commun. 2018, 9, 3534. [Google Scholar] [CrossRef]
Butz, D.; Schmiederer, T.; Hadatsch, B.; Wohlleben, W.; Weber, T.; Sussmuth, R.D. Module extension of a non-ribosomal peptide synthetase of the glycopeptide antibiotic balhimycin produced by Amycolatopsis balhimycina. Chembiochem 2008, 9, 1195–1200. [Google Scholar] [CrossRef]
Pham, J.V.; Yilma, M.A.; Feliz, A.; Majid, M.T.; Maffetone, N.; Walker, J.R.; Kim, E.; Cho, H.J.; Reynolds, J.M.; Song, M.C.; et al. A Review of the Microbial Production of Bioactive Natural Products and Biologics. Front. Microbiol. 2019, 10, 1404. [Google Scholar] [CrossRef]

Figure 1. The typical domain arrangement of NRPS. Adenylation (A) domain, thiolation (T) domain, condensation (C) domain, and thioesterase (TE) domain are represented in orange, blue, green, and pink circles, respectively. Note: The module illustrated in the figure refers to a unit that comprises the three core domains C, A, and T (the initiation module, shown on the left, consists of A and T domains only, while the termination module, illustrated on the right, contains C, A, T and TE domains).

Figure 2. Activation and loading of substrate catalyzed by the A domain. Initially, the A domain activates the acyl monomer: substrate reacts with adenosine 5′-triphosphate (ATP) to generate the acyl-adenosine monophosphate (AMP) intermediate and inorganic pyrophosphate (PPi). Subsequently, the aminoacyl-AMP undergoes nucleophilic attack by the thiol group located at the terminus of the 4′-phosphopantetheine arm of the downstream thiolation (T) domain, leading to the formation of a thioester-bound aminoacyl-S-T domain by linking to the thiol of the phosphopantetheine arm of T domain, followed by the release of AMP.

Figure 3. Composition of some interrupted A domains: In KtzH, an interruption occurs between a8 and a9 codes of the A domain in module 4, where an O-methylase domain is inserted [55]. In TioS, interruptions occur between a8 and a9 codes of the A domain in both module 3 and module 4, with the insertion of an N-methylase domain [53]. In TioN, an interruption occurs between a2 and a3 codes of the A domain, with the insertion of an S-methylase domain [50]. In MarQ, an interruption occurs between a2 and a3 codes of the A domain, with the insertion of an S-methylase domain [57]. M_b: main-chain methylase domain (yellow); M_s: side-chain methylase domain (light green).

Figure 4. Two typical cases of A domain substitution. (A) The A domain in Bacillus subtilis SrfA-C, which recognizes Leu, is replaced with an A domain that recognizes Cys. (B) The A domain of the second module of PvdD that recognizes L-Thr along with the linker region is replaced with a randomly selected one from nine different types of A domains that recognize other substrates and their corresponding linker regions. Among these, substitutions of six exchanged A domains labeled with dashed circles achieved higher yields. Abbreviations: aad: δ-L-α-aminoadipyl residue, fhOrn: N5-formyl-N5-hydroxyornithine residue.

Figure 5. The cartoon representation of the overall structure of GrsA (PDB ID: 1AMU), phenylalanine-selective A domain of gramicidin synthetase 1, showing the subdomain and core domain of the A domain, which are represented in yellow and cyan, respectively.

Figure 6. The second A domain of EndA that recognizes L-Thr. The FSD responsible for substrate recognition in this domain is replaced with the FSD from EndC that recognizes Ser, enabling the second A domain of EndA to recognize Ser. Abbreviation: Hpg: hydroxyphenylglycine.

Figure 7. Several typical cases of sequence removal and insertion in A domains. (A) The wild-type A domain of KtzH is naturally interrupted. The methylase activity of this domain is abolished after its interrupted Ms domain is removed. (B) Replacing the interrupted M_s domain in KtzH with the interrupted M domain from TioN yields an A domain exhibiting both methylation and adenylation functions; inserting the interrupted M domains from KtzH and TioS into the uninterrupted A domain of Ecm6 results in the generation of corresponding A domains with dual functions of methylation and adenylation.

Figure 8. The domain arrangement and overall structure of SrfA-C. (A) The domain arrangement of the SrfA-C module: C domain, A domain, T domain, TE domain, and linker regions that connect these domains are represented as green, orange, blue, pink, and gray, respectively. (B) The surface representation of the overall structure of SrfA-C (PDB ID: 2VSQ), as well as the colors of the domains and linker regions, are identical to that in (A).

Figure 9. Two typical cases of whole-module rearrangement. (A) Deleting the second module in SrfA-A and connecting modules 1 and 3 resulted in the production of a hexapeptide lacking a Leu residue. (B) By the insertion of a composite module composed of the T-E bidomain from module 4 and the C-A bidomain from module 5 in BpsB into modules 4 and 5 of BpsB, leading to the production of a novel octapeptide with three Hpg residues. Abbreviation: Hpg: hydroxyphenylglycine.

Table 2. Summary of several types of interrupted A domains with methylase activities.

Product	Interrupted A Domains	Interruption Site	Methylation Type	Reference
Microcystin-LR	McyA (A3-M_b-A3)	a8, a9	N-methylation	[59]
Pyochelin	PchF (A2-M_b-A2)	a8, a9	N-methylation	[51]
Kutzneride	KtzH (A4-M_s-A4)	a8, a9	O-methylation	[55]
Columbamides	ColG (A-M_s-M_b-A)	a8, a9	O-methylation, N-methylation	[60]
Micropeptin	McnC (A6-M_b-A6)	a8, a9	N-methylation	[61]
Thiocoraline	TioS (A3-M_b-A3) (A4-M_b-A4)	a8, a9	N-methylation	[53,56]
Thiocoraline	TioN(A-M_s-A)	a2, a3	S-methylation	[50]
Maremycins	MarQ (A-M_s-A)	a2, a3	S-methylation	[57]
Thalassospiramide	TtcC (A6-M_b-A6)	a2, a3	N-methylation	[62]
Thalassospiramide	TtmB (A6-M_b-A6)	a2, a3	N-methylation	[62]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, M.; Peng, Z.; Huang, Z.; Fang, J.; Li, X.; Qiu, X. Functional Diversity and Engineering of the Adenylation Domains in Nonribosomal Peptide Synthetases. Mar. Drugs 2024, 22, 349. https://doi.org/10.3390/md22080349

AMA Style

Zhang M, Peng Z, Huang Z, Fang J, Li X, Qiu X. Functional Diversity and Engineering of the Adenylation Domains in Nonribosomal Peptide Synthetases. Marine Drugs. 2024; 22(8):349. https://doi.org/10.3390/md22080349

Chicago/Turabian Style

Zhang, Mengli, Zijing Peng, Zhenkuai Huang, Jiaqi Fang, Xinhai Li, and Xiaoting Qiu. 2024. "Functional Diversity and Engineering of the Adenylation Domains in Nonribosomal Peptide Synthetases" Marine Drugs 22, no. 8: 349. https://doi.org/10.3390/md22080349

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Functional Diversity and Engineering of the Adenylation Domains in Nonribosomal Peptide Synthetases

Abstract

1. Introduction

2. Major Function and Functional Diversity of A Domains

2.1. Substrate-Activation Function of A Domain

2.1.1. Substrate Recognition Mechanism of A Domain

Amino Acid Recognition Mechanism of A Domain

Keto Acid Recognition Mechanism of A Domain

2.1.2. Catalytic Mechanism of the A Domain

2.2. Auxiliary Functions of A Domains

3. Engineering of A Domains

3.1. Site-Specific Mutation of Substrate Specificity Codes

3.2. Substitution of the A Domain

3.3. Substitution of the Recognition Subdomain of the A Domain

3.4. Domain Insertion

3.5. Substitution of C-A Bidomain

3.6. NRPS Engineering by Whole-Module Rearrangements

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI