Next Article in Journal
Steric Effects of Mesoporous Silica Supported Bimetallic Au-Pt Catalysts on the Selective Aerobic Oxidation of Aromatic Alcohols
Previous Article in Journal
Biological Activities of NHC–Pd(II) Complexes Based on Benzimidazolylidene N-heterocyclic Carbene (NHC) Ligands Bearing Aryl Substituents
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

The HD-Domain Metalloprotein Superfamily: An Apparent Common Protein Scaffold with Diverse Chemistries

Department of Biochemistry, Brandeis University, 415 South Street, Waltham, MA 02453, USA
*
Author to whom correspondence should be addressed.
Catalysts 2020, 10(10), 1191; https://doi.org/10.3390/catal10101191
Submission received: 10 September 2020 / Revised: 6 October 2020 / Accepted: 9 October 2020 / Published: 15 October 2020
(This article belongs to the Section Biocatalysis)

Abstract

:
The histidine–aspartate (HD)-domain protein superfamily contains metalloproteins that share common structural features but catalyze vastly different reactions ranging from oxygenation to hydrolysis. This chemical diversion is afforded by (i) their ability to coordinate most biologically relevant transition metals in mono-, di-, and trinuclear configurations, (ii) sequence insertions or the addition of supernumerary ligands to their active sites, (iii) auxiliary substrate specificity residues vicinal to the catalytic site, (iv) additional protein domains that allosterically regulate their activities or have catalytic and sensory roles, and (v) their ability to work with protein partners. More than 500 structures of HD-domain proteins are available to date that lay out unique structural features which may be indicative of function. In this respect, we describe the three known classes of HD-domain proteins (hydrolases, oxygenases, and lyases) and identify their apparent traits with the aim to portray differences in the molecular details responsible for their functional divergence and reconcile existing notions that will help assign functions to yet-to-be characterized proteins. The present review collects data that exemplify how nature tinkers with the HD-domain scaffold to afford different chemistries and provides insight into the factors that can selectively modulate catalysis.

1. Introduction

The histidine–aspartate (HD)-domain superfamily [1] (IPR003607) contains more than 318,000 metalloproteins that are involved in a wide array of functions including immunoresponse [2], nucleic acid metabolism [3,4,5], inflammation [6], virulence [6,7,8], stress response [9,10], and small molecule activation [11,12,13]. They are found across all domains of life and are typified by a tandem histidine–aspartate (HD) dyad that coordinates at least one (often two or three) metal ions. Although there are many uncharacterized HD-domain proteins, chemical diversion appears to be linked to details in the local protein environment, extra ligands, and genomic co-occurrence with partner proteins, all of which may serve as blueprints for their functional assignment.

1.1. Characteristics and General Classification of HD-Domain Proteins

The main traits that functionally differentiate HD-domain proteins are (i) the chemical nature of the metal ion/cofactor, (ii) cofactor nuclearity, (iii) supernumerary ligands, (iv) conserved amino acid sequence motif insertions or residues vicinal to the active site, (v) auxiliary domains with catalytic or regulatory roles, and (vi) interaction with protein partners. These six features appear to distinctly influence the range of chemistries these proteins perform. Despite variations in primary sequence, all HD-domain proteins share the HD residue dyad that coordinates transition metal ions such as Fe, Mn, Co, Mg, Cu, Zn, and Ni (Table 1) as well as a helical fold (Figure 1A) and functionally cluster based on sequence similarities (Figure 1B) [1,3,14].
The HD-domain superfamily fosters three different enzymatic classes: hydrolases, oxygenases, and lyases, with the hydrolases being the largest and best-characterized group. Hydrolases are further subdivided into (i) phosphatases, including dGTPases [38], RelA/SpoT [9,17], SAMHD1 [18,19], EF1143 [20,21], etc. and (ii) phosphodiesterases (PDEs), including exoribonucleases [39], PDEases [27,28,29,30,31,32,33,34], Cas proteins [23,24,25,26], and HD-GYPs [14,35,36,40] (Figure 1B). Phosphatases hydrolyze a multitude of (deoxy)nucleotide-based substrates that vary in the identity of their base(s) and the extent of phosphorylation (Figure 2) [3,5,16,22]. PDEs, on the other hand, degrade a variety of cyclic signaling molecules and single-stranded nucleic acids (Figure 2) [23,24,25,41]. Oxygenases and lyases are relatively recent additions to the HD-domain superfamily with only a handful of representatives to have been biochemically characterized. All identified oxygenases catalyze the oxidative cleavage of C-C/P bonds [11,12,13,15] (Figure 2), while the single known lyase is a cyanamide hydratase [37]. This functional plurality widens the chemical repertoire of the HD-domain superfamily, which is likely to harbor more oxygenases, lyases, or enzymes with novel chemistries.

1.2. HD-Domain Hydrolases: The Phosphatase Subfamily

HD-domain phosphatases play essential roles in regulating the cellular pool of (deoxy)ribonucleotides and signaling molecules involved in bacterial stress responses [3,4,10,16,42], such as (p)ppGpp and Ap4A (Figure 2). Phosphatases are further subdivided into mono-, di-, and triphosphatases with distinct structural features that may provide clues for the future classification of unknown HD-domain phosphatases. Interestingly, all have a strictly conserved arginine residue prior to the first histidine of the metal binding motif (Figure 3), which is pivotal for activity [16,18,22]. The exact chemical role of this arginine in catalysis has not yet been delineated. Its importance in hydrolysis most likely stems from the ability of this residue to ensure proper substrate positioning and/or to form intermolecular interactions with the substrate phosphate groups. Most biochemically characterized HD-domain mono- and diphosphatases are dimers in solution [3,5,16], which appears to be related to enzymatic function and may represent a regulatory mechanism for tuning activity. However, it is currently unknown if both sites are catalytically active or if one site allosterically activates the other. In contrast, triphosphatases are allosterically regulated by nucleotide binding to secondary sites, and most are active as tetramers [4,20] or hexamers [38].
In most cases, phosphate hydrolysis is supported by non-redox metal ions, with the most active cofactors being Co or Mn for monophosphatases, Mn for diphosphatases, and Mg or Mn for triphosphatases (Table 1). With a few notable exceptions (YqeK, SAMHD1, and OxsA), these metals are bound in a mononuclear configuration by a conserved “H…HD…D” motif (Figure 3).

1.2.1. Monophosphatases

YfbR and YGK1

YfbR is a dimeric 5′-deoxyribonucleotidase that is specific to 2′-deoxyribonucleotide-5′-monophosphates (dNMPs) and hydrolyzes dAMP with a kcat/KM of 25.6 × 103 M−1s−1 [3]. YfbR is most active with a mononuclear Co2+ cofactor, while Mn2+ and Cu2+ can also support activity to a lesser extent [3]. The Co ion is bound in an octahedral geometry by the HD-domain residues H33, H68, D69, and D137, the phosphate oxygen of the nucleotide, and a water molecule that is activated by Co2+ for nucleophilic attack of the phosphate group. YfbR shows no preference for the chemical nature of the substrate base because the latter makes no significant interactions with the protein [3]. However, YfbR exhibits specificity for deoxyribomonophosphates, afforded by steric hindrance from W19 and additional hydrogen bond contacts between the deoxyribose moiety with residues R18 and D77. Substitutions of R18, D77, W19, and the metal binding ligands completely inactivate the enzyme [3].
The HD-domain monophosphatase YGK1 has high structural similarity to YfbR and acts on dNMPs with catalytic efficiencies between 0.78 × 103 and 6.7 × 103 M−1s−1 [5]. Similar to YfbR, the cofactor is mononuclear and held to the protein polypeptide by H61, H89, D90, and D159 (Figure 3). The metal ions that support catalysis follow the trend: Mn2+ > Co2+ > Mg2+ [5].

1.2.2. Diphosphatases

YqeK

YqeK is an Ap4A hydrolase that symmetrically cleaves Ap4A to two adenosine diphosphates (ADPs) with a kcat/KM of 34 × 106 M−1s−1 [10]. Increased levels of Ap4A are linked to temperature sensitivity, stress response, and antibiotic resistance [43]. Other substrates include Ap5A, Gp4G, Ap4G, and Ap4U with kcat/KM values of 1.5 ×106, 15 × 106, 11 × 106, and 18×106 M−1s−1, respectively [10]. These catalytic efficiencies are similar to those observed by the ApaH family of Ap4A hydrolases, which are thought to be mutually exclusive from YqeK (i.e., apaH and yqeK genes do not co-occur in the same organism) [10].
YqeK attains a homodimeric configuration, and unlike other known HD-domain phosphatases, it binds a dimetal cofactor that, on the basis of the crystal structure, is diiron (Figure 3). The ligands to the diiron site include the conserved H21, H50, D51, and D127 residues as well as two additional histidines (H83, H109) that help coordinate the second metal ion (Figure 3) [5]. Thus, the ability to coordinate a dinuclear site can be diagnosed on the basis of primary sequence by the presence of six conserved residues making up the extended “H…HD…H…H…D” motif. The relevance of the diiron site in catalysis has not yet been explored, and the chemical nature of the active cofactor is presently unknown.

YpgQ

YpgQ and its orthologs belong to a smaller subfamily within the HD-domain phosphatases, which is designated as RnaY (IPR017705), that contain the RNase Y N-terminal region. While several of its homologs have been crystallographically characterized, YpgQ is the sole representative that has been biochemically characterized [16]. It is a homodimer that hydrolyzes oxy and deoxy nucleotide triphosphates (NTPs and dNTPs) and preferentially acts on dGTP and dATP with apparent kcat/KM values of 281 and 91 M−1s−1, respectively [16]. It harbors a catalytically essential mononuclear Mn cofactor coordinated by the invariant residues H29, H58, D59, and D124. Four other residues in the vicinity of the active site (R136, H158, F140, and H25) are proposed to contribute to substrate specificity [16]. R136 and H158 make hydrogen bond contacts with the 3′ and 2′ hydroxyl groups of the ribose sugar, respectively. Additionally, H158 and F140 stabilize the purine base via π–π stacking, while H25 hydrogen bonds to the phosphate group such that the β-phosphate is bound to a metal-activated water molecule (Figure 3).

SpoT/RelA

SpoT and RelA act on the secondary messengers (p)ppGpp, which are alarmones produced as part of the stringent response to amino acid starvation, temperature change, and other environmental cues (Figure 2 and Figure 3) [17,44]. Both proteins are believed to be dimers in solution [9]. SpoT is an Mn2+-dependent bifunctional enzyme with weak (p)ppGpp synthetic activity from its SYNTH domain and strong hydrolytic activity from its HD-domain [42,45]. RelA has only synthetic activity and lacks hydrolytic activity, which is most likely due to a divergent HD-domain (compared to that of SpoT) [42].

1.2.3. Triphosphatases

SAMHD1

The sterile alpha motif and HD-domain containing protein 1 (SAMHD1) is a sophisticated deoxynucleotide triphosphate triphosphohydrolase that regulates dNTP levels in mammalian cells and plays a key role in HIV restriction, innate immunity, cancer, and cell cycle [2,4,46]. It does so by depleting cellular dNTP levels via an allosterically driven mechanism [47]. It has a strict specificity for deoxyribonucleotides (dGTP > dCTP > dTTP > dATP) [18] but can also hydrolyze other substrates to a lesser extent, including Clofarabine-TP, Ara-CTP, Ara-ATP, and Ara-UTP [48]. Hydrolysis of dNTPs by SAMHD1 is Mg-dependent and occurs with an apparent catalytic constant of 1–2 s−1 [49], but the protein is also active with Mn. Optimal activity is contingent on the assembly of a homotetrameric complex that is afforded by the sequential binding of oxy and deoxyribonucleotides that act as allosteric activators [48]. SAMHD1 has two allosteric (AL) sites; AL-site 1 (D137, Q142, and R145) is tuned for binding both oxy and deoxy guanosine-based nucleotides, while AL-site 2 (N119, D330, N358, and R372) is selective for dNTPs [4].
The main catalytic core is held to the protein polypeptide by four invariant residues, H167, H206, D207, and D311, which until recently was well accepted to bind a single metal ion [18,19]. A new structure, however, shows that the active site might be binuclear harboring a Fe–Mg center with the H233 coordinating the second metal ion (i.e., Mg) (Figure 3). The two metal ions are bridged by D207 and the substrate α-phosphate [19]. Two histidine residues vicinal to the active site, H210 and H215, form salt bridges with the phosphate oxygens [18], and together with R164 and H233, are important for catalytic activity [18].

EF1143

EF1143 from Enterococcus faecalis, a bacterium found in the human intestinal tract of humans, regulates the cellular levels of DNA building molecules by depleting the dNTP pools [20]. It is a distant homolog of SAMHD1 and shares some structural and functional similarities to the latter. Both SAMHD1 and EF1143 are homotetramers, and their first allosteric site is specific to guanosine-based triphosphates [4,20,48]. However, there are several salient differences between the two enzymes. EF1143 does not require dNTP binding for tetramerization, and in contrast to SAMHD1, the first allosteric site of EF1143 is strictly specific for dGTP [4,20]. The active site is similar to that of SAMHD1, harboring residues H66, H110, D111, and D183 to coordinate a divalent metal ion. The Mg form of EF1143 is activated by dGTP and selectively hydrolyzes dATP and dCTP. In contrast, the Mn form does not require a nucleotide effector and hydrolyzes all dNTPs [20].

OxsA

OxsA catalyzes the formation of the antitumor, antiviral, and antibacterial nucleoside analogue oxetanocin-A (OXT) [22] (Figure 2). OxsA employs H31, H66, D67, and D132 to bind its metallocofactor for the sequential hydrolysis of phosphate groups from phosphorylated forms of OXT (i.e., OXT-P, OXT-PP, and OXT-PPP). The crystallographically observed cofactor nuclearity is dependent on the extent of substrate phosphorylation such that a mononuclear form catalyzes the hydrolysis of the singly phosphorylated OXT-P, while the binding of a second metal ion extends the hydrolytic capabilities to act on OXT-PP and OXT-PPP [22]. OxsA is most active with Co2+ with apparent kcat/KM values of 106, 221, and 525 M−1s−1 for OXT-P, OXT-PP, and OXT-PPP, respectively [22].
When OXT-P is bound, H66 occupies multiple orientations and can be replaced by water as a ligand to the metal ion. In contrast, when OXT-PP(P) is bound, H66 is a ligand to the active site metal. The second metal ion is octahedrally coordinated by D132, which bridges the two metal ions, water molecules, and the β and γ phosphate groups (Figure 3) [22]. Thus, it is hypothesized that upon sequential hydrolysis of the γ and β phosphates, OxsA loses the second metal ion, allowing OXT-P to occupy the second metal site for efficient hydrolysis. The substrate is stabilized via additional interactions between H75 and the C3′ hydroxymethyl group of OXT, as well as S78 and the endocyclic ring of OXT [22]. W17, a residue conserved both in OxsA and YfbR, is proposed to exclude ribonucleotide binding [22].

1.3. HD-Domain Hydrolases: The PDE Subfamily

PDEs can harbor both mononuclear and polynuclear cofactors. A common feature of all known polynuclear (di-or tri-nuclear) PDEs is an extra histidine residue in the active site such that their characteristic metal binding sequence is “H…HD…H…HH…D” (Figure 4). The role of this histidine in activity or structure has not been explicitly established. However, on the basis of structural studies, this residue makes additional hydrogen bond contacts to the substrate phosphate groups, suggesting a possible role in substrate binding.

1.3.1. HD-Domain PDEs Acting on DNA

Clustered regularly interspaced short palindromic repeats (CRISPR)-associated systems (Cas) are major players in prokaryotic adaptive immunity and RNA-based defense [50,51]. Type I CRISPR–Cas utilize a multicomponent system and recruit a single nuclease, Cas3, for the degradation of invader nucleic acids. The Cas3-associated gene can encode for a protein that has only the HD-domain (Cas3′’, I-A subtype), or more commonly, an N-terminal HD-domain fused to a Superfamily 2 helicase (Cas3). Type III-B CRISPR–Cas utilize Cas10 (Cmr2) for RNA-activated ssDNA cleavage [26].
The crystal structure of the Thermobifida fusca Cas3 shows a diiron active site (Figure 4), yet no activity with this cofactor has been demonstrated [23]. Cas3 proteins can be promiscuously activated by various metal ions, with most being activated by Ni or Co and, to a lesser extent, by other divalent metal ions (with the exception of Mg and Ca) [23,25]. In contrast, the Pyrococcus furiosus Cmr2 (dinuclear) and the Cas3′’ proteins (mononuclear in the published structures) exhibit an Mg-dependent PDE activity [23,25]. The nuclearity of the Cas3′’s may be an artifact due to the larger flexibility of the protein polypeptide, making binding of the second metal ion more transient. Of note, no correlation between helicase and PDE activities has been demonstrated to date.

1.3.2. HD-Domain PDEs Acting on Cyclic Mononucleotides

cAMP- and cGMP-specific PDEs (also referred to as PDEases) are essential regulators in cyclic nucleotide-dependent signal transduction in diverse physiological processes including immune response, neuronal activity, hypertension, and inflammatory response [30,31,52]. Currently, 21 genes encoding human PDEases have been identified and classified into 12 families according to their substrate specificities, pharmacological properties, and tissue localization [31,53]. These are further divided into three groups: cAMP-specific (PDE4, PDE7, PDE8, and PDE12), cGMP-specific (PDE5, PDE6, and PDE9) [31,41], and dual-specific (PDE1, PDE2, PDE3, PDE10, and PDE11) [31].
All PDEases are dimeric and have a conserved catalytic carboxy terminal domain as well as a variable regulatory amino terminal domain [30,54]. However, PDEases are also active as monomers; therefore, the functional significance of dimerization remains unknown [54]. On the basis of their binding affinity for divalent metal ions, PDEases are distinguished into two classes: Class I in mammals and flies and Class II in yeast and protozoans [55]. The most extensively studied are Class I PDEases, in which two metal ions (e.g., Zn and Mg) are octahedrally coordinated, forming a somewhat unconventional bimetallic site [30] Although the canonical motif suggests the binding of a single metal ion (M1) in the HD motif, stabilization of the second metal ion (M2) is accomplished via the aspartate of the HD motif and five water molecules, one of which is bridging M1 and M2. The bridging water molecule has been suggested to be a hydroxide, which can nucleophilically attack the phosphodiester bond [30]. In the crystal structures, the identity of M1 and M2 is often found to be Zn and Mg, respectively (Figure 4) [30]. Class I PDEs are active with either Mn or Mg but not with Zn. Therefore, although Zn can bind in the M1 position with high affinity, it cannot stimulate activity by itself or has inhibitory effects [31].
The substrate specificity in PDEases is afforded by a so-called “glutamine-switch” mechanism in which [30] a conserved glutamine in the vicinity of the active site can adopt two different orientations. In one orientation, it can form a bidentate hydrogen bond with the adenine ring (cAMP-specific) or two hydrogen bonds with guanine ring and two hydrogen bonds with neighboring alanine and tryptophan residues (cGMP-specific) [30,31]. In the dual-specific PDEases, the conserved glutamine has higher rotational flexibility and no orientation constraints, allowing it to adopt orientations for both substrates.

1.3.3. HD-Domain PDEs Acting on c-di-AMP

Cyclic-di-AMP is a second messenger essential in bacterial signaling and a critical player in bacterial physiology and pathogenesis [56,57]. PgpH performs the one-step hydrolysis of c-di-AMP to 5′pApA in an Mn-dependent fashion but cannot hydrolyze other cyclic dinucleotides (i.e., c-di-GMP) [6]. The active site Mn2+ ions are octahedrally coordinated by seven residues, H514, H543, D544, H580, H604, H605, and D648, as well as the two terminal oxygen atoms of the c-di-AMP phosphate group (Figure 4). The metal ions activate a water molecule opposite the scissile phosphate for the nucleophilic attack of phosphorus. Protonation of the resulting oxyanion terminates the reaction [6].

1.3.4. HD-Domain PDEs Acting on c-di-GMP and c-GAMP; the HD-GYP Subclass

HD-GYPs are a special subclass of the PDE subfamily and are functionally homologous to EAL proteins (typified by the glutamate-alanine-leucine residue triad) [1,58]. They can be single domain proteins or fusions to extra regulatory, sensory, or catalytic protein domains [8,59,60].
While cyclic-di-GMP is their most common substrate, the recently discovered hybrid dinucleotide, 3′3′c-GAMP, is also hydrolyzed by some HD-GYPs [59]. Out of the nine HD-GYPs encoded in Vibrio cholerae, VCA0681, VCA0931, and VCA0210 are the only HD-GYPs to hydrolyze both c-di-GMP and 3′3′c-GAMP. More recently, PmxA from Myxococcus xanthus was identified as a 3′3′c-GAMP specific PDE that is hardly active toward c-di-GMP or c-di-AMP [60]. Selectivity for 3′3′c-GAMP is attributed to a glutamine near the active site, although this residue is not conserved in VCA0681, VCA0931, and VCA0210, suggesting that the molecular origins for 3′3′cGAMP specificity may vary among HD-GYPs.
In addition to the seventh ligand added to their active site (i.e., an extra histidine adjacent to the last histidine of the motif), all active HD-GYPs have a glycine-tyrosine-proline (GYP) residue triad in a loop close to the active site (Figure 5) [14,35]. However, because single amino acid substitutions of each of the GYP domain residues to alanines hardly affect PDE activity [36], its role in catalysis and protein stability remains poorly understood. The GYP motif is considered important for interaction with the GGDEF cyclase (named after its highly conserved Gly-Gly-Asp-Glu-Phe sequence motif) [8] and serves as a substrate specificity element for the recognition of c-di-GMP and its hybrid 3′3′-cGAMP analog [40].
HD-GYPs differ on the basis of their active metal cofactor and possible catalytic outcomes. While most commonly harbor a dimetal cofactor, some incorporate a trinuclear cofactor by involving a glutamate residue as an eighth ligand to the site (Figure 5) [36]. The assembly of a dinuclear or trinuclear cofactor is presumed to afford different reaction outcomes. Dinuclear HD-GYPs can only perform a one-step hydrolysis, whereas trinuclear ones can perform a two-step hydrolysis, leading to the respective monophosphates. Metal ions that can stimulate hydrolysis are Fe, Mn, and, to a lesser extent, Co and Ni [40].
PmGH from Persephonella marina is the prototypical trinuclear HD-GYP and the first to be crystallographically characterized [36]. PmGH harbors a triiron cofactor with the third iron coordinated by the glutamate E185. The trimetal cofactor is additionally stabilized by three other crystallization molecules, invoking the possibility that other solvent molecules may be incorporated under physiological conditions (Figure 5). It is active with both Fe2+ and Mn2+. On the basis of primary amino acid sequence, PA4781 from Pseudomonas aeruginosa is also a putative trinuclear PDE; however, the available crystal structure shows two Ni ions in the active site at an elongated distance. PA4781 is unselective in its metal ion incorporation, has limited activity, and exhibits a preference for 5′-pGpG over c-di-GMP to form GMPs [14].
Only one structure of a dinuclear HD-GYP exists: Bd1817 from Bdellovibrio bacteriovorus. It harbors a diiron cofactor, but the presence of an asparagine instead of the last aspartate of the binding motif, a degraded GYP motif (Figure 5), and its complete inactivity toward c-di-GMP [35] do not allow for the inference of substrate positioning and specificity in dinuclear HD-GYPs.

1.4. HD-Domain Oxygenases

Most of the known HD-domain proteins are phosphohydrolases, but three members, namely myo-inositol oxygenase (MIOX), PhnZ, and TmpB, are monooxygenases and perform the oxidative cleavage of a C-X bond [11,12,13,15]. The discovery of this chemistry expands the catalytic repertoire of the HD-domain superfamily, and their conserved protein features may provide insight into the identification of yet-to-be-characterized HD-domain proteins as oxygenases.
The first discovered HD-domain oxygenase, MIOX, catalyzes the oxidative cleavage of a C-C bond of myo-inositol to form D-glucuronic acid (Figure 6) [11]. Myo-inositol is a precursor for inositol phosphoglycans, which act as insulin mediators, and altered inositol metabolism has been associated with diabetes. Therefore, the activity of MIOX is of increasing interest, as it presents a potential therapeutic target for treating both type-1 and type-2 diabetes.
PhnZ and TmpB were later established as oxygenases, demonstrating that MIOX is not a functional outlier [12,13]. Both PhnZ and TmpB are involved in the degradation of organophosphonates, which are compounds that serve as sources of inorganic phosphate for bacteria that occupy phosphate-limited environments (e.g., marine ecosystems) [12]. Unlike MIOX, PhnZ and TmpB act in tandem with the non-heme α-ketoglutarate (KG) dependent hydroxylases PhnY and TmpA, respectively, to cleave the C–P bond of their substrates (Figure 2) [12,13]. PhnY initiates the degradation of 2-aminoethylphosphonate (2-AEP) via the addition of a hydroxyl group to the C1 carbon in a stereospecific manner producing (R)-2-amino-1-hydroxyethyl phosphonate (OH-AEP) [12]. PhnZ performs the subsequent oxidative cleavage of the C-P bond of OH-AEP forming inorganic phosphate and glycine (Figure 2 and Figure 6) [12]. The TmpA/TmpB pathway is mechanistically similar, with the only difference being the nature of the substrate, i.e., 2-(trimethylammonio)ethyl phosphonate (TMAEP) for TmpA [13].
MIOX, PhnZ, and TmpB bind a catalytically essential diiron cofactor via the HD-domain sequence “H…HD…H…H…D” (Figure 6). Each iron is coordinated in an octahedral geometry and bridged by the carboxylate group of the first aspartate residue in the HD-domain sequence as well as a μ-oxo/hydroxo bridge (Figure 6) [15]. Unlike other dinuclear nonheme–iron oxygenases, which utilize the fully reduced FeII/FeII form of their cofactors, HD-domain oxygenases stabilize a mixed-valent FeII/FeIII state for the four-electron oxidation of a C-C/P bond and do not require an external reducing system for reactivation of the cofactor (i.e., after one substrate turnover the site returns to the FeII-FeIII form) [15]. Stabilization under the same redox conditions of the FeIIFeIII cofactor in oxygenases and the FeIIFeII cofactor in HD-domain hydrolases suggests that the HD-domain sites may tune activity through the modulation of cofactor reduction potentials [40].
The iron ion in the Fe1 site serves as a Lewis acid and binds the substrate such that the C-X bond is opposite the iron (Figure 6) [15]. Then, the second iron site (Fe2) is available to bind molecular oxygen, forming a FeIII/FeIII superoxo species that initiates oxidative cleavage by abstracting a hydrogen atom from the substrate. Following turnover, the active mixed-valent FeII/FeIII form is regenerated [12], and thus, there is no need for an external reducing system to reactivate the enzyme, which is a feature unique to HD-domain containing oxygenases.
Unlike MIOX, PhnZ and TmpB sequences contain a transient YxxE loop involved in substrate binding. Prior to substrate binding, the tyrosine (Y24 in PhnZ and Y30 in TmpB) is oriented toward the active site and binds to the Fe2 site, while the glutamate (E27 in PhnZ and E33 in TmpB) faces away from the active site [13,15]. Substrate binding induces a conformational change, positioning the glutamate within hydrogen bonding distance to the amino group of the substrate and causing the tyrosine–iron bond to break [13,15]. Dissociation of the tyrosine frees the Fe2 site for O2 binding and subsequent turnover and most likely serves as a protective mechanism to prevent oxidative inactivation of the active site (Figure 6).
Collectively, HD-domain oxygenases have catalytic and structural features that differ significantly not only from that of other nonheme–iron oxygenases, but also of HD-domain hydrolases. This divergence is useful as it can provide some descriptors to distinguish oxygenases from hydrolases within the HD-domain family. It is likely that these characteristics are conserved among all HD-domain oxygenases and may provide a critical first step into the characterization of other HD-domain proteins of unknown function.

1.5. Lyases

The HD-Domain Cyanamide Hydratase Ddi2

Cyanamide is a toxic nitrile compound used in agriculture both as a fertilizer and herbicide, as well as an alcohol deterrent for humans. Some soil-inhabiting microorganisms, such as Myrothecium verrucaria and Saccharomyces (S.) cerevisiae, have developed cyanamide biodegradation pathways as a likely defense mechanism against cyanamide-producing plants. The cyanamide hydratase Ddi2, although currently not well characterized, catalyzes the conversion of cyanamide to urea (Figure 7) and represents a functionally distinct subgroup of the HD-domain superfamily [37].
The S. cerevisiae Ddi2 has a unique HD metal binding motif, “H…HD”, in which the characteristic second aspartate residue is absent and T157 occupies that position. Ddi2 hydrates cyanamide with a catalytic efficiency of 566 M−1s−1 and utilizes residues H55, H88, and D89 to coordinate the active site Zn2+ [37]. The binding of cyanamide to the active site displaces a water molecule for direct coordination to the Zn2+ via a substrate nitrogen atom. It remains unclear whether substrate binding occurs by the amino nitrogen or the nitrile atom. However, it is likely that the nitrile atom binds the Zn directly in a similar fashion to carbonic anhydrase [37].

2. Conclusions

The HD-domain superfamily is multi-faceted with respect to its functional repertoire and structural diversity. Although apparently similar, HD-domain proteins can catalyze a variety of different reactions by incorporating protein-based and inorganic-based elements that tune their activities. The present review demonstrates that, to date, identification of the cofactor nuclearity, chemical nature, and substrate selectivity cannot be inferred solely on the basis of primary amino acid sequence or crystallography. This fact may be ascribed both to the plasticity of the HD motif to accommodate non-canonical residues or solvent ligands as well as the promiscuity of the motif for the coordination of divalent (and trivalent) metal ions.
To date, there is no established correlation between the chemical nature of the metallocofactor and the type of the reaction performed by the HD-domain enzymes. The employment of an Fe-based cofactor was initially thought to occur solely in oxygenases and be unfavorable for hydrolases due to the redox-active nature of Fe. However, more and more hydrolytic enzymes are found to incorporate catalytically relevant Fe-based cofactors, revoking this notion. Whereas some HD-domain hydrolases are selective for the types of divalent metal ions that can stimulate activity (e.g., PDEs1-11 HD-GYPs), others are rather promiscuous and can utilize most of the first-row transition metal ions (e.g., Cas3 PDEs). It is still puzzling why the metal ion selectivity is wide and why some of the proteins are highly selective, while others are highly unselective, especially when considering the high degree of conservation of the HD-domain binding motif. It could be that metal ion selection is linked to the environmental niches of the organisms in which the related HD-domain proteins are found or could reflect the apparent independence of a specific reaction on the exact chemical nature of the metal ion. Overall, there appears to be a loose relationship between the type of cofactor and the type of reaction catalyzed. All currently reported HD-GYP PDEs are dinuclear (or trinuclear), highlighting perhaps the importance of both metal ions in substrate binding and hydrolysis. All presently known oxygenases are also dinuclear. On the basis of their structure and mechanism, one of the metal ions is employed in molecular oxygen binding and the second one is involved in cosubstrate binding. However, cofactor selection on the basis of activity type breaks down for phosphatases, as both mono- and dinuclear active site configurations can act on similar (if not the same) substrates. There is an observed trend, in which bulkier substrates or compounds with multiple hydrolyzable bonds are most likely to be processed by multinuclear HD-domain enzymes. Presumably, in these cases the extended active site structure facilitates substrate binding and catalysis.
The diversity of the HD-domain metalloproteins is continuing to expand as more members become biochemically characterized, allowing for the identification of specific traits that can act as molecular descriptors and predictors of function. Within this ever-growing family, these features may aid in the assignment of HD-domain proteins with unknown activities as well as new hydrolases, oxygenases, lyases, or proteins of new unidentified chemistries.

Author Contributions

M.L., S.S., C.U., I.P., J.C., N.C., M.M., P.J., and M.-E.P. wrote the manuscript, A.M. retrieved and curated all the HD-domain crystal structures in the manuscript, M.L. prepared all the graphical content, M.M. performed the sequence similarity network analysis. All authors have read and agreed to the published version of the manuscript.

Funding

This research has been supported by the National Institutes of Health (Grant No. GM111978 and Grant No. GM126303 to M.-E.P.).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Aravind, L.; Koonin, E.V. The HD domain defines a new superfamily of metal-dependent phosphohydrolases. Trends Biochem. Sci. 1998, 23, 469–472. [Google Scholar] [CrossRef]
  2. Chen, S.; Bonifati, S.; Qin, Z.; St Gelais, C.; Wu, L. SAMHD1 Suppression of Antiviral Immune Responses. Trends Microbiol. 2018, 27, 254–267. [Google Scholar] [CrossRef]
  3. Zimmerman, M.D.; Proudfoot, M.; Yakunin, A.; Minor, W. Structural Insight into the Mechanism of Substrate Specificity and Catalytic Activity of an HD-Domain Phosphohydrolase: The 5′-Deoxyribonucleotidase YfbR from Escherichia coli. J. Mol. Biol. 2008, 378, 215–226. [Google Scholar] [CrossRef] [Green Version]
  4. Ji, X.; Tang, C.; Zhao, Q.; Wang, W.; Xiong, Y. Structural basis of cellular dNTP regulation by SAMHD1. Proc. Natl. Acad. Sci. USA 2014, 111, E4305–E4314. [Google Scholar] [CrossRef] [Green Version]
  5. Yang, J.; Wang, F.; Yang, D.; Zhou, K.; Liu, M.; Gao, Z.; Liu, P.; Dong, Y.; Zhang, J.; Liu, Q. Structural and biochemical characterization of the yeast HD domain containing protein YGK1 reveals a metal-dependent nucleoside 5ʹ-monophosphatase. Biochem. Biophys. Res. Commun. 2018, 501, 674–681. [Google Scholar] [CrossRef]
  6. Huynh, T.; Luo, S.; Pensinger, D.; Sauer, J.-D.; Tong, L.; Woodward, J. An HD-domain phosphodiesterase mediates cooperative hydrolysis of c-di-AMP to affect bacterial growth and virulence. Proc. Natl. Acad. Sci. USA 2015, 112, E747–E756. [Google Scholar] [CrossRef] [Green Version]
  7. Dow, J.M.; Fouhy, Y.; Lucey, J.F.; Ryan, R.P. The HD-GYP Domain, Cyclic Di-GMP Signaling, and Bacterial Virulence to Plants. Mol. Plant-Microbe Interact. 2006, 19, 1378–1384. [Google Scholar] [CrossRef] [Green Version]
  8. Ryan, R.P.; McCarthy, Y.; Andrade, M.; Farah, C.S.; Armitage, J.P.; Dow, J.M. Cell-cell signal-dependent dynamic interactions between HD-GYP and GGDEF domain proteins mediate virulence in Xanthomonas campestris. Proc. Natl. Acad. Sci. USA 2010, 107, 5989–5994. [Google Scholar] [CrossRef] [Green Version]
  9. Atkinson, G.C.; Tenson, T.; Hauryliuk, V. The RelA/SpoT Homolog (RSH) Superfamily: Distribution and Functional Evolution of ppGpp Synthetases and Hydrolases across the Tree of Life. PLoS ONE 2011, 6, e23479. [Google Scholar] [CrossRef] [Green Version]
  10. Minazzato, G.; Gasparrini, M.; Amici, A.; Cianci, M.; Mazzola, F.; Orsomando, G.; Sorci, L.; Raffaelli, N. Functional Characterization of COG1713 (YqeK) as a Novel Diadenosine Tetraphosphate Hydrolase Family. J. Bacteriol. 2020, 202, e00053-20. [Google Scholar] [CrossRef]
  11. Brown, P.M.; Caradoc-Davies, T.T.; Dickson, J.M.J.; Cooper, G.J.S.; Loomes, K.M.; Baker, E.N. Crystal structure of a substrate complex of myo-inositol oxygenase, a di-iron oxygenase with a key role in inositol metabolism. Proc. Natl. Acad. Sci. USA 2006, 103, 15032–15037. [Google Scholar] [CrossRef] [Green Version]
  12. Wörsdörfer, B.; Lingaraju, M.; Yennawar, N.H.; Boal, A.K.; Krebs, C.; Bollinger, J.M.; Pandelia, M.-E. Organophosphonate-degrading PhnZ reveals an emerging family of HD domain mixed-valent diiron oxygenases. Proc. Natl. Acad. Sci. USA 2013, 110, 18874–18879. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Rajakovich, L.J.; Pandelia, M.-E.; Mitchell, A.J.; Chang, W.-C.; Zhang, B.; Boal, A.K.; Krebs, C.; Bollinger, J.M. A New Microbial Pathway for Organophosphonate Degradation Catalyzed by Two Previously Misannotated Non-Heme-Iron Oxygenases. Biochemistry 2019, 58, 1627–1647. [Google Scholar] [CrossRef] [PubMed]
  14. Rinaldo, S.; Paiardini, A.; Stelitano, V.; Brunotti, P.; Cervoni, L.; Fernicola, S.; Protano, C.; Vitali, M.; Cutruzzolà, F.; Giardina, G. Structural Basis of Functional Diversification of the HD-GYP Domain Revealed by the Pseudomonas aeruginosa PA4781 Protein, Which Displays an Unselective Bimetallic Binding Site. J. Bacteriol. 2015, 197, 1525–1535. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Van Staalduinen, L.M.; McSorley, F.R.; Schiessl, K.; Séguin, J.; Wyatt, P.B.; Hammerschmidt, F.; Zechel, D.L.; Jia, Z. Crystal structure of PhnZ in complex with substrate reveals a di-iron oxygenase mechanism for catabolism of organophosphonates. Proc. Natl. Acad. Sci. USA 2014, 111, 5171–5176. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  16. Jeon, Y.J.; Park, S.C.; Song, W.S.; Kim, O.-H.; Oh, B.-C.; Yoon, S.-I. Structural and biochemical characterization of bacterial YpgQ protein reveals a metal-dependent nucleotide pyrophosphohydrolase. J. Struct. Biol. 2016, 195, 113–122. [Google Scholar] [CrossRef]
  17. Hogg, T.; Mechold, U.; Malke, H.; Cashel, M.; Hilgenfeld, R. Conformational antagonism between opposing active sites in a bifunctional RelA/SpoT homolog modulates (p)ppGpp metabolism during the stringent response. Cell 2004, 117, 57–68. [Google Scholar] [CrossRef] [Green Version]
  18. Goldstone, D.C.; Ennis-Adeniran, V.; Hedden, J.J.; Groom, H.C.; Rice, G.I.; Christodoulou, E.; Walker, P.A.; Kelly, G.; Haire, L.F.; Yap, M.W.; et al. HIV-1 restriction factor SAMHD1 is a deoxynucleoside triphosphate triphosphohydrolase. Nature 2011, 480, 379–382. [Google Scholar] [CrossRef]
  19. Morris, E.R.; Caswell, S.J.; Kunzelmann, S.; Arnold, L.H.; Purkiss, A.G.; Kelly, G.; Taylor, I.A. Crystal structures of SAMHD1 inhibitor complexes reveal the mechanism of water-mediated dNTP hydrolysis. Nat. Commun. 2020, 11, 3165. [Google Scholar] [CrossRef]
  20. Vorontsov, I.I.; Minasov, G.; Kiryukhina, O.; Brunzelle, J.S.; Shuvalova, L.; Anderson, W.F. Characterization of the deoxynucleotide triphosphate triphosphohydrolase (dNTPase) activity of the EF1143 protein from Enterococcus faecalis and crystal structure of the activator-substrate complex. J. Biol. Chem. 2011, 286, 33158–33166. [Google Scholar] [CrossRef] [Green Version]
  21. Vorontsov, I.I.; Wu, Y.; DeLucia, M.; Minasov, G.; Mehrens, J.; Shuvalova, L.; Anderson, W.F.; Ahn, J. Mechanisms of Allosteric Activation and Inhibition of the Deoxyribonucleoside Triphosphate Triphosphohydrolase from Enterococcus faecalis. J. Biol. Chem. 2014, 289, 2815–2824. [Google Scholar] [CrossRef] [Green Version]
  22. Bridwell-Rabb, J.; Kang, G.; Zhong, A.; Liu, H.-w.; Drennan, C.L. An HD domain phosphohydrolase active site tailored for oxetanocin-A biosynthesis. Proc. Natl. Acad. Sci. USA 2016, 113, 13750–13755. [Google Scholar] [CrossRef] [Green Version]
  23. Huo, Y.; Nam, K.H.; Ding, F.; Lee, H.; Wu, L.; Xiao, Y.; Farchione, M.D.; Zhou, S.; Rajashankar, K.; Kurinov, I.; et al. Structures of CRISPR Cas3 offer mechanistic insights into Cascade-activated DNA unwinding and degradation. Nat. Struct. Mol. Biol. 2014, 21, 771–777. [Google Scholar] [CrossRef] [Green Version]
  24. Gong, B.; Shin, M.; Sun, J.; Jung, C.H.; Bolt, E.L.; van der Oost, J.; Kim, J.S. Molecular insights into DNA interference by CRISPR-associated nuclease-helicase Cas3. Proc. Natl. Acad. Sci. USA 2014, 111, 16359–16364. [Google Scholar] [CrossRef] [Green Version]
  25. Beloglazova, N.; Petit, P.; Flick, R.; Brown, G.; Savchenko, A.; Yakunin, A.F. Structure and activity of the Cas3 HD nuclease MJ0384, an effector enzyme of the CRISPR interference. EMBO J. 2011, 30, 4616–4627. [Google Scholar] [CrossRef]
  26. Benda, C.; Ebert, J.; Scheltema, R.A.; Schiller, H.B.; Baumgärtner, M.; Bonneau, F.; Mann, M.; Conti, E. Structural Model of a CRISPR RNA-Silencing Complex Reveals the RNA-Target Cleavage Activity in Cmr4. Mol. Cell 2014, 56, 43–54. [Google Scholar] [CrossRef] [Green Version]
  27. Pandit, J.; Forman, M.D.; Fennell, K.F.; Dillman, K.S.; Menniti, F.S. Mechanism for the allosteric regulation of phosphodiesterase 2A deduced from the X-ray structure of a near full-length construct. Proc. Natl. Acad. Sci. USA 2009, 106, 18225–18230. [Google Scholar] [CrossRef] [Green Version]
  28. Scapin, G.; Patel, S.B.; Chung, C.; Varnerin, J.P.; Edmondson, S.D.; Mastracchio, A.; Parmee, E.R.; Singh, S.B.; Becker, J.W.; Van der Ploeg, L.H.T.; et al. Crystal Structure of Human Phosphodiesterase 3B:  Atomic Basis for Substrate and Inhibitor Specificity. Biochemistry 2004, 43, 6091–6100. [Google Scholar] [CrossRef]
  29. Xu, R.X.; Hassell, A.M.; Vanderwall, D.; Lambert, M.H.; Holmes, W.D.; Luther, M.A.; Rocque, W.J.; Milburn, M.V.; Zhao, Y.; Ke, H.; et al. Atomic Structure of PDE4: Insights into Phosphodiesterase Mechanism and Specificity. Science 2000, 288, 1822–1825. [Google Scholar] [CrossRef]
  30. Zhang, K.Y.J.; Card, G.L.; Suzuki, Y.; Artis, D.R.; Fong, D.; Gillette, S.; Hsieh, D.; Neiman, J.; West, B.L.; Zhang, C.; et al. A glutamine switch mechanism for nucleotide selectivity by phosphodiesterases. Mol. Cell 2004, 15, 279–286. [Google Scholar] [CrossRef]
  31. Kawai, K.; Endo, Y.; Asano, T.; Amano, S.; Sawada, K.; Ueo, N.; Takahashi, N.; Sonoda, Y.; Nagai, M.; Kamei, N.; et al. Discovery of 2-(cyclopentylamino)thieno [3,2-d]pyrimidin-4(3H)-one derivatives as a new series of potent phosphodiesterase 7 inhibitors. J. Med. Chem. 2014, 57, 9844–9854. [Google Scholar] [CrossRef]
  32. Wang, H.; Yan, Z.; Yang, S.; Cai, J.; Robinson, H.; Ke, H. Kinetic and Structural Studies of Phosphodiesterase-8A and Implication on the Inhibitor Selectivity. Biochemistry 2008, 47, 12760–12768. [Google Scholar] [CrossRef] [Green Version]
  33. Liu, S.; Mansour, M.N.; Dillman, K.S.; Perez, J.R.; Danley, D.E.; Aeed, P.A.; Simons, S.P.; LeMotte, P.K.; Menniti, F.S. Structural basis for the catalytic mechanism of human phosphodiesterase 9. Proc. Natl. Acad. Sci. USA 2008, 105, 13309–13314. [Google Scholar] [CrossRef] [Green Version]
  34. Wang, H.; Liu, Y.; Hou, J.; Zheng, M.; Robinson, H.; Ke, H. Structural insight into substrate specificity of phosphodiesterase 10. Proc. Natl. Acad. Sci. USA 2007, 104, 5782–5787. [Google Scholar] [CrossRef] [Green Version]
  35. Lovering, A.L.; Capeness, M.J.; Lambert, C.; Hobley, L.; Sockett, R.E. The Structure of an Unconventional HD-GYP Protein from Bdellovibrio Reveals the Roles of Conserved Residues in this Class of Cyclic-di-GMP Phosphodiesterases. mBio 2011, 2, e00163-11. [Google Scholar] [CrossRef] [Green Version]
  36. Bellini, D.; Caly, D.L.; McCarthy, Y.; Bumann, M.; An, S.-Q.; Dow, J.M.; Ryan, R.P.; Walsh, M.A. Crystal structure of an HD-GYP domain cyclic-di-GMP phosphodiesterase reveals an enzyme with a novel trinuclear catalytic iron centre. Mol. Microbiol. 2014, 91, 26–38. [Google Scholar] [CrossRef] [Green Version]
  37. Li, J.; Jia, Y.; Lin, A.; Hanna, M.; Chelico, L.; Xiao, W.; Moore, S.A. Structure of Ddi2, a highly inducible detoxifying metalloenzyme from Saccharomyces cerevisiae. J. Biol. Chem. 2019, 294, 10674–10685. [Google Scholar] [CrossRef]
  38. Barnes, C.O.; Wu, Y.; Song, J.; Lin, G.; Baxter, E.L.; Brewster, A.S.; Nagarajan, V.; Holmes, A.; Soltis, S.M.; Sauter, N.K.; et al. The crystal structure of dGTPase reveals the molecular basis of dGTP selectivity. Proc. Natl. Acad. Sci. USA 2019, 116, 9333–9339. [Google Scholar] [CrossRef] [Green Version]
  39. Oussenko, I.A.; Sanchez, R.; Bechhofer, D.H. Bacillus subtilis YhaM, a member of a new family of 3′-to-5′ exonucleases in gram-positive bacteria. J. Bacteriol. 2002, 184, 6250–6259. [Google Scholar] [CrossRef] [Green Version]
  40. Sun, S.; Pandelia, M.-E. HD-[HD-GYP] Phosphodiesterases: Activities and Evolutionary Diversification within the HD-GYP Family. Biochemistry 2020, 59, 2340–2350. [Google Scholar] [CrossRef]
  41. Biswas, K.H.; Sopory, S.; Visweswariah, S.S. The GAF Domain of the cGMP-Binding, cGMP-Specific Phosphodiesterase (PDE5) Is a Sensor and a Sink for cGMP. Biochemistry 2008, 47, 3534–3543. [Google Scholar] [CrossRef] [PubMed]
  42. Hauryliuk, V.; Atkinson, G.C.; Murakami, K.S.; Tenson, T.; Gerdes, K. Recent functional insights into the role of (p)ppGpp in bacterial physiology. Nat. Rev. Microbiol. 2015, 13, 298–309. [Google Scholar] [CrossRef] [Green Version]
  43. Ji, X.; Zou, J.; Peng, H.; Stolle, A.-S.; Xie, R.; Zhang, H.; Peng, B.; Mekalanos, J.J.; Zheng, J. Alarmone Ap4A is elevated by aminoglycoside antibiotics and enhances their bactericidal activity. Proc. Natl. Acad. Sci. USA 2019, 116, 9578–9585. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Neidhardt, F.C. (Ed.) The Stringent Response. In Escherichia coli and Salmonella: Cellular and Molecular Biology; ASM Press: Washington, DC, USA, 1996; pp. 1458–1496. [Google Scholar]
  45. Xiao, H.; Kalman, M.; Ikehara, K.; Zemel, S.; Glaser, G.; Cashel, M. Residual guanosine 3′,5′-bispyrophosphate synthetic activity of relA null mutants can be eliminated by spoT null mutations. J. Biol. Chem. 1991, 266, 5980–5990. [Google Scholar]
  46. Mauney, C.H.; Hollis, T. SAMHD1: Recurring roles in cell cycle, viral restriction, cancer, and innate immunity. Autoimmunity 2018, 51, 96–110. [Google Scholar] [CrossRef]
  47. Ji, X.; Wu, Y.; Yan, J.; Mehrens, J.; Yang, H.; DeLucia, M.; Hao, C.; Gronenborn, A.M.; Skowronski, J.; Ahn, J.; et al. Mechanism of allosteric activation of SAMHD1 by dGTP. Nat. Struct. Mol. Biol. 2013, 20, 1304–1309. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  48. Hollenbaugh, J.A.; Shelton, J.; Tao, S.; Amiralaei, S.; Liu, P.; Lu, X.; Goetze, R.W.; Zhou, L.; Nettles, J.H.; Schinazi, R.F.; et al. Substrates and Inhibitors of SAMHD1. PLoS ONE 2017, 12, e0169052. [Google Scholar] [CrossRef]
  49. Hansen, E.C.; Seamon, K.J.; Cravens, S.L.; Stivers, J.T. GTP activator and dNTP substrates of HIV-1 restriction factor SAMHD1 generate a long-lived activated state. Proc. Natl. Acad. Sci. USA 2014, 111, E1843–E1851. [Google Scholar] [CrossRef] [Green Version]
  50. Morisaka, H.; Yoshimi, K.; Okuzaki, Y.; Gee, P.; Kunihiro, Y.; Sonpho, E.; Xu, H.; Sasakawa, N.; Naito, Y.; Nakada, S.; et al. CRISPR-Cas3 induces broad and unidirectional genome editing in human cells. Nat. Commun. 2019, 10, 5302. [Google Scholar] [CrossRef]
  51. Pickar-Oliver, A.; Gersbach, C.A. The next generation of CRISPR–Cas technologies and applications. Nat. Rev. Mol. Cell Biol. 2019, 20, 490–507. [Google Scholar] [CrossRef]
  52. Boswell-Smith, V.; Spina, D.; Page, C.P. Phosphodiesterase inhibitors. Br. J. Pharmacol. 2006, 147, S252–S257. [Google Scholar] [CrossRef]
  53. Eskandari, N.; Mirmosayyeb, O.; Bordbari, G.; Bastan, R.; Yousefi, Z.; Andalib, A. A short review on structure and role of cyclic-3′,5′-adenosine monophosphate-specific phosphodiesterase 4 as a treatment tool. J. Res. Pharm. Pract. 2015, 4, 175–181. [Google Scholar] [CrossRef]
  54. Heikaus, C.C.; Pandit, J.; Klevit, R.E. Cyclic nucleotide binding GAF domains from phosphodiesterases: Structural and mechanistic insights. Structure 2009, 17, 1551–1557. [Google Scholar] [CrossRef] [Green Version]
  55. Tian, Y.; Cui, W.; Huang, M.; Robinson, H.; Wan, Y.; Wang, Y.; Ke, H. Dual specificity and novel structural folding of yeast phosphodiesterase-1 for hydrolysis of second messengers cyclic adenosine and guanosine 3′,5′-monophosphate. Biochemistry 2014, 53, 4938–4945. [Google Scholar] [CrossRef]
  56. Luo, Y.; Helmann, J.D. Analysis of the role of Bacillus subtilis σ(M) in β-lactam resistance reveals an essential role for c-di-AMP in peptidoglycan homeostasis. Mol. Microbiol. 2012, 83, 623–639. [Google Scholar] [CrossRef] [Green Version]
  57. Fahmi, T.; Port, G.C.; Cho, K.H. c-di-AMP: An Essential Molecule in the Signaling Pathways that Regulate the Viability and Virulence of Gram-Positive Bacteria. Genes 2017, 8, 197. [Google Scholar] [CrossRef] [Green Version]
  58. Galperin, M.Y.; Natale, D.A.; Aravind, L.; Koonin, E.V. A specialized version of the HD hydrolase domain implicated in signal transduction. J. Mol. Microbiol. Biotechnol. 1999, 1, 303–305. [Google Scholar]
  59. Gao, J.; Tao, J.; Liang, W.; Zhao, M.; Du, X.; Cui, S.; Duan, H.; Kan, B.; Su, X.; Jiang, Z. Identification and characterization of phosphodiesterases that specifically degrade 3′3′-cyclic GMP-AMP. Cell Res. 2015, 25, 539–550. [Google Scholar] [CrossRef]
  60. Wright, T.A.; Jiang, L.; Park, J.J.; Anderson, W.A.; Chen, G.; Hallberg, Z.F.; Nan, B.; Hammond, M.C. Second messengers and divergent HD-GYP phosphodiesterases regulate 3′,3′-cGAMP signaling. Mol. Microbiol. 2019, 113, 222–236. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure 1. (A) Helical structure of three HD-domain proteins. YqeK (PDB: 2O08) is a phosphatase, PhnZ (PDB: 4N6W) is an oxygenase, and Ddi2 (PDB: 6DKA) is a lyase. All exhibit a helical fold characteristic to HD-domain proteins despite their diverse functions. (B) Sequence similarity network (SSN) of the HD-domain superfamily depicting its size and functional clustering. The SSN was generated via the Enzyme Function Initiatives-Enzyme Similarity tool (EFI-EST) and visualized in Cytoscape. The SSN was generated by employing the IPR003607 family and tailored so that nodes represent sequences with ≥ 50% identity and an e-value of 5. The SSN was further refined to contain the major protein clusters (size-wise), which amount to 183,015 unique protein sequences. Edges between nodes represent an alignment score of 70. HD-domain phosphohydrolases (SpoT/RelA, SAMHD1, deoxyguanosine phosphatases (dGTPases), nucleotidyltransferases) are represented in green and blue, while hydratases are shown in yellow. PDEs are shown in red (HD-GYP proteins), light pink (exoribonucleases), orange (Cas proteins), and pink (PDEases). Oxygenases are shown in purple and their cluster, which consists of four nodes (372 sequences), is enlarged for visualization. Gray clusters contain proteins of unidentified function.
Figure 1. (A) Helical structure of three HD-domain proteins. YqeK (PDB: 2O08) is a phosphatase, PhnZ (PDB: 4N6W) is an oxygenase, and Ddi2 (PDB: 6DKA) is a lyase. All exhibit a helical fold characteristic to HD-domain proteins despite their diverse functions. (B) Sequence similarity network (SSN) of the HD-domain superfamily depicting its size and functional clustering. The SSN was generated via the Enzyme Function Initiatives-Enzyme Similarity tool (EFI-EST) and visualized in Cytoscape. The SSN was generated by employing the IPR003607 family and tailored so that nodes represent sequences with ≥ 50% identity and an e-value of 5. The SSN was further refined to contain the major protein clusters (size-wise), which amount to 183,015 unique protein sequences. Edges between nodes represent an alignment score of 70. HD-domain phosphohydrolases (SpoT/RelA, SAMHD1, deoxyguanosine phosphatases (dGTPases), nucleotidyltransferases) are represented in green and blue, while hydratases are shown in yellow. PDEs are shown in red (HD-GYP proteins), light pink (exoribonucleases), orange (Cas proteins), and pink (PDEases). Oxygenases are shown in purple and their cluster, which consists of four nodes (372 sequences), is enlarged for visualization. Gray clusters contain proteins of unidentified function.
Catalysts 10 01191 g001
Figure 2. Known substrates of HD-domain proteins. Phosphatases can remove one to three terminal phosphate groups from (deoxy)ribonucleotides or cleave (a)symmetrically polyphosphate containing nucleotides (represented in gray). The position of cleavage has been highlighted in red for substrates with four phosphates. PDEs hydrolyze phosphodiester bonds of cyclic (di)nucleotide substrates via either one-step hydrolysis (cleavage of one side of the diester bond) releasing a linearized product or two-step hydrolysis releasing individual nucleotides. PDEs can also act on RNA and DNA substrates. HD-domain oxygenases cleave a C-X bond (indicated in red).
Figure 2. Known substrates of HD-domain proteins. Phosphatases can remove one to three terminal phosphate groups from (deoxy)ribonucleotides or cleave (a)symmetrically polyphosphate containing nucleotides (represented in gray). The position of cleavage has been highlighted in red for substrates with four phosphates. PDEs hydrolyze phosphodiester bonds of cyclic (di)nucleotide substrates via either one-step hydrolysis (cleavage of one side of the diester bond) releasing a linearized product or two-step hydrolysis releasing individual nucleotides. PDEs can also act on RNA and DNA substrates. HD-domain oxygenases cleave a C-X bond (indicated in red).
Catalysts 10 01191 g002
Figure 3. Active sites of HD-domain phosphohydrolases. Mononuclear HD-domain phosphohydrolases utilize a conserved motif “H…HD…D” to bind a variety of metals including cobalt (pink), zinc (purple), magnesium (light green), nickel (dark green), or iron (orange). Small red spheres represent water molecules. The dinuclear phosphatase YqeK harbors two extra histidines between the HD and D residues to stabilize the second metal ion. Phosphatases are classified into mono-, di-, or triphosphohydrolases, labeled in light green, dark green, and blue, respectively. All phosphohydrolases have a conserved arginine (shown in teal), which is located typically three residues prior to the first histidine of the HD motif and in the vicinity of the oxygens of the substrate phosphate group. Other important residues are shown in pale blue and are described in the text.
Figure 3. Active sites of HD-domain phosphohydrolases. Mononuclear HD-domain phosphohydrolases utilize a conserved motif “H…HD…D” to bind a variety of metals including cobalt (pink), zinc (purple), magnesium (light green), nickel (dark green), or iron (orange). Small red spheres represent water molecules. The dinuclear phosphatase YqeK harbors two extra histidines between the HD and D residues to stabilize the second metal ion. Phosphatases are classified into mono-, di-, or triphosphohydrolases, labeled in light green, dark green, and blue, respectively. All phosphohydrolases have a conserved arginine (shown in teal), which is located typically three residues prior to the first histidine of the HD motif and in the vicinity of the oxygens of the substrate phosphate group. Other important residues are shown in pale blue and are described in the text.
Catalysts 10 01191 g003
Figure 4. Active sites of HD-domain PDEs. HD-domain PDEs utilize the conserved HD motif “H…HD…H…HH…D” to bind a di- or trinuclear metal center. The metal ions coordinated in their active sites are zinc (purple), magnesium (light green), nickel (dark green), or iron (orange). Small red spheres represent water molecules.
Figure 4. Active sites of HD-domain PDEs. HD-domain PDEs utilize the conserved HD motif “H…HD…H…HH…D” to bind a di- or trinuclear metal center. The metal ions coordinated in their active sites are zinc (purple), magnesium (light green), nickel (dark green), or iron (orange). Small red spheres represent water molecules.
Catalysts 10 01191 g004
Figure 5. Active site of HD-GYPs. HD-GYP proteins utilize an “H…HD…H…HH…D” motif that typically binds a dinuclear metal center. The third metal ion in the PmGH active site is stabilized by crystallization molecules shown in gray. In addition, these enzymes contain a GYP residue triad vicinal to the active site (shown in blue), the importance of which is currently unclear. Bd1817, which is inactive toward c-di-GMP, lacks the GYP tyrosine and the terminal aspartate is an asparagine.
Figure 5. Active site of HD-GYPs. HD-GYP proteins utilize an “H…HD…H…HH…D” motif that typically binds a dinuclear metal center. The third metal ion in the PmGH active site is stabilized by crystallization molecules shown in gray. In addition, these enzymes contain a GYP residue triad vicinal to the active site (shown in blue), the importance of which is currently unclear. Bd1817, which is inactive toward c-di-GMP, lacks the GYP tyrosine and the terminal aspartate is an asparagine.
Catalysts 10 01191 g005
Figure 6. Active sites of HD-domain oxygenases. Oxygenases utilize the “H…HD…H…H…D” motif to bind a diiron metal center. The substrate scissile bond is positioned above one of the iron sites, leaving the second site open for oxygen binding. PhnZ and TmpB contain an YxxE loop (green) in their primary sequence that is located vicinal to the active site, which upon substrate binding undergoes a conformational change to allow for oxygen binding and catalysis.
Figure 6. Active sites of HD-domain oxygenases. Oxygenases utilize the “H…HD…H…H…D” motif to bind a diiron metal center. The substrate scissile bond is positioned above one of the iron sites, leaving the second site open for oxygen binding. PhnZ and TmpB contain an YxxE loop (green) in their primary sequence that is located vicinal to the active site, which upon substrate binding undergoes a conformational change to allow for oxygen binding and catalysis.
Catalysts 10 01191 g006
Figure 7. Reaction and active site of the HD-domain hydratase Ddi2. Ddi2 utilizes a Zn metal center to convert cyanamide to urea. The terminal aspartate residue found in HD-domain proteins is a replaced by a threonine in Ddi2. The role of this threonine, T157 (valine in the crystal structure), is predicted to allow for substrate positioning or metal specificity [37].
Figure 7. Reaction and active site of the HD-domain hydratase Ddi2. Ddi2 utilizes a Zn metal center to convert cyanamide to urea. The terminal aspartate residue found in HD-domain proteins is a replaced by a threonine in Ddi2. The role of this threonine, T157 (valine in the crystal structure), is predicted to allow for substrate positioning or metal specificity [37].
Catalysts 10 01191 g007
Table 1. List of representative histidine–aspartate (HD)-domain proteins from the three known subclasses, oxygenases, phosphatases and phosphodiesterases (PDEs), that are crystallographically and biochemically characterized.
Table 1. List of representative histidine–aspartate (HD)-domain proteins from the three known subclasses, oxygenases, phosphatases and phosphodiesterases (PDEs), that are crystallographically and biochemically characterized.
SubclassesProteinNuclearityActive MetalChemistrySubstratePDB IDOriginReferences
oxygenasesMIOXdinuclearFeOxygenasemyo-inositol2HUOMus musculus[11]
PhnZdinuclearFeOxygenaseOH-AEP4MLMbacterium HF130_AEPn_1[12,15]
TmpBdinuclearFeOxygenaseOH-TMAEP6NPALeisingera caerulea[13]
phosphatasesYfbRmononuclearCoMonophosphatasedAMP2PAQEscherichia coli K-12[3]
YGK1mononuclearMnMonophosphatasedNMP5YOXSaccharomyces cerevisiae[5]
YqeKdinuclearFeDiphosphataseAp4A2O08Bacillus halodurans[10]
YpgQmononuclearMnDiphosphatasedNTP5DQVBacillus subtilis[16]
SpoTmononuclearMnDiphosphatase(p)ppGpp1VJ7Streptococcus dysgalactiae[17]
SAMHD1mononuclearMgTriphosphatasedNTP3U1NHomo sapiens[18,19]
EF1143mononuclearMgTriphosphatasedNTP4LRLEnterococcus faecalis V583[20,21]
OxsAMono/dinuclear *CoMono/Di/TriphosphataseOxetanocin-A5TK8Bacillus megaterium[22]
phosphodiesterasesCas3dinuclearCoPDEssDNA4QQWThermobifida fusca YX[23]
Cas3dinuclearNiPDEssDNA4Q2CThermobaculum terrenum[24]
Cas3’’dinuclearCaPDEssDNA3S4LMethanocaldococcus jannaschii[25]
Cas10dinuclearNi, MnPDEssDNA4W8YPyrococcus furiosus[26]
PDE1-3dinuclear #Mg, MnPDEcAMP, cGMP1TAZ, 3ITU, 1SO2Homo sapiens[27,28]
PDE4dinuclear #Mg, MnPDEcAMP1F0JHomo sapiens[29]
PDE5dinuclear #MgPDEcGMP1TBFHomo sapiens[30]
PDE7-8dinuclear #MnPDEcAMP4PM0, 3ECMHomo sapiens[31,32]
PDE9dinuclear #Mn, MgPDEcGMP3DY8Homo sapiens[33]
PDE10dinuclear #MgPDEcAMP, cGMP2OUNHomo sapiens[34]
PgpHdinuclearMnPDEc-di-AMP4S1BListeria monocytogenes[6]
Bd1817dinuclearFe *PDEc-di-GMP3TM8Bdellovibrio bacteriovorus[35]
PmGHtrinuclearFe, MnPDEc-di-GMP4MCWPersephonella marina EX-H1[36]
PA4781trinuclear &MgPDEc-di-GMP4R8ZPseudomonas aeruginosa PAO1[14]
lyasesDdi2mononuclearZnHydrataseCyanamide6DKASaccharomyces cerevisiae[37]
# Denotes proteins for which the crystal structure shows two active site metal ions at an average interatomic distance of ≈3.8 Å. The primary sequence suggests a mononuclear binding site. In phosphodiesterases (PDEs), the second metal ion is stabilized by the aspartate of the HD motif, a bridging hydroxide and four terminally ligated water molecules. & Although no experimental evidence currently exists, PA4781 belongs the trinuclear clade of HD-GYP proteins as inferred from its primary amino acid sequence. * Bd1817 is inactive toward Bis-(3′-5′)-cyclic guanosine monophosphate (c-di-GMP); therefore, the active metal ion refers to the metal ion observed in the crystal structure. OH-AEP stands for 1-hydroxy-2-aminoethylphosphonate, OH-TMAEP stands for 1-hydroxy-2-(trimethylammonio)ethylphosphonate, dNMP stands for deoxymonophosphate, in which N can be A, G, U, C, Ap4A stands for diadenosine tetraphosphate, cGMP stands for guanosine 3′,5′-cyclic monophosphate, cAMP stands for adenosine 3′,5′-cyclic monophosphate, c-di-AMP stands for Bis-(3′-5′)-cyclic adenosine monophosphate.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Langton, M.; Sun, S.; Ueda, C.; Markey, M.; Chen, J.; Paddy, I.; Jiang, P.; Chin, N.; Milne, A.; Pandelia, M.-E. The HD-Domain Metalloprotein Superfamily: An Apparent Common Protein Scaffold with Diverse Chemistries. Catalysts 2020, 10, 1191. https://doi.org/10.3390/catal10101191

AMA Style

Langton M, Sun S, Ueda C, Markey M, Chen J, Paddy I, Jiang P, Chin N, Milne A, Pandelia M-E. The HD-Domain Metalloprotein Superfamily: An Apparent Common Protein Scaffold with Diverse Chemistries. Catalysts. 2020; 10(10):1191. https://doi.org/10.3390/catal10101191

Chicago/Turabian Style

Langton, Michelle, Sining Sun, Chie Ueda, Max Markey, Jiahua Chen, Isaac Paddy, Paul Jiang, Natalie Chin, Amy Milne, and Maria-Eirini Pandelia. 2020. "The HD-Domain Metalloprotein Superfamily: An Apparent Common Protein Scaffold with Diverse Chemistries" Catalysts 10, no. 10: 1191. https://doi.org/10.3390/catal10101191

APA Style

Langton, M., Sun, S., Ueda, C., Markey, M., Chen, J., Paddy, I., Jiang, P., Chin, N., Milne, A., & Pandelia, M. -E. (2020). The HD-Domain Metalloprotein Superfamily: An Apparent Common Protein Scaffold with Diverse Chemistries. Catalysts, 10(10), 1191. https://doi.org/10.3390/catal10101191

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop