Fine-Tuning of Sequence Specificity by Near Attack Conformations in Enzyme-Catalyzed Peptide Hydrolysis

Sadiq, S. Kashif

doi:10.3390/catal10060684

Open AccessArticle

Fine-Tuning of Sequence Specificity by Near Attack Conformations in Enzyme-Catalyzed Peptide Hydrolysis

by

S. Kashif Sadiq

^1,2

¹

Heidelberg Institute for Theoretical Studies, Schloss-Wolfsbrunnenweg 35, 69118 Heidelberg, Germany

²

Infection Biology Unit, Universitat Pompeu Fabra, Carrer Doctor Aiguader 88, Barcelona Biomedical Research Park, 08003 Barcelona, Spain

Catalysts 2020, 10(6), 684; https://doi.org/10.3390/catal10060684

Submission received: 13 April 2020 / Revised: 31 May 2020 / Accepted: 8 June 2020 / Published: 18 June 2020

(This article belongs to the Special Issue Quantum Chemical Modelling of Enzymatic Reactions)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

The catalytic role of near attack conformations (NACs), molecular states that lie on the pathway between the ground state (GS) and transition state (TS) of a chemical reaction, is not understood completely. Using a computational approach that combines Bürgi–Dunitz theory with all-atom molecular dynamics simulations, the role of NACs in catalyzing the first stages of HIV-1 protease peptide hydrolysis was previously investigated using a substrate that represents the recognized SP1-NC cleavage site of the HIV-1 Gag polyprotein. NACs were found to confer no catalytic effect over the uncatalyzed reaction there (

Δ Δ G_{N}^{‡} \sim

0 kcal/mol). Here, using the same approach, the role of NACs across multiple substrates that each represent a further recognized cleavage site is investigated. Overall rate enhancement varies by

| Δ Δ G^{‡} | \sim

12–15 kcal/mol across this set, and although NACs contribute a small and approximately constant barrier to the uncatalyzed reaction (<

Δ

G

_{N}^{‡ u}

> = 4.3 ± 0.3 kcal/mol), they are found to contribute little significant catalytic effect (

| Δ Δ G_{N}^{‡} | \sim

0–2 kcal/mol). Furthermore, no correlation is exhibited between NAC contributions and the overall energy barrier (

R^{2}

= 0.01). However, these small differences in catalyzed NAC contributions enable rates to match those required for the kinetic order of processing. Therefore, NACs may offer an alternative and subtle mode compared to non-NAC contributions for fine-tuning reaction rates during complex evolutionary sequence selection processes—in this case across cleavable polyproteins whose constituents exhibit multiple functions during the virus life-cycle.

Keywords:

enzyme catalysis; bimolecular reactions; molecular dynamics; HIV-1 protease; near attack conformations; enzyme specificity

1. Introduction

According to Pauling [1], rate enhancement due to enzyme catalysis stems from the tighter binding of the transition state (TS) as compared to the ground state (GS) of a substrate undergoing chemical reaction. This is thermodynamically equivalent to a negative activation free energy difference (

Δ Δ

G

^{‡}

) between the catalyzed (

Δ

G

^{‡ c}

) and the uncatalyzed (

Δ

G

^{‡ u}

) reaction where:

Δ Δ G^{‡} = Δ G^{‡ c} - Δ G^{‡ u} .

(1)

Activation rates are describable within a generalized transition state theory (TST) framework [2] which extends the original formulation of TST [3] such that the rate of a chemical reaction is defined as:

k (T) = γ (T) (k_{B} T / h) {(C^{0})}^{1 - n} e x p [- Δ G^{‡} / R T],

(2)

where T is the temperature, R is the gas constant,

C^{0}

, the standard state concentration of the reactant, n the order of the reaction,

Δ G^{‡}

is the standard-state free energy of activation, and

γ (T)

is the generalized transmission coefficient. Rate enhancement via enzyme catalysis can then be afforded by lowering

Δ G^{‡}

and/or raising

γ (T)

. The molecular origin of rate increase can stem from a number of processes such as quantum mechanical tunneling [4,5,6], non-equilibrium dynamics [7,8,9,10,11,12], changes in substrate flexibility between the TS and the GS [13,14,15], preorganization of electrostatic interactions that favor formation of the TS [16,17,18,19,20,21,22] and the thermodynamic stabilization of near attack conformations (NACs)—GS conformations that lie on the transition path closer to the TS [13,23,24,25,26,27,28,29,30].

The role of NACs in catalyzing enzymatic reactions has recently been examined for nucleophilic bimolecular reactions [31], specifically enzyme-catalyzed peptide hydrolysis by HIV-1 protease. The catalyzed reaction proceeds via a likely general-acid/general base (GA/GB) mechanism [32] that forms a gem-diol intermediate in a two-step process. The first step constitutes the largest energy barrier [33] and involves nucleophilic water attack of the lytic peptide bond carbon atom. This is facilitated by hydrogen bonding between the monoprotonated D25 residue in the catalytic aspartic dyad (D25/D25

^{'}

) with the adjacent carbonyl oxygen and between D25

^{'}

and the nucleophilic water (Figure 1). Thus, this step may make use of NACs along the pathway to form a primary transition state (TS). NAC formation was previously studied by developing an all-atom explicit solvent molecular dynamics simulation framework [31] in conjunction with the Bürgi–Dunitz theory [34,35,36,37]. This enables identification of NACs in terms of a nucleophilic attack distance threshold (

d_{a}

≤ 3.2 Å) and a narrow attack angle range (100° ≤

α

≤ 110°) with respect to the relevant carbonyl group (Figure 1). The activation barrier (

Δ G^{‡}

) can then be separated into NAC (

Δ G_{N}^{‡}

) and non-NAC (

Δ G_{n N}^{‡}

) components:

Δ G^{‡} = Δ G_{N}^{‡} + Δ G_{n N}^{‡} .

(3)

The catalytic contribution of any component (X) of the activation barrier is then:

Δ Δ G_{X}^{‡} = Δ G_{X}^{‡ c} - Δ G_{X}^{‡ u} .

(4)

Multiple sequence specificity, high selectivity, and finely-tuned rate variation are significant phenomena across biochemical regulation processes and peptide hydrolysis by HIV-1 protease is an excellent example of this. HIV-1 Gag and GagPol precursor polyproteins consist of multiple protein domains sequentially connected via peptide bonds at their inter-protein junctions. Gag consists of matrix (MA), capsid (CA), spacer peptide 1 (SP1), nucleocapsid (NC), spacer peptide 2 (SP2), and protein p6. Additionally, GagPol is synthesized via a -1 frameshift with a ratio of 1:20 compared to Gag [38], which allows continued sequential translation after NC into a trans-frame-region (TFR) and then precursor Pol that contains the viral protease (PR), reverse-transcriptase (RT and RH), and integrase (IN). The viral protease recognizes and cleaves multiple sequences within Gag and GagPol [39], notably at junctions: MA-CA, CA-SP1, SP1-NC, NC-SP2, SP2-p6, TFR-PR, PR-RT, RT-RH, and RH-IN.

Turnover number,

k_{cat}

, has been experimentally studied widely for HIV-1 protease for various substrates that constitute these sequences—and shows a well-defined rate ordering that ranges over 100-fold from slowest to fastest [39]. Nonetheless, the molecular origin of such differences remains not completely understood.

It has been found that for the enzyme reaction catalyzed by HIV-1 protease bound to the SP1-NC (alternatively, termed p2-NC) octapeptide cleavage substrate and a catalytic water molecule [31] that NACs do not contribute to rate enhancement (

Δ Δ G_{N}^{‡ c}

∼ 0 kcal/mol), whilst non-NAC contributions account for (

Δ Δ G_{n N}^{‡ c} \sim

−15 kcal/mol). As transmission effects are minor (up to

γ \sim

10

^{3}

), electrostatic preorganization of the active site is likely the major component of rate enhancement in HIV-1 protease at this cleavage junction.

Nonetheless, the role of NAC thermodynamics in affecting catalytic rate changes amongst differentially recognized substrates has not hitherto been examined. NACs do pose a small but significant barrier in both the catalyzed and uncatalyzed reactions for the SP1-NC cleavage junction [31]. Therefore, it is hypothesized that by exhibiting variation across different recognized substrates NACs may play some role in controlling enzyme specificity through changes in

k_{cat}

. This hypothesis is evaluated here by comparing experimentally determined and estimated

Δ G^{‡}

with computed

Δ G_{N}^{‡}

and derived

Δ G_{n N}^{‡}

values across the range of nine above-mentioned cleavage junctions with varying

k_{cat}

, through a previously established computational approach that makes use of explicit solvent molecular dynamics simulations to compute mole fractions of NAC-formation during peptide hydrolysis in both the enzyme-bound and enzyme-free substrate systems [31].

2. Results

2.1. Differential Nucleophilic Water Binding

Computation of the free energy of NAC-formation from a ground state (GS) first requires a suitable definition of the GS for the nucleophilic water molecule. In order to define this, the distance (

d_{n}

) of the nearest water molecule to the catalytic center (Figure 1) was calculated across the ensemble of MD simulations for each of the enzyme-bound substrate systems.

The density distributions of

d_{n}

reveal mostly two distinct bell-shaped regions of non-zero density for each system, corresponding to a state where a nucleophilic water molecule is bound (NWB) in the catalytic site and to when a nucleophilic water molecule is unbound (NWU). The latter density peak is due to the presence of a previously characterized [31] structural non-nucleophilic water molecule that becomes the nearest water molecule to the catalytic center when the nucleophilic water molecule leaves the catalytic site (Figure 2). These two regions are sharply partitioned by a region of near-zero density (dashed black line) for all systems. Thus, a nearest water density description within such a partition threshold (

d_{n}^{p}

) is a suitable definition of the GS for a nucleophilic water molecule in each enzyme–substrate complex (Table 1).

However, the profiles of the distributions for different systems do vary. The MA-CA, CA-SP1, SP1-NC, NC-SP2, and RH-IN distributions share a similar profile: the NWB and NWU states are characterized by a single major peak between ∼1.5–3.5 Å and a single minor peak between ∼5–6.5 Å, respectively. The TFR-PR and RT-RH distributions exhibit a single broad peak at 2.7 Å and 3.1 Å respectively corresponding to the NWB state and one of similar density at 6.1 Å and 5.8 Å respectively, corresponding to NWU. The SP2-p6 system exhibits a single large NWB-peak at 2.4 Å but only trace densities of the NWU state—nonetheless, these constitute two very small peaks at 5.8 Å and 7.4 Å. Thus, for the SP2-p6 system, the nucleophilic water molecule almost never escapes the catalytic site, whilst there are also rare instances when there is neither a water molecule in the GS nor a structural water molecule between the ligand and the protease. Finally, the PR-RT system is the only one for which the NWU state has a significantly higher density than the NWB state. A minor NWB-peak is exhibited at 3.4 Å with an additional trace density peak at 1.8 Å. A much larger peak corresponding to the NWU state is exhibited at 6.3 Å.

Three distinct thermodynamic groups emerge (Table 1) for the free energy of nucleophilic water binding (

Δ

G

_{n w b}

). For the MA-CA, SP1-NC, NC-SP2, TFR-PR, and RT-RH systems,

Δ

G

_{n w b}

ranges between −0.1 to −1.1 kcal/mol. Despite the error of the calculation being relatively small in these cases (≤0.3 kcal/mol), these values are not significantly differentiable from thermal noise,

k_{b} T \sim

0.6 kcal/mol at 300 K. A second group corresponds to significantly attractive binding thermodynamics, consisting of CA-SP1 (−1.9 ± 0.5 kcal/mol), SP2-p6 (−3.7 ± 1.2 kcal/mol) and RH-IN (−2.2 ± 0.7 kcal/mol). These systems exhibit favorable binding beyond thermal noise. By contrast, the PR-RT system alone exhibits unfavorable binding association beyond thermal noise (2.0 ± 0.3 kcal/mol).

However, care has to be taken when interpreting these binding free energies quantitatively, especially those with significant favorable or unfavorable values. This is because the frequency of reversible transitions between bound and unbound states varies across the system, implying that an approximate equilibrium ensemble may not be exhibited for every system. Therefore, a simple mole fraction of each state may not be a sufficient descriptor of the free energy of water binding. For example, the favorable

Δ

G

_{n w b}

of CA-SP1, SP2-p6 and RH-IN systems are likely to be overestimated as unbinding of the nucleophilic water molecule is rare even with the 10 μs of aggregate sampling performed here. This is indicative of a significantly higher kinetic barrier for water dissociation, but unlikely to the extent that the thermodynamics would be unfavorable. Similarly, the unfavorable

Δ

G

_{n w b}

of the PR-RT system is also likely to be overestimated; rapid dissociation of the water molecule is exhibited during equilibration, thus production simulations are initiated in a water unbound state and only rare events of subsequent water binding are observed.

Nonetheless, despite these limitations, the partitioning of the nucleophilic water states into unbound and bound remains well-defined, thus the bound sub-ensemble of each system provides an accurate representation of the GS from which subsequent mole fractions of desired states, including NACs, can be computed.

Similarly, the GS of the enzyme-free substrate systems could also be characterized in terms of a distance metric (

d_{o}

)—in this case, the distance of the nearest water molecule to the midpoint of the lytic peptide bond. The density distributions of

d_{o}

expectedly reveal a single-peaked bell-shaped distribution between 3.6–4.2 Å for all systems decaying rapidly and with near-zero density (

ρ (d_{o}) \leq

0.001) beyond 6 Å (Figure 3). A threshold of

d_{o}^{p}

= 6 Å was therefore set to partition bound from unbound regions—therefore, in the enzyme-free substrate systems, it can be considered that there is always a nucleophilic water molecule proximal to the lytic peptide bond and thus effectively ‘bound’ in the GS.

2.2. Analysis of the Hydrogen Bond Network

Four distinct hydrogen bonds (hb

_{1}

-hb

_{4}

) have previously been characterized in the interaction between the catalytic aspartic acid dyad, the nucleophilic water molecule, and the peptide substrate [31] (Figure 4A). However, this gives rise to a hydrogen bond network of potentially 16 states, when combined, it was previously shown for the SP1-NC system that only 12 of these were ever populated (Figure 4B)—due to the mutual exclusivity of hb

_{1}

and hb

_{2}

. In order to explore the role of hydrogen bonding across the various peptide systems, the population density distribution of each putative state within the hydrogen bond network was computed for the NAC state and the ground state (GS) in general and compared with the network in the unbound state (Figure 4C). Population densities are reported in terms of the potential of mean force (G in kcal/mol) derived from the relative mole fractions of each hydrogen bond state, normalized with respect to the density of the given conformation in each system—thus the row vectors in each system sum to unity in terms of population density.

Only the same 12 hydrogen bond states as reported before [31] are ever populated across all nine systems—therefore, mutual exclusivity of hb

_{1}

and hb

_{2}

is preserved across all systems. For all systems, the most densely populated hydrogen bond states when the nucleophilic water molecule is unbound are states 1 (no hydrogen bond) and 9 (only the inter-aspartyl hydrogen bond—hb

_{2}

). By comparison, hydrogen bond state 5, involving hb

_{1}

, between protonated D25 and the peptide bond carbonyl, is less densely populated in the absence of a bound water molecule. As expected, states involving hydrogen bonding with a bound water molecule are not exhibited except rare events involving hb

_{2}

- or hb

_{3}

-type bonding when the water molecule is at the periphery of the binding site.

The principal change in the hydrogen bond distribution when going from the unbound to the GS is a substantial shift in density towards state 2 (between water and D25

^{'}

—only hb

_{3}

) and state 10 (hb

_{2}

and hb

_{3}

)—therefore, the additional presence of hb

_{3}

compared to the unbound state. However, other hydrogen bond states are also populated, albeit less densely, in the GS. In particular, there is a notable shift towards states containing hb

_{1}

—for example, states 5–8. There is qualitatively little difference in hydrogen bond distribution between the NAC and GS—the predominant shift with respect to the unbound state is still towards states 2 and 10 as well as a range of other less densely populated states.

There is a lack of hydrogen bond state density for the PR-RT system compared to the other systems. This may be due to the fact that the overall population density of the bound state is very small and therefore due to improper sampling of the hydrogen bond distribution.

Overall, hydrogen bond analysis reveals that NACs do not exhibit a dissimilar hydrogen bond distribution to the GS but the GS in general does promote additional adoption of states involved in the general acid/general base (GA/GB) mechanism for catalysis (specifically hb

_{1}

and hb

_{3}

) as compared to the unbound state. Thus, nucleophilic water binding into the catalytic site—by increasing the probability of forming hb

_{3}

in turn promotes formation of hb

_{1}

, priming the water for successful nucleophilic attack.

2.3. Thermodynamic Decomposition of Activation Free Energy Contributions

The free energy of NAC formation was calculated for the enzyme-bound (

Δ

G

_{N}^{‡ c}

) and enzyme-free (

Δ

G

_{N}^{‡ u}

) substrate systems by computing the mole fraction of NACs compared to the GS in each case. All systems exhibit a relatively small enzyme-bound NAC free energy (

Δ

G

_{N}^{‡ c}

) but one which is significant over thermal noise. A range of 2.7 kcal/mol is exhibited, from SP1-NC, the most unfavorable (

Δ

G

_{N}^{‡ c}

= 4.6 ± 0.2 kcal/mol—as reported previously [31]) to TFR-PR, the least unfavorable (

Δ

G

_{N}^{‡ c}

= 1.9 ± 0.2 kcal/mol). By comparison, the enzyme-free NAC free energies (

Δ

G

_{N}^{‡ u}

) are of similar overall magnitude but exhibit a smaller range of 1.0 kcal/mol with SP2-p6 the most unfavorable (

Δ

G

_{N}^{‡ u}

= 4.8 ± 0.3 kcal/mol) and MA-CA the least unfavorable (

Δ

G

_{N}^{‡ u}

= 3.8 ± 0.1 kcal/mol).

In order to determine the sensitivity of these results to the distance threshold that partitions the bound from unbound nucleophilic water molecule states,

Δ

G

_{N}^{‡ c}

and

Δ

G

_{N}^{‡ u}

were computed across a range of both

d_{n}^{p}

and

d_{o}^{p}

for each of the enzyme-bound and enzyme-free systems, respectively (Figure 5). All enzyme-bound systems exhibit robust insensitivity to variation in

d_{n}^{p}

between 3.8–5.0 Å. Conversely, all enzyme-free systems expectedly exhibit significant sensitivity to variation in

d_{o}^{p}

in a range of small

d_{o}^{p}

values (3.0–4.5 Å), but then plateau to robust insensitivity for

d_{o}^{p}

> 4.5 Å. This further validates the choices of

d_{n}^{p}

and

d_{o}^{p}

for the various systems.

The difference in NAC free energy (

Δ Δ

G

_{N}^{‡}

) between the catalyzed and uncatalyzed systems (Table 2) determines the contribution of NACs to the catalytic effect. These results suggest that NACs contribute only marginally to catalysis in three systems: MA-CA (−1.7 ± 0.2 kcal/mol), SP2-p6 (−1.7 ± 0.4 kcal/mol), and TFR-PR (−2.2 ± 0.3 kcal/mol). In the other systems, the NAC contribution is negligible and/or indistinguishable from thermal noise.

3. Discussion

The overall activation energy barrier for uncatalyzed peptide hydrolysis—based on an experimentally determined half-life of ∼500 years [42]—is estimated at

Δ

G

^{‡ u}

∼ 30 kcal/mol [31]. Furthermore, as this is likely to be independent of the sequence of flanking amino acids, this value was assigned to all uncatalyzed cleavage substrates in the current study. HIV-1 protease thus exhibits enormous catalytic power, lowering this barrier to

Δ

G

^{‡ c}

∼ 15–18 kcal/mol (a significant reduction of

| Δ Δ

G

^{‡} |

∼ 12–15 kcal/mol) across the range of substrates it cleaves (Table 2). The results reported here support previous studies [31] and further reinforce the idea that NACs do not contribute significantly to the overall catalytic effect (

| Δ Δ

G

_{N}^{‡} |

∼ 0–2 kcal/mol). Rather, non-NAC effects—such as electrostatic preorganization, which is known to play a powerful role in enzyme catalysis [16,17,18,19,20,21,22], are thus likely the main source of the catalytic power in HIV-1 protease, contributing a reduction of

| Δ Δ

G

_{n N}^{‡} |

∼ 10–15 kcal/mol (Table 2). This range is likely to originate from the differing electrostatic properties of the various cleaved peptide sequences and their subsequent differences on the preorganization of the bound enzyme-substrate complex.

The NAC free energy in the uncatalyzed reaction (

Δ

G

_{N}^{‡ u}

) varies by only 1.0 kcal/mol in the calculations reported here. Therefore, when taking into account thermal noise (∼0.6 kcal/mol) all systems can be interpreted to have near-identical uncatalyzed NAC free energies, characterized by a mean value of <

Δ

G

_{N}^{‡ u}

> = 4.3 ± 0.3 kcal/mol. Then, assuming the overall uncatalyzed reaction barrier is similar across different sequences, the uncatalyzed non-NAC contribution (

Δ

G

_{n N}^{‡ u}

) is also nearly identical across the different systems, with a mean value of <

Δ

G

_{n N}^{‡ u}

> = 25.7 ± 0.3 kcal/mol.

However, this is not the case in the catalyzed systems. Experimentally, variation of up to three orders of magnitude occurs across

k_{cat}

[39] for the substrates of HIV-1 protease—from which the above-mentioned variation in catalyzed activation energy barrier is estimated (

Δ

G

^{‡ c}

(max)–

Δ

G

^{‡ c}

(min) ∼ 3 kcal/mol). This is similar to the range of variation observed in the computed

Δ

G

_{N}^{‡ c}

. If non-NAC contributions were similar to each other in the catalyzed systems, then variation in the overall barrier would also be correlated to variation in

Δ

G

_{N}^{‡ c}

. This would be consistent with NACs exhibiting a ‘correlative’ mode of fine-tuning the catalyzed reaction. However, decomposition into NAC (

Δ

G

_{N}^{‡ c}

) and non-NAC (

Δ

G

_{n N}^{‡ c}

) components shows that both components exhibit a similar range of variation to the overall barrier and

Δ

G

^{‡ c}

does not correlate with

Δ

G

_{N}^{‡ c}

, exhibiting a correlation coefficient of only

R^{2}

= 0.01 (Figure 6A). This implies sequence-dependent variation of both the NAC and non-NAC contributions across the various substrates in the catalyzed reaction and suggests that differences in activation barrier from cleavage sequence changes are due to the combined and significant variations of both components.

Nonetheless, a subtle role for NACs may still emerge when considering the variation in both components. In the context of the comparably large reductions in energy barrier that non-NAC effects are capable of inducing, their ability to fine-tune such reductions to match the optimally required catalyzed rates across the various substrates may be limited. This is highlighted by comparing non-NAC contributions to the activation barrier with each other (Figure 6B). All the peptide sequences studied here may be grouped into three distinct non-NAC activation energy bands with means at 15.3, 13.3 and 11.2 kcal/mol—where significant gaps (∼2 kcal/mol) exist between the different bands—therefore, no overlap exists when taking into account thermal noise. These bands appear to be too coarsely-tuned to achieve the required differences between the various substrates, whilst the ordering of substrates by non-NAC contributions is not consistent with that of the overall catalyzed energy barriers. Given these non-NAC contributions, invariant NAC contributions in the catalyzed systems, as estimated for the uncatalyzed systems, would resolve neither of these two issues.

Thus, minor variations in the catalyzed NAC free energies (

Δ

G

_{N}^{‡ c}

) exhibit a ‘complementary’ mode of fine-tuning both the differences between and the absolute values of the overall energy barriers to match the optimal requirements for correctly ordered maturation. Furthermore, the existence of a small thermodynamic barrier by NACs enables the catalyzed barrier to reach the virologically relevant region (Figure 6B-yellow band) with the given non-NAC contributions—and thus to meet the overall required timescale for virion maturation.

Why the HIV-1 system has evolved multiple cleavage sequences that result in sometimes very similar

k_{cat}

is not fully understood. One plausible explanation could be that because sequence change can have an effect on both

k_{cat}

and

K_{M}

and the reaction kinetics of polyprotein cleavage [43,44] is directed by the combination of both, different sequences could control each of these parameters differentially. However, several cleavage sequences recognized by HIV-1 protease both have similar

k_{cat}

and

K_{M}

to each other and are therefore not consistent with the above-mentioned explanation. For example, this applies to the MA-CA (SGNY-PIVQ), SP1-NC (ATIM-MQRG), TFR-PR (SFNF-PQIT), and PR-RT (TLNF-PISP) junctions [39], yet these junctions consist of significantly varying amino acid sequences.

Another possible explanation could be because amino acid sequences can face selection pressure for multiple biomolecular functions not restricted to the given enzymatic reaction. In the case of polyprotein cleavage by HIV-1 protease, each cleavage site corresponds to the C- and N-termini of its adjunct proteins—several of which are involved in structural and functional interactions beyond cleavage alone. This may impose restrictions on which amino acids are selected in the cleavage sequence—emergent sequences being able to fulfill both the required enzymatic rate as well as the other biomolecular functions. In particular, electrostatic preorganization effects would be highly susceptible to the electrostatic profile of the substrate. Sequence changes selected by other functions may thus correspond to abrupt changes in the enzymatic barrier that are modulated by smaller NAC changes. These large reductions due to non-NAC contributions combined with the fine tuning conferred by NACs may help to fulfill the required enzymatic rate using a different sequence.

Comparing the SP1-NC and TFR-PR substrates serves as a good example to illustrate this. The SP1-NC cleavage site is one of the fastest cleaved sites by HIV-1 protease, sharing the same specificity rate constant as the TFR-PR system [39]. However, it exhibits a non-NAC free energy, which is 2.7 kcal/mol lower than the TFR-PR system (Table 2). The nucleocapsid (NC) protein is essential for condensation of viral RNA [45]—the positively charged arginine in its N-terminus region (MQRG-) is likely to aid non-specific binding to the negatively charged RNA. As part of the SP1-NC cleavage sequence, this residue may also alter the electrostatic balance in the HIV-1 protease active site, contributing to the computed low non-NAC barrier exhibited for this junction. This in turn would lead to mistimed fast cleavage without the additional barrier afforded by the NAC component (4.6 kcal/mol) that restores cleavage rate to the required value. Conversely, the TFR-PR substrate achieves a similar

k_{cat}

with a different sequence, selected in part due to the structural requirements of the HIV-1 protease N-terminus in forming an interdigitated dimer interface [46] as well as a self-associated precursor protease during intramolecular autocatalysis [47]. This less positive sequence may contribute to the larger non-NAC energy barrier with respect to SP1-NC. However, the same overall energy barrier can still be achieved by this sequence by allowing for a significantly reduced NAC contribution with respect to the enzyme-free system (

Δ Δ

G

_{N}^{‡} \sim

−2 kcal/mol).

The degree of decoupling between NAC and non-NAC contributions upon single amino-acid mutations has not been investigated here. It is unknown whether a given amino acid mutation in the cleavage region would affect both NAC and non-NAC contributions simultaneously. If so, this would give rise to a more complex picture of fine-tuning where multiple mutations whose sum of NAC and non-NAC contributions kept the desired rate constant. However, single mutations may exist that alter predominantly either NAC or non-NAC contributions. For example, some hydrophobic-hydrophobic mutations may affect the activation barrier more through the NAC rather than the non-NAC mode, whilst hydrophilic-charged changes may alter the non-NAC barrier but not significantly affect catalytic water entry or NAC formation. Both scenarios could easily be accommodated by the fact that

k_{cat}

depends not only on the immediate residues that juxtapose the lytic bond, but on at least the P4-P4

^{'}

subsites that constitute the octapeptide cleavage sequence [41].

Future studies that elucidate this might therefore link the role of NAC and non-NAC contributions to the step-wise evolution of cleavage junctions, where sequences that were not initially lytic junctions became so by mutations that first contributed large discrete non-NAC reductions in the energy barrier and whose rate constants were then fine-tuned by subsequent mutations that altered the NAC contribution. This decoupling might also in part account for the role of compensatory mutations [48,49] that restore viral fitness when antiretroviral therapy causes drug resistance mutations deleterious to fitness [50] to emerge.

4. Materials and Methods

The role of NAC formation was investigated by performing and comparing ensembles of all-atom explicit solvent molecular dynamics simulations of HIV-1 protease bound to each of a set of eight recognized octapeptide substrates (enzyme-bound) representing inter-protein cleavage junctions (Table 1) in addition to one octapeptide substrate (SP1-NC) that was already previously simulated and reported [31]. These nine enzyme-bound systems were further compared to corresponding simulations of the apo-ligand (enzyme-free) systems in explicit solvent.

4.1. Initial Preparation

Initial structures were taken from the 1KJ4, 1F7A, 1TSU, 1KJF, 1KJ4, 1KJ4, 1KJG, and 1KJH crystal structures of peptide-bound HIV-1 protease complexes for MA-CA, CA-SP1, NC-SP2, SP2-p6, TFR-PR, PR-RT, RT-RH, and RH-IN junctions, respectively [51,52,53] and, when required, mutated to match the corresponding peptide sequences. Crystallographic water molecules were preserved. An additional water molecule was inserted between the lytic peptide bond and the catalytic dyad as is expected in the general acid/general base (GA/GB) cleavage mechanism [32]. The inactive catalytic dyad D25N was converted into catalytically active D25 form with a monoprotonated state [54,55]. Hydrogen atoms were added, the systems were electrically neutralized (0.15 M NaCl) and explicitly solvated with TIP3P water [56] and topologies were generated using the Leap module of AMBER 14 [57]. The standard AMBER force field for bioorganic systems (ff03) [58] was used to describe all protein parameters. All equilibration and production simulations were performed using ACEMD [59].

4.2. Molecular Dynamics Equilibration and Production Simulation Protocol

The molecular dynamics equilibration and simulation protocol for all respective enzyme-bound and enzyme-free systems were identical to those previously reported for the SP1-NC system [31] and are fully described therein. Production ensembles of 100 × 100 ns and 10 × 1 μs were generated for each enzyme-bound and corresponding enzyme-free apo-ligand system, respectively, in the NVT ensemble with temperature maintained at 300 K. Coordinate snapshots were generated every 100 ps and 10 ps, respectively. Experimental accuracy of the molecular simulation protocol for the HIV-1 protease has been previously validated using NMR S

^{2}

order parameters [60]. The effects of forcefield variation are small and have been accounted for in a previous study [31].

4.3. Analysis

For the enzyme-bound and enzyme-free systems, the nearest water distance (

d_{n}

and

d_{o}

respectively) was defined as the distance of the oxygen atom WAT:O of the nearest water molecule from the center of the catalytic site (defined as the geometric center between the four atoms: p1:C, p1

^{'}

:N, D25:CG and D25

^{'}

:CG) and the center of the lytic peptide bond (p1:C-p1

^{'}

:N), respectively (Figure 1). The ground state (GS) was characterized by the nearest water molecule being within an appropriate distance cutoff (

d_{n}^{p}

and

d_{o}^{p}

) from the respective centers in the enzyme-bound and enzyme-free systems. These thresholds were chosen after analysis of the density distributions for nucleophilic water binding (see Results). The nucelophile attack distance (

d_{a}

) was defined as the distance between the carbon atom of the lytic peptide bond and the oxygen atom of the nearest water molecule (WAT

_{n u c}

:O

_{n u c}

-p1:C). The nucelophile attack angle (

α

) was defined as the angle between the

d_{a}

vector and the vector corresponding to the carbonyl bond adjacent to the lytic peptide bond (WAT

_{n u c}

:O

_{n u c}

-p1:C-p1:O). NACs were characterized in terms of Bürgi–Dunitz criteria corresponding to an angle range of 100° ≤

α

≤ 110° and distance threshold of

d_{a}

≤ 3.2 Å. A hydrogen bond network was analyzed in the catalytic site of the enzyme-bound systems—characterized by cooperative combinations of a set of four distinct hydrogen bonds

h b

= { hb

_{1}

, hb

_{2}

, hb

_{3}

, hb

_{4}

} and resulting in a set of 12 non-zero density hydrogen bond states s = {1,…,12 } (Figure 4). The threshold for a hydrogen bond was a donor–acceptor distance ≤3.5 Å and donor–hydrogen-acceptor angle of ≥150°.

Probability densities (

ρ (d_{n})

and

ρ (d_{o})

) were calculated by binning ensemble data along the corresponding reaction coordinate space, respectively (

d_{n}

and

d_{o}

) using kernel density estimation with an Epanechnikov kernel and bandwidth parameter h = 0.75. The mole fraction (

Γ_{M}

) of a given macrostate (M) was computed as

Γ_{M} = m / N

, where m is the number of datapoints within the above-mentioned boundary partitions of the macrostate in the corresponding reaction coordinate space, and N, the total number of datapoints in the given ensemble. The potential of mean force for a given state (

G_{M}

) was calculated as

G_{M} = - k_{B} T l n (Γ_{M})

. Free energy differences (

Δ

G) between various macrostates were calculated from the ratios of the corresponding mole fractions according to

Δ G = - k_{B} T l n (Γ_{M 2} / Γ_{M 1})

, where

Γ_{M 1}

and

Γ_{M 2}

are mole fractions of any given states

M 1

and

M 2

, respectively. Hydrogen bond state mole fractions (

Γ_{s}

) within a given macrostate were calculated as

Γ_{s} = n_{s} / m

, where

n_{s}

is the number of datapoints within the criteria for each hydrogen bond state s within the subset of datapoints (m) comprising state M. Care was taken to integrate the possible degenerate configurations pertaining to each distinct hydrogen bond that arise from structural symmetries imposed by molecular rotation—as previously reported [31].

The complete ensemble for each system was used to perform the hydrogen bond (Figure 4) and free energy variation (Figure 5) analyses. Reported free energy values, as well as their errors (Table 1 and Table 2), were calculated by partitioning each ensemble into five subsets in both the enzyme-bound and enzyme-free systems; mole fractions were calculated independently in each subset and averaged to yield means and standard deviations. The only exception to this was the calculation of the NAC free energy for the enzyme-bound PR-RT system for which very few absolute counts were exhibited—thus the ensemble was divided into only two subsets to compute means and errors here.

Estimates for the overall catalyzed energy barrier (

Δ

G

^{‡ c}

) for all substrates except NC-SP2, were made based on converting experimental

k_{cat}

values [39] using Equation (2) and assuming

γ

= 1 at 300 K. NC-SP2 was not measured in [39], but was measured in a similar study at 310 K [41], as were several other cleavage junctions at this temperature [40]. Therefore,

k_{cat}

for NC-SP2 at 300 K was estimated by multiplying the

k_{cat}

ratio of NC-SP2/CA-SP1 from [40,41] to the

k_{cat}

value of CA-SP1 in the main set [39] from which

Δ

G

^{‡ c}

was subsequently calculated.

5. Conclusions

Computational studies have previously revealed the existence of a small but significant thermodynamic barrier contributed by the formation of near attack conformations (NACs) that lie on the transition path of the peptide hydrolysis reaction catalyzed by HIV-1 protease [31]. However, NACs were also found there to confer no catalytic effect because the thermodynamic barrier to their formation was equivalent in the uncatalyzed reaction. In the current study, the role of NACs has been further explored across a range of substrates cleaved by the HIV-1 protease, using the same all-atom ensemble molecular dynamics simulation approach coupled to Bürgi–Dunitz theory for characterizing nucleophilic attack of a water molecule on the lytic peptide bond.

This study supports the previous findings that NACs play little or no role in the catalytic effect induced by HIV-1 protease—catalytic barrier reduction is thus entirely dominated by non-NAC contributions. Nonetheless, the functional role of NACs may emerge when considering them together with non-NAC contributions as well as the virological requirements for the ordering of cleavage rates across the different substrates. For HIV-1, the kinetic order of cleavage is tightly regulated [61] to achieve correct architectural reorganization into a mature virion [62] within the physiologically required timescale for infection [43,44].

The findings reported here suggest that NAC contributions, whilst small, are also largely invariant across multiple substrates in the uncatalyzed reaction. Similarly, non-NAC contributions, even though large, are also relatively invariant because the uncatalyzed barrier may be independent of peptide sequence. The catalyzed reactions, however, present a different picture because a range of variation (∼3 kcal/mol) exists in the overall energy barrier and both non-NAC and NAC contributions are shown to vary by ∼3 kcal/mol across the range of substrates studied. Although non-NAC contributions dominate the reduction in the overall barrier, they appear to be grouped in discrete energy bands that are too coarsely separated to account for either the small variations or the correct ordering of the overall barriers across the substrates. The small variation in NAC contributions thus may constitute a complementary fine-tuning mechanism to rectify both of these issues.

Biological systems face evolutionary selection pressure on multiple fronts and viruses, in particular, are well-known for their parsimony in achieving macromolecular functionality. Together, such a combination of coarse-reduction by non-NAC and complementary fine-tuning by NAC contributions, respectively, may thus provide a way for controlling required enzymatic rates, whilst under sequence selection pressure for other biomolecular functions. There is likely a trade-off in selecting amino-acid sequences for these functions and those required to directly optimize the given enzymatic reaction. This selection pressure may make it difficult to fine-tune the necessary differences in enzymatic rate by solely using changes in non-NAC contributions. NACs offer an alternative mode to accommodate such differences whilst modulating the rate to meet that required biologically for optimal processing at the corresponding site.

Establishing NAC effects have proven computationally challenging because rigorous characterization at the limit of a classical forcefield requires sufficient sampling to observe ground state (GS) motions towards the transition state. This has challenged the exploration of relative NAC effects in similar but non-identical systems. Here, sufficient sampling of the GS is made possible for most systems by reversible water entry to the active-site with around 10 μs of aggregate sampling per enzyme-bound system. Other systems of interest for exploring NAC contributions may therefore be accessible with current computational power.

Furthermore, future studies using quantum mechanical/molecular mechanics (QM/MM) approaches [22] may test the accuracy of the classical observations exhibited here in more detail and also establish a quantum analog of the nucleophilic attack criteria given by Bürgi–Dunitz theory, from which NAC contributions could be better decoupled from subsequent steps along the reaction pathway.

Nonetheless, the observation here that, for the HIV-1 protease system, relative enzyme specificity is in part directed by fine-tuned nucleophilic water NAC contributions to the catalyzed reaction barrier, implies that similar NAC-specificity effects may exist in other enzyme–substrate reactions and suggests a novel mechanism for fine-tuning and controlling such reactions.

Funding

The author acknowledges support from amfAR Mathilde Krim Fellowship in Basic Biomedical Research number 108680, from the Volkswagen Foundation ‘Experiment! Funding Initiative’ Grant No. 93874 and from the Klaus Tschira Foundation.

Acknowledgments

The author thanks Gianni De Fabritiis for previous use of the GPUGRID.net infrastructure as well as Peter Coveney, Natalia Gabrielli, Andreas Meyerhans, Gilles Mirambeau and Sébastien Lyonnais for valuable discussions.

Conflicts of Interest

The author declares no conflict of interest.

References

Pauling, L. Molecular architecture and biological reactions. Chem. Eng. News 1946, 24, 1375–1377. [Google Scholar] [CrossRef] [Green Version]
Garcia-Viloca, M.; Gao, J.; Karplus, M.; Truhlar, D. How enzymes work: Analysis by modern rate theory and computer simulations. Science 2004, 303, 186–195. [Google Scholar] [CrossRef] [PubMed]
Eyring, H.; Stearn, A. The application of the theory of absolute reaction rates to proteins. Chem. Rev. 1939, 24, 253–270. [Google Scholar] [CrossRef]
Cui, Q.; Karplus, M. Quantum mechanics/molecular mechanics studies of triosephosphate isomerase-catalyzed reactions: Effect of geometry and tunneling on proton-transfer rate constants. J. Am. Chem. Soc. 2002, 124, 3093–3124. [Google Scholar] [CrossRef]
Masgrau, L.; Roujeinikova, A.; Johannissen, L.; Hothi, P.; Basran, J.; Ranaghan, K.; Mulholland, A.; Sutcliffe, M.; Scrutton, N.; Leys, D. Atomic description of an enzyme reaction dominated by proton tunneling. Science 2006, 312, 237–241. [Google Scholar] [CrossRef] [Green Version]
Gao, J.; Truhlar, D. Quantum mechanical methods for enzyme kinetics. Ann. Rev. Phys. Chem. 2002, 53, 467–505. [Google Scholar] [CrossRef] [Green Version]
Careri, G.; Fasella, P.; Gratton, E.; Jencks, W. Statistical time events in enzymes: A physical assessment. Crit. Rev. Biochem. Mol. Biol. 1975, 3, 141–164. [Google Scholar] [CrossRef]
Welch, G.; Somogyi, B.; Damjanovich, S. The role of protein fluctuations in enzyme action: A review. Progr. Biophys. Mol. Biol. 1982, 39, 109. [Google Scholar] [CrossRef]
Olsson, M.; Parson, W.; Warshel, A. Dynamical contributions to enzyme catalysis: Critical tests of a popular hypothesis. Chem. Rev. 2006, 106, 1737–1756. [Google Scholar] [CrossRef]
Eisenmesser, E.; Millet, O.; Labeikovsky, W.; Korzhnev, D.; Wolf-Watz, M.; Bosco, D.; Skalicky, J.; Kay, L.; Kern, D. Intrinsic dynamics of an enzyme underlies catalysis. Nature 2005, 438, 117–121. [Google Scholar] [CrossRef]
Boehr, D.; McElheny, D.; Dyson, H.; Wright, P. The dynamic energy landscape of dihydrofolate reductase catalysis. Science 2006, 313, 1638–1642. [Google Scholar] [CrossRef]
Henzler-Wildman, K.A.; Thai, V.; Lei, M.; Ott, M.; Wolf-Watz, M.; Fenn, T.; Pozharski, E.; Wilson, M.A.; Petsko, G.A.; Karplus, M.; et al. Intrinsic motions along an enzymatic reaction trajectory. Nature 2007, 450, 838–844. [Google Scholar] [CrossRef]
Bruice, T. A view at the millennium: The efficiency of enzymatic catalysis. Accounts. Chem. Res. 2002, 35, 139–148. [Google Scholar] [CrossRef] [PubMed]
Falzone, C.; Wright, P.; Benkovic, S. Dynamics of a flexible loop in dihydrofolate reductase from Escherichia coli and its implication for catalysis. Biochemistry 1994, 33, 439–442. [Google Scholar] [CrossRef] [PubMed]
Osborne, M.; Schnell, J.; Benkovic, S.; Dyson, H.; Wright, P. Backbone dynamics in dihydrofolate reductase complexes: Role of loop flexibility in the catalytic mechanism. Biochemistry 2001, 40, 9846–9859. [Google Scholar] [CrossRef] [PubMed]
Villà, J.; Warshel, A. Energetics and dynamics of enzymatic reactions. J. Phys. Chem. B 2001, 105, 7887–7907. [Google Scholar] [CrossRef]
Warshel, A. Electrostatic origin of the catalytic power of enzymes and the role of preorganized active sites. J. Biol. Chem. 1998, 273, 27035–27038. [Google Scholar] [CrossRef] [Green Version]
Cui, Q.; Elstner, M.; Karplus, M. A theoretical analysis of the proton and hydride transfer in liver alcohol dehydrogenase (LADH). J. Phys. Chem. B 2002, 106, 2721–2740. [Google Scholar] [CrossRef]
Hansson, T.; Nordlund, P.; Åqvist, J. Energetics of nucleophile activation in a protein tyrosine phosphatase. J. Mol. Biol. 1997, 265, 118–127. [Google Scholar] [CrossRef]
Warshel, A.; Sharma, P.; Kato, M.; Xiang, Y.; Liu, H.; Olsson, M. Electrostatic basis for enzyme catalysis. Chem. Rev. 2006, 106, 3210–3235. [Google Scholar] [CrossRef]
Kienhöfer, A.; Kast, P.; Hilvert, D. Selective stabilization of the chorismate mutase transition state by a positively charged hydrogen bond donor. J. Am. Chem. Soc. 2003, 125, 3206–3207. [Google Scholar] [CrossRef] [PubMed]
Lyne, P.; Mulholland, A.; Richards, W. Insights into chorismate mutase catalysis from a combined QM/MM simulation of the enzyme reaction. J. Am. Chem. Soc. 1995, 117, 11345–11350. [Google Scholar] [CrossRef]
Shurki, A.; Štrajbl, M.; Villà, J.; Warshel, A. How much do enzymes really gain by restraining their reacting fragments? J. Am. Chem. Soc. 2002, 124, 4097–4107. [Google Scholar] [CrossRef] [PubMed]
Štrajbl, M.; Shurki, A.; Kato, M.; Warshel, A. Apparent NAC effect in chorismate mutase reflects electrostatic transition state stabilization. J. Am. Chem. Soc. 2003, 125, 10228–10237. [Google Scholar] [CrossRef]
Bruice, T.C.; Lightstone, F. Ground state and transition state contributions to the rates of intramolecular and enzymatic reactions. Acc. Chem. Res. 1999, 32, 127–136. [Google Scholar] [CrossRef]
Bruice, T.C.; Benkovic, S.J. Chemical basis for enzyme catalysis. Biochemistry 2000, 39, 6267–6274. [Google Scholar] [CrossRef]
Schowen, R. How an enzyme surmounts the activation energy barrier. Proc. Natl. Acad. Sci. USA 2003, 100, 11931–11932. [Google Scholar] [CrossRef] [Green Version]
Hur, S.; Bruice, T.C. The near attack conformation approach to the study of the chorismate to prephenate reaction. Proc. Natl. Acad. Sci. USA 2003, 100, 12015–12020. [Google Scholar] [CrossRef] [Green Version]
Hur, S.; Bruice, T.C. The mechanism of catalysis of the chorismate to prephenate reaction by the Escherichia coli mutase enzyme. Proc. Natl. Acad. Sci. USA 2002, 99, 1176–1181. [Google Scholar] [CrossRef] [Green Version]
Hur, S.; Bruice, T.C. Comparison of formation of reactive conformers (NACs) for the Claisen rearrangement of chorismate to prephenate in water and in the E. coli mutase: The efficiency of the enzyme catalysis. J. Am. Chem. Soc. 2003, 125, 5964–5972. [Google Scholar] [CrossRef]
Sadiq, S.K.; Coveney, P.V. Computing the role of near attack conformations in an enzyme-catalyzed nucleophilic bimolecular reaction. J. Chem. Theor. Comput. 2015, 11, 316–324. [Google Scholar] [CrossRef] [PubMed]
Park, H.; Suh, J.; Lee, S. Ab initio studies on the catalytic mechanism of aspartic proteinases: Nucleophilic versus general acid/general base mechanism. J. Am. Chem. Soc. 2000, 122, 3901–3908. [Google Scholar] [CrossRef]
Piana, S.; Bucher, D.; Carloni, P.; Rothlisberger, U. Reaction mechanism of HIV-1 protease by hybrid Car-Parrinello/classical MD simulations. J. Phys. Chem. B 2004, 108, 11139–11149. [Google Scholar] [CrossRef]
Bürgi, H.B.; Dunitz, J.D.; Shefter, E. Geometrical reaction coordinates. II. Nucleophilic addition to a carbonyl group. J. Am. Chem. Soc. 1973, 95, 5065–5067. [Google Scholar] [CrossRef]
Bürgi, H.B.; Lehn, J.M.; Wipff, G. Ab initio study of nucleophilic addition to a carbonyl group. J. Am. Chem. Soc. 1974, 96, 1956–1957. [Google Scholar] [CrossRef]
Dunitz, J.D.; Lehn, J.M.; Wipff, G. Stereochemistry of reaction paths at carbonyl centres. Tetrahedron 1974, 30, 1563–1572. [Google Scholar]
Bürgi, H.B.; Dunitz, J.D.; Shefter, E. Chemical reaction paths. IV. Aspects of O⋯C=O interactions in crystals. Acta. Crystall. Sect. B 1974, 30, 1517–1527. [Google Scholar] [CrossRef]
Shehu-Xhilaga, M.; Crowe, S.M.; Mak, J. Maintenance of the Gag/Gag-Pol ratio is important for human immunodeficiency virus type 1 RNA dimerization and viral infectivity. J. Virol. 2001, 75, 1834–1841. [Google Scholar] [CrossRef] [Green Version]
Maschera, B.; Darby, G.; Palu, G.; Wright, L.L.; Tisdale, M.; Myers, R.; Blair, E.D.; Furfine, E.S. Human immunodeficiency virus mutations in the viral protease that confer resistance to saquinavir increase the dissociation rate constant of the protease-saquinavir complex. J. Biol. Chem. 1996, 271, 33231–33235. [Google Scholar] [CrossRef] [Green Version]
Tözsér, J.; Bláha, I.; Copeland, T.D.; Wondrak, E.M.; Oroszlan, S. Comparison of the HIV-1 and HIV-2 proteinases using oligopeptide substrates representing cleavage sites in Gag and Gag-Pol polyproteins. FEBS Lett. 1991, 281, 77–80. [Google Scholar] [CrossRef] [Green Version]
Fehér, A.; Weber, I.T.; Bagossi, P.; Boross, P.; Mahalingam, B.; Louis, J.M.; Copeland, T.D.; Torshin, I.Y.; Harrison, R.W.; Tözsér, J. Effect of sequence polymorphism and drug resistance on two HIV-1 Gag processing sites. Eur. J. Biochem. 2002, 269, 4114–4120. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Radzicka, A.; Wolfenden, R. Rates of uncatalyzed peptide bond hydrolysis in neutral solution and the transition state affinities of proteases. J. Am. Chem. Soc. 1996, 118, 6105–6109. [Google Scholar] [CrossRef]
Sadiq, S.K.; Könnyü, B.; Müller, V.; Coveney, P.V. Reaction kinetics of catalyzed competitive heteropolymer cleavage. J. Phys. Chem. B 2011, 115, 11017–11027. [Google Scholar] [CrossRef]
Könnyü, B.; Sadiq, S.K.; Turányi, T.; Hírmondó, R.; Müller, B.; Kräusslich, H.G.; Coveney, P.V.; Müller, V. Gag-Pol processing during HIV-1 virion maturation: A systems biology approach. PLoS Comput. Biol. 2013, 9, e1003103. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Mirambeau, G.; Lyonnais, S.; Coulaud, D.; Hameau, L.; Lafosse, S.; Jeusset, J.; Borde, I.; Reboud-Ravaux, M.; Restle, T.; Gorelick, R.J.; et al. HIV-1 protease and reverse transcriptase control the architecture of their nucleocapsid partner. PLoS ONE 2007, 2, e669. [Google Scholar] [CrossRef]
Wlodawer, A.; Miller, M.; Jaskólski, M.; Sathyanarayana, B.K.; Baldwin, E.; Weber, I.T.; Selk, L.M.; Clawson, L.; Schneider, J.; Kent, S.B.H. Conserved folding in retroviral proteases: Crystal structure of a synthetic HIV-1 protease. Science 1989, 245, 616–621. [Google Scholar] [CrossRef]
Sadiq, S.K.; Noé, F.; De Fabritiis, G. Kinetic characterization of the critical step in HIV-1 protease maturation. Proc. Natl. Acad. Sci. USA 2012, 109, 20449–20454. [Google Scholar] [CrossRef] [Green Version]
Nijhuis, M.; Schuurman, R.; de Jong, D.; Erickson, J.; Gustchina, E.; Albert, J.; Schipper, P.; Gulnik, S.; Boucher, C.A.B. Increased fitness of drug resistant HIV-1 protease as a result of acquisition of compensatory mutations during suboptimal therapy. AIDS 1999, 13, 2349–2359. [Google Scholar] [CrossRef]
Mammano, F.; Trouplin, V.; Zennou, V.; Clavel, F. Retracing the evolutionary pathways of human immunodeficiency virus type 1 resistance to protease inhibitors: Virus fitness in the absence and in the presence of drug. J. Virol. 2000, 74, 8524–8531. [Google Scholar] [CrossRef] [Green Version]
Martinez-Picado, J.; Savara, A.V.; Sutton, L.; D’Aquila, R.T. Replicative fitness of protease inhibitor-resistant mutants of human immunodeficiency virus type 1. J. Virol. 1999, 73, 3744–3752. [Google Scholar] [CrossRef] [Green Version]
Prabu-Jeyabalan, M.; Nalivaika, E.; Schiffer, C.A. Substrate shape determines specificity of recognition for HIV-1 protease: Analysis of crystal structures of six substrate complexes. Structure 2002, 10, 369–381. [Google Scholar] [CrossRef]
Prabu-Jeyabalan, M.; Nalivaika, E.; Schiffer, C.A. How does a symmetric dimer recognize an asymmetric substrate? A substrate complex of HIV-1 protease. J. Mol. Biol. 2000, 301, 1207–1220. [Google Scholar] [CrossRef] [PubMed]
Prabu-Jeyabalan, M.; Nalivaika, E.; King, N.M.; Schiffer, C.A. Structural basis for coevolution of a human immunodeficiency virus type 1 nucleocapsid-p1 cleavage site with a V82A drug-resistant mutation in viral protease. J. Virol. 2004, 78, 12446–12454. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Wittayanarakul, K.; Hannongbua, S.; Feig, M. Accurate prediction of protonation state as a prerequisite for reliable MM-PB (GB) SA binding free energy calculations of HIV-1 protease inhibitors. J. Comput. Chem. 2008, 29, 673–685. [Google Scholar] [CrossRef]
Kovalskyy, D.; Dubyna, V.; Mark, A.E.; Korenelyuk, A. A molecular dynamics study of the structural stability of HIV-1 protease under physiological conditions: The role of Na+ ions in stabilizing the active site. Proteins Struct. Funct. Bioinf. 2005, 58, 450–458. [Google Scholar] [CrossRef] [Green Version]
Jorgensen, W.L.; Chandrasekhar, J.; Madura, J.D.; Impey, R.W.; Klein, M.L. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 1983, 79, 926–935. [Google Scholar] [CrossRef]
Case, D.A.; Cheatham, T.E., III; Darden, T.; Gohlke, H.; Luo, R.; Merz, K.M., Jr.; Onufriev, A.; Simmerling, C.; Wang, B.; Woods, R.J. The Amber biomolecular simulation programs. J. Comput. Chem. 2005, 26, 1668–1688. [Google Scholar] [CrossRef] [Green Version]
Duan, Y.; Wu, C.; Chowdhury, S.; Lee, M.C.; Xiong, G.; Zhang, W.; Yang, R.; Cieplak, P.; Luo, R.; Lee, T. A point-charge force field for molecular mechanics simulations of proteins based on condensed-phase quantum mechanical calculations. J. Comput. Chem. 2003, 24, 1999–2012. [Google Scholar] [CrossRef]
Harvey, M.J.; Giupponi, G.; Fabritiis, G.D. ACEMD: Accelerating biomolecular dynamics in the microsecond time scale. J. Chem. Theor. Comput. 2009, 5, 1632–1639. [Google Scholar] [CrossRef] [Green Version]
Sadiq, S.K.; De Fabritiis, G. Explicit solvent dynamics and energetics of HIV-1 protease flap opening and closing. Proteins 2010, 78, 2873–2885. [Google Scholar] [CrossRef]
Pettit, S.C.; Henderson, G.J.; Schiffer, C.A.; Swanstrom, R. Replacement of the P1 amino acid of human immunodeficiency virus type 1 Gag processing sites can inhibit or enhance the rate of cleavage by the viral protease. J. Virol. 2002, 76, 10226–10233. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Sundquist, W.I.; Kräusslich, H.G. HIV-1 assembly, budding, and maturation. Cold Spring Harb. Perspect. Med. 2012, 2, a006924. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Representation of a near attack conformation (NAC) in HIV-1 protease by a nucleophilic water molecule, as described previously [31]. NACs are characterized in terms of Bürgi–Dunitz criteria corresponding to a nucleophile attack angle (WAT

_{n u c}

:O

_{n u c}

-p1:C-p1:O) range (100° ≤

α

≤ 110°) and distance (WAT

_{n u c}

:O

_{n u c}

-p1:C) threshold (

d_{a}

≤ 3.2 Å). The red dot represents the catalytic center (geometric center of p1:C, p1

^{'}

:N, D25:CG and D25

^{'}

:CG),

d_{n}

represents the nearest water distance to the catalytic center. The ground state corresponds to a nucleophilic water molecule within the bound perimeter (dashed blue circle) in the catalytic site (

d_{n} \leq d_{n}^{p}

).

Figure 1. Representation of a near attack conformation (NAC) in HIV-1 protease by a nucleophilic water molecule, as described previously [31]. NACs are characterized in terms of Bürgi–Dunitz criteria corresponding to a nucleophile attack angle (WAT

_{n u c}

:O

_{n u c}

-p1:C-p1:O) range (100° ≤

α

≤ 110°) and distance (WAT

_{n u c}

:O

_{n u c}

-p1:C) threshold (

d_{a}

≤ 3.2 Å). The red dot represents the catalytic center (geometric center of p1:C, p1

^{'}

:N, D25:CG and D25

^{'}

:CG),

d_{n}

represents the nearest water distance to the catalytic center. The ground state corresponds to a nucleophilic water molecule within the bound perimeter (dashed blue circle) in the catalytic site (

d_{n} \leq d_{n}^{p}

).

Figure 2. Normalized density distributions (

ρ (d_{n})

) of the nearest water distance (

d_{n}

) to the catalytic center across all enzyme-bound substrate systems. The bound-unbound partition distance (

d_{n}^{p}

) is shown (dashed black line).

Figure 2. Normalized density distributions (

ρ (d_{n})

) of the nearest water distance (

d_{n}

) to the catalytic center across all enzyme-bound substrate systems. The bound-unbound partition distance (

d_{n}^{p}

) is shown (dashed black line).

Figure 3. Normalized density distributions (

ρ (d_{o})

) of the nearest water distance (

d_{o}

) to the midpoint of the lytic peptide bond across all enzyme-free substrate systems. The bound-unbound partition distance (

d_{o}^{p}

), chosen as 6 Å, is shown (dashed black line).

Figure 3. Normalized density distributions (

ρ (d_{o})

) of the nearest water distance (

d_{o}

) to the midpoint of the lytic peptide bond across all enzyme-free substrate systems. The bound-unbound partition distance (

d_{o}^{p}

), chosen as 6 Å, is shown (dashed black line).

Figure 4. (A) Four distinct hydrogen bonds that occur in the HIV-1 protease catalytic site when a nucleophilic water molecule is bound and (B) twelve possible combinations of hydrogen bond states that exhibit non-zero density as represented in a previous study [31]. (C) Thermodynamic variation of catalytic site hydrogen bond state distribution across different substrates when the nucleophilic water molecule is in the NAC, GS, and unbound states.

Figure 5. Free energy of NAC formation (A) for all enzyme-bound (

Δ

G

_{N}^{‡ c}

) and (B) enzyme-free (

Δ

G

_{N}^{‡ u}

) substrate systems when varying the threshold

d_{n}^{p}

and

d_{o}^{p}

that partitions bound and unbound nucleophilic water states, respectively.

Figure 5. Free energy of NAC formation (A) for all enzyme-bound (

Δ

G

_{N}^{‡ c}

) and (B) enzyme-free (

Δ

G

_{N}^{‡ u}

) substrate systems when varying the threshold

d_{n}^{p}

and

d_{o}^{p}

that partitions bound and unbound nucleophilic water states, respectively.

Figure 6. (A) Plot of experimentally derived values for the activation barrier (

Δ

G

^{‡ c}

) against calculated values for the free energy of NAC formation in the catalyzed reaction (

Δ

G

_{N}^{‡ c}

). (B) Comparison of NAC (red),

Δ

G

_{N}^{‡ c}

, and non-NAC (blue),

Δ

G

_{n N}^{‡ c}

, free energy contributions. All systems lie within one of three distinct

Δ

G

_{n N}^{‡ c}

bands separated by significantly more than thermal noise (black lines with blue bands). Only one non-NAC band is immediately within the virologically relevant region (yellow) for differential enzyme specificity. NAC contributions are small (<5 kcal/mol) and within ∼3 kcal/mol of each other, but serve to fine-tune

Δ

G

^{‡ c}

for all substrates to both match the optimal barrier differences and reach the relevant barrier region required by the virus—neither of which is achieved, in general, by non-NAC contributions alone.

Figure 6. (A) Plot of experimentally derived values for the activation barrier (

Δ

G

^{‡ c}

) against calculated values for the free energy of NAC formation in the catalyzed reaction (

Δ

G

_{N}^{‡ c}

). (B) Comparison of NAC (red),

Δ

G

_{N}^{‡ c}

, and non-NAC (blue),

Δ

G

_{n N}^{‡ c}

, free energy contributions. All systems lie within one of three distinct

Δ

G

_{n N}^{‡ c}

bands separated by significantly more than thermal noise (black lines with blue bands). Only one non-NAC band is immediately within the virologically relevant region (yellow) for differential enzyme specificity. NAC contributions are small (<5 kcal/mol) and within ∼3 kcal/mol of each other, but serve to fine-tune

Δ

G

^{‡ c}

for all substrates to both match the optimal barrier differences and reach the relevant barrier region required by the virus—neither of which is achieved, in general, by non-NAC contributions alone.

Table 1. Free energies of nucleophilic water binding (

Δ

G

_{n w b}

) in the enzyme-bound substrate complexes based on assigned partition thresholds,

d_{n}^{p}

. All energies are in kcal/mol.

Table 1. Free energies of nucleophilic water binding (

Δ

G

_{n w b}

) in the enzyme-bound substrate complexes based on assigned partition thresholds,

d_{n}^{p}

. All energies are in kcal/mol.

System	Cleavage Sequence	$d_{n}^{p}$ (Å)	$Δ$ G $_{nwb}$
MA-CA	SGNY-PIVQ	4.6	−0.4 ± 0.1
CA-SP1	ARVL-AEAM	4.2	−1.9 ± 0.5
SP1-NC	ATIM-MQRG	4.6	−1.0 ± 0.1
NC-SP2	RQAN-FLGK	4.6	−1.1 ± 0.3
SP2-p6	PYNF-LQSR	4.6	−3.7 ± 1.2
TFR-PR	SFNF-PQIT	4.6	−0.4 ± 0.2
PR-RT	TLNF-PISP	4.6	2.0 ± 0.3
RT-RH	AETP-YVDG	4.6	−0.1 ± 0.2
RH-IN	RKIL-FLDG	4.6	−2.2 ± 0.7

Table 2. Activation free energies in both enzyme-catalyzed and uncatalyzed systems decomposed in terms of NAC and non-NAC contributions. All energies are in kcal/mol. Errors for quantities computed in the simulations are shown. Derived and estimated quantities are reported without errors.

	Uncatalyzed			Catalyzed
System	$Δ$ G $^{‡ u}$	$Δ$ G $_{N}^{‡ u}$	$Δ$ G $_{nN}^{‡ u}$	$Δ$ G $^{‡ c}$ $^{a}$	$Δ$ G $_{N}^{‡ c}$	$Δ$ G $_{nN}^{‡ c}$	$Δ Δ$ G $^{‡}$	$Δ Δ$ G $_{N}^{‡}$	$Δ Δ$ G $_{nN}^{‡}$
MA-CA	∼30 $^{b}$	3.8 ± 0.1	26.2	15.7	2.1 ± 0.1	13.6	−14.3	−1.7 ± 0.2	−12.6
CA-SP1		4.4 ± 0.1	25.6	17.0	3.7 ± 0.2	13.3	−13.0	−0.7 ± 0.3	−12.3
SP1-NC		4.6 ± 0.1	25.4	15.3	4.6 ± 0.2	10.7	−14.7	0.0 ± 0.3	−14.7
NC-SP2		4.3 ± 0.1	25.7	16.7	3.8 ± 0.2	12.9	−13.3	−0.5 ± 0.3	−12.8
SP2-p6		4.8 ± 0.3	25.2	18.4	3.1 ± 0.1	15.3	−11.6	−1.7 ± 0.4	−9.9
TFR-PR		4.1 ± 0.1	25.9	15.3	1.9 ± 0.2	13.4	−14.7	−2.2 ± 0.3	−12.5
PR-RT		3.9 ± 0.1	26.1	15.8	4.2 ± 0.5	11.6	−14.2	0.3 ± 0.6	−14.5
RT-RH		4.7 ± 0.4	25.3	17.2	3.7 ± 0.4	13.5	−12.8	−1.0 ± 0.8	−11.8
RH-IN		4.4 ± 0.1	25.6	16.6	3.5 ± 0.1	13.1	−13.4	−0.9 ± 0.2	−12.5

^{a}

based on experimentally measured and estimated (see Materials and Methods)

k_{cat}

values [39,40,41];

^{b}

based on an experimentally determined half-life of ∼500 years [42].

© 2020 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Sadiq, S.K. Fine-Tuning of Sequence Specificity by Near Attack Conformations in Enzyme-Catalyzed Peptide Hydrolysis. Catalysts 2020, 10, 684. https://doi.org/10.3390/catal10060684

AMA Style

Sadiq SK. Fine-Tuning of Sequence Specificity by Near Attack Conformations in Enzyme-Catalyzed Peptide Hydrolysis. Catalysts. 2020; 10(6):684. https://doi.org/10.3390/catal10060684

Chicago/Turabian Style

Sadiq, S. Kashif. 2020. "Fine-Tuning of Sequence Specificity by Near Attack Conformations in Enzyme-Catalyzed Peptide Hydrolysis" Catalysts 10, no. 6: 684. https://doi.org/10.3390/catal10060684

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Fine-Tuning of Sequence Specificity by Near Attack Conformations in Enzyme-Catalyzed Peptide Hydrolysis

Abstract

1. Introduction

2. Results

2.1. Differential Nucleophilic Water Binding

2.2. Analysis of the Hydrogen Bond Network

2.3. Thermodynamic Decomposition of Activation Free Energy Contributions

3. Discussion

4. Materials and Methods

4.1. Initial Preparation

4.2. Molecular Dynamics Equilibration and Production Simulation Protocol

4.3. Analysis

5. Conclusions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI