Next Article in Journal
Androgen Receptors in Human Breast Cancer and Female Canine Mammary Tumors
Previous Article in Journal
Hydrogen Bonding of Trialkyl-Substituted Urea in Organic Environment
Previous Article in Special Issue
Two Methods for Superposing the Structures of Like-Molecule Assemblies: Application to Peptide and Protein Oligomers and Aggregates
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Dynamical Asymmetry in SARS-CoV2 Protease Reveals the Exchange Between Catalytic Activity and Stability in Homodimers

1
INFN and Department of Physics, University of Rome Tor Vergata, 00133 Rome, Italy
2
Department of Environment and Health, Istituto Superiore di Sanità, 00161 Rome, Italy
3
Department of Experimental Medicine, University of Rome Tor Vergata, 00133 Rome, Italy
4
Unit of Chemical-Physics Fundamentals in Chemical Engineering, Department of Science and Technology for Sustainable Development and One Health, Università Campus Bio-Medico of Rome, 00128 Rome, Italy
5
Unit of Electrotechnics, Faculty of Engineering, Università Campus Bio-Medico of Rome, 00128 Rome, Italy
*
Authors to whom correspondence should be addressed.
Molecules 2025, 30(7), 1412; https://doi.org/10.3390/molecules30071412 (registering DOI)
Submission received: 21 January 2025 / Revised: 17 March 2025 / Accepted: 20 March 2025 / Published: 22 March 2025

Abstract

:
The molecular approach to understanding the mechanisms of emerging diseases, like COVID-19, has largely accelerated the search for successful therapeutical strategies. In this work, we present an extensive molecular dynamics (MD) analysis of two forms of the SARS-CoV-2 main protease MPro. We analyzed the free form (apo) and compared the results with those coming from the (holo) form bound to the inhibitor Boceprevir, an FDA-approved drug repurposed for COVID-19 therapy. We applied Dynamic Cross Correlation (DCC) analysis to the MD simulations to trace the concerted motion patterns within the protein structure. Although symmetric, the homodimer in the bound form showed clearly asymmetric dynamical behavior. In particular, the presence of concerted motions was detected in the protomer where the expulsion of the substrate from the active site happened. Such behavior was not observed in the same time lapses in the apo form. These results highlight a sort of ‘symmetry breaking’, making a symmetric structure to display functional induced asymmetric behavior in response to a perturbation. This highly coordinated dynamics in response to an external cue confirms the character of ‘complex molecular machines’ of biopolymers.

1. Introduction

As the outbreak of the novel coronavirus disease (COVID-19) caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) hit with an unprecedented global impact, it was evident a better understanding of molecular mechanisms behind virus infection was a winning strategy to reach an efficient therapy [1,2].
The main viral protease (MPro), also known as 3ClPro, is one of the key targets for drug development against SARS-CoV-2 [3]. This enzyme plays a crucial role in viral replication and is therefore an attractive target for therapeutic interventions.
The MPro protease appears to be an attractive pharmacological target due to its essential role in processing polyproteins translated from viral RNA [4]. This protease operates at no fewer than 11 cleavage sites on the large polyprotein 1ab. It is a dimer composed of two identical subunits that together form two active sites. Inhibiting the activity of this enzyme would block viral replication. Moreover, since no known human proteases exhibit a similar cleavage specificity, such inhibitors are unlikely to be toxic to humans. For this reason, numerous studies have therefore focused on the search for potential inhibitors [4,5,6]. Scientific research has also focused on identifying molecules that could prevent its dimerization. It was, in fact, discovered that in solution, this protease does not perform its enzymatic function when present as a monomer [7,8,9]. A very recent study demonstrated that the MPro dimer in complex with peptides mimicking the substrate adopts an asymmetric structure: the binding of substrate to a single subunit reveals allosteric communication between the active sites. Furthermore, arginine 4 and 298 are found to be crucial factors in the transition from a symmetric to an asymmetric dimer [10].
From a structural point of view, each monomer subunit, which comprises three different domains (I, II and III), is unstable [7]. Dimerization is promoted by domain III aggregation (see Figure 1) and the oligomeric form is stabilized by the tight binding between the C and N terminus of the two chains and by the salt bridge between Glu290 of one chain and Arg4 of the second chain [11]. Since the dimerization is a key step in the catalytic function of the MPro, targeting the dimer interface is a potentially winning strategy to reduce the viral infection [12].
The protein functional region resides (in each chain) at the intersection between domains I and II and it is characterized by a catalytic dyad formed by His41 and Cys145, which is strongly conserved among coronaviruses. MPro binding is allosterically regulated and different allosteric sites have been identified on its surface. Alzyoud and coworkers [13] detected two classes of MPro allosteric sites: distal and dimerization sites. Binding to these sites of potential allosteric inhibitors has been demonstrated to alter the catalytic function through conformational changes that involve both subunits. In SARS-CoV MPro, the co-operation of the two monomers resulted in being quite peculiar and, in fact, functional asymmetry was detected by Chen and co-workers in 2003 [14]. Such a feature was later confirmed for SARS-CoV2 MPro by Iida et al. in 2021 [15].
Molecular dynamics (MD) is the key computational technique to describe the protein function through its dynamics. Starting from the knowledge of atoms dynamics according to classical force fields, it allows gaining insights about the structure–function relationship in protein molecules, the protein–protein and protein–ligand interactions [16,17,18,19,20]. Since the outbreak of the COVID-19 pandemic, molecular dynamics (MD) has been applied to the molecular components of SARS-CoV-2, specifically the spike protein and MPro, to explore molecular mechanisms and develop viable and effective therapeutic strategies [2,14,21,22,23,24,25,26,27,28,29,30].
In this work, we move forward by providing a two-fold methodology for the punctual analysis of MPro MD simulations so as to identify crucial regions and residues for the enzyme activity. We applied the analysis to the MD simulations of MPro (free and bound to the inhibitor Boceprevir, an FDA-approved drug for COVID-19 therapeutical applications [31]). Boceprevir is a protease inhibitor originally used to treat hepatitis C [32] and it has been investigated against SARS-CoV-2 because of the following: (1) SARS-CoV-2 protease shares structural similarities with the HCV protease, which is the target of Boceprevir; (2) in vitro studies have shown that Boceprevir can bind to the active site of MPro, preventing the maturation of viral proteins [33]; and (3) since Boceprevir is already approved for human use, it has a well-known safety profile, shortening the timeline for drug repurposing.
In this way, we were able to compare the dynamics of apo and holo (inhibited) forms and so identify a dynamical signature of inhibition. This approach allows defining the properties of good candidates for COVID-19 therapy targeting MPro.
The method we applied is the computation of the Dynamical Cross Correlation (DCC) matrix based on single-residue trajectories along with MD simulations, while at the same time giving some theoretical hints on the character of biopolymers as complex systems. The Pearson correlation coefficient between the trajectories of residue pairs allows detecting concerted residue motions by tracing out the allosteric pathways in a dynamic way [34,35,36].
The identification of communication pathways in MPro plays a pivotal role in the comprehension of its action and in the identification of allosteric inhibitors [13,30]. Understanding collective motions aids in comprehending how different regions of a protein communicate during its function and how to disrupt this communication. Thus, findings from this study may have significant implications for developing effective therapeutic strategies targeting this vital enzyme in SARS-CoV-2.

2. Materials and Methods

2.1. Structure Identification

We applied the analysis on two molecular structures of MPro from the RCSB PDB server.
The PDB file of the apo protein was generated from the mmCIF file available in the Protein Data Bank (6y2e). Specifically, we used the mmCIF file corresponding to the biological assembly (i.e., a homodimer protein) from the PDB server https://www.rcsb.org (access 10 January 2024). The server https://mmcif.pdbj.org (access 10 January 2024) was used to convert the mmCIF format to PDB format. This server uses, for the conversion, the MAXIT tool from the RCSB PDB server.
For the complex with Boceprevir, we used the 7BRP pdb file.

2.2. Molecular Dynamics Simulations

Classical MD simulations were performed by employing the open-source software GROMACS 2021.7 [37] with the CHARMM36 force field [38]. We built two different systems, one for the apo MPro and one for MPro in complex with Boceprevir. The ligand (Boceprevir) force field was computed with CGenFF version 2.5 [39,40,41] and translated to be used in GROMACS with the help of a python script (cgenff_charmm2gmx.py) by Justin Lemkul downloaded from Github. Each of the systems was placed in a cubic box with a side equal to 10 nm and we considered the presence of periodic boundary conditions. Each box was filled with TIP3P water molecules and an appropriate number of counterions necessary for the system neutrality (0.15 M). The simulation procedure was as follows: after the minimization, which was performed with the steepest descent algorithm, we equilibrated each system for 1 ns first in the NVT and then in the NPT ensemble. The successive production runs each consisted of 300 ns long simulation in the NPT ensemble at 310 K, by employing the v-rescale thermostat [19], with a coupling time of 0.1 ps. The pressure was kept constant at 1 bar, with a coupling time of 2 ps, and an isothermal compressibility of 4.5·10−5 bar−1 by means of the Berendsen barostat [42]. The bonds involving hydrogen atoms were constrained according to the LINCS algorithm [43]. The radius of the sphere, which defines the list for the pairwise interactions, was set to 1.0 nm. Consistently, the van der Waals interaction was also cut off at 1.0 nm. The particle mesh Ewald method [44] for the long-range electrostatic interactions was employed using a 1.0 nm real-space cutoff. A time step of 2 fs was used to integrate the equations of motion.
Three distinct simulations were performed for the two forms. The MD data analysis was carried out with standard GROMACS tools. The MD simulations were performed on LEONARDO GPU cluster at CINECA.

2.3. Molecular Dynamics Analysis

2.3.1. DCC Analysis

We applied a Canonical Analysis of Motion to the MD frames to identify concerted motions of protein regions [45].
At first, we computed the Displacement Matrix (DISPL), an m × p matrix, m being the number of frames and p the number of residues: its i-th row contains the displacement values (Euclidean distances between the alpha carbons of the same residue at different times) for all residues at time t i with respect to t i 1 . The first row was calculated with respect to the cystallographic PDB structure.
The DISPL columns report the single-residue displacement along time, somehow representative of the single-residue trajectory.
Once the DISPL is computed, it is possible to compute the Dynamic Cross Correlation (DCC) matrix, whose generic element D C C i j is the Pearson correlation coefficient between the displacement vectors of the i-th and j-th residues, computed as follows:
DCC i , j = Δ r i t Δ r j t Δ r i t 2   Δ r j t 2
where r i t represents the i-th residue alpha-carbon position as a function of time while Δ r i t   is the residue displacement in the frame at t (generic element of the matrix DISPL). DCC allows for a deep understanding of the MD simulations, revealing the concerted motions between distal residues [2,46,47]. In other words, if the correlation between the displacement of the two vectors for the two residues is high, that means those two residues move in a concerted motion. So, a high value of DCC i j means that residues i-th and j-th move together.
To better emphasize the coordinated movements within and between different regions of the protein MPro, we identified specific areas in the dynamic correlation map (DCC map) based on unique findings, as depicted in Figure 2. The color scale reports the value of the DCC in terms of hot (cold) colors corresponding to high (low) values of correlation coefficients between a pair of residues (hot values → high probability of concerted motions, cold values → low probability of concerted motions). So, the large blue regions correspond to a pair of residues whose displacement vectors are completely uncorrelated (correlation coefficient ~0. On the other hand, red spots correspond to residues with highly correlated (concerted) motions, for instance, residues in the same alpha-helix.
For this analysis, we grouped domain I + II together (A1 and B1 in the two chains, respectively), as they are responsible for the catalytic activity of MPro, and considered them separately from domain III (A2 and B2). Region AB comprises the interactions between domain III of chain A and chain B, emerging as dynamically correlated.
Finally, we also considered motions within the single chains (A and B in Figure 2).

2.3.2. Protein Contact Network Analysis

The protein contact networks are obtained from molecular dynamics frames (pdb files), according to the method widely explained in a previous paper [48]. In brief, for each frame, the alpha-carbons’ positions are extracted and the adjacency matrix of mutual distances between residues is computed. A link exists between two residues if their distance falls between 4 and 8 Å. The adjacency matrix is the descriptor of the protein contact network; it is a binary matrix whose generic element is one if there is a link between the corresponding nodes. From matrix A, it is possible to extract several descriptors and in particular the betweenness centrality (btw). The btw is a protein contact network descriptor that measures the importance of a residue in mediating communication between other residues in the network. Specifically, it measures the number of shortest paths that pass through a given residue according to the following equation:
b t w i = v V ,   v i u V , u i σ u v i σ u v
where σ u v is the number of shortest paths connecting nodes u and v and σ u v i is the number of shortest paths connecting the two nodes through node i.

3. Results

Figure 3 reports the RMSD for the three simulations for the apo and holo forms. In the insets, the corresponding evolution of the protein’s radius of gyration (RH) is reported.
The three simulations for the holo form start from a condition of double occupancy of the catalytic sites. All systems reached stability in the last 100 ns of the MD simulations as demonstrated by the standard deviations value of the RMSD and RH calculated in the range (200–300) ns (less than 8%).
On the basis of the events that take place at the inhibitor binding sites, the MD time course can be divided into different phases.
For instance, in Figure 4, the case of holo simulation #1 (Figure 3B, green) has been quantitatively analyzed by evaluating the distances between the two inhibitor molecules and the two protein subunits. In particular, an asymmetric dynamic behavior can be observed and three different MD phases identified (Figure 4A) as follows:
  • PHASE I: in the first 72 ns, both chains retain their substrates; at t ≈ 72 ns, chain B loses its substrate;
  • PHASE II: from t ≈ 72 ns to 200 ns, only chain A retains its substrate; at t = 200 ns, chain A also loses its substrate;
  • PHASE III: from t = 200 ns, both chains are unbound.
Figure 4. (A) Simulation 1 of the MD of the holo form: the figure reports the distance between substrates and their corresponding catalytic phases. Three phases are detected, corresponding to the asymmetric occupancy of the two catalytic sites. (B) DCC for the whole MD simulation. (C) DCC for phase I. (D) DCC for phase II. (E) DCC for phase III. In red rectangles, the AB region of the map, reporting data of correlation between motions of residues in domain III of chain A and chain B.
Figure 4. (A) Simulation 1 of the MD of the holo form: the figure reports the distance between substrates and their corresponding catalytic phases. Three phases are detected, corresponding to the asymmetric occupancy of the two catalytic sites. (B) DCC for the whole MD simulation. (C) DCC for phase I. (D) DCC for phase II. (E) DCC for phase III. In red rectangles, the AB region of the map, reporting data of correlation between motions of residues in domain III of chain A and chain B.
Molecules 30 01412 g004
In order to characterize the details of the inter-subunit interaction along the simulation time course, the DCC matrix has been evaluated and the profiles for the whole simulation time and the three individual stages are shown in Figure 4B–E. The first phase (stage I, Figure 4C) is marked by a general increase in correlated movements between individual residues, suggesting a high degree of coordination across the protein structure. Following substrate 2 detachment in stage II, there is a reduction in the correlation, indicative of reduced concerted motion (Figure 4D).
In stage III (Figure 4E), there is a general reduction in the correlation, approaching the values of phase I. This pattern suggests a dynamic process where initial coordinated enzyme “shaking” facilitates the first substrate ejection (in chain B), followed by a quieter phase, and then a partial resurgence of correlated movements, associated with the release of the second substrate molecule (from chain A). Notably, the AB region displays weak but significant correlations in stage I, which disappear in stage II and reappear, though fainter, in phase III, highlighting its potential role in substrate interactions.
To better characterize these dynamics, we applied the DCC method over a time range of 10 ns (100 frames). The results are reported in Figure 5 and Figure 6.
Figure 5 reports the DCC’s time course for the whole holo molecule, the two chains and the region of interaction (AB in Figure 2).
The whole protein, chain B and region AB experience two peaks of correlation around 40 and 70 ns, the latter being the highest value (0.18 for chain B and 0.27 for the AB region).
Going more in detail (Figure 6), the regions experiencing these two peaks are A2 (reaching at 70 ns the highest value of 0.34, Figure 6B) and B1 (Figure 6C). Meanwhile, B2 exhibits a lower peak at 40 ns and a noticeable peak at 70 ns (0.28, Figure 6D). Region A1 shows similar peaks, but at quite lower values of the correlation coefficient at different times (0.14 at 20 ns and 0.15 at 60 ns, Figure 6A).
We verified similar patterns for two additional MD simulations of the holo form of MPro (see Supplementary Files).
Despite the great qualitative impact, the DCC representations in Figure 5 and Figure 6 do not allow a quantitative interpretation of the molecular events in which the protein domains are involved. For such reason, we performed a coarse-graining analysis of such data, calculating the average DCC value (<DCC>) in the different regions identified in Figure 2. In this way, a single DCC value, representing each particular polypeptide region, quantitatively summarizes the temporal correlation of the residues belonging to that area along the simulation.
The first result is the quite different behavior of the two chains (A, B), in the holo and in the apo forms. As shown in Figure 7, the <DCC> values in the apo protein are roughly the same in the three phases, oscillating around 0.08. On the contrary, in the holo form, both chains are characterized by higher <DCC> values in the first phase (>0.11). Then, after the release of the inhibitor molecule from chain B, a rapid decrease in the average correlation values is observed in phase II, followed by a slight increase in phase III, involving the sole chain A (in blue, in Figure 7). A second important feature emerging from the <DCC> analysis concerns the AB area of the diagram reported in Figure 2. This region represents the correlation between domains belonging to different protein chains. The high <DCC> value observed in the holo form during phase I (in green, Figure 7) suggests that synchronic motions occur between residues within the catalytic domains of chain B (located in the middle of domains I and II), with those of the oligomerization domain of chain A (i.e., domain III).
More details on the intra-domain residues correlation have been obtained by evaluating the <DCC> values for the individual A1, A2, B1 and B2 regions (Figure 2) of both the holo (Figure 8A) and apo (Figure 8B) protein forms.
At first glance, the correlation of the residue trajectories appears very homogeneous only for the apo MPro, which displays marginal oscillation of the <DCC> values as a function of time (Figure 8B). In this case, two striking pieces of evidence emerge as follows: (i) a symmetry between the two chains and (ii) a larger contribution of the A2 and B2 region to the average correlation of each protein chain. This last feature is interesting, since the A2 and B2 areas correspond to domain III (see Figure 2), which is the main domain responsible for the MPro oligomerization process. High intra-domain correlation values thus suggest that the cooperative organization of the A2 and B2 residues takes place to facilitate the interaction of the two chains at the monomer–monomer interface. The case of the holo protein is more complex. According to the values reported in Figure 8A, regions A2 and B1 play a main role during phase I of the simulation. Then, large changes occur, with the <DCC> values in A2 oscillating from—32% in phase II, to the recovery of about + 20% in phase III. Interestingly, a similar trend occurs in the case of A1, despite it being to a lower extent. Different behavior was instead found for the residues of the B1 region. In this case, we observe a decrease in the <DCC> value after the release of Boceprevir (phase II and III).
In order to elucidate the role of the dimeric interface in constituting the active dimer, we have analyzed the btw values of the residues of the holo samples at different MD frames. Residues with high btw are important for maintaining the overall connectivity and communication within the protein structure; they often act as “bridges” in transmitting conformational information between different regions of the protein. They are, therefore, prime candidates for involvement in allosteric mechanisms [48].
In Figure 9, we have shown the btw values for the amino acids of the holo protease at t = 30 ns, i.e., during phase I of simulation #1 (panel A). In the same figure (panel B), we have reported the changes in this parameter with respect to those obtained at 30 ns at two other MD times (150 ns and 240 ns). In this graph, the most relevant changes occur in some particular residues of chain B, as evidenced by the rectangles in gray. These changes occur when subunit B releases the inhibitor (from phase I to phase II) but the same B residues are also involved when movements of Boceprevir are in subunit A (from phase I to phase III).

4. Discussion

Symmetry is an intriguing feature of oligomers. The number of configurations that such proteins may assume is impressive [49,50] and, since the crystal structure of hemoglobin was solved, it was clear that the symmetric arrangement of oligomers has enormous implications in their biological function. Indeed, the first model of allosterism (MWC) was based on symmetry and it is a milestone in the chemistry of protein–ligand interaction [51]. Yet, at the microscopic level, the so-called “induced-fit” concept has introduced the idea that the crystallographic symmetry of oligomeric proteins and enzymes may be broken, once the first ligand/substrate molecule binds one subunit.
The case of homodimers is very peculiar as they represent the simplest form of subunit interaction: each subunit can exclusively interact/control its own mate; thus, there is no chance for those extended conformational changes that characterize the cooperative functional dynamics of oligomers containing more than two subunits. Moreover, the identity between the two protomers generates a global structural symmetry to the dimer. Nevertheless, it has been found that the two equivalent active sites of homo-dimers are rarely both occupied at the same time, a strange behavior that suggests an intrinsic functional asymmetry for these proteins. Pai and coworkers have provided a rationale for such dynamic asymmetry in the case of a homo-dimeric dehalogenase [52,53]. In particular, they demonstrated that the entropic cost of substrate binding in one subunit is counterbalanced by a more disordered state in the other monomer, such “structural information” being transmitted and shared between the two protomers through an allosteric pathway. Such findings indicate that the degree of motional coordination among the residues in the empty subunit differs from that of the substrate-bound monomer. Similar results were also found for the case of SARS 3C-like proteinase where the two protomers of the dimer are asymmetric and only one protomer is active at one time [14].
The analysis of MD dynamics for the two forms of protease in our study highlights a distinct difference between the dynamics in the presence (holo) and in the absence (apo) of a ligand. In the apo form, with both binding sites empty, a symmetrical pattern between the two chains was found (Figure 7). In particular, the two domains comprising the catalytic residues move in a concerted motion region (regions A1 and B1 in Figure 8), while the third domain (A2 and B2), responsible for the protease dimerization, moves on its own. The two chains of the holo form display, instead, a more complex behavior, the evolution of the DCC pattern depending on the presence/absence of a single or both ligand molecules. For instance, during phase I, higher average correlations occur in chain B with respect to chain A (region B1 in Figure 5B and Figure 7). This means that in attempting to clear out the binding site, the residues in chain B cooperate all together until the ligand is expulsed. Interestingly, during such a phase, strong cooperation between the two chains also takes place, as demonstrated by the high <DCC> values observed in the AB region (Figure 7, phase I, green bar). The release of the second ligand molecule, at the end of phase II, is less dramatic and, in fact, the average DCC values of each holo protein domain progressively re-align to those of the apo form (Figure 8). This peculiar behavior of the protease, in which the ligand is released during the simulation but later rebinds to the active site, has also been observed with other inhibitors in the study conducted by Komatsu and collaborators [54]. This finding suggests that the absence of one ligand confers the whole protein with an enhanced conformational dynamic. On the other hand, such behavior is also compatible with the already mentioned concept of “entropic reservoir”: the free subunit can have a regulatory role by means of an entropy compensation mechanism, confirming the crucial functional role played by empty subunits in homo-dimeric enzymes [51,52].
The important role of the dimer interface in constituting the active molecule and transmitting information through the two monomers is also evident from the analysis of the btw values (Figure 9). We have seen that in our case, the values of the btw of some peculiar residues of chain B change a lot with respect to chain A (Figure 9). Some of these amino acids (ARG 4, SER 10, GLY 11 and GLU 14), as evidenced in Figure 9B, belong to the so-called allosteric site on the distal region of the protein [13]. Identifying and modifying those residues with high btw could help alter or engineer allosteric pathways to modulate protein function since several efforts have been performed to identify drugs able to bind both in the catalytic and distal allosteric binding sites [55].
In conclusion, the two identical polypeptide chains of homodimers confer them with the peculiar symmetry observed in the crystallographic models. Such symmetry is retained in the apo form, but it is immediately lost when the dimer interacts with ligands and a “dynamic asymmetry” emerges. This feature is accompanied by a strong inter-chain correlation and appears as a biological strategy to optimize the performance of such enzymatic systems, confirming that initial binding to one active site allosterically perturbs the second one and generates an asymmetric dimer [10].
The two peaks observed at 40 ns and 70 ns in the detailed DCC’s time course can be interpreted as a “tipping point”, i.e., critical thresholds before a dramatic change in the protein structure and function (the loss of the first substrate molecule). This is justified by the fact these events in complex systems dynamics are detected by peaks in variance and correlation [56].
Finally, together with allostery, the intrinsic dynamic asymmetry of homodimers would explain their widespread presence in the protein world [52].
The emergence of a ‘dynamic asymmetry’ from a ‘structural symmetry’ is a signature of the onset of complexity at the molecular level, confirming the peculiar location of biopolymers on the edge between simplicity and complexity physics [57].

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/molecules30071412/s1, Figures S1 and S5: Simulation 2 (S1) and 3 (S5) of the MD of the holo form: the figure reports the distance between substrates and their corresponding catalytic phases. Figures S2, S6 and S8: DCC for the whole MD simulation, DCC for phase I, DCC for phase II, DCC for phase III for holo simulations 2 (S2), 3(S6) and apo simulation 1 (S8). Figures S3, S4, S7 and S8: DCC variation in molecular dynamics of the holo form for simulation 2 (S3 and S4) and simulation 3 (S7 and S8).

Author Contributions

V.M. and L.D.P. equally contributed as first authors; conceptualization, L.D.P. and A.G.; methodology L.D.P., V.M. and M.P.; software, V.M., L.D. and M.P.; validation, G.M. and A.D.V.; formal analysis, G.M. and A.D.V.; resources, G.M. and V.M.; writing—original draft preparation, L.D.P.; writing—review and editing, all authors; visualization, A.D.V. and G.M.; supervision, G.M. and A.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Italian Ministry of University and Research (MUR), competitive PRIN2022 grant, prot. No. 2022PJKF88.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Yan, W.; Zheng, Y.; Zeng, X.; He, B.; Cheng, W. Structural biology of SARS-CoV-2: Open the door for novel therapies. Signal Transduct. Target. Ther. 2022, 7, 26. [Google Scholar]
  2. Parida, P.K.; Paul, D.; Chakravorty, D. The natural way forward: Molecular dynamics simulation analysis of phytochemicals from Indian medicinal plants as potential inhibitors of SARS-CoV-2 targets. Phytother. Res. 2020, 34, 3420–3433. [Google Scholar] [PubMed]
  3. Hu, Q.; Xiong, Y.; Zhu, G.-H.; Zhang, Y.-N.; Zhang, Y.W.; Huang, P.; Ge, G.B. The SARS-CoV-2 main protease (Mpro): Structure, function, and emerging therapies for COVID-19. MedComm 2022, 3, e151. [Google Scholar] [PubMed]
  4. Duan, Y.; Wang, H.; Yuan, Z.; Yang, H. Structural biology of SARS-CoV-2 Mpro and drug discovery. Curr. Opin. Struct. Biol. 2023, 82, 102667. [Google Scholar]
  5. Biernacki, K.; Ciupak, O.; Daśko, M.; Rachon, J.; Flis, D.; Budka, J.; Inkielewicz-Stępniak, I.; Czaja, A.; Rak, J.; Demkowicz, S. Development of potent and effective SARS-CoV-2 main protease inhibitors based on maleimide analogs for the potential treatment of COVID-19. J. Enzyme Inhib. Med. Chem. 2024, 39, 2290910. [Google Scholar]
  6. Wang, H.; Yang, Q.; Liu, X.; Xu, Z.; Shao, M.; Li, D.; Duan, Y.; Tang, J.; Yu, X.; Zhang, Y.; et al. Structure-based discovery of dual pathway inhibitors for SARS-CoV-2 entry. Nat. Commun. 2023, 14, 7574. [Google Scholar]
  7. Almutairi, G.O.; Malik, A.; Alonazi, M.; Khan, J.M.; Alhomida, A.S.; Khan, M.S.; Alenad, A.M.; Altwaijry, N.; Alafaleq, N.O. Expression, purification, and biophysical characterization of recombinant MERS-CoV main (Mpro) protease. Int. J. Biol. Macromol. 2022, 209, 984–990. [Google Scholar]
  8. Silvestrini, L.; Belhaj, N.; Comez, L.; Gerelli, Y.; Lauria, A.; Libera, V.; Mariani, P.; Marzullo, P.; Ortore, M.G.; Piccionello, A.P.; et al. The dimer-monomer equilibrium of SARS-CoV-2 main protease is affected by small molecule inhibitors. Sci. Rep. 2021, 11, 9283. [Google Scholar]
  9. Nashed, N.T.; Aniana, A.; Chiliveri, R.G.S.C.; Louis, J.M. Modulation of the monomer-dimer equilibrium and catalytic activity of SARS-CoV-2 main protease by a transition-state analog inhibitor. Commun. Biol. 2022, 5, 160. [Google Scholar]
  10. Fornasier, E.; Fabbian, S.; Shehi, H.; Enderle, J.; Gatto, B.; Volpin, D.; Biondi, B.; Bellanda, M.; Giachin, G.; Sosic, A.; et al. Allostery in homodimeric SARS-CoV-2 main protease. Commun. Biol. 2024, 7, 1435. [Google Scholar]
  11. Yang, H.; Rao, Z. Structural biology of SARS-CoV-2 and implications for therapeutic development. Nat. Rev. Microbiol. 2021, 19, 685–700. [Google Scholar] [PubMed]
  12. Goyal, B.; Goyal, D. Targeting the Dimerization of the Main Protease of Coronaviruses: A Potential Broad-Spectrum Therapeutic Strategy. ACS Comb. Sci. 2020, 22, 297–305. [Google Scholar]
  13. Alzyoud, L.; Ghattas, M.A.; Atatreh, N. Allosteric binding sites of the SARS-CoV-2 main protease: Potential targets for broad-spectrum anti-Coronavirus agents. Drug Des. Dev. Ther. 2022, 16, 2463–2478. [Google Scholar]
  14. Chen, H.; Wei, P.; Huang, C.; Tan, L.; Liu, Y.; Lai, L. Only one protomer is active in the dimer of SARS 3C-like proteinase. J. Biol. Chem. 2006, 281, 13894–13898. [Google Scholar]
  15. Iida, S.; Fukunishi, Y. Asymmetric dynamics of dimeric SARS-CoV-2 and SARS-CoV main proteases in an apo form: Molecular dynamics study on fluctuations of active site, catalytic dyad, and hydration water. BBA Adv. 2021, 1, 100016. [Google Scholar] [CrossRef] [PubMed]
  16. Adcock, S.A.; McCammon, J.A. Molecular dynamics: Survey of methods for simulating the activity of proteins. Chem. Rev. 2006, 106, 1589–1615. [Google Scholar] [PubMed]
  17. Hollingsworth, S.A.; Dror, R.O. Molecular dynamics simulation for all. Neuron 2018, 99, 1129–1143. [Google Scholar] [CrossRef]
  18. Karplus, M.; Petsko, G.A. Molecular dynamics simulations in biology. Nature 1990, 347, 631–639. [Google Scholar]
  19. Bussi, G.; Donadio, D.; Parrinello, M. Canonical sampling through velocity rescaling. J. Chem. Phys. 2007, 126, 014101. [Google Scholar]
  20. Karplus, M. Molecular dynamics simulations of biomolecules. Acc. Chem. Res. 2002, 35, 321–323. [Google Scholar]
  21. Tiwari, G.; Chauhan, M.S.; Sharma, D. In silico study of inhibition activity of boceprevir drug against 2019-nCoV main protease. Z. Naturforschung C 2024, 79, 1–12. [Google Scholar] [CrossRef]
  22. Shekaari, A.; Jafari, M. Structural dynamics of COVID-19 main protease. J. Mol. Struct. 2021, 1223, 129235. [Google Scholar] [PubMed]
  23. Dehury, B.; Raina, V.; Misra, N.; Suar, M. Effect of mutation on structure, function and dynamics of receptor binding domain of human SARS-CoV-2 with host cell receptor ACE2: A molecular dynamics simulations study. J. Biomol. Struct. Dyn. 2021, 39, 7231–7245. [Google Scholar]
  24. Hadi-Alijanvand, H.; Di Paola, L.; Hu, G.; Leitner, D.M.; Verkhivker, G.M.; Sun, P.; Poudel, H.; Giuliani, A. Biophysical Insight into the SARS-CoV2 Spike–ACE2 Interaction and Its Modulation by Hepcidin through a Multifaceted Computational Approach. ACS Omega 2022, 7, 17024–17042. [Google Scholar] [PubMed]
  25. Ray, D.; Le, L.; Andricioaei, I. Distant Residues Modulate Conformational Opening in SARS-CoV-2 Spike Protein. Cold Spring Harb. Lab. 2021, 118, e2100943118. [Google Scholar] [CrossRef]
  26. Brielle, E.S.; Schneidman-Duhovny, D.; Linial, M. The SARS-CoV-2 exerts a distinctive strategy for interacting with the ACE2 human receptor. Viruses 2020, 12, 497. [Google Scholar] [CrossRef]
  27. Baig, A.M.; Khaleeq, A.; Syeda, H. Elucidation of cellular targets and exploitation of the receptor-binding domain of SARS-CoV-2 for vaccine and monoclonal antibody synthesis. J. Med. Virol. 2020, 92, 2792–2803. [Google Scholar]
  28. Turoňová, B.; Sikora, M.; Schürmann, C.; Hagen, W.J.H.; Welsch, S.; Blanc, F.E.C.; von Bülow, S.; Gecht, M.; Bagola, K.; Hörner, C.; et al. In situ structural analysis of SARS-CoV-2 spike reveals flexibility mediated by three hinges. Science 2020, 370, 203–208. [Google Scholar]
  29. Ali, A.; Vijayan, R. Dynamics of the ACE2-SARS-CoV-2/SARS-CoV spike protein interface reveal unique mechanisms. Sci. Rep. 2020, 10, 14214. [Google Scholar] [CrossRef]
  30. Carli, M.; Sormani, G.; Rodriguez, A.; Laio, A. Candidate binding sites for allosteric inhibition of the SARS-CoV-2 main protease from the analysis of large-scale molecular dynamics simulations. J. Phys. Chem. Lett. 2021, 12, 65–72. [Google Scholar]
  31. Göhl, M.; Zhang, L.; El Kilani, H.; Sun, X.; Zhang, K.; Brönstrup, M.; Hilgenfeld, R. From repurposing to redesign: Optimization of boceprevir to highly potent inhibitors of the SARS-CoV-2 main protease. Molecules 2022, 27, 4292. [Google Scholar] [CrossRef]
  32. Venkatraman, S. HCV: The Journey from Discovery to a Cure. Top. Med. Chem. 2019, 31, 293–315. [Google Scholar]
  33. Kneller, D.W.; Li, H.; Phillips, G.; Weiss, K.L.; Zhang, Q.; Arnould, M.A.; Jonsson, C.B.; Surendranathan, S.; Pavathareddy, J.; Blakeley, M.P.; et al. Covalent narlaprevir- and boceprevir-derived hybrid inhibitors of SARS-CoV-2 main protease. Nat. Commun. 2022, 13, 2268. [Google Scholar] [CrossRef] [PubMed]
  34. Ichiye, T.; Karplus, M. Collective motions in proteins: A covariance analysis of atomic fluctuations in molecular dynamics and normal mode simulations. Proteins 1991, 11, 205–217. [Google Scholar] [CrossRef] [PubMed]
  35. Lange, O.F.; Grubmüller, H. Full correlation analysis of conformational protein dynamics. Proteins 2008, 70, 1294–1312. [Google Scholar] [CrossRef]
  36. Liu, X.; Golden, L.C.; Lopez, J.A.; Shepherd, T.R.; Yu, L.; Fuentes, E.J. Conformational Dynamics and Cooperativity Drive the Specificity of a Protein-Ligand Interaction. Biophys. J. 2019, 116, 2314–2330. [Google Scholar] [CrossRef]
  37. Abraham, M.J.; Murtola, T.; Schulz, R.; Páll, S.; Smith, J.C.; Hess, B.; Lindahl, E. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 2015, 1–2, 19–25. [Google Scholar] [CrossRef]
  38. Best, R.B.; Zhu, X.; Shim, J.; Lopes, P.E.M.; Mittal, J.; Feig, M.; Mackerell, A.D., Jr. Optimization of the additive CHARMM all-atom protein force field targeting improved sampling of the backbone φ, ψ and side-chain χ(1) and χ(2) dihedral angles. J. Chem. Theory Comput. 2012, 8, 3257–3273. [Google Scholar] [CrossRef]
  39. Vanommeslaeghe, K.; Hatcher, E.; Acharya, C.; Kundu, S.; Zhong, S.; Shim, J.; Darian, E.; Guvench, O.; Lopes, P.; Vorobyov, I.; et al. CHARMM general force field: A force field for drug-like molecules compatible with the CHARMM all-atom additive biological force fields. J. Comput. Chem. 2010, 31, 671–690. [Google Scholar] [CrossRef]
  40. Vanommeslaeghe, K.; Raman, E.P.; MacKerell, A.D., Jr. Automation of the CHARMM General Force Field (CGenFF) II: Assignment of bonded parameters and partial atomic charges. J. Chem. Inf. Model. 2012, 52, 3155–3168. [Google Scholar] [CrossRef]
  41. Vanommeslaeghe, K.; MacKerell, A.D., Jr. Automation of the CHARMM General Force Field (CGenFF) I: Bond perception and atom typing. J. Chem. Inf. Model. 2012, 52, 3144–3154. [Google Scholar] [CrossRef] [PubMed]
  42. Berendsen, H.J.C.; Postma, J.P.M.; van Gunsteren, W.F.; DiNola, A.; Haak, J.R. Molecular dynamics with coupling to an external bath. J. Chem. Phys. 1984, 81, 3684–3690. [Google Scholar]
  43. Hess, B.; Bekker, H.; Berendsen, H.J.C.; Fraaije, J.G.E.M. LINCS: A linear constraint solver for molecular simulations. J. Comput. Chem. 1997, 18, 1463–1472. [Google Scholar]
  44. Darden, T.; York, D.; Pedersen, L. Particle mesh Ewald: An N·log(N) method for Ewald sums in large systems. J. Chem. Phys. 1993, 98, 10089–10092. [Google Scholar]
  45. Briki, F.; Genest, D. Canonical analysis of correlated atomic motions in DNA from molecular dynamics simulation. Biophys. Chem. 1994, 52, 35–43. [Google Scholar] [PubMed]
  46. McCammon, J.A. Protein dynamics. Rep. Prog. Phys. 1984, 47, 1. [Google Scholar]
  47. da Penha Valente, R.P.; de Souza, R.C.; de Medeiros Muniz, G.; Ferreira, J.E.V.; de Miranda, R.M.; Lima, A.H.L.E.; da Silva Gonçalves Vianez, J.L., Jr. Using Accelerated Molecular Dynamics Simulation to elucidate the effects of the T198F mutation on the molecular flexibility of the West Nile virus envelope protein. Sci. Rep. 2020, 10, 9625. [Google Scholar]
  48. Di Paola, L.; Mei, G.; Di Venere, A.; Giuliani, A. Disclosing Allostery Through Protein Contact Networks. Methods Mol. Biol. 2021, 2253, 7–20. [Google Scholar]
  49. Levy, E.D.; Erba, E.B.; Robinson, C.V.; Teichmann, S.A. Assembly reflects evolution of protein complexes. Nature 2008, 453, 1262–1265. [Google Scholar]
  50. Perica, T.; Chothia, C.; Teichmann, S.A. Evolution of oligomeric state through geometric coupling of protein interfaces. Proc. Natl. Acad. Sci. USA 2012, 109, 8127–8132. [Google Scholar]
  51. Changeux, J.-P. Allostery and the Monod-Wyman-Changeux model after 50 years. Annu. Rev. Biophys. 2012, 41, 103–133. [Google Scholar] [CrossRef] [PubMed]
  52. Kim, T.H.; Mehrabi, P.; Ren, Z.; Sljoka, A.; Ing, C.; Bezginov, A.; Ye, L.; Pomès, R.; Prosser, R.S.; Pai, E.F. The role of dimer asymmetry and protomer dynamics in enzyme catalysis. Science 2017, 355, eaag2355. [Google Scholar] [CrossRef]
  53. Mehrabi, P.; Di Pietrantonio, C.; Kim, T.H.; Sljoka, A.; Taverner, K.; Ing, C.; Kruglyak, N.; Pomès, R.; Pai, E.F.; Prosser, R.S. Substrate-Based allosteric Regulation of a homodimeric enzyme. J. Am. Chem. Soc. 2019, 141, 11540–11556. [Google Scholar] [CrossRef]
  54. Komatsu, T.S.; Okimoto, N.; Koyama, Y.M.; Hirano, Y.; Morimoto, G.; Ohno, Y.; Taiji, M. Drug binding dynamics of the dimeric SARS-CoV-2 main protease, determined by molecular dynamics simulation. Sci. Rep. 2020, 10, 16986. [Google Scholar] [CrossRef] [PubMed]
  55. Fischer, C.; Feys, J.R. SARS-CoV-2 Mpro Inhibitors: Achieved Diversity, Developing Resistance and Future Strategies. Future Pharmacol. 2023, 3, 80–107. [Google Scholar] [CrossRef]
  56. Scheffer, M.; Bascompte, J.; Brock, W.A.; Brovkin, V.; Carpenter, S.R.; Dakos, V.; Held, H.; van Nes, E.H.; Rietkerk, M.; Sugihara, G. Early-warning signals for critical transitions. Nature 2009, 461, 53–59. [Google Scholar] [CrossRef]
  57. Frauenfelder, H.; Wolynes, P.G. Biomolecules: Where the physics of complexity and simplicity meet. Phys. Today 1994, 47, 58–64. [Google Scholar] [CrossRef]
Figure 1. Pictorial representation of the domains of MPro: (A) domain distribution along the sequence of the monomer; (B) domains in the ribbon structure of the single chain; (C) domain spatial distribution in the apo dimer (in blue and red the two subunits); in yellow, the catalytic site residues (HIS 41 and CYS 145, pdb file 6y2e); (D) domain spatial distribution in the holo dimer with the inhibitors reported in green (pdb file 7brp).
Figure 1. Pictorial representation of the domains of MPro: (A) domain distribution along the sequence of the monomer; (B) domains in the ribbon structure of the single chain; (C) domain spatial distribution in the apo dimer (in blue and red the two subunits); in yellow, the catalytic site residues (HIS 41 and CYS 145, pdb file 6y2e); (D) domain spatial distribution in the holo dimer with the inhibitors reported in green (pdb file 7brp).
Molecules 30 01412 g001
Figure 2. Identification of regions in the DCC maps (for the apo form) to better highlight the concerted motions between and within domains and chains. A1 and B1 refer to the region including domain I + II in both chains, whereas A2 and B2 comprise domain III in the chains. A and B comprise single chains. Region AB includes interactions between domain III of chain A and chain B.
Figure 2. Identification of regions in the DCC maps (for the apo form) to better highlight the concerted motions between and within domains and chains. A1 and B1 refer to the region including domain I + II in both chains, whereas A2 and B2 comprise domain III in the chains. A and B comprise single chains. Region AB includes interactions between domain III of chain A and chain B.
Molecules 30 01412 g002
Figure 3. RMSD for the three simulations for the apo (A) and the holo (B) forms. In the insets, the trends of the protein’s radius of gyration as a function of time are shown for all simulations. The bold lines represent the result of curve fits obtained using a smoothing factor of 20% through the center of the data (Kaleidagraph Sinergy software 5.0).
Figure 3. RMSD for the three simulations for the apo (A) and the holo (B) forms. In the insets, the trends of the protein’s radius of gyration as a function of time are shown for all simulations. The bold lines represent the result of curve fits obtained using a smoothing factor of 20% through the center of the data (Kaleidagraph Sinergy software 5.0).
Molecules 30 01412 g003
Figure 5. DCC variation in molecular dynamics of the holo form of MPro. (A) Whole molecule. (B) Chain A. (C) Chain B. (D) Interaction zone AB.
Figure 5. DCC variation in molecular dynamics of the holo form of MPro. (A) Whole molecule. (B) Chain A. (C) Chain B. (D) Interaction zone AB.
Molecules 30 01412 g005
Figure 6. DCC variation in molecular dynamics of the holo form of MPro. (A) Region A1. (B) Region A2. (C) Region B1. (D) Region B2.
Figure 6. DCC variation in molecular dynamics of the holo form of MPro. (A) Region A1. (B) Region A2. (C) Region B1. (D) Region B2.
Molecules 30 01412 g006
Figure 7. Average DCC values in the different simulation phases for the holo (filled bars) and apo (dotted bars) forms. The blue and red color are associated with chain A and chain B, respectively. The green bars correspond to the <DCC> values evaluated in region AB (see Figure 2). The bars correspond to the standard deviations of the values.
Figure 7. Average DCC values in the different simulation phases for the holo (filled bars) and apo (dotted bars) forms. The blue and red color are associated with chain A and chain B, respectively. The green bars correspond to the <DCC> values evaluated in region AB (see Figure 2). The bars correspond to the standard deviations of the values.
Molecules 30 01412 g007
Figure 8. Average <DCC> values evaluated for regions A1 (blue), A2 (cyan), B1 (red), B2 (orange), in the case of the holo (A) and apo (B) protein.
Figure 8. Average <DCC> values evaluated for regions A1 (blue), A2 (cyan), B1 (red), B2 (orange), in the case of the holo (A) and apo (B) protein.
Molecules 30 01412 g008
Figure 9. (A): btw values at 30 ns are reported for residues of holo chain A (1–300) and B (301–600); in (B), the difference in bwt values at 150 ns (green) and 240 ns (red) compared to those at t = 30 ns is reported for each residue. The rectangles in gray highlight the regions with the highest values of Δbtw.
Figure 9. (A): btw values at 30 ns are reported for residues of holo chain A (1–300) and B (301–600); in (B), the difference in bwt values at 150 ns (green) and 240 ns (red) compared to those at t = 30 ns is reported for each residue. The rectangles in gray highlight the regions with the highest values of Δbtw.
Molecules 30 01412 g009
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Minicozzi, V.; Giuliani, A.; Mei, G.; Domenichelli, L.; Parise, M.; Di Venere, A.; Di Paola, L. The Dynamical Asymmetry in SARS-CoV2 Protease Reveals the Exchange Between Catalytic Activity and Stability in Homodimers. Molecules 2025, 30, 1412. https://doi.org/10.3390/molecules30071412

AMA Style

Minicozzi V, Giuliani A, Mei G, Domenichelli L, Parise M, Di Venere A, Di Paola L. The Dynamical Asymmetry in SARS-CoV2 Protease Reveals the Exchange Between Catalytic Activity and Stability in Homodimers. Molecules. 2025; 30(7):1412. https://doi.org/10.3390/molecules30071412

Chicago/Turabian Style

Minicozzi, Velia, Alessandro Giuliani, Giampiero Mei, Leonardo Domenichelli, Mauro Parise, Almerinda Di Venere, and Luisa Di Paola. 2025. "The Dynamical Asymmetry in SARS-CoV2 Protease Reveals the Exchange Between Catalytic Activity and Stability in Homodimers" Molecules 30, no. 7: 1412. https://doi.org/10.3390/molecules30071412

APA Style

Minicozzi, V., Giuliani, A., Mei, G., Domenichelli, L., Parise, M., Di Venere, A., & Di Paola, L. (2025). The Dynamical Asymmetry in SARS-CoV2 Protease Reveals the Exchange Between Catalytic Activity and Stability in Homodimers. Molecules, 30(7), 1412. https://doi.org/10.3390/molecules30071412

Article Metrics

Back to TopTop