Molecular Recognition and Self-Organization in Life Phenomena Studied by a Statistical Mechanics of Molecular Liquids, the RISM/3D-RISM Theory

Masatake Sugita; Itaru Onishi; Masayuki Irisa; Norio Yoshida; Fumio Hirata

doi:10.3390/molecules26020271

,

and

¹

Department of Computer Science, School of Computing, Tokyo Institute of Technology, W8-76, 2-12-1, Ookayama Meguro-ku, Tokyo 152-8550, Japan

²

Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, Iizuka, Fukuoka 820-8502, Japan

³

Department of Chemistry, Kyushu University, Fukuoka, Fukuoka 812-8581, Japan

⁴

Theoretical and Computational Molecular Science, Institute for Molecular Science, Okazaki, Aichi 444-8585, Japan

Molecules2021, 26(2), 271;https://doi.org/10.3390/molecules26020271

This article belongs to the Special Issue Molecular Recognition and Self-Assembly in Chemistry and Medicine

Version Notes

Order Reprints

Review Reports

Abstract

There are two molecular processes that are essential for living bodies to maintain their life: the molecular recognition, and the self-organization or self-assembly. Binding of a substrate by an enzyme is an example of the molecular recognition, while the protein folding is a good example of the self-organization process. The two processes are further governed by the other two physicochemical processes: solvation and the structural fluctuation. In the present article, the studies concerning the two molecular processes carried out by Hirata and his coworkers, based on the statistical mechanics of molecular liquids or the RISM/3D-RISM theory, are reviewed.

Keywords:

molecular recognition; self-organization; RISM/3D-RISM theory; water; solvation; fluctuation; selective ion binding; enzymatic reaction; drug screening; protein

1. Introduction

There are two molecular processes that are essential for living bodies to maintain their life. Those are the molecular recognition and self-organization processes [1,2].

The molecular recognition is the process in which a biomolecule such as protein binds a ligand, or a small molecule, at its active site with non-covalent bonds. A typical example of such processes is seen in an enzymatic reaction. An enzymatic reaction proceeds essentially in three steps: (1) binding substrate (reactant) molecules at the active site, (2) experiencing a chemical reaction involving the recombination of atoms, (3) releasing product molecules from the enzyme to be ready for the next catalytic cycle. The first and third steps above are nothing but the molecular recognition and reverse processes, respectively, while the second process is an ordinary chemical reaction driven by electronic structure changes in reactants. A characteristic of an enzymatic reaction, which distinguishes it from ordinary chemical reactions in solution, is the formation of an enzyme–substrate complex referred to as the Michaelis–Menten complex [3]. The formation of the Michaelis–Menten complex is essentially a thermodynamic process governed by the free energy change between the bound and unbound states of a pair of protein and ligand molecules. Inclusion of the equilibrium constant of the complex formation in the rate constant of the chemical reaction features an enzymatic reaction. Although we have selected just one process, or an enzymatic reaction, as an example of the molecular recognition processes, all other functions that are performed by protein, such as transportation of water molecules and ions through cell membranes, are related in one way or the other to the molecular recognition process [4,5,6,7,8]. As such, binding of drug molecules to a target molecule is nothing but a molecular recognition process [9,10,11,12,13,14,15,16].

The molecular recognition process in a biological system is distinguished from that in gas phase in one important point, which concerns “water” or “solvation” [2,6,7,8,9,10,11,12,13,14,15,16,17]. In the usual situation, one or a few water molecules and a few small ions are bound or recognized at the active site of an enzyme or protein before a substrate molecule is bound. Substrate molecules are most likely solvated by water, as well. So, in order for protein to bind substrate molecules, some of the water molecules should be removed or “desolvated” from the active site. The free energy change associated with the desolvation process, referred to as “desolvation free energy,” constitutes a very important part of the free energy change associated with the formation of a Michaelis–Menten complex.

The self-organization is another important characteristic of biomolecules. Two prototypical examples of the process are (1) the formation of the cell membrane, and (2) the protein folding [1,2,17,18,19,20]. The process prepares a construction of molecules in which biomolecules function as an enzyme, a signal transducer, an inhibitor, and so on. Cell membranes are a sort of lipid bilayer constructed from several ingredients, including phospholipid as a major component [17]. The phospholipid has an amphiphilic characteristic, in which the hydrophilic head-groups consist of phosphate groups, while the hydrophobic tail-groups are alkyl chains. Those molecules align to make a membrane in such a way as head-to-head and tail-to-tail are close together. At a glance, such a configuration among lipid molecules may not be stable thermodynamically as a membrane due to the two physical causes: the electrostatic repulsion among head groups and the reduced entropy originated from the ordering of the molecules. Then, why do the lipid molecules self-assemble themselves? The short answer to the question is “water” or “solvation.” Another example of the self-organization is the protein folding. Protein changes its conformation from native state to unfolded state when the thermodynamic condition is changed. However, it recovers its unique native conformation reversibly when the thermodynamic state is returned back into the native condition. The phenomenon called “protein folding” was first found by C. Anfinsen [18,19,20]. The process is also counterintuitive, considering that the molecule should climb up the huge entropy barrier to reach a unique native conformation. Of course, there are interactions among the atoms in protein, either attractive or repulsive, and they may contribute to overcome the entropy barrier. However, it can be readily proved by performing the molecular dynamics (MD) simulation of a protein in a vacuum that it is not the case. If one performs the MD simulation of a protein in a vacuum starting from the structure picked up from the protein data bank (PDB), it will quickly collapse into an unidentified non-native conformation within few picoseconds. But then, why does a protein fold into a native conformation when the thermodynamic condition is brought into the native state? The quick answer to the question is again “water” or “solvation” [2,21]. The solvation free energy, including the electrostatic as well as the hydrophobic interactions involving water molecules, plays crucial roles for protein to fold into its native state.

There is another physicochemical process that is concerned with both the molecular recognition and self-organization, which is the structural fluctuation [22,23,24,25]. Under normal condition in living cells, the structure of an enzyme is in spatial as well as temporal fluctuation around its native conformation due to thermal motion. A substrate molecule may not be bound at the active site of many of conformations in a fluctuated state, since it may not be comfortable geometrically as well as energetically in terms of the free energy including that of solvation. Two popular models to describe the molecular recognition process taking the structural fluctuation into account are “induced fitting” and “conformation selection” [26,27]. “Induced fitting” sees the binding process as a temporal process in which a host molecule opens its gate or mouth consisting of amino acid residues for accommodating a guest molecule, induced by a perturbation due to the guest molecule. The conformational selection, on the other hand, interprets the binding process in terms of an ensemble of the host molecules, including those having the conformation ready for binding the guest molecule. The binding process is a stochastic process depending on the probability of a guest molecule to find the host molecule in the conformation that is favorite for binding. The structural fluctuation also plays a crucial role in self-organization processes in enzymatic reactions [28]. An enzyme as a catalyst should restore its original structure to complete its reaction, and be ready for the next reaction cycle [29]. The process is nothing but a self-organization process. Without this process, the protein would stay in a structure that does not function as an enzyme, and it loses its activity as a catalyst.

In the present paper, the theoretical as well as computational studies concerning the molecular recognition and self-organization processes in life phenomena, carried out by authors’ group, are reviewed. The theory requires a treatment of water in molecular detail as was implied in the previous paragraphs. For that purpose, we have employed the statistical mechanics of molecular liquids, or the XRISM and 3D-RISM theories, developed based on the RISM theory originated by Chandler and Andersen [2,21,30,31,32,33,34,35].

2. Brief Review of the 3D-RISM/RISM Theory

Let us begin the section with asking the following questions to the readers. “What is the structure of liquid?” “How can the structure of liquid be characterized?” These questions are non-trivial, because unlike molecules and crystal, the liquid state does not form a structure of definite shape. One can readily define the structure of a molecule by giving the bond lengths, bond angles, and dihedral angles even for the most complex molecule like protein. The crystalline structure of solid can be also defined unambiguously by giving the lattice constants. However, molecules in liquids are in continuous diffusive motion, and thereby the definite geometry among the molecules cannot be defined. In such a case, we can only use the statistical or probabilistic language [2].

The probabilistic language to characterize the structure of liquids is the distribution functions, which are nothing but the moments of the density field,

ν (r) = \sum_{i} δ (r - r_{i})

, with respect to the Boltzmann weight. If there is no field applied to the system, the first moment or the average density is just constant everywhere in the system, namely,

ρ (r) \equiv ⟨ ν (r) ⟩ = ρ = N / V

where V and N are the volume of the container and the number of molecules in the system, respectively, and

⟨ \cdot \cdot \cdot ⟩

indicates the thermal average. So, the average density does not convey any information with respect to the liquid. However, if you look at the second moment,

ρ (r, r^{'}) = ⟨ ν (r) ν (r^{'}) ⟩

, this quantity carries the structural information of liquids. The quantity is referred to as the density pair distribution function, which has essentially the same physical meaning as the radial distribution function (RDF) obtained from X-ray diffraction measurement. The density pair distribution function

ρ (r, r^{'})

is proportional to the probability density of finding two molecules at the two positions r and r’ at the same time, and it becomes just a product of the average density when the distance of the two position becomes so large that there is no “correlation” between the density of the two positions as in Equation (1).

\lim_{| r - r^{'} | \to \infty} ρ (r, r^{'}) \to ρ (r) ρ (r^{'}) (= ρ^{2} in uniform liquids)

(1)

The quantity

g (r, r^{'}) = ρ (r, r^{'}) / ρ^{2}

represents a “correlation” of the density at the two positions r and r’. So, it is referred to as the pair correlation functions (PCFs), or the radial distribution functions when the liquid density is uniform and the translational invariance is implied. We further define a function called the “total correlation function” by

h (r, r^{'}) = g (r, r^{'}) - 1

, which represents the correlation of the density “fluctuations” at the two positions r and r’ (Equation (2)),

h (r, r^{'}) = ⟨ δ ν (r) δ ν (r^{'}) ⟩ / ρ^{2}

(2)

where

δ ν (r) (= ν (r) - ρ)

denotes the density fluctuation. The main task of the liquid state theory is to find an equation which governs the function

g (r, r^{'})

or

h (r, r^{'})

based on the statistical mechanics, and to solve the equation.

As is briefly described in the Introduction, an “exact” equation referred to as the Ornstein–Zernike equation, which relates

h (r, r^{'})

with another correlation function called the direct correlation function

c (r, r^{'})

, can be “derived” from the grand canonical partition function by means of the functional derivatives. Our theory to describe the molecular recognition starts from the Ornstein–Zernike equation generalized to a solution of polyatomic molecules, or the molecular Ornstein–Zernike (MOZ) equation (Equation (3)) [36]:

h (1, 2) = c (1, 2) + \int c (1, 3) ρ h (3, 2) d (3)

(3)

where h(1,2) and c(1,2) are the total and direct correlation functions, respectively, and the numbers in the parenthesis represent the coordinates of molecules in the liquid system, including both the position R and the orientation Ω. The boldface letters of the correlation functions indicate that they are matrices consisting of the elements labeled by the species in the solution. In the simple case of a binary mixture, the equation can be written down labeling the solute by “u” and solvent by “v” as in Equations (4) and (5). (It is straightforward to generalize the equations to the multi-component mixtures.)

h_{v v} (1, 2) = c_{v v} (1, 2) + \int c_{v v} (1, 3) ρ_{v} h_{v v} (3, 2) d (3) + \int c_{v u} (1, 3) ρ_{u} h_{u v} (3, 2) d (3)

(4)

h_{u v} (1, 2) = c_{u v} (1, 2) + \int c_{u v} (1, 3) ρ_{v} h_{v v} (3, 2) d (3) + \int c_{u u} (1, 3) ρ_{u} h_{u v} (3, 2) d (3)

(5)

h_{u u} (1, 2) = c_{u u} (1, 2) + \int c_{u v} (1, 3) ρ_{v} h_{v u} (3, 2) d (3) + \int c_{u u} (1, 3) ρ_{u} h_{u u} (3, 2) d (3)

(6)

By taking the limit of infinite dilution (

ρ_{u} \to 0

), one gets Equations (7) and (8),

h_{v v} (1, 2) = c_{v v} (1, 2) + \int c_{v v} (1, 3) ρ_{v} h_{v v} (3, 2) d (3)

(7)

h_{u v} (1, 2) = c_{u v} (1, 2) + \int c_{u v} (1, 3) ρ_{v} h_{v v} (3, 2) d (3)

(8)

The equations depend essentially on six coordinates in the Cartesian space, and it includes a sixfold integral. This integral is the one that prevents the theory from applications to polyatomic molecules. It is the interaction site model and the RISM approximation proposed by Chandler and Andersen [35] that enabled one to solve the equations. The idea behind the model is to project the functions onto the one-dimensional space along the distance between the interaction sites, usually placed at the center of atoms, by taking the statistical average over the angular coordinates of molecules with fixing the separation between a pair of interaction site (Equation (9)).

f_{α γ} (r) = \frac{1}{Ω^{2}} \int δ (R_{1} + l_{1}^{α}) δ (R_{2} + l_{2}^{γ} - r) f (1, 2) d (1) d (2)

(9)

where

l_{1}^{α}

is the vector displacement of site α in molecule i from the molecular center R_i. It follows that

R_{1} + l_{1}^{α} = r_{1}^{α}

denotes the position of site α in molecule i. The angular average represented by Equation (9) is called Chandler–Andersen transformation [2]. The angular average of the second terms in Equations (7) and (8) is formidable, but the RISM approximation (Equation (10))

c (1, 2) \approx \sum_{α γ} c_{α γ} (| r_{1}^{α} - r_{2}^{γ} |)

(10)

allows one to perform the angular average to lead the RISM equation (Equation (11))

ρ h ρ = ω \times c \times ω + ω \times c \times ρ h ρ

(11)

where the asterisk denotes the convolution integrals, that is expressed by Equation (12).

f * g = \int f (r_{1}, r_{3}) g (r_{3}, r_{2}) d r_{3}

(12)

Hereafter, solvent density is denoted by

ρ

instead of

ρ_{v}

. The new function

ω

, which appeared in Equation (11) during its derivation, is called the “intramolecular” correlation function, which is defined for a pair of atoms α and γ in a molecule, expressed by Equation (13).

ω_{α γ} (r) = ρ δ_{α γ} δ (r) + (1 - δ_{α γ}) δ (r - l_{α γ})

(13)

in which

δ_{α γ}

and

δ (r)

are the Kronecker and Dirac delta functions, respectively. By virtue of the Dirac delta function, the term

δ (r - l_{α γ})

imposes a distance constraint

l_{α γ}

between the pair of atoms. So, giving the distance constraints to all pairs of atoms in a molecule defines the molecular structure or geometry in terms of trigonometry. This is the way the molecular structure is incorporated into the RISM theory.

The 3D-RISM equation for the solute–solvent system at infinite dilution can be derived from Equation (8) by performing the Chandler–Andersen transformation just for the coordinate of “solvent,” not for that of solute [2,21,33,34]. The equation reads Equation (14) as,

h_{γ} (r) = \sum_{γ^{'}} [ω_{{γ γ}^{'}}^{v v} (| r - r^{'} |) + ρ h_{{γ γ}^{'}}^{v v} (| r - r^{'} |)] c_{γ^{'}} (r^{'}) d r^{'}

(14)

where

h_{γ} (r)

and

c_{γ^{'}}

(r^{'})

are the total and direct correlation functions of site γ and γ’, respectively, of solvent molecules at two positions r and r’ in the Cartesian coordinate, the origin of which is placed at an arbitrary position, usually inside the protein. The functions

ω_{γ^{'} γ}^{v v}

(r) and

h_{γ^{'} γ}^{v v}

(r) are the correlation functions for solvent molecules, which appear in Equation (11). These equations can be applied to the molecular recognition process. If one views the solute molecule as a “source of external force” exerted on solvent molecules, then

ρ g (r) = (ρ g (r) + ρ)

is identified as the density distribution of solvent molecules in the “external force.” This identification called “Percus trick” is the key concept that made the formulation of the molecular recognition process possible by means of statistical mechanics [36].

The equations described above contain two unknown functions, h(r) and c(r). Therefore, they are not closed without another equation that relates the two functions. Several approximations have been proposed for the closure relations: HNC, PY, MSA, and so on [36]. The HNC closure can be obtained from the diagrammatic expansion of the pair correlation functions with respect to the density and discarding a set of diagrams called the “bridge diagrams,” which have multifold integrals. It should be noted that the terms kept in the HNC closure relation still include those up to the infinite orders of the density. Alternatively, the relation has been derived from the linear response of a free energy functional to the density fluctuation created by a molecule fixed in the space within the Percus trick. The HNC closure relation reads Equation (15),

h (r) = \exp (- u (r) / k_{B} T + h (r) - c (r)) - 1

(15)

where

k_{B}

and T are the Boltzmann constant and temperature, respectively, and

u (r)

the interaction potential between a pair of atoms in the system. Equation (15) is the relation that incorporates the physical and chemical characteristics of the system into the theory through

u (r)

. The PY approximation can be obtained from the HNC relation just by linearizing the factor

\exp [h (r) - c (r)]

. The HNC closure has been quite successful for describing the structure and thermodynamics of liquids and solutions, including water. However, the approximation is notorious in the low density regime. The drawback becomes fatal sometimes when one tries to apply the theory to associating liquid mixtures or solutions, especially of dilute concentration, because a solution of “dilute” concentration is equivalent to “low density” liquid for the minor component. In order to get rid of the problem, Kovalenko and Hirata proposed the following approximation, or the KH closure expressed by Equation (16) [33,34]:

g (r) = {\begin{array}{l} \exp (d (r)) & for d (r) \leq 0 \\ 1 + d (r) & for d (r) > 0 \end{array}

(16)

where

d (r) = - u (r) / k_{B} T + h (r) - c (r)

. The approximation turns out to be quite successful even for the mixture of complex liquids.

The procedure of solving the equations consists of two steps. We first solve the RISM equation, Equation (11), for

h_{γ^{'} γ}^{v v}

(r) of a solvent or a mixture of solvents in cases of solutions. Then, we solve the 3D-RISM equation, Equation (14), for

h_{γ} (r)

of a protein-solvent (solution) system, inserting

h_{γ^{'} γ}^{v v}

(r) for the solvent into Equation (14), which was calculated in the first step. Considering the definition g(r) = h(r) + 1, g(r) thus obtained is the three-dimensional distribution of solvent molecules around a protein in terms of the interaction site representation of a solvent or a mixture of solvents in cases of solutions. The so-called solvation free energy can be obtained from the distribution function through Equations (17) and (18) corresponding, respectively, to the two closure relations described above, Equation (15) and Equation (16) [33,37]:

Δ μ_{HNC} = ρ^{v} k_{B} T \sum_{γ} \int d r [\frac{1}{2} h_{γ}^{u v} {(r)}^{2} - c_{γ}^{u v} (r) - \frac{1}{2} h_{γ}^{u v} (r) c_{γ}^{u v} (r)]

(17)

Δ μ_{HNC} = ρ^{v} k_{B} T \sum_{γ} \int d r [\frac{1}{2} h_{γ}^{u v} {(r)}^{2} Θ (- h_{γ}^{u v} (r)) - c_{γ}^{u v} (r) - \frac{1}{2} h_{γ}^{u v} (r) c_{γ}^{u v} (r)]

(18)

where Θ denotes the Heaviside step function. The other thermodynamic quantities concerning solvation can be readily obtained from the standard thermodynamic derivative of the free energy except for the partial molar volume [38,39,40].

The partial molar volume, which is a very important quantity to probe the response of the free energy (or stability) of protein to pressure, including so-called “pressure denaturation,” is not a “canonical” thermodynamic quantity for the (V,T) ensemble, since the volume is an independent thermodynamic variable of the ensemble. The partial molar volume of protein at infinite dilution (Equation (19)) can be calculated from the Kirkwood–Buff equation [Kirkwood–Buff] generalized to the site–site representation of liquid and solutions [38,39,40],

\bar{V} = k_{B} T χ_{T} [1 - ρ \sum_{γ} \int c_{γ} (r) d r]

(19)

where

χ_{T}

is the isothermal compressibility of pure solvent or solution, which is obtained from the site–site correlation functions of solutions. In the following, we show an application of the theory described above in order to demonstrate the robustness of the theory.

The example is the partial molar volume of protein, which can be calculated using Equation (19) from h(r), or equivalently from c(r) calculated by the 3D-RISM equation. The partial molar volume of several proteins in water, which are treated frequently in the literature, is depicted against the molecular weight in Figure 1 [40]. Also plotted in the same figure are the experimental data corresponding to the theoretical results. It can be readily seen that the theory reproduces the experimental results in quantitative manner. The reader may think that the results can be reproduced just by a simple consideration of the geometry of protein, or the exclusion volume of protein. However, it will not be the case. Why? It is because the partial molar volume is a “thermodynamic quantity,” not a “geometrical quantity.” The partial molar volume is a quantity that reflects all the solvent–solvent and solute–solvent interactions as well as all the configurations of water molecules in the system. On the other hand, the geometrical volume accounts just for the simplified (hardcore type) repulsive interaction between solute and solvent. All the other factors, such as the attractive interactions between solute and solvent and the solvent reorganization, are entirely missing. The volume changes due to the solvent reorganization are especially important for the partial molar volume of protein, because it is related to the “cavity” volume in protein. As has been well studied, a protein has many internal cavities in which water molecules can or cannot be bound. It can be explained with a simple “thought experiment” concerning the partial molar volume of protein.

Figure 1. Partial molar volume of proteins in water plotted against molecular weight. (The figure was reprinted from Ref. [40]. Copyright (2005) American Chemical Society.).

The thought experiment is to dissolve a protein into water. If one puts a protein molecule into water, some of the cavities in the protein may accommodate water molecules, but others may not. If the cavity is not filled by water, then the cavity space will contribute to the partial molar volume of the protein. On the other hand, if the space accommodates water molecules as a result of solvent reorganization, it will cause a negative contribution to the entire volume of solution, and compensate the increase due to the cavity volume. This compensation is significant: if a cavity is filled by one water molecule, it causes the reduction of the volume by 18 cm³/mol. Therefore, if a theory is not able to take account the reorganization of water molecules induced by protein, it will fail to predict the partial molar volume. The quantitative agreement between the experimental and theoretical results shown in the figure demonstrate that the theory is capable of accounting for all the solute–solvent and solvent–solvent interactions as well as solvent reorganization induced by protein.

In the following sections, it is demonstrated how the RISM/3D-RISM theory is capable of describing the molecular recognition and reorganization processes.

3. Molecular Recognition in Life Phenomena

3.1. Recognition of Water Molecules by Protein

It is a well-documented fact that water is essential for living systems to maintain their life [2,41,42]. In order to clarify the role of water in living systems in molecular detail, many scientists in the field of X-ray and neutron diffraction measurement have been trying to determine the position and orientation of water molecules around and inside biomolecules, or protein and DNA [43,44,45]. Nevertheless, the task is not so easy even for the modern experimental technologies to determine the position of water molecules due essentially to the limited resolution of those experiments. It is because water molecules at the surface of protein are not always bound firmly to some specific sites of the biomolecules, but exchange the positions quite frequently. In fact, this flexibility and fluctuation of water molecules are essential for living systems to maintain their life. The diffraction measurement can only locate some water molecules that have significant residence time at some specific positions of the biomolecules. It was Imai et al. who broke through this difficult problem by means of the molecular theory of solvation, or the RISM/3D-RISM theory, which was described briefly in the preceding section [46].

Imai et al. carried out the RISM/3D-RISM calculation for a hen egg-white lysozyme immersed in water and obtained the 3D-distribution function of oxygen and hydrogen of water molecules around and inside the protein. The native 3D structure of the protein was taken from the Protein Data Bank (PDB). The protein was known to have a cavity composed of the residues from Y53 to I58 and from A82 to S91, in which four water molecules were determined by means of the X-ray diffraction measurement [47]. In the calculation, those water molecules are not included explicitly.

In Figure 2 depicted by green surfaces or spots are

g (r)

of oxygen atoms of water molecules using an isosurface representation, which is very similar to the electron density map obtained from the X-ray crystallography. They have drawn g(r), which is greater than a threshold value. The right, center, and left figures correspond, respectively, to g(r) > 2.0, g(r) > 4.0, and g(r) > 8.0. Since g(r) is unity in the bulk, the left figure indicates that the probability of finding those water molecules at the surface is more than twice as large, compared to the bulk water. As such, the water molecules depicted in the right figure have eight times higher probability to be found than those in the bulk. The water molecules are those bound firmly to some specific atom of the protein due to, say, the hydrogen bonds, and they are quite rare, as one can see from the figure. In this sense, the threshold values play the role of the “temperature” in the X-ray diffraction measurement: if you lower the temperature, you can observe more water molecules, which have weaker interaction with protein. The results suggest that the X-ray and neutron diffraction communities have acquired a powerful theoretical tool to analyze their data to locate the position and orientations of water molecules, since the theory also provides the distribution of hydrogen atoms of water molecules.

Figure 2. 3D-distribution g(r) of water around a protein (lysozyme). (The figure is reprinted from Ref. [46]. Copyright (2005) American Chemical Society.)

The results depicted in Figure 2 are what Imai et al. had expected before they actually carried out the calculation, although the results were entirely new by themselves in the history of statistical mechanics. Entirely unexpected was that they observed some peaks of water distribution in a cavity “inside” the protein, which are surrounded by the residues from Y53 to I58 and from A82 to S91. The results are shown in Figure 3. The left picture in Figure 3 shows the isosurfaces of g(r) > 8 for water oxygen (green) and hydrogen (pink) in the cavity. In the figure, only the surrounding residues are displayed, except for A82 and L83, which are located in the front side. There are four distinct peaks of water oxygen and seven distinct peaks of water hydrogen in the cavity. The spots colored by green and pink indicate water oxygen and hydrogen, respectively. From the isosurface plots, Imai et al. reconstructed the most probable model of the hydration structure. It is shown in the center of Figure 3, where the four water molecules are numbered in the order from the left. Water 1 is hydrogen bonding to the main-chain oxygen of Y53 and the main-chain nitrogen of L56. Water 2 forms hydrogen bonds to the main-chain nitrogen of I56 and the main-chain oxygen of L83, which is not drawn in the figure. Water 3 and 4 also form hydrogen bonds with protein sites, the former to the main-chain oxygen of S85 and the latter to the main-chain oxygens of A82 (not displayed) and of D87. There is also a hydrogen bond network among Water 2, 3, and 4. The peak of the hydrogen between Water 3 and 4 does not appear in the figure because it is slightly less than 8, which means the hydrogen bond is weaker or looser than the other hydrogen-bonding interactions. Although the hydroxyl group of S91 is located at the center of the four water molecules, it makes only weak interactions with them.

Figure 3. The 3D-distribution g(r) of water inside the active site of protein: green surface, oxygen; purple surface, hydrogen. (The figure is reprinted from Ref. [46]. Copyright (2005) American Chemical Society.)

It is interesting to compare the hydration structure obtained by the RISM/3D-RISM theory with crystallographic water sites of X-ray structure [47]. The crystallographic water molecules in the cavity are depicted on the right of Figure 3 showing four water sites in the cavity, much as the RISM/3D-RISM theory has probed. Moreover, the water distributions obtained from the theory and experiment are very similar to each other. Thus, it is concluded that the RISM/3D-RISM theory can predict the water-binding sites with great success. This was the first occasion in the history of theoretical physics to probe a little molecule recognized by a cavity of a biomolecule.

When the results were published in the JACS Communications, the authors of the paper suggested that the method could be extended to the molecular recognition of other small ligands, including drug compounds, just by considering those molecules as a component of aqueous solution. The suggestion was criticized in an article in the Royal Society of Chemistry, saying “it is too much to extrapolate from the analysis of just one cavity in one protein and claim that the method is robust and widely applicable” [48]. The following few topics reviewed in rest of this section are answers to the criticism.

3.2. Noble Gas Recognized by Protein

Molecular recognition by protein, or ligand binding, is one of the most fundamental functions of protein in the biological process, as was emphasized in the Introduction section of this article, taking the substrate binding at active sites of an enzyme as an example. Another important example of ligand binding by protein is the binding of a drug compound to inhibit a protein activity. In either case, a ligand to be bound is a chemical compound that in general has a complicated molecular structure, consisting of many atoms with or without (partial) charges. In order for a ligand to be bound by a receptor, the geometrical shape and charge distribution of the ligand should be matched with those of the active site (or cavity) of the receptor. It is the idea behind the bioinformatic-based drug screening, or the key-and-lock concept. There is no question about that. However, it is not the entire story. The active sites or cavities of protein are not empty before a ligand comes into them. They are usually filled with some water molecules, an example of which was discussed in the preceding section. In order for a ligand to be accommodated in the cavity, one or a few water molecules should be disposed from the cavity to make a space. That process is a thermodynamic process requiring consumption of free energy, referred to as “desolvation free energy” [49,50,51,52,53].

The methodology described in the previous section can be applied to the process with a slight modification, and provides a powerful theoretical tool to realize the ligand binding by protein. The modification to be made is just to change the solvent from pure water to an aqueous solution containing ligand molecules. Presented in this section are the results for noble gases [54], which are the simplest models of non-polar ligands.

Shown in Figure 4 are the 3D distribution functions of xenon and water (oxygen and hydrogen) around lysozyme, calculated by the RISM/3D-RISM theory for lysozyme in a water–xenon mixture at the concentration of 0.001 M. The molecular surface of the protein is drawn with blue color. The regions of g(r) > 8 are painted with different colors for different species: yellow, xenon; red, water oxygen; white, water hydrogen. Of course, the surface painted with blue color is covered by water molecules weakly bound to the protein, which are not shown. A number of well-defined peaks, yellow and red spots, are found for xenon and water oxygen at the surface of the protein, which are separated from each other. The result demonstrates the great capability of the RISM/3D-RISM theory to predict the “preferential binding” of ligands. The distributions of ligand and water are simultaneously found in this result, which means that the peak of either the ligand or water is found at each site depending on the ratio of their affinities to the site. Actually, Figure 4 indicates that there are water- and xenon-preferred sites on the protein surface. Similar results were obtained for the other gases and the other concentrations.

Figure 4. The 3D-distribution (g(r)) of Xe and water molecules in lysozyme: red surface, water oxygen; white surface, water hydrogen; yellow surface, xenon (a). The X-ray results are painted in orange (b). (The figure is reprinted from Ref. [54]. Copyright (2007) American Chemical Society).

It is of interest to compare the distribution of xenon obtained by the RISM/3D-RISM theory with the xenon sites in the X-ray structure [55], even though their conditions are not the same: the former is an aqueous solution under atmospheric pressure, while the latter is crystal under xenon gas pressure of 12 bar. There are two binding sites of xenon in lysozyme: one corresponds to the binding pocket of native ligands, which is referred to as the substrate binding site, and the other is located in a cavity inside the protein, which is referred to as the internal site. The theoretical results of the 3D distribution of xenon are compared with the X-ray xenon site at the substrate binding site in Figure 4a. The location of a high and sharp peak found by the theory is in perfect agreement with the X-ray xenon site. Shown in Figure 4b are the results at the internal site. The xenon peak found there is actually a minor one. Nevertheless, the location is again consistent with the X-ray site. It is of interest to note that the peaks of water are shifted off from the xenon binding site.

Shown in Figure 5 is the size dependence of the coordination number of noble gases at the two binding sites, which was calculated at the concentration of 0.001 M. At the substrate binding site, the coordination number becomes exponentially larger as the size of gas increases (Figure 5a). At the internal site, the coordination number becomes larger with increasing the gas size up to s ~3.4 Å, while it decreases in the region where

σ > 3.4

Å (Figure 5b). As a result, argon has the largest binding affinity to the internal site. These results demonstrate that the RISM/3D-RISM theory has the ability to describe ligand size selectivity in the binding, or molecular recognition. Although there are no corresponding experimental data, the present results serve as a representative test case.

Figure 5. The size dependence of noble gases bound at (a) the substrate binding site, and (b) the internal site. (The figure is reprinted from Ref. [54]. Copyright (2007) American Chemical Society).

3.3. Selective Ion-Binding by Protein

Selective ion binding by protein plays an essential role in a variety of physiological processes. The binding of calcium ions by some protein initiates the process to induce the muscle contraction and enzymatic reactions [56,57]. The initial process of the information transmission through an ion channel is an ion binding at the pore of the transporter molecule [58]. The ion binding plays an important role, sometimes even in the folding process of a protein by inducing the secondary structure [59]. Such processes feature the highly selective ion binding by proteins. Therefore, it is of great interest for the life sciences to elucidate the origin of the ion selectivity in molecular detail. For that purpose, a RISM/3D-RISM calculation was carried out for aqueous solutions of three different electrolytes, CaCl₂, NaCl, and KCl, and for four different mutants of human lysozyme: wild-type, Q86D, A92D, Q86D/A92D, which have been studied experimentally by Kuroki and Yutani [60,61,62].

Shown in Figure 6 are the 3D-distributions of water molecules and the cations inside and around the cleft under concern, which consists of amino acid residues from Q86 to A92. The area where the distribution function, g(r), is greater than five is painted with a different color for each species: oxygen of water, red; Na⁺ ion, yellow; Ca²⁺ ion, orange; K⁺ ion, purple. For the wild type of protein in the aqueous solutions of all the electrolytes studied, CaCl₂, NaCl and KCl, there are no significant distributions (g(r) > 5) observed for the ions inside the cleft, as is seen in the figure in the upper left. The Q86D mutant shows essentially the same behavior as that of the wild type, but with the water distribution being changed slightly (upper center figure). A trace of yellow spot is seen, which suggests a slight possibility of an Na⁺ ion bound in the middle of the binding site, but it will not be large enough to make any significant contribution to the distribution. In place, a rather large distribution that corresponds to water oxygen is observed, as is indicated with the red color in the figure. The distribution covers faithfully the region where the crystallographic water molecules were detected, as shown with the spheres colored gray. There is a small difference between the theory and the experiment. That is the crystallographic water bound to the backbone of Asp-91. The theory does not reproduce the water molecule, the reason for which is not known. Except for the small difference, the theoretical observation is in accord with the experimental finding, especially in the respect that the protein with the wild-type sequence does not bind either Na⁺ or Ca²⁺.

Figure 6. The 3D-distribution of ions in lysozyme. Upper left, wild type; upper middle, Q86D mutant; upper right, A92D mutant. lower left, Q86D /A92D mutant. The theoretical result for the Q86D /A92D mutant is compared with the result of the X-ray crystallography in the lower right two panels: The binding sites are closed up in the insets of the two panels. (The figure is reprinted from Refs. [60,61]. Copyright (2006) and (2007) American Chemical Society.)

On the other hand, the A92D mutant in the NaCl solution exhibits distinct distributions of an Na⁺ ion bound in the recognition site, which is consistent with the finding by the experiment (upper right figure). The Na⁺ ion is bound primarily to the carbonyl oxygen atoms of Asp-92 and has the distribution around the atoms. There is a water distribution in the active site, but the form of the distribution is altered entirely from that in the wild type. The change in the distribution indicates that the Na⁺ ion bound in the active site is not naked, but is hydrated by water molecules. The mutant shows no indication of a K⁺ ion bound to the site. (The results are not shown.) It suggests that the A92D mutant is able to discriminate an Na⁺ ion from a K⁺ ion. The finding demonstrates the capability of the RISM/3D-RISM theory to realize the ion selectivity by protein.

Shown in the lower panels are the 3D-distributions of Ca²⁺ ions and of water oxygen in the ion binding site of the holo-Q86D/A92D mutant. The mutant is well regarded experimentally as a calcium binding protein. The mutant, in fact, exhibits a strong calcium binding activity as can be seen in the figure. The calcium ion is bound by the carboxyl groups of the three Asp residues and is distributed around the oxygen atoms. The 3D-distribution of water at the center of the triangle made by the three carbonyl oxygen atoms is reduced dramatically, indicating that the Ca²⁺ ion is coordinated by the oxygen atoms directly, not with water molecules between them. The Ca²⁺ ion, however, is not entirely naked, because the persistent water distribution is seen at least at two positions where original water molecules were located in the wild type of the protein.

The results obtained in the study gave a great confidence to the authors to apply the method to actual enzymatic reactions in which ions as cofactors play important roles. The following topics concern the role of ions in an enzymatic reaction.

3.4. Molecular Recognition in an Enzymatic Reaction: Role of Mg²⁺ Ions in DNA Hydrolysis Reaction by Restriction Enzyme EcoRV

It is known that small ions such as magnesium ions play an important role as “cofactors” in enzymatic reactions [63]. Moreover, their effect is extremely sensitive to the type of ion. For example, in the case of Escort, one of the type-II restriction enzymes that decompose DNA by hydrolysis reaction, the reaction does not proceed just by replacing Mg²⁺ ion with Ca²⁺ ion [64]. This suggests that the effect of the cofactor cannot be explained simply by the general physics called the “Coulomb interaction,” but is extremely specific to ionic species, the atomic detail of binding position (or distribution) in the active site, and its fluctuation. Moreover, the distribution of ions and their fluctuations are closely correlated (conjugated) with the structural fluctuations of proteins, and the binding position (or distribution) must naturally change as the reaction progresses. So far, research on hydrolysis reactions with restriction enzymes has been actively conducted to determine the positions of small ions at various stages of the reaction through techniques such as X-ray crystallography, but consensus has not yet been reached [65,66,67,68]. The main reason for this is that the structural fluctuations of the protein–DNA complex and the ion distribution are closely coupled to each other. In other words, the ion position changes significantly as the reaction proceeds.

In order to clarify the position of Mg²⁺ ions and their role in the hydrolysis reaction due to the restriction enzyme (EcoRV), Onishi et al. carried out a RISM/3D-RISM calculation and a molecular dynamics (MD) simulation for the precursor state of the reaction [69].

Figure 7 shows the distribution of ions in the EcoRV–DNA complex determined using the 3D-RISM/KH theory. In the figure, the distribution of ions is shown by the orange network structure. It is interesting to compare this result with the experimental result based on X-ray crystallography (Figure 7b) [68]. First, the ion positions, I*, II*, and III* determined by the experiment, correspond roughly to the peaks, I^†, II^†, III^† of the distribution estimated by the RISM/3D-RISM calculation. On the other hand, the theoretical result due to RISM/3D-RISM shows another peak (IV^†) of the distribution that was not detected by X-ray crystallography. The height of this peak is about a half that of the other peaks, indicating that the position is not the most comfortable position in the equilibrium condition. It may be the reason why the ion position was not detected experimentally by the X-ray analysis. Nevertheless, it is quite unlikely that the ion at the position with such high probability is doing nothing in the reaction.

Figure 7. The 3D-distribution (g(r)) of ions in the EcoRV–DNA complex: (a) 3D-RISM/KH calculation, (b) Experimental results (Ref. [68]). (The figure is reprinted from Ref. [69]. Copyright (2018) American Chemical Society.).

Therefore, in order to clarify the role of the ion at the position (IV^†), an MD simulation was performed with the structure in which one Mg²⁺ ion was placed at this position as the initial structure. The result is shown in Figure 8. The initial structure of the MD simulation is shown in Figure 8a. (The initial structure also shows the arrangement of water molecules obtained by the RISM/3D-RISM calculation.) Figure 8b shows the structure after one nanosecond.

Figure 8. MD simulation: (a) initial structure, (b) structure after 1 nanosecond (equilibrium structure). (The figure is reprinted from Ref. [69]. Copyright (2018) American Chemical Society.)

As a result of this MD simulation, the following changes took place in the structure of the EcoRV–DNA complex and in the arrangement of the solvent (water and ions).

The phosphate group (scissile phosphate group) involved in DNA cleavage was twisted, and moved to the position suitable for nucleophilic attack.
The Mg²⁺ ion at the position IV^† moved to position “B” in the figure, and a water molecule serving as the substrate was placed at the proper position in the reaction.
The position and orientation of one water molecule changed, and it moved to the position where it could act as a nucleophile.

In the study, the equilibrium structure obtained by the MD simulation was referred to as the M structure. Interestingly, the M structure is very similar to the structure of BamHI-DNA obtained by X-ray crystallography in both the structure of the protein–DNA complex and the ion-binding position (Figure 9) [70]. In particular, the position of EcoRV in the M structure almost overlaps with the position of BamHI (Figure 9b).

Figure 9. Comparison of (a) the MD initial structure and (b) the equilibrium structure (M structure) of EcoRV–DNA complex with BamHI-DNA structure (DNA, ball and stick; protein, stick; EcoRV, green; BamHI, blue; position of Mg²⁺ ion in MD simulation of EcoRV-DNA, green sphere (each label of ions in the initial structure has the same meaning as that in Figure 7); position of Ca²⁺ ion in BamHI-DNA, yellow sphere). [The figure is reprinted from Ref. [69]].

BamHI is a protein similar to EcoRV, but Mg²⁺ is replaced by Ca²⁺ and has no catalytic activity for hydrolysis reactions. For this reason, BamHI has been assigned to the structure immediately before the hydrolysis reaction, and the results of this study are in harmony with this experimental hypothesis [70].

3.5. Molecular Recognition in Drug Screening

The in silico drug discovery is the most interesting field where the molecular recognition may play a crucial role.

The prediction of the ligand binding sites and affinities is the starting point for the drug discovery [71,72]. Therefore, a large number of computational as well as experimental approaches have been devoted to solve the problem [73,74,75,76]. The computational methodologies are classified into two categories or stages. One is the prediction of ligand binding sites in a target protein. The binding sites are found, in the most common cases, based on a purely geometric analysis of the protein structure, in which cavities or clefts in the protein are detected and regarded as the potential binding sites [73]. The binding sites can also be predicted by the bioinformatics-based methodologies such as the multiple alignment of the amino acid sequences for a protein family [77]. The other category is the docking of a ligand molecule at the binding sites that are already known or predicted in advance. Possible docking structures are then evaluated based on a force field or a scoring function [74,75,76].

Although such docking programs are increasingly popular in the fields of bioscience and pharmacology [78], the theoretical methodologies based on the physical chemistry are not fully developed. One of the least developed methodologies is how to account for the effect of water in the binding affinity or free energy. Water plays multiple roles in the binding affinity. For examples, bulk water exerts the reaction field, acting on a pair of ligand and receptor molecules. This effect includes the electrostatic screening and the hydrophobic interaction between protein and ligand molecules. Individual water molecules can act as integral molecular components of the complex [79]. In fact, water molecules are often found at the binding interface of protein–ligand complexes mediating with the hydrogen bonds or simply filling void spaces. Water around and inside a protein molecule regulates the structural fluctuation of the biomolecule, which of course has a significant effect on the binding affinity. In spite of such significance of water molecules, the effect of water has conventionally been treated at the level of continuum solvent models [74,75,76]. In so many words, it is obvious that such models will not account for the desolvation free energy properly. On the other hand, it is quite reasonable to expect that the RISM/3D-RISM theory may play a great role in this step of drug discovery. In fact, the theory has been applied to in silico drug discovery in a variety of ways [12,80,81,82]. As an example of those applications of the RISM/3D-RISM theory to drug screening, the study carried out by Hasegawa et al. is briefly reviewed, which takes “Pim1 kinase” as a target protein and “triazolopyridazine” as well as its derivatives as inhibitors [16].

The Pim1 kinase belongs to the Pim (Proviral Integration-site MulV) family along with Pim1, 2, and it is a Serine/Threonine Kinase. The 3D structure of a Pim1 kinase (PDBID: 3BGQ) with an inhibitor is shown in Figure 10. The enzyme is expressed widely in our body due to its phosphorylation activity of substrates concerning a variety of bio-functions such as cell cycle, apoptosis, and differentiation [12,83,84,85]. In particular, the enzyme is expressed excessively in malignant tumors such as leukemia, lymphoma, and prostatic carcinoma [84,85,86,87,88]. On the other hand, a mouse, the Pim1 kinase of which is knocked out, did not show any indication as a phenotype [89,90,91,92,93]. Therefore, it is considered that such a tumor may be treated by an inhibitor of the enzyme, with minor side effects. Although there have been some reports about the compounds that are tightly bound to the Pim1 kinase, they have not been commercialized yet as an actual cancer drug [89,90].

Figure 10. (a) A 3D structure of the Pim1 kinase and triazolo pyridazine inhibitor termed as VX2 in the Protein Data Bank (PDB). The PDB code is 3BGQ. We refer to the VX2 as a ligand 1, in this study, for the sake of simplicity of the terminology. (b) Chemical structure of the VX2. Circled part of the ligand is termed as triazolo[4,3-b]pyridazine scaffold. This is a common part of all the ligands that are applied in this study. (The figure is reprinted from Ref. [16]. Copyright (2017) American Chemical Society.)

The experimental values of the inhibition constant (

K_{i}

), which is a measure of the binding affinity, observed by Grey et al. are listed in Table 1. [94]

Table 1. The list of ligands and the inhibition constant, K_i.

Δ G_{b i n d, \exp}

is the experimental value of the binding free energy estimated from -RTlnK_i at the temperature T = 300 K.

Δ Δ G_{b i n d, \exp}

is the binding free energy relative to that of Ligand No. 1.

There are two points that should be remembered when one tries to perform the rational drug screening based on the RISM/3D-RISM theory. Firstly, the criteria for screening drug candidates should be the binding affinity, which is defined by

K_{A} = \exp [\frac{- Δ G_{b i n d}}{k_{B} T}]

(20)

where

K_{A}

is the equilibrium constant of the reaction,

[r e c e p t e r] + [l i g a n d] ⇄ [c o m p l e x]

(21)

and

Δ G_{b i n d}

is the binding free energy defined by

Δ G_{b i n d} = G_{c o m} - (G_{r e c} + G_{l i g})

(22)

where

G_{r e c}

,

G_{l i g}

and

G_{c o m}

are the free energies of the receptor (protein), ligand (drug), and their complex, respectively. Unfortunately, the RISM/3D-RISM theory cannot be applied directly to calculate those free energies. However, one may take an alternative route to find

Δ G_{b i n d}

using the thermodynamic cycle depicted in the following schematic picture (Figure 11).

Figure 11. Standard thermodynamic cycle for the protein–ligand binding in aqueous solution. (The figure is reprinted from Ref. [16]. Copyright (2017) American Chemical Society.)

The binding free energy is calculated from the cycle as

Δ G_{b i n d} = Δ G_{s o l u t e} + Δ G_{d e s o l v}

(23)

where

Δ G_{s o l u t e}

is the binding free energy of solutes in vacuum defined by

Δ G_{s o l u t e} = Δ E_{s o l u t e} - T Δ S_{s o l u t e}

(24)

in which

Δ E_{s o l u t e}

and

T Δ S_{s o l u t e}

are the energetic and entropic contributions. Those quantity can be calculated readily by means of the molecular mechanics.

Δ G_{d e s o l v}

is the desolvation free energy defined by

Δ G_{d e s o l v} = Δ μ_{c o m} - (Δ μ_{r e c} + Δ μ_{l i g})

(25)

where

Δ μ_{c o m}

,

Δ μ_{r e c}

, and

Δ μ_{l i g}

are the solvation free energy of the complex, the receptor, and the ligand, respectively. Those can be calculated from the RISM/3D-RISM theory using Equations (17) and (18).

The other point of concern in the RISM/3D-RISM calculation is the structural fluctuation of protein [2]. As will be described in Section 4 in the present paper, the structure of protein is in spatial as well as temporal fluctuation, and many of the fluctuated states of protein may not bind the drug molecule to be examined. On the other hand, in the ordinary RISM/3D-RISM calculation, the structure of protein is assumed to be rigid, meaning that all the bond lengths, bond angles, and dihedral angles are fixed, that is, no structural fluctuation is allowed. Therefore, it is very likely that the structure (atomic coordinates) of protein picked up from the Protein Data Bank (PDB) may not be accurate enough for the ligand to be bound. In order to take the structural fluctuation into consideration, Hasegawa et al. carried out the MD simulation of the protein in water (details of the calculation are omitted here) [16].

The binding free energies of the 16 compounds to the target protein, calculated based on the MM/3D-RISM/KH method, are shown in Table 2, and plotted against the corresponding experimental values [94] in Figure 12. A rather high correlation (~0.69) is observed between the theoretical and experimental results. This result demonstrates that the MM/3D-RISM/KH method is applicable to the problem of compound screening and lead optimization, where relative affinity among the compounds has significance. However, the theoretical results show some systematic deviation from the experimental values toward the positive side. There are several conceivable causes for the systematic deviation: insufficient sampling of the conformational space of the protein, insufficient accuracy of the RISM/3D-RISM theory for estimating the solvation free energy, inadequate solution conditions, and so on. In contrast to the method that directly estimates the binding free energy, the method based on a thermodynamic cycle requires several theoretical methodologies for estimating each component of the binding free energy; inter-atomic interactions of solute, solvation free energy, conformational entropy, and external entropy. Since each method for estimating the component of the binding free energy has its own approximation, each method may systematically under- or overestimate the thermodynamic quantity. These systematic errors seem to give rise to the unphysical positive value of the binding free energy.

Table 2. Binding free energy and its components obtained from MM/RISM/3D-RISM method:

Δ E_{s o l u t e}

, the change of the interaction energy upon binding;

- T Δ S_{s o l u t e}

, the contribution from the entropy change;

Δ G_{s o l v}

, the desolvation free energy;

Δ G_{b i n d_c a l c}

, the theoretical estimate of the binding free energy;

Δ G_{b i n d_\exp}

, the experimental value of the binding free energy estimated from

K_{i}

[94].

Figure 12. Correlation between calculated and experimental values of binding free energy for all the ligands. Correlation coefficient R = 0.69. The figures are colored in different manners based on the structural feature of the ligands. (a) Ligands that include CF₃ on the meta position of the phenyl ring are colored with red. Other ligands are colored with blue. (b) Ligands that have cyclo-hexane, cyclo-butane, and cyclo-propane are colored with purple, orange, and green, respec-tively. (The figure is reprinted from Ref. [16]. Copyright (2017) American Chemical Society).

Among those components of the binding free energies, the conformational entropy deserves special attention because the scientists in the community of molecular dynamics simulation are experiencing a hard time to find converged results for the quantity, even after sampling a sub-micro-second length of trajectory. The situation reminds us of Levinthal’s estimate of the conformational degrees of freedom, which amounts to ~10⁵⁰ for a small protein having ~100 amino acids [95]. It may be impossible to find the converged results for the conformational entropy by the standard method of the simulation. An analytical approach based on the RISM/3D-RISM method combined with the generalized Langevin theory described in the following section may have an advantage in that respect.

4. Structural Fluctuation and Reorganization of Protein in Water

4.1. The Theory of Structural Fluctuation of Protein

The structural fluctuation of protein coupled with the density fluctuation of solvent (water or ion) plays an essential role in its functional expression. By combining the 3D-RISM theory and the generalized Langevin theory [96], Kim and Hirata proposed a new theory that describes the structural fluctuations of protein coupled with the density fluctuations of solvent, with the purpose of constructing a computational scientific methodology [2,23].

This theory is described by the following Langevin-type equation.

M_{α} \frac{d^{2} Δ R_{α} (t)}{d t^{2}} = - k_{B} T \sum_{β} {(L^{- 1})}_{α β} \cdot Δ R_{β} (t) - \int_{0}^{t} Γ_{α β} (t - s) \cdot \frac{P_{β} (s)}{M_{β}} + W_{α} (t)

(26)

In Equation (26),

Δ R_{α}

represents the fluctuation (displacement from the equilibrium structure) of the

α - t h

atom in the protein and is defined by

Δ R_{α} (t) = R_{α} (t) - ⟨ R_{α} ⟩

, where

R_{α} (t)

and

⟨ R_{α} ⟩

denote the coordinate of atom

α

and its ensemble average. In this equation, the first term on the right side is the restoring force (Hook type) proportional to the displacement, the second term is the memory term (friction term), and the third term is the random force originated from thermal motion. Included in the first term is the variance–covariance matrix, whose elements are defined by the following equation.

L_{α β} = ⟨ Δ R_{α} Δ R_{β} ⟩

(27)

If one ignores the second and third terms on the right side of Equation (26), it has the form of a so-called harmonic oscillator. Therefore, Equation (26) expresses a physical picture in which the motion of this harmonic oscillator is driven by a random force resulting from thermal motion and attenuated by a frictional force proportional to the velocity. If the protein is placed in a vacuum,

k_{B} T {(L^{- 1})}_{α β}

in the first term of Equation (26) is a so-called force constant (Hessian), which is expressed by the second derivative of the potential energy with respect to the atomic coordinates, or displacement. However, since the actual protein is in aqueous solution, it is not just the mechanical potential energy of the interatomic interaction. It must be the second derivative of the potential of mean force (or free energy) including the solvation free energy. That is,

k_{B} T {(L^{- 1})}_{α β} = \frac{\partial^{2} F ({R})}{\partial Δ R_{α} Δ R_{β}}

(28)

F ({R}) = U ({R}) + Δ μ ({R})

(29)

In Equations (28) and (29), {R} represents the structure (atomic coordinate) of the protein, and

F ({R})

,

U ({R})

and

Δ μ ({R})

are the potential of mean force (or free energy), potential energy, and solvation free energy, respectively. That is, the structural fluctuation of protein described by Equation (26) must be the fluctuation in the free energy surface including the solvent. The free energy surface has a quadratic form, as can be seen by performing the second integral on the displacement of the coordinates in Equation (28). However, the free energy surface is a quadric surface in a multidimensional space spanned by the coordinates of the number of atoms (~10⁴) in the protein.

F ({R}) = \frac{1}{2} k_{B} T \sum_{α, β} Δ R_{α} \cdot {(L^{- 1})}_{α β} \cdot Δ R_{β}

(30)

This also means that the structural fluctuations in the free energy surface defined by Equation (30) have a Gaussian distribution. That is,

w (Δ R_{1}, Δ R_{2}, \dots, Δ R_{N}) = \sqrt{\frac{‖ A ‖}{{(2 π)}^{3 N}}} \exp [- \frac{1}{2} \sum_{α} \sum_{β} A_{α β} Δ R_{α} Δ R_{β}]

(31)

where

A_{α β}

is an element of the matrix expressed by the following equation, and

‖ A ‖

is its determinant.

A_{α β} = \frac{\partial^{2} F ({R})}{\partial Δ R_{α} \partial Δ R_{β}}

(32)

The physics implied by Equation (31) for the distribution of protein structural fluctuations is consistent with the experimental results of small-angle X-rays and neutron scattering [97]. If Equation (31) is transformed into Fourier space, the left side corresponds to the intensity (structural factor) of the scattering experiment, while the right side becomes a Gaussian function concerning the wave vector. Therefore, when the logarithm of both sides is taken and plotted against the square of wave vector, a straight line having a negative slope is obtained. Such a plot is called a Guinier plot in the field of small-angle X-ray scattering experiments, and Equation (31) perfectly matches such an experiment [2,97].

The expression of Equation (28) regarding Hessian, which defines the structural fluctuation of protein, gives a great perspective for describing the structural response to thermodynamic perturbation and its dynamics. This is because the solvation–free-energy surface contained in the expression can be obtained by the RISM/3D-RISM theory developed by Hirata and his coworkers [2]. The linear response theory that describes the structural response of proteins to thermodynamic perturbations such as “temperature,” “pressure,” “denaturing agents,” and “amino acid substitutions” is derived as follows. First, the change in free energy due to thermodynamic perturbation is represented by the following formula [23]:

F ({R}) = \frac{1}{2} k_{B} T \sum_{α, β} Δ R_{α} \cdot {(L^{- 1})}_{α β} \cdot Δ R_{β} - \sum_{α} Δ R_{α} \cdot f_{α}

(33)

The first term on the right side of Equation (33) is the free energy of the non-perturbed system expressed by Equation (30). On the other hand, the second term is the change in free energy due to perturbation, which is expressed by perturbation (

f_{α}

) and structural displacement (

Δ R_{α}

) due to the perturbation. Applying the variational principle (

\partial F / \partial Δ R_{α} = 0

) to this equation gives the following equation that expresses the structural response of the protein to the thermodynamic perturbation [23,24,98]:

⟨ Δ R_{α} ⟩ = {(k_{B} T_{0})}^{- 1} \sum_{β} {⟨ Δ R_{α} Δ R_{β} ⟩}_{0} \cdot f_{β}

(34)

According to Equation (34), the average structure (atomic coordinates) changes,

⟨ Δ R_{α} ⟩

, responding to the thermodynamic perturbation, and its response function is the variance–covariance matrix

{⟨ Δ R_{α} Δ R_{β} ⟩}_{0}

of the non-perturbed system. Equation (34) can be further extended to the “non-linear” region by applying the idea of “analytical continuation.” To do this, we first divide the perturbation (

f_{β}

) in Equation (34) into several steps so that the individual perturbation stays within the linear regime. Letting the j-th perturbation be

f_{β}^{j}

, the response to the perturbation is expressed by the following equation [2,24]:

{⟨ Δ R_{α} ⟩}_{j + 1} = {(k_{B} T_{j})}^{- 1} \sum_{β} {⟨ Δ R_{α} Δ R_{β} ⟩}_{j} \cdot f_{β}^{j}

(35)

The response function (

{⟨ Δ R_{α} Δ R_{β} ⟩}_{j}

) contained in Equations (34) and (35) is related to the free energy of the protein by Equation (28), and the free energy is, in turn, given by the RISM/3D-RISM theory as a function of the structure (atomic coordinate) of the protein. Therefore, it is possible to calculate the structural response of the protein to thermodynamic perturbations [2,24].

4.2. Incoherent Elastic Neutron Scattering

The structural fluctuations of the protein are directly reflected in the mean square displacement,

M = \sum_{α} ⟨ Δ R_{α}^{2} ⟩

, obtained from the incoherent scattering of neutrons. When this mean square displacement is plotted against temperature, it increases linearly with temperature, but its slope changes rapidly around 230 K. Various physical interpretations such as “glass transition” [99], “harmonic to unharmonic transition” [100], and “alpha to beta transition” [101] have been given to this behavior, but no conclusion has been reached yet.

Based on the theory summarized in the preceding section, Hirata gave a new physical interpretation of the mean square displacement of proteins and its temperature dependence [2,25].

The structure factor obtained from the neutron incoherent (elastic) scattering experiment is defined by the following equation.

S^{E I S F} (Q, ω = 0) \equiv \int w^{i n c} ({Δ R}) \exp (- i Q \cdot Δ R) d Δ R

(36)

The experiment is carried out by irradiating powdered material with thermal neutrons. Therefore, the material can be considered to satisfy the spatial isotropic condition as in the case of the solution system. In Equation (36),

w^{i n c} ({Δ R})

is essentially a probability distribution function of structural fluctuations defined in Equation (31), but only the hydrogen atom (H) has a large scattering cross section with respect to neutrons. Further, given that it is incoherent scattering, we obtain:

w^{i n c} ({Δ R}) = {(2 π L_{α α})}^{- 3 n / 2} \exp [- \frac{1}{2} \sum_{α}^{n} \frac{Δ R_{α}^{2}}{L_{α α}}]

(37)

Here,

α

signifies the hydrogen atom,

L_{α α}

means the diagonal term of the variance/covariance matrix, and is given by the following equation.

L_{α α} = ⟨ Δ R_{α}^{2} ⟩

(38)

By substituting Equation (37) into Equation (36) and considering the “isotropicity” of fluctuations, the integration of Equation (36) can be performed analytically, and the following formula for the incoherent scattering factor of neutrons is obtained.

S^{E I S F} (Q, ω = 0) = \prod_{α}^{n} \exp (- \frac{L_{α α}}{2} Q^{2})

(39)

or taking the logarithm,

\log S^{E I S F} (Q, ω = 0) = - \frac{1}{2} (\sum_{α}^{n} L_{α α}) Q^{2} = - \frac{1}{2} M Q^{2}

(40)

Equation (40) means that the incoherent neutron scattering factor decreases linearly with the square of the wave vector (

Q^{2}

), and predicts the experimental results. Furthermore, since it is intuitively clear that the “structural fluctuation” reflected in the slope (mean square displacement, M) increases with temperature, the behavior shown in Figure 13 below is expected. The behavior of this figure is also in qualitative agreement with the result of the incoherent neutron scattering experiment [100].

Figure 13. Wave number and temperature dependence of neutron structure factor. (a) Theoretical prediction: T₁ < T₂ < T₃ < T₄. (b) Experimental data Ref. [100]. ((a) is reprinted from Ref. [25]. Copyright (2018) Elsevier. (b) is reprinted from Ref. [100]. Copyright (2008) Elsevier.).

Now, considering the Equations (27)–(31) and (34) described above, the following equation regarding the temperature dependence of the mean square displacement is obtained [2,25].

M = k_{B} T \sum_{α}^{n} (\frac{1}{K_{α α}^{E} + K_{α α}^{W}})

(41)

In (41),

K_{α α}^{E}

and

K_{α α}^{W}

mean the “force constant” related to the restoring force of the structural fluctuation and are defined by the following equations concerning the interatomic interactions in the protein and the hydration free energy, respectively.

K_{α α}^{E} = \frac{\partial^{2} U ({R})}{\partial Δ R_{α} \partial Δ R_{α}} K_{α α}^{W} = \frac{\partial^{2} Δ μ ({R})}{\partial Δ R_{α} \partial Δ R_{α}}

(42)

It can be readily imagined that the contribution from hydration,

K_{α α}^{W}

, to the force constant acts to weaken the interatomic interaction in the protein. For example, hydrogen bonds between amino acid residues or skeletal atoms in a protein are loosened or replaced by hydrogen bonds with water molecules. Then, it can be concluded that the signs of

K_{α α}^{E}

and

K_{α α}^{W}

must be opposite. Considering this conclusion and Equation (37), it can be concluded that the temperature dependence of the mean square displacement (M) of a protein in solvent is in general larger than that in a vacuum. Furthermore, it is expected that this change will occur rather abruptly at a certain temperature. This is because when the temperature of the protein solution is lowered, the fluctuation of the protein and water itself freezes, and the fluctuation is governed by the interaction energy inside the protein

U ({R})

in Equation (29)). It is hypothesized that the freezing temperature of this protein structure (and water) is around 230 K. In other words, it can be considered that protein fluctuations are dominated by “energy elasticity” (

K_{α α}^{E}

) at low temperatures, while at

T < 230 K

by “solvent induced elasticity” (

K_{α α}^{W}

). This is merely a hypothesis so far, but the hypothesis is depicted conceptually in Figure 14 with corresponding experimental results by Kataoka and Nakagawa [101].

Figure 14. (a) Temperature dependence of the mean square displacement deduced from Equation (41). (b) Experimental results obtained by Nakagawa and Kataoka [102]. ((a) is reprinted from Ref. [25]. Copyright (2018) Elsevier. (b) is reprinted from Ref. [102]. Copyright (2010) Physical Society of Japan.).

5. Protein Folding as an Example of Self-Organization Process

The theory of the “structural fluctuation” of protein, sketched in the previous sections, can be applied to describe the self-organization process that is one of the two elementary processes essential in life phenomena. This section is devoted to explaining an application of the theory to the protein folding as an example of the self-organization processes [103].

Under certain thermodynamic conditions (solvent environment), a protein reversibly refolds (folds) from a random coil state to its native structure (or tertiary structure) unique to its amino acid sequence (or primary structure). The process is not a spontaneous process if one focuses just on the protein structure itself, say in a vacuum, since it is a process that requires climbing up the large entropy barrier associated with ordering the structure from the disordered state of the denatured state to the native state. It is the solvent that plays a crucial role to make the process reversible. Let us quote the entire statement written by C. Anfinsen in order to explain his finding concerning the protein folding, which is called Anfinsens’s thermodynamic hypothesis [18,19]:

This hypothesis states that three-dimensional structure of a native protein in its normal physiological milieu (solvent, pH, ionic strength, presence of other components such as metal ions or prosthetic groups, temperature, and other) is the one in which the Gibbs free energy of the whole system is lowest; that is, the native conformation is determined by the totality of interatomic interactions and hence by the amino acid sequence, in a given environment [18].

Since the protein can perform its unique function only in its native conformation, the “folding” mechanism is one of the most important problems in the biophysics field as a fundamental process of life phenomenon [18,19,20,103]. Hirata and his coworkers applied the theory of “protein structure fluctuation” that has been described so far to the problem and proposed a new picture and methodology for the folding mechanism [103].

The new methodology and picture of protein folding derived from the theory described in the previous sections are as follows.

(A) Under certain thermodynamic conditions, proteins have a distribution around their equilibrium structure. The distribution is a Gaussian distribution described by Equation (31), and the variance/covariance matrix that characterizes the distribution is expressed by Equation (27).

(B) Structural changes of proteins associated with changes in thermodynamic conditions lead to changes in the mean value (equilibrium structure) and the half width (variance–covariance matrix) of this Gaussian distribution. The linear or non-linear response theory presented in Equation (34) or (35) can be applied to describe the structural change due to a thermodynamic perturbation.

The picture of protein folding deduced from Equation (35) is drawn conceptually in Figure 15.

Figure 15. Schematic diagram showing the structural transition of proteins. (The figure is reprinted from Ref. [103]. Copyright (2018) American Institute of Physics.)

Figure 15 includes two popular models of protein folding practiced in the community, the two-state model and the intermediate-state model. If the structural distribution (

{R_{N}}

) is a Gaussian distribution with peaks only at the two structures, the native structure (

{R_{N}}

) and the denatured state (

{R_{N}}

), as the thermodynamic condition (e.g., pressure) changes, then it can be considered as a two-state model. The intermediate state is, so to speak, a transition state, and the probability distribution of that state is so small that it can be neglected. On the other hand, if there is a significant distribution between the native structure and the denatured state, the structural transition is via the intermediate state. In order to know what kind of structural change occurs, it is necessary to solve Equation (35) by giving an atomic interaction model (or Hamiltonian) of an aqueous protein solution. In Equation (35), the perturbation (

f_{j}^{β}

) is an input for changing thermodynamic conditions, while the variance–covariance matrix

{⟨ Δ R Δ R ⟩}_{α β}

can be calculated as the second derivative of the free energy surface with respect to the atomic coordinate of protein, based Equation (28).

6. Summary and Perspective

In the present article, the theoretical as well as computational studies on the molecular recognition and self-organization processes in life phenomena, carried out based on the statistical mechanics theory of solvation, or the RISM/3D-RISM theory, were reviewed. The theory provides the information concerning the position as well as orientation of small molecules around and inside protein in terms of the probability distribution of atoms, in a manner similar to the electron distribution detected by the X-ray crystallography. It was demonstrated with a few examples that the theory is able to probe a ligand molecule, including water, recognized by protein at its active site or a cavity in atomistic detail. Water molecules recognized by protein at its active site or a cavity are of special importance, since those water molecules play multiple roles when protein expresses its function, i.e., as substrates and nucleophiles in enzymatic hydrolysis reactions, controlling the ion mobility in an ion channel, and so on. They also make an important contribution to the binding affinity of a drug compound to a target protein, since the compound may not be accommodated at the active site of protein unless one or a few water molecules in the cavity are disposed from the cavity.

It was also clarified in the article that water plays a crucial role in the structural fluctuation of protein, which in turn makes an essential contribution to the self-organization of the biomolecule, or the protein folding. It can be readily imagined that a protein in a vacuum undergoes a plastic deformation with any finite perturbation, mechanical or thermodynamic. It is water that makes the protein structure elastic, and assists the biomolecule to refold into the native conformation. The new concept of elasticity referred to as solvent-induced elasticity was introduced in the article. The solvent-induced elasticity is the key to understanding why an enzyme restores its original conformation after having completed its role as a catalyst.

Author Contributions

The paper reviews the original works carried out by Hirata and his collaborators, and each section of the paper is contributed by the coauthors. Conceptualization, F.H.; writing-review and editing, F.H.; writing-original draft of Section 2, F.H. and N.Y.; writing-original draft of Section 3, N.Y., I.O., M.I., and M.S.; writing-original draft of Section 4, F.H.; writing-original draft of Section 5, F.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Acknowledgments

F.H. was a fellow in the Toyota Physical and Chemical Research Institute from 1 April 2016 to 31 March 2020. The studies related to the structural fluctuation of protein, reviewed in this paper, were carried out mainly during the stay at the institute. He is grateful to the financial support from the institute. Writing this paper received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

Lehn, J.-M. From Molecular Recognition towards Molecular Information Processing and Self-Organization. Angewante Chem. 1990, 29, 1304–1319. [Google Scholar] [CrossRef]
Hirata, F. Exploring Life Phenomena with Statistical Mechanics of Molecular Liquids; CRC Press: Boca Raton, FL, USA, 2020. [Google Scholar]
Michaelis, L.; Menten, M. Die kinetik der invertinwirkung. Biochem. Z 1913, 49, 333–369. [Google Scholar]
Preston, G.M.; Carroll, T.P.; Guggino, W.B.; Agre, P. Appearance of water channels in Xenopus oocytes expressing red cell CHIP28 protein. Science 1992, 256, 385–387. [Google Scholar] [CrossRef]
Doyle, D.A.; Cabral, J.M.; Pfuetzner, R.A.; Kuo, A.; Gulbis, J.M.; Cohen, S.L.; Chait, B.T.; MacKinnon, R. The structure of the potassium channel: Molecular basis of K⁺ conduction and selectivity. Science 1998, 280, 69–77. [Google Scholar] [CrossRef]
Phongphanphanee, S.; Yoshida, N.; Oiki, S.; Hirata, F. The ‘Ambivalent’ Snug-fit Sites in the KcsA Potassium Channel Probed by 3D-RISM Theory. Pure Appl. Chem. 2014, 86, 97–104. [Google Scholar] [CrossRef][Green Version]
Phongphanphanee, S.; Yoshida, N.; Hirata, F. On the proton exclusion of aquaporins: A statistical mechanics study. J. Am. Chem. Soc. 2008, 130, 1540–1541. [Google Scholar] [CrossRef]
Phongphanphanee, S.; Rungrotmongkol, T.; Yoshida, N.; Hannonbua, S.; Hirata, F. Proton transport through the influenza A M2 channel: Three-dimensional reference interaction site model study. J. Am. Chem. Soc. 2010, 132, 9782–9788. [Google Scholar] [CrossRef]
Imai, T.; Oda, K.; Kovalenko, A.; Hirata, F.; Kidera, A. Ligand mapping on protein surfaces by the 3D-RISM theory: Toward Computational Fragment-based Drug Design. J. Am. Chem. Soc. 2009, 131, 12430–12440. [Google Scholar] [CrossRef] [PubMed]
Imai, T.; Isogai, H.; Seto, T.; Kovalenko, A.; Hirata, F. Theoretical Study of Volume Changes Accompanying Xenon-Lysozyme Binding: Implications for the Molecular Mechanism of Pressure Reversal of Anesthesia. J. Phys. Chem. B 2006, 110, 12149–12154. [Google Scholar] [CrossRef] [PubMed]
Kiyota, Y.; Yoshida, N.; Hirata, F. A New Approach for Investigating the Molecular Recognition of Protein: Toward Structural-Based Drug Design Based on the 3D-RISM Theory. J. Chem. Theor. Comp. 2011, 7, 3803–3815. [Google Scholar] [CrossRef] [PubMed]
Phanich, J.; Rungrotmongkol, T.; Sindhikara, D.; Phongphanphanee, S.; Yoshida, N.; Hirata, F.; Kungwan, N.; Spot Hannongbua, S. A 3D-RISM/RISM Study of the Oseltamivir Binding Efficiency with the Wild-type and Resistance-Associated Mutant Forms of the Viral Influenza B Neuraminidase. Protein Sci. 2016, 25, 147–158. [Google Scholar] [CrossRef] [PubMed]
Sugita, M.; Hamano, M.; Kasahara, K.; Kikuchi, T.; Hirata, F. New Protocol for Predicting the Ligand-Biding Site and Mode Based on the 3D-RISM/KH Theory. J. Chem. Theory Comput. 2020, 16, 2864–2876. [Google Scholar] [CrossRef] [PubMed]
Sugita, M.; Hirata, F. Predicting the binding free energy of the inclusion process of 2-hydroxypropyl-b-cyclodextrin and small molecules by means of the MM/3D-RISM method. J. Phys. Condens. Matter 2016, 28, 384002–384013. [Google Scholar] [CrossRef] [PubMed]
Hayashino, Y.; Sugita, M.; Arima, H.; Irie, T.; Kikuchi, T.; Hirata, F. Predicting the Binding Mode of 2-Hydroxypropyl-β-Cyclodextrin (HPβCD) to Cholesterol by Means of the MD Simulation and the 3D-RISM/KH Theory. J. Phys. Chem. B 2018, 122, 5716–5725. [Google Scholar] [CrossRef] [PubMed]
Hasegawa, T.; Sugita, M.; Kikuchi, T.; Hirata, F. A Systematic Analysis of the Binding Affinity bewteen the Pim-1 Kinase and Its Inhibitors Based on the MM/3D-RISM/KH Method. J. Chem. Inf. Modeling 2017, 57, 2789–2798. [Google Scholar] [CrossRef] [PubMed]
Sackmann, E. Physical Basis of Self-organization and function of membranes: Physics of vesicles. In Hand Book of Biological Physics; Lipowsky, R., Sackmann, E., Eds.; Elsevier Science B.V.: Amsterdam, The Netherlands, 1995; Chapter 5. [Google Scholar]
Anfinsen, C.B. Principle that Govern the Folding of Protein Chain. Science 1973, 181, 223–230. [Google Scholar] [CrossRef]
Arai, M.; Kuwajima, K. Role of the Molten Gloubule State in Protein Folding. Adv. Protein Chem. 2000, 53, 209–282. [Google Scholar]
Bryngelson, J.D.; Onuchic, J.N.; Socci, N.D.; Wolynes, P.G. Funnels, Pathways, and the Energy Landscape of Protein Folding: A Synthesis. Proteins 1995, 21, 167–195. [Google Scholar] [CrossRef]
Hirata, F. (Ed.) Molecular Theory of Solvation; Kluwer: Dordrecht, The Netherlands, 2003. [Google Scholar]
Terazima, M.; Kataoka, M.; Ueoka, R.; Okamoto, Y. (Eds.) Molecular Science of Fluctuations toward Biological Functions; Springer: Berlin/Heidelberg, Germany, 2015. [Google Scholar]
Kim, B.; Hirata, F. Structural fluctuation of protein in water around its native state: A new statistical mechanics formulation. J. Chem. Phys. 2012, 138, 054108. [Google Scholar] [CrossRef]
Hirata, F.; Akasaka, K. Structural fluctuation of proteins induced by thermodynamic perturbation. J. Chem. Phys. 2015, 142, 044110. [Google Scholar] [CrossRef]
Hirata, F. On the interpretation of the temperature dependence of the mean sqaure of displacement (MSD) of protein, obtained from the incoherent neutron scattering. J. Mol. Liq. 2018, 270, 218–226. [Google Scholar] [CrossRef]
Jonson, K. Role of Induced Fit in Enzyme Specificity: A Molecular Forward/Reverse Switch. J. Biol. Chem. 2008, 283, 26297–26301. [Google Scholar] [CrossRef] [PubMed]
Hammes, G.G.; Chang, Y.-C.; Oas, T.G. Conformational selection or induced fit: A flux description of reaction mechanism. Proc. Natl. Acad. Sci. USA 2009, 106, 13737–13741. [Google Scholar] [CrossRef] [PubMed]
Henzler-Widman, K.; Kern, D. Dynamic Personality of Protein. Nature 2007, 450, 964–972. [Google Scholar] [CrossRef]
Yoshida, M.; Muneyuki, E.; Hisabori, T. ATP synthase—A marvellous rotray engine of the cell. Nat. Rev. Mol. Cell Biol. 2001, 2, 669–677. [Google Scholar] [CrossRef] [PubMed]
Hirata, F.; Rossky, P.J. An Extended RISM Equation for Molecular Polar Liquids. Chem. Phys. Lett. 1981, 83, 329–334. [Google Scholar] [CrossRef]
Hirata, F.; Pettitt, B.M.; Rossky, P.J. Application of an Extended RISM Equation to Dipolar and Quadrupolar Fluids. J. Chem. Phys. 1982, 77, 509–520. [Google Scholar] [CrossRef]
Beglov, D.; Roux, B. An integral equation to describe the solvation of polar molecules in liquid water. J. Phys. Chem. B 1997, 101, 782–7826. [Google Scholar] [CrossRef]
Kovalenko, A.; Hirata, F. Three-Dimensional Density Profiles of Water in Contact with a Solute of Arbitraly Shape: A RISM Approach. Chem. Phys. Lett. 1998, 290, 237–244. [Google Scholar] [CrossRef]
Kovalenko, A.; Hirata, F. Self-Consistent Description of a Metal-Water Interface by the Kohn-Sham Density Functional Theory and the Three-Dimensional Reference Interaction Site Model. J. Chem. Phys. 1999, 110, 10095–10112. [Google Scholar] [CrossRef]
Chandler, D.; Andersen, H.C. Optimized Cluster Expansions for Classical Fluids. 2. Theory of Molecular Liquids. J. Chem. Phys. 1972, 57, 1930–1931. [Google Scholar] [CrossRef]
Hansen, J.P.; McDonald, I.R. Theory of Simple Liquids; Academic Press: London, UK, 1986. [Google Scholar]
Singer, S.J.; Chandler, D. Free-energy Functions in the Extended RISM Approximation. Mol. Phys. 1985, 55, 621–625. [Google Scholar] [CrossRef]
Imai, T.; Kinoshita, M.; Hirata, F. Theoretical Study for Partial Molar Volume of Amino Acids in Aqueous Solution: Implacation of Ideal Fluctuation Volume. J. Chem. Phys. 2000, 112, 9469–9478. [Google Scholar] [CrossRef]
Imai, T.; Kovalenko, A.; Hirata, F. Solvation Thermodynamics of Protein Studied by the 3D-RISM Theory. Chem. Phys. Lett. 2004, 395, 1–6. [Google Scholar] [CrossRef]
Imai, T.; Kovalenko, A.; Hirata, F. Partial Molar Volume of Proteins Studied by the Three-Dimensional Reference Interaction Site Model Theory. J. Phys. Chem. B 2005, 109, 6658–6665. [Google Scholar] [CrossRef]
Franks, F. Water: Compehensive Treatise; Plenum: New York, NY, USA, 1972. [Google Scholar]
Eisenberg, D.; Kauzman, W. The Structure and Properties of Water; Clarendon: Oxford, UK, 1969. [Google Scholar]
Nakasako, M. Water-Protein interactions from high-resolution protein crystalography. Philos. Trans. Biol. Sci. 2004, 359, 1191–1206. [Google Scholar] [CrossRef]
Arai, S.; Chatake, T.; Ohhara, T.; Kurihara, K.; Tanaka, I.; Suzuki, N.; Fujimoto, Z.; Mizuno, H.; Niimura, N. Complicated water orientation in the minor groove of the B-DNA decamer d(CCATTAATGG) observed by neutron diffraction measurement. Nucleic Acids Res. 2005, 33, 3017–3024. [Google Scholar] [CrossRef][Green Version]
Niimura, N.; Arai, S.; Kurihara, K.; Chatake, T.; Tanaka, I.; Bau, R. Recent results on hydrogen and hydration in biology studied by neutron macromolecular crystallography. Cell. Mol. Life Sci. 2005, 62, 285–300. [Google Scholar] [CrossRef]
Imai, T.; Hiraoka, R.; Kovalenko, A.; Hirata, F. Water molecules in protein cavity detected by a statistical-mechanical theory. J. Am.Chem. Soc. 2005, 127, 15334–15335. [Google Scholar] [CrossRef]
Wilson, K.P.; Malcolm, B.A.; Matthews, B.W. Structural and thermodynamic analysis of compensating mutations within the core of chicken egg white lysozyme. J. Biol. Chem. 1992, 267, 10842. [Google Scholar] [CrossRef]
Evans, J. Probing for Water in Protein Cavities; Royal Society of Chemistry: London, UK, 2005. [Google Scholar]
Lazaridis, T. Inhomogeneous Fluid Approach to Solvation Thermodynamics, 1. Theory. J. Phys. Chem. B 1998, 102, 3531–3541. [Google Scholar] [CrossRef]
Lazaridis, T. Inhomogeneous Fluid Approach to Solvation Thermodynamics, 2. Application to simple Fluids. J. Phys. Chem. B 1998, 102, 3542–3550. [Google Scholar] [CrossRef]
Young, T.T.; Abel, R.R.; Kim, B.B.; Beme, B.J.B.; Friesner, R.A.R. Motifs for Molecular Recognition Exploiting Hydrophobic Enclosure in Protein-Ligad Binding. Proc. Natl. Acad. Sci. USA 2007, 104, 808–813. [Google Scholar] [CrossRef] [PubMed]
Sindhikara, D.J.; Yoshida, N.; Hirata, F. Placevent: An Algorithm for Prediction of Explcit Solvent Atom Distribution–Application to HIV-1 Protease and F-ATP Synthase. J. Comput. Chem. 2012, 33, 1536–1543. [Google Scholar] [CrossRef]
Sindhikara, D.J.; Hirata, F. Analysis of Biomolecular Solvation Site by 3D-RISM Theory. J. Phys. Chem. B 2013, 117, 6718–6723. [Google Scholar] [CrossRef]
Imai, T.; Hiraoka, R.; Seto, T.; Kovalenko, A.; Hirata, F. Three-Dimensional Distribution Function Theory for the Prediction of Protein-Ligand Binding Sites and Affinities: Application to the Binding of Noble Gases to Hen Egg-White Lysozyme in Aqueous Solution. J. Phys. Chem. B 2007, 111, 11585–11591. [Google Scholar] [CrossRef]
Prangé, T.; Schiltz, M.; Pernot, L.; Colloc’h, N.; Longhi, S.; Bourguet, W.; Fourme, R. Exploring hydrophobic sites in proteins wth xenon and krypton. Proteins Struct. Funct. Genet. 1998, 30, 6–73. [Google Scholar] [CrossRef]
Herzberg, O.; James, M.N. Structure of the calcium regulatory muscle protein troponin-C at 2.8-A resolution. Nature 1985, 313, 653–659. [Google Scholar] [CrossRef]
Ikura, M.; Clore, G.M.; Gronenborn, A.M.; Zhu, G.; Klee, C.B.; Bax, A. Solution structure of a calmodulin-target peptide complex by multidimensional NMR. Science 1992, 256, 632–638. [Google Scholar] [CrossRef]
Hille, B. Ionic Channels of Excitable Membranes; Sinauer Associates: Shunderland, MA, USA, 2001. [Google Scholar]
Tsuda, S.; Ogura, K.; Hasegawa, Y.; Yagi, K.; Hikichi, K. H-1-NMR study of rabbit skeltal-mascle troponin-C-NG²⁺-induced conformational change. Biochemistry 1990, 29, 4951–4958. [Google Scholar] [CrossRef]
Yoshida, N.; Phongphanphanee, S.; Maruyama, Y.; Imai, T.; Hirata, F. Selective Ion-Binding by Protein Probed with the 3D-RISM Theory. J. Am. Chem. Soc. 2006, 128, 12042–12043. [Google Scholar] [CrossRef] [PubMed]
Yoshida, N.; Phongphanphanee, S.; Hirata, F. Selective Ion-Binding by Protein Probed with the Statistical Mechanical Integral Equation Theory. J. Phys. Chem. B 2007, 111, 4588–4595. [Google Scholar] [CrossRef]
Kuroki, R.; Yutani, K. Structural and thermodynamic responses of mutations at a Ca²⁺ binding site engineered into human lysozyme. J. Biol. Chem. 1998, 273, 34310–34315. [Google Scholar] [CrossRef]
Cowan, J.A. Structure and Catalytic Chemistry of Magnetisium-Dependent Enzymes. Biometals 2002, 15, 225–235. [Google Scholar] [CrossRef] [PubMed]
Kostrewa, D.; Winkler, F.K. Mg²⁺ Binding to the Active Site of EcoRV Endonuclease Endonuclease: A Crystallographic Study of Complexes with Substrate and Product DNA at 2Å Resolution. Biochemistry 1995, 34, 683–696. [Google Scholar] [CrossRef] [PubMed]
Vipond, I.B.; Baldwin, G.S.; Halford, S.E. Divalent Metal Ions at the Active Sites of the EcoRV and EcoRI Restriction Endonucleases. Biochemistry 1995, 34, 697–704. [Google Scholar] [CrossRef] [PubMed]
Groll, D.H.; Jeltsch, A.; Selent, U.; Pingoud, A. Does the Restriction Endonuclease EcoRV Employ a Two-Metal-Ion Mechanism for DNA Cleavage? Biochemistry 1997, 36, 11389–11401. [Google Scholar] [CrossRef] [PubMed]
Horton, N.C.; Perona, J.J. Making the Most of Metal Ions. Nat. Struct. Biol. 2001, 8, 290–293. [Google Scholar] [CrossRef]
Horton, N.; Perona, J.J. DNA Cleavage by EcoRV Endonuclease: Two Metal Ions in Three Metal Ion Binding Sites. Biochemistry 2004, 8, 6841–6857. [Google Scholar] [CrossRef]
Onishi, I.; Sunaba, S.; Yoshida, N.; Hirata, F. Role of Mg²⁺ Ions in DNA Hydrolysis by EcoRV, Studied by the 3D-Reference Interaction Site Model and Molecular Dynamics. J. Phys. Chem. B 2018, 122, 9061–9075. [Google Scholar] [CrossRef]
Viadiu, H.; Aggarwal, A.K. The role of metals in catalysis by the restriction endonuclease BamHI. Nat. Struc. Biol. 1998, 5, 910–916. [Google Scholar] [CrossRef] [PubMed]
Kitchen, B.D.; Decornez, H.; Furr, J.R.; Bajorath, J. Docking and scoring in virtual screening for drug discovery: Meyhods and applications. Nat. Rev. Drug Discov. 2004, 3, 935–949. [Google Scholar] [CrossRef] [PubMed]
Klebe, G. Virtual ligand screening: Strategies, perspectives and limitations. Drug Discov. Today 2006, 11, 580–594. [Google Scholar] [CrossRef] [PubMed]
Sotriffer, C.; Klebe, G. Identification and mappinf of small-molecule binding sites in proteins: Computational tools for structure-based drug design. Farmaco 2002, 57, 243–251. [Google Scholar] [CrossRef]
Gohlke, H.; Klebe, G. Approaches to the description and prediction of the binding affinity of small- molecule ligands to macromolecular receptors. Angew. Chem. Int. Ed. 2002, 41, 2645–2676. [Google Scholar] [CrossRef]
Halperin, I.; Ma, B.; Wolfson, H.; Nussinov, R. Principles of docking: An overview of search algorithms and a guide to scoring functions. Proteins Struct. Funct. Genet. 2002, 47, 409–443. [Google Scholar] [CrossRef]
Brooijmans, N.; Kuntz, I.D. Molecular recofnition and docking algorithms. Annu. Rev. Biophys. Biomol. Struct. 2003, 32, 335–373. [Google Scholar] [CrossRef]
Lichtarge, O.; Sowa, M.E. Evolutionary predictions of binding surfaces and interactions. Curr. Opin. Struct. Biol. 2002, 12, 21–27. [Google Scholar] [CrossRef]
Sousa, S.F.; Fernandes, P.A.; Ramos, M.J. Protein-ligand docking: Current status and future challenges. Proteins: Struct. Funct. Genet. 2006, 65, 15–26. [Google Scholar] [CrossRef]
Ladbury, J.E. Just add water! The effect of water on the specificity of protein-ligand binding sites and its potential application to drug design. Chem. Biol. 1996, 3, 973–980. [Google Scholar] [CrossRef]
Bachmann, M.; Möröy, T. The Serine/Threonine Kinase Pim-1. Int. J. Biochem. Cell Biol. 2005, 37, 726–730. [Google Scholar] [CrossRef]
Merkel, A.L.; Meggers, E.; Ocker, M. PIM1 Kinase as a Target for Cancer Therapy. Expert Opin. Investig. Drugs 2012, 21, 425–436. [Google Scholar] [CrossRef]
Imai, T.; Miyashita, N.; Sugita, Y.; Kovalenko, A.; Hirata, F.; Kidera, A. Functionality Mapping on Internal Surfaces of Multidrug Transporter AcrB Based on Molecular Theory of Solvation: Implication for Drug Effux Ptahway. J. Phys. Chem.B. 2011, 115, 8288–8295. [Google Scholar] [CrossRef]
Blinov, N.; Huang, W.; Nikolic, D.; Wishart, D. 3D-RISM-Docc: A New Fragment-Based Drug Design Protocol. J. Chem. Theor. Comp. 2012, 8, 3356–3372. [Google Scholar]
Nikolic, D.; Blinov, N.; Nikolic, D. Biomolecular Recognition Based on 3D Molecular Theory of Solvation. Biophys. J. 2014, 106, 411A. [Google Scholar]
Zhukova, Y.N.; Alekseeva, M.G.; Zakharevich, N.V.; Shtil, A.A.; Danilenko, V.N. Pim Family of Protein Kinases: Structure, Functions, and Roles in Hematopoietic Malignancies. Mol. Biol. 2011, 45, 695–703. [Google Scholar] [CrossRef]
Liang, C.; Li, Y.-Y. Use of Regulators and Inhibitors of Pim-1, a Serine/Threonine Kinase, for Tumour Therapy (Review). Mol. Med. Rep. 2014, 9, 1–10. [Google Scholar] [CrossRef]
Tursynbay, Y.; Zhang, J.; Li, Z.; Tokay, T.; Zhumadilov, Z.; Wu, D.; Xie, Y. Pim-1 Kinase as Cancer Drug Target: An Update (Review). Biomed. Rep. 2015, 4, 1–7. [Google Scholar] [CrossRef]
Shah, N.; Pang, B.; Yeoh, K.-G.; Thorn, S.; Chen, C.S.; Lilly, M.B.; Salto-Tellez, M. Potential Roles for the PIM1 Kinase in Human Cancer—A Molecular and Therapeutic Appraisal. Eur. J. Cancer. 2008, 44, 2144–2151. [Google Scholar] [CrossRef]
Valdman, A.; Fang, X.; Pang, S.-T.; Ekman, P.; Egevad, L. Pim-1 Expression in Prostatic Intraepithelial Neoplasia and Human Prostate Cancer. Prostate 2004, 60, 367–371. [Google Scholar] [CrossRef]
Laird, P.W.; van der Lugt, N.M.T.; Clarke, A.; Domen, J.; Linders, K.; McWhir, J.; Berns, A.; Hooper, M. In Vivo Analysis of Pim-1 Deficiency. Neucleic Acids Res. 2003, 21, 4750–4755. [Google Scholar] [CrossRef] [PubMed]
Domen, J.; van der Lugt, N.M.T.; Laird, P.W.; Saris, C.J.M.; Clarke, A.R.; Hooper, M.L.; Berns, A. Impaired Interleukin-3 Response in Pim-1-Deficient Bone Marrow-Derived Mast Cells. Blood 1993, 82, 1445–1452. [Google Scholar] [CrossRef] [PubMed]
Burger, M.T.; Nishiguchi, G.; Han, W.; Lan, J.; Simmons, R.; Atallah, G.; Ding, Y.; Tamez, V.; Zhang, Y.; Mathur, M.; et al. Identification of N-(4-((1 R,3 S,5, S)-3-Amino-5-Methylcyclohexyl)Pyridin-3-Y1)-6-(2,6-Difluorophenyl)-5-Fluoropicolinamide(PIM447), a potent and selective proviral insertion site of moloney murine leukemia (PIM) 1, 2, and 3 kinase inhibitor in clinical trials for hematological malignancies. J. Med. Chem. 2015, 2, 8373–8386. [Google Scholar]
Wan, X.; Zhang, W.; Li, L.; Xie, Y.; Li, W.; Huang, N. A New Target for an Old Drug: Identifying Mitoxantrone as a Nanomolar Inhibitor of PIM1 Kinase via Kinome-Wide Selectivity Modeling. J. Med. Chem. 2013, 56, 2619–2629. [Google Scholar] [CrossRef]
Grey, R.; Pierce, A.C.; Bemis, G.W.; Jacobs, M.D.; Moody, C.S.; Jajoo, R.; Mohal, N.; Green, J. Structure-Based Design of 3-Aryl-6-Amino-Triazolo[4,3-B]Pyridazine Inhibitors of Pim-1 Kinase. Bioorg. Med. Chem. Lett. 2009, 19, 3019–3022. [Google Scholar] [CrossRef]
Levinthal, C. Are there pathways for protein folding? J. Chim. Phys. 1968, 65, 44–45. [Google Scholar] [CrossRef]
Mori, H. Transport, Collective Motion, and Brownian Motions. Prog. Theor. Phys. 1965, 33, 423–455. [Google Scholar] [CrossRef]
Kataoka, M.; Nishi, I.; Fujisawa, T.; Ueki, T.; Tokunaga, F.; Goto, Y. Structural Characterization of the Moten Globule and Native States of Apomyoglobin by Solution X-ray Scattring. J. Mol. Biol. 1995, 249, 215–228. [Google Scholar] [CrossRef]
Ikeguchi, M.; Ueno, J.; Sato, M.; Kidera, A. Protein Structural Change upon ligand binding. Phys. Rev. Lett. 2005, 94, 78102. [Google Scholar] [CrossRef]
Doster, W.; Cusack, S.; Petry, W. Dynamical Transition of Myoglobin Revealed by Ineleastic Neutron Scattering. Nature 1989, 337, 754–756. [Google Scholar] [CrossRef]
Nakagawa, H.; Joti, Y.; Kitao, A.; Kataoka, M. Hydration Affects Both Harmonic and Anharmonic Nature of Protein Dynamics. Biophys. J. 2008, 95, 2916–2923. [Google Scholar] [CrossRef] [PubMed]
Chen, G.; Fenimore, P.W.; Frauenfelder, H.; Mezai, F. Protein fluctuations explored by inelastic neutron scattering and dielectric relaxation spectroscopy. Philos. Mag. 2008, 88, 3877–3883. [Google Scholar] [CrossRef]
Nakagawa, H.; Kataoka, M. Percolation of Hydration Water as a Control of Protein Dynamics. J. Phys. Soc. Jpn. 2010, 79, 083801–083805. [Google Scholar] [CrossRef]
Hirata, F.; Sugita, M.; Yoshida, M.; Akasaka, K. Structural Fluctuation of Protein and Anfinsen’s Thermodynamic Hypothesis. J. Chem. Phys. 2018, 148, 20901–20909. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Partial molar volume of proteins in water plotted against molecular weight. (The figure was reprinted from Ref. [40]. Copyright (2005) American Chemical Society.).

Figure 2. 3D-distribution g(r) of water around a protein (lysozyme). (The figure is reprinted from Ref. [46]. Copyright (2005) American Chemical Society.)

Figure 3. The 3D-distribution g(r) of water inside the active site of protein: green surface, oxygen; purple surface, hydrogen. (The figure is reprinted from Ref. [46]. Copyright (2005) American Chemical Society.)

Figure 4. The 3D-distribution (g(r)) of Xe and water molecules in lysozyme: red surface, water oxygen; white surface, water hydrogen; yellow surface, xenon (a). The X-ray results are painted in orange (b). (The figure is reprinted from Ref. [54]. Copyright (2007) American Chemical Society).

Figure 5. The size dependence of noble gases bound at (a) the substrate binding site, and (b) the internal site. (The figure is reprinted from Ref. [54]. Copyright (2007) American Chemical Society).

Figure 6. The 3D-distribution of ions in lysozyme. Upper left, wild type; upper middle, Q86D mutant; upper right, A92D mutant. lower left, Q86D /A92D mutant. The theoretical result for the Q86D /A92D mutant is compared with the result of the X-ray crystallography in the lower right two panels: The binding sites are closed up in the insets of the two panels. (The figure is reprinted from Refs. [60,61]. Copyright (2006) and (2007) American Chemical Society.)

Figure 7. The 3D-distribution (g(r)) of ions in the EcoRV–DNA complex: (a) 3D-RISM/KH calculation, (b) Experimental results (Ref. [68]). (The figure is reprinted from Ref. [69]. Copyright (2018) American Chemical Society.).

Figure 8. MD simulation: (a) initial structure, (b) structure after 1 nanosecond (equilibrium structure). (The figure is reprinted from Ref. [69]. Copyright (2018) American Chemical Society.)

Figure 9. Comparison of (a) the MD initial structure and (b) the equilibrium structure (M structure) of EcoRV–DNA complex with BamHI-DNA structure (DNA, ball and stick; protein, stick; EcoRV, green; BamHI, blue; position of Mg²⁺ ion in MD simulation of EcoRV-DNA, green sphere (each label of ions in the initial structure has the same meaning as that in Figure 7); position of Ca²⁺ ion in BamHI-DNA, yellow sphere). [The figure is reprinted from Ref. [69]].

Figure 10. (a) A 3D structure of the Pim1 kinase and triazolo pyridazine inhibitor termed as VX2 in the Protein Data Bank (PDB). The PDB code is 3BGQ. We refer to the VX2 as a ligand 1, in this study, for the sake of simplicity of the terminology. (b) Chemical structure of the VX2. Circled part of the ligand is termed as triazolo[4,3-b]pyridazine scaffold. This is a common part of all the ligands that are applied in this study. (The figure is reprinted from Ref. [16]. Copyright (2017) American Chemical Society.)

Figure 11. Standard thermodynamic cycle for the protein–ligand binding in aqueous solution. (The figure is reprinted from Ref. [16]. Copyright (2017) American Chemical Society.)

Figure 12. Correlation between calculated and experimental values of binding free energy for all the ligands. Correlation coefficient R = 0.69. The figures are colored in different manners based on the structural feature of the ligands. (a) Ligands that include CF₃ on the meta position of the phenyl ring are colored with red. Other ligands are colored with blue. (b) Ligands that have cyclo-hexane, cyclo-butane, and cyclo-propane are colored with purple, orange, and green, respec-tively. (The figure is reprinted from Ref. [16]. Copyright (2017) American Chemical Society).

Figure 13. Wave number and temperature dependence of neutron structure factor. (a) Theoretical prediction: T₁ < T₂ < T₃ < T₄. (b) Experimental data Ref. [100]. ((a) is reprinted from Ref. [25]. Copyright (2018) Elsevier. (b) is reprinted from Ref. [100]. Copyright (2008) Elsevier.).

Figure 14. (a) Temperature dependence of the mean square displacement deduced from Equation (41). (b) Experimental results obtained by Nakagawa and Kataoka [102]. ((a) is reprinted from Ref. [25]. Copyright (2018) Elsevier. (b) is reprinted from Ref. [102]. Copyright (2010) Physical Society of Japan.).

Figure 15. Schematic diagram showing the structural transition of proteins. (The figure is reprinted from Ref. [103]. Copyright (2018) American Institute of Physics.)

Table 1. The list of ligands and the inhibition constant, K_i.

Δ G_{b i n d, \exp}

is the experimental value of the binding free energy estimated from -RTlnK_i at the temperature T = 300 K.

Δ Δ G_{b i n d, \exp}

is the binding free energy relative to that of Ligand No. 1.

Table 1. The list of ligands and the inhibition constant, K_i.

Δ G_{b i n d, \exp}

is the experimental value of the binding free energy estimated from -RTlnK_i at the temperature T = 300 K.

Δ Δ G_{b i n d, \exp}

is the binding free energy relative to that of Ligand No. 1.

No.	Ki (nM)	$Δ G_{b i n d, \exp}$ (kcal/mol)	$Δ Δ G_{b i n d, \exp}$ (kcal/mol)	No.	Ki (nM)	$Δ G_{b i n d, \exp}$ (kcal/mol)	$Δ Δ G_{b i n d, \exp}$ (kcal/mol)
1	11	−10.93	0.00	9	18	−10.63	0.30
2	44	−10.10	0.83	10	320	−8.92	2.01
3	94	−9.65	1.28	11	430	−8.74	2.19
4	21	−10.54	0.39	12	210	−9.17	1.76
5	160	−9.33	1.60	13	410	−8.77	2.16
6	49	−10.03	0.89	14	980	−8.25	2.68
7	100	−9.61	1.32	15	50	−10.02	0.91
8	1800	−7.89	3.04	16	54	−9.98	0.95

Table 2. Binding free energy and its components obtained from MM/RISM/3D-RISM method:

Δ E_{s o l u t e}

, the change of the interaction energy upon binding;

- T Δ S_{s o l u t e}

, the contribution from the entropy change;

Δ G_{s o l v}

, the desolvation free energy;

Δ G_{b i n d_c a l c}

, the theoretical estimate of the binding free energy;

Δ G_{b i n d_\exp}

, the experimental value of the binding free energy estimated from

K_{i}

[94].

Table 2. Binding free energy and its components obtained from MM/RISM/3D-RISM method:

Δ E_{s o l u t e}

, the change of the interaction energy upon binding;

- T Δ S_{s o l u t e}

, the contribution from the entropy change;

Δ G_{s o l v}

, the desolvation free energy;

Δ G_{b i n d_c a l c}

, the theoretical estimate of the binding free energy;

Δ G_{b i n d_\exp}

, the experimental value of the binding free energy estimated from

K_{i}

[94].

No.	$Δ E_{s o l u t e}$ (kcal/mol)	$- T Δ S_{s o l u t e}$ (kcal/mol)	$Δ G_{s o l v}$ (kcal/mol)	$Δ G_{b i n d_c a l c}$ (kcal/mol)		$Δ G_{b i n d_\exp}$ (kcal/mol)
No.	$Δ E_{s o l u t e}$ (kcal/mol)	$- T Δ S_{s o l u t e}$ (kcal/mol)	$Δ G_{s o l v}$ (kcal/mol)	Ave.	Std. err.	$Δ G_{b i n d_\exp}$ (kcal/mol)
1	−56.18	8.06	48.02	0.26	1.33	−10.93
2	−51.81	8.05	45.39	1.62	1.94	−10.10
3	−42.22	12.85	34.76	5.38	1.27	−9.65
4	−50.34	7.95	42.56	0.17	2.96	−10.54
5	−57.74	12.83	50.09	5.17	1.69	−9.33
6	−60.98	12.57	52.86	4.45	1.41	−10.03
7	−60.04	15.29	52.37	7.62	1.37	−9.61
8	−44.7	9.96	39.27	4.53	2.13	−7.89
9	−42.49	6.54	36.64	0.69	1.99	−10.63
10	−54.48	11.11	48.08	4.71	2.46	−8.92
11	−43.69	9.79	39.51	5.61	1.27	−8.74
12	−44.39	8.87	39.81	4.29	1.07	−9.17
13	−55.57	9.6	0.61	4.63	1.67	−8.77
14	−52.04	10.11	46.95	5.02	1.16	−8.25
15	−59.06	13.3	49.45	3.7	1.35	−10.02
16	−53.51	9.88	46.16	2.53	1.86	−9.98

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Molecular Recognition and Self-Organization in Life Phenomena Studied by a Statistical Mechanics of Molecular Liquids, the RISM/3D-RISM Theory

Abstract

1. Introduction

2. Brief Review of the 3D-RISM/RISM Theory

3. Molecular Recognition in Life Phenomena

3.1. Recognition of Water Molecules by Protein

3.2. Noble Gas Recognized by Protein

3.3. Selective Ion-Binding by Protein

3.4. Molecular Recognition in an Enzymatic Reaction: Role of Mg²⁺ Ions in DNA Hydrolysis Reaction by Restriction Enzyme EcoRV

3.5. Molecular Recognition in Drug Screening

4. Structural Fluctuation and Reorganization of Protein in Water

4.1. The Theory of Structural Fluctuation of Protein

4.2. Incoherent Elastic Neutron Scattering

5. Protein Folding as an Example of Self-Organization Process

6. Summary and Perspective

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

Molecular Recognition and Self-Organization in Life Phenomena Studied by a Statistical Mechanics of Molecular Liquids, the RISM/3D-RISM Theory

Abstract

1. Introduction

2. Brief Review of the 3D-RISM/RISM Theory

3. Molecular Recognition in Life Phenomena

3.1. Recognition of Water Molecules by Protein

3.2. Noble Gas Recognized by Protein

3.3. Selective Ion-Binding by Protein

3.4. Molecular Recognition in an Enzymatic Reaction: Role of Mg2+ Ions in DNA Hydrolysis Reaction by Restriction Enzyme EcoRV

3.5. Molecular Recognition in Drug Screening

4. Structural Fluctuation and Reorganization of Protein in Water

4.1. The Theory of Structural Fluctuation of Protein

4.2. Incoherent Elastic Neutron Scattering

5. Protein Folding as an Example of Self-Organization Process

6. Summary and Perspective

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

3.4. Molecular Recognition in an Enzymatic Reaction: Role of Mg²⁺ Ions in DNA Hydrolysis Reaction by Restriction Enzyme EcoRV