Next Article in Journal
Suitable Method for Improving Friction Performance of Magnetic Wheels with Metal Yokes
Previous Article in Journal
Structural–Parametric Synthesis of Path-Generating Mechanisms and Manipulators
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Numerical Integrator for Kinetostatic Folding of Protein Molecules Modeled as Robots with Hyper Degrees of Freedom

Department of Electrical & Computer Engineering, College of Engineering & Computer Science, University of Michigan–Dearborn, 2057 Tony England ELB, Dearborn, MI 48128, USA
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Robotics 2024, 13(10), 150; https://doi.org/10.3390/robotics13100150
Submission received: 12 August 2024 / Revised: 27 September 2024 / Accepted: 30 September 2024 / Published: 2 October 2024
(This article belongs to the Special Issue Bioinspired Robotics: Toward Softer, Smarter and Safer)

Abstract

:
The kinetostatic compliance method (KCM) models protein molecules as nanomechanisms consisting of numerous rigid peptide plane linkages. These linkages articulate with respect to each other through changes in the molecule dihedral angles, resulting in a kinematic mechanism with hyper degrees of freedom. Within the KCM framework, nonlinear interatomic forces drive protein folding by guiding the molecule’s dihedral angle vector towards its lowest energy state in a kinetostatic manner. This paper proposes a numerical integrator that is well suited to KCM-based protein folding and overcomes the limitations of traditional explicit Euler methods with fixed step size. Our proposed integration scheme is based on pseudo-transient continuation with an adaptive step size updating rule that can efficiently compute protein folding pathways, namely, the transient three-dimensional configurations of protein molecules during folding. Numerical simulations utilizing the KCM approach on protein backbones confirm the effectiveness of the proposed integrator.

1. Introduction

Kinetostatic modeling is a cornerstone in the design, analysis, motion control, and optimization of robotic systems [1]. This approach has proven effective in such diverse robotics applications as design of underactuated origami grippers [2], tendon-driven parallel continuum robots [3], and concentric push–pull robots [4].
Kinetostatic modeling has also proven remarkably effective in determining the transient and final three-dimensional structures of proteins during folding [5,6,7]. The kinetostatic approach to protein folding leverages the fundamental concept that proteins can be represented as nanomechanisms consisting of rigid peptide plane linkages (see, e.g., [7,8,9,10,11,12,13]). These linkages form a kinematic chain with hyper degrees of freedom; for instance, even a very small synthetic protein such as chignolin has nine peptide planes, corresponding to eighteen degrees of freedom in the backbone chain [14]. This allows them to articulate with respect to each other through rotations in dihedral angles centered at alpha-Carbon atoms, as illustrated in Figure 1 (see Section 2.1 for further details). Hence, it should be no surprise that techniques from robotic motion planning [11,15,16] and control theory [12,13,17,18,19] have been utilized to investigate the problem of protein folding/unfolding.
When modeling proteins as kinematic nanomechanisms, the kinetostatic compliance method (KCM) views protein folding as a process where the molecular configuration changes due to electrostatic and van der Waals nonlinear interatomic forces [20]. In this model, the peptide linkages articulate with respect to each other through dihedral angle rotations. Since its introduction, the KCM has proven to be a valuable tool for understanding hydrogen bond formation and its impact on protein mobility [21]. Furthermore, it has enabled the design of peptide-based nanorobots and nanomachines [22,23,24,25]. Indeed, KCM-based folding simulations can provide quantitative prediction of mobility and range of motion in protein-based nanomachines such as parallel nanorobotic platforms [26].
To predict and control robotic system trajectories, roboticists typically employ dynamic models expressed as ordinary or partial differential equations. Therefore, devising an appropriate integration scheme considering a robot’s specific equations of motion and application is crucial. Table 1 provides a sample overview of numerical integration algorithms used in the robotics literature, ranging from implicit Euler methods for tendon-driven robots to Lie group variational integrators for underwater autonomous vehicles.
While the KCM approach offers computational advantages for simulating kinetostatic protein folding, its reliance on explicit Euler integration with a fixed step size hinders its efficiency. Indeed, each integration step necessitates intensive calculations of interatomic forces, incorporation of physical constraints, and conversion to kinetostatic input torques. Consequently, the explicit Euler method’s convergence to a folded conformation is often computationally expensive due to the excessive number of required iterations. To overcome these limitations, a fast and stable numerical algorithm is essential for accelerating large-scale KCM-based protein folding simulations.
In this paper, we develop an effective numerical integration technique for KCM-based protein folding. Our proposed integrator relies on the explicit pseudo-transient continuation framework ( Ψ TC, our choice of Ψ TC for abbreviating “pseudo-transient continuation”, follows the pioneering work of Kelley, Coffey, and collaborators; see, e.g., [35,36]). Ψ TC is an efficient numerical method for calculating steady-state solutions of systems arising from ODE- or PDE-based transient dynamics [35,36,37,38,39,40]. The technique adapts the step sizes base on the underlying dynamics and proximity to the steady state. These updates are rooted in variations of the switched evolution relaxation (SER) method by Mulder and van Leer [41]. Thanks to its robust convergence and stability properties, Ψ TC has been applied in diverse fields such as fluid dynamics [42], radiation transport [43], and magnetohydrodynamics [44].
  • Contributions of the Paper: The present paper advances the KCM-based protein folding framework by introducing an explicit Ψ TC numerical integrator equipped with adaptive step-size control tailored to kinetostatic protein folding. Unlike previous KCM studies that exclusively relied on explicit Euler integrators with a fixed step size, our approach offers enhanced efficiency and accuracy. We provide rigorous analysis of the numerical stability and convergence properties of the explicit Ψ TC integrator within the kinetostatic protein folding context. Leveraging these properties, our method converges to folded protein conformations with fewer computational steps than traditional KCM approaches. Hence, the proposed method results in a significant reduction of the computational cost associated with KCM-based folding numerical simulations. Furthermore, our proposed numerical integration technique has potential beyond kinetostatic protein folding. Leveraging the similarity between the protein kinematic structure in the KCM framework and robotic manipulators with hyper degrees of freedom [45], our method is also applicable to multi-section continuum robots [46,47]. Moreover, its potential for fast numerical integration makes it a strong candidate for real-time implementation of model predictive controllers in soft robotic arms [48].
This paper significantly expands upon a preliminary version presented as a conference poster [49]. In this paper, we delve deeper into the Ψ TC numerical integration algorithm by providing a detailed description and discussing its numerical properties in the context of kinetostatic protein folding. Additionally, we present a detailed analysis (absent from the conference poster) investigating the stability and convergence properties of the proposed Ψ TC numerical integrator. Finally, the algorithm effectiveness is rigorously assessed through more extensive numerical simulations on protein backbone chains with many degrees of freedom.
The rest of this paper is organized as follows. First, in Section 2 we provide an overview of the kinematics of protein molecules and the KCM framework for modeling the protein folding process. In Section 3, we present the problem statement and elaborate the computational needs of kinetostatic protein folding. Next, in Section 4 we present our explicit Ψ TC integration scheme and an adaptive step-size control strategy for the underlying kinetostatic protein folding process. After presenting the numerical simulation results in Section 5, we conclude the paper with future research directions and final remarks in Section 6.
  • Notation: We denote the set of all non-negative real numbers by R + . Given a vector x in R M and a real constant p 1 , we denote its p-norm by | x | p . Given an integer M and matrix A in R M × M , we let ϱ ( A ) denote the spectral radius of A . We say that a sequence { x k } k = 1 R M converges q-linearly to L if there exists κ ( 0 , 1 ) such that lim k | x k + 1 L | | x k L | = κ .

2. Background

In this section we provide a summary of the KCM framework for modeling the kinetostatic folding of protein molecules in vacuo. To prevent ambiguity, we note that “conformation” is the established term in the field of biochemistry for denoting the spatial arrangement of a protein molecule’s kinematic structure; conversely, the field of robot kinematics predominantly employs “configuration” for the same concept. In this paper, we utilize “conformation” and “configuration” interchangeably.

2.1. Nano-Linkage-Based Kinematic Model of Protein Molecules

Proteins are macromolecules with complex structures and intricate dynamics. These long molecular chains consist of peptide planes that are interconnected by chemical bonds. The functionality of protein molecules depends critically on their three-dimensional (3D) structure (i.e., their conformation), which can be directly deduced from the linear sequence of amino acids forming their polypeptide chain. For brevity, the following presentation is confined to the protein backbone.
A schematic of the protein polypeptide backbone chain is depicted in Figure 1 and Figure 2. As can be seen from the figure, each individual peptide plane is formed by six coplanar atoms that are covalently bonded (visualized as red lines in Figure 2). The coplanarity assumption about the α -carbon C α , carbonyl CO, amide nitrogen NH, and another α -carbon is motivated by the wealth of related high-resolution X-ray crystallographic data (see, e.g., [50]). This fundamental assumption underpins the conceptualization of protein molecules as intricate nanomechanisms with numerous degrees of freedom (see, e.g., [5,8,12,13,51]). Accordingly, the key building blocks of the protein kinematic chain are the rigid peptide planes, which function as rigid nanoscale linkages (see Figure 1 and Figure 2).
As illustrated in Figure 1 and Figure 2, the rotation of the nano-linkages in the protein kinematic chain with respect to each other is facilitated by the central carbon atoms, which are denoted by C α and commonly known as the alpha-Carbon atoms. Indeed, C α atoms play the role of hinges connecting peptide planes together; C α carbon forms chemical bonds with four distinct constituents: the nitrogen, hydrogen, and carbonyl carbon atoms of the peptide bond and a variable side chain represented by SR. The first and last C α atoms are bonded to an N-terminus (an amino group)–peptide plane pair and a C-terminus (a carboxyl group)–peptide plane pair, respectively.
  • Protein Configuration Vector: The nano-kinematic structure of a protein molecule comprising N 1 peptide planes can be comprehensively represented by a set of bond lengths and dihedral angle pairs. Considering the ith peptide plane 1 i N 1 , each pair of these dihedral angles represent rotations about the covalent bonds N C α , namely, θ 2 i + 1 , and C α C , i.e., θ 2 i + 2 (Figure 2). Additionally, the dihedral angle vector
    θ : = [ θ 1 , , θ 2 N ]
    represents the configuration of the backbone chain. Therefore, the vector θ encompassing the dihedral angles resides within the 2 N -dimensional configuration space Q . This space, formally defined as Q : = S 1 × × S 1 , represents the Cartesian product of 2 N individual unit circles S 1 , each corresponding to a single dihedral angle within the backbone chain.
Each dihedral angle in the conformation vector θ in Equation (1) corresponds to a single degree of freedom in the protein molecule’s kinematic chain. For each degree of freedom, we can define a unit vector u j , 1 j 2 N . These vectors align with the axes of rotation around which the nano-kinematic chain can rotate. As illustrated in Figure 2, the unit vectors u 2 i and u 2 i + 1 represent the directions along two crucial bonds within the ith peptide plane: the C α C bond and the N C α bond. These bonds define the primary rotation axes within the chain’s segments. Serving as key structural references, u 1 and u 2 N denote the unit vectors associated with the amino terminus (N-terminus) and carboxyl terminus (C-terminus), respectively, of the polypeptide chain. These termini define the endpoints of the entire nano-kinematic chain. As illustrated in Figure 1, the N-terminus and C-terminus can be considered as the base and end-effector, respectively, of the protein nano-kinematic mechanism.
While dihedral angles define the rotational degrees of freedom within a protein’s kinematic chain, additional information is necessary to fully capture the spatial orientation of the rigid peptide nano-linkages. This information comes in the form of body vectors, denoted by b j , 1 j 2 N . These vectors comprehensively define the relative spatial arrangement of coplanar atoms within each peptide plane. Consequently, any relative positional relationship between two atoms within a plane can be mathematically expressed as a linear combination of their corresponding body vectors, i.e., k 1 m b 2 i + k 2 m b 2 i + 1 , where the coefficients k 1 m and k 2 m , 1 m 4 are constants specific to the atom pair and are identical throughout the peptide chain (see, e.g., [5,6] for more details). The body vectors b j and unit vectors u j act in concert to provide a comprehensive depiction of the protein molecule’s conformation as it dynamically varies with respect to the vector θ , which encompasses the dihedral angles.
  • Protein Forward Kinematics: The intricate interplay between the dihedral angle vector θ and a protein’s kinematic structure can be elegantly described through a framework of rotational matrix transformations [8,52]. This formalism offers a comprehensive and computationally efficient approach for representing protein conformation as a function of its internal torsional degrees of freedom. A reference configuration, designated by θ = 0 with reference unit vectors u j 0 and reference body vectors b j 0 , 1 j 2 N , serves as the starting point for the following transformations:
u j ( θ ) = X ( θ , u j 0 ) u j 0 b j ( θ ) = X ( θ , u j 0 ) b j 0
which compute the mapping from the dihedral angle space to the protein kinematic configuration. Each of the transformation matrices X ( θ , u j 0 ) , 1 j 2 N in (2) is defined using sequential matrix multiplication:
X ( θ , u j 0 ) : = r = 1 j R ( θ j , u j 0 )
where the rotation matrix R ( θ j , u j 0 ) describes the rotation about the vector u j 0 with angle θ j and the matrix Ξ ( θ , u j 0 ) determines the molecule kinematic structure using the dihedral angle conformation vector θ . In Equation (3), the rotation matrix R ( θ j , u j 0 ) SO(3) describes the rotation about the direction provided by the unit vector u j 0 with an angle equal to θ j . We remark that the special orthogonal group SO3 is the set of all rotational matrices about the origin of three-dimensional Euclidean space; furthermore, any rotation matrix R ( α , v ^ ) , where α is an angle and v ^ = [ v ^ x , v ^ y , v ^ z ] is a unit vector, can be written as
R ( α , v ^ ) = v ^ x 2 V α + C α v ^ x v ^ y V α v ^ z S α v ^ x v ^ z V α + v ^ y S α v ^ x v ^ y V α + v ^ z S α v ^ y 2 V α + C α v ^ y v ^ z V α v ^ x S α v ^ x v ^ z V α v ^ y S α v ^ y v ^ z V α + v ^ x S α v ^ z 2 V α + C α ,
where V α : = 1 cos ( α ) , C α : = cos ( α ) , and S α : = sin ( α ) .
Subsequent to the computation of body vectors b j ( θ ) (Equation (2)) and anchoring the N -terminal atom at the origin, the Cartesian coordinates of the kth-peptide plane atoms can be determined by
r i ( θ ) = j = 1 i b j ( θ ) , 1 i 2 N 1 ,
where the indices i = 2 k 1 and i = 2 k correspond to the nitrogen and C α atoms, respectively.

2.2. Kinetostatic Compliance Folding

The KCM framework leverages the established experimental observation that it is possible to capture the essence of protein folding dynamics while neglecting inertial forces (see, e.g., [18,19,51,53]). According to the KCM, protein dihedral angles evolve kinetostatically under the influence of interatomic force fields.
  • Interatomic Force Fields Responsible for Kinetostatic Folding: Consider a peptide chain composed of N a atoms and N 1 peptide planes. The dihedral angle vector, denoted by θ defined in Equation (1), encodes the conformational state of the chain. The Cartesian coordinates of any two atoms a i , a j within the chain are represented by r i ( θ ) , r j ( θ ) , respectively (Equation (5)). Their Euclidean distance, a fundamental metric for characterizing interatomic interactions, is then calculated as d i j ( θ ) : = | r i ( θ ) r j ( θ ) | 2 . The specific parameters pertaining to atom charges, van der Waals radii, interatomic distances, dielectric constant, potential well depths, and force weights are available in [6] and the provided references.
Within this framework, the total free energy responsible for protein folding, denoted by G ( θ ) , decomposes into electrostatic and van der Waals contributions as expressed in
G ( θ ) : = G elec ( θ ) + G vdw ( θ ) ,
where G elec ( θ ) and G vdw ( θ ) represent the protein’s electrostatic potential energy and van der Waals interatomic potential energy, respectively (detailed expressions can be found in [5]). Consequently, the net forces acting on each atom a i , 1 i N a due to Coulombic and van der Waals interactions can be obtained through negative gradients of the respective individual energy terms: F i elec ( θ ) = r i G elec and F i vdw ( θ ) = r i G vdw .
  • Kinetostatic Folding Torque Vector: The KCM-based modeling framework [5] necessitates calculation of the resultant forces and torques acting on each of the N 1 peptide planes within the protein molecule. These computed forces and torques are subsequently concatenated into a 6 N -dimensional vector F ( θ ) , which serves as the generalized force vector driving the protein folding process. However, in order to direct the actual changes in the protein’s configuration, the vector F ( θ ) needs to be mapped to an equivalent 2 N -dimensional torque vector, denoted as τ ( θ ) . This mapping translates the generalized force vector into the specific torsional modifications exerted on the dihedral angles, ultimately governing the process of kinetostatic protein folding. The mathematical expression for the kinetostatic folding torque vector is expressed by
    τ ( θ ) = J ( θ ) F ( θ ) ,
    where J ( θ ) R 6 N × 2 N represents the molecule chain Jacobian at conformation θ . This matrix, acting as a bridge between generalized forces and torques, translates the 6 N -dimensional force vector into the corresponding 2 N -dimensional torque vector dictating the dihedral angle modifications (see [5] for detailed derivations). Consequently, the changes in dihedral angles at each protein conformation take place under the direct influence of the kinetostatic folding torque vector acting on the peptide backbone. Specifically, the vector τ ( θ ) R 2 N , which aligns with the steepest-descent direction of the total free energy G ( θ ) in the protein conformation landscape, guides the dihedral angle variations during folding.
  • Folded Protein Conformations: Within the context of protein folding, at each local minimum θ of the aggregate free energy function G ( θ ) defined in Equation (6), the corresponding torque vector τ ( θ ) vanishes identically; this signifies the protein molecule’s kinetostatic stationarity at folded conformations, where the absence of net kinetostatic torque reflects the balanced internal forces within the protein structure. Equation (7) specifies the torque vector τ ( θ ) aligned with the steepest-descent direction of the free energy gradient in the conformational landscape of the protein molecule. As described below, Kazerounian and colleagues [5,8] in their pioneering KCM framework leveraged a normalized kinetostatic folding torque vector in an iterative manner.
  • Successive Kinetostatic Folding Iteration: Within the KCM framework, dihedral angles evolve kinetostatically under the influence of the kinetostatic folding torque vector resulting from the interatomic force fields. In particular, given an unfolded protein molecule conformation θ 0 , the established kinetostatic compliance method relates joint torques to dihedral angle changes via the following numerical scheme (see, e.g., [5,51]):
    θ k + 1 = θ k + κ 0 τ ( θ k ) | τ ( θ k ) | , k Z 0 +
    where Z 0 + denotes the set of non-negative integers, the -norm of τ ( θ k ) is defined as | τ ( θ k ) | : = max i | τ i ( θ k ) | , and τ ( θ k ) = J ( θ k ) F ( θ k ) according to Equation (7).
In the dihedral angle update iterations provided by (8), the normalized torque vector τ ( θ k ) | τ ( θ k ) | governs the incremental updates to the dihedral angles at each conformation θ k .
As demonstrated by [12], the iteration provided by Equation (8) can be considered as an explicit Euler integration with time step κ 0 . Crucially, the integration time step κ 0 , which is determined through a heuristic procedure, has to be chosen sufficiently small to prevent extensive angle variations (see Section 2.3 for the resulting computational implications). Finally, we remark that the interaction between joint moments and dihedral angle variations in the kinetostatic folding process can theoretically lead to redundant calculations if not handled properly. However, our proposed method in this paper avoids this issue through the use of the explicit pseudo-transient continuation ( Ψ TC) integrator incorporating an adaptive step-size updating rule. This adaptive step size ensures that the step size increases as the system approaches a stable conformation, helping to avoid unnecessary iterations and redundant recalculations of joint moments. This also reduces the computational complexity, as the system requires fewer torque evaluations in regions of the energy landscape near equilibrium.
  • Convergence Criterion: The iterative process governed by Equation (8) continues until the molecule’s aggregated free energy G ( θ ) converges to the vicinity of a local minimum within the free energy landscape. Convergence is achieved numerically when the kinetostatic folding torque vector norm falls below a predefined tolerance τ tol > 0 , i.e., when | τ ( θ k ) | 2 < τ tol , where | τ ( θ k ) | 2 denotes the Euclidean norm of τ ( θ k ) .

2.3. Computational Burden of Kinetostatic Folding Iterations

A flowchart of the successive kinetostatic folding iteration is depicted in Figure 3. The most computationally intensive procedure at each conformation of the protein molecule (highlighted in red) consists of the electrostatic and van der Waals force computations. If exact interatomic force calculations are desired, then the computational complexity of this step for a molecule with N a atoms is quadratic (i.e., of order O ( N a 2 ) ) [5]. The computational complexity of numerical algorithms can be defined using big O notation, as follows: consider any two real-valued functions h 1 ( · ) and h 2 ( · ) ; if there exist a real number a 0 > 0 and real number z 0 such that the inequality | h 1 ( z ) | a 0 | h 2 ( z ) | is satisfied for all z z 0 , then we say that h 1 ( z ) = O ( h 2 ( z ) ) . Moreover, to encode more physical constraints in protein folding numerical simulations, such as entropy-loss constraints [12], it is necessary to solve a box-constrained convex quadratic program (QP) at each conformation of the protein molecule. The computational cost of such a convex QP using a state-of-the-art interior-point QP solver is of order O ( N 3 ) [54], where there are N 1 peptide planes in the protein backbone chain.
The variation of dihedral angles in the flowchart (the dashed border) needs to be governed by a proper kinetostatic folding update rule. For the conventional KCM iteration, the dihedral angle update rule governing the kinetostatic folding is provided by Equation (8). However, preventing numerical instability and large dihedral angle variations requires choosing sufficiently small values for the integration time step κ 0 . This inevitably results in an excessive number of iterations for convergence to a folded protein molecule configuration. Consequently, there is a need for departure from explicit Euler integrators using fast and stable numerical algorithms for accelerating large-scale KCM-based protein folding simulations.

3. Problem Statement: Protein Folding Pathway Computation Problem (PFPCP)

In this section, we consider the kinetostatic folding of protein molecules and formulate the main problem addressed in the paper.
  • Protein Folding Pathway Computation Problem (PFPCP): Consider a protein backbone chain with N 1 peptide planes and dihedral angle vector θ , as provided by Equation (1). Furthermore, consider the KCM-based dynamics of protein folding provided by the following initial value problem:
    θ ˙ = F cl ( θ ) , θ ( 0 ) = θ 0
    where F cl ( θ ) : = τ ( θ ) | τ ( θ ) | , in which the kinetostatic folding torque vector τ ( θ ) is provided by Equation (7). Moreover, the dihedral angle vector θ 0 is an unfolded initial conformation that belongs to the domain of attraction of a folded conformation θ f . We devise a numerical integration scheme to integrate Equation (9) to the steady state, namely, the folded conformation θ f , and obtain the sequence of protein folding pathway samples θ k k = 0 N s , where N s is the smallest integer satisfying | F cl ( θ k ) | 2 < τ tol , for all k N s .
Remark 1.
As discussed in Section 2.2, any folded protein conformation θ f is an asymptotically stable equilibrium point of the KCM nonlinear dynamical system in Equation (9) (see, e.g., [5,6,7]). This signifies the protein molecule’s kinetostatic stationarity in folded conformations, where the absence of net kinetostatic torque reflects the balanced internal forces within the protein structure. Therefore, at any folded conformation θ f , we have
F cl ( θ f ) = 0 ,
which provides a convergence criterion for KCM iterations. Specifically, given a desired tolerance τ tol , if N s is the smallest integer that satisfies
| F cl ( θ k ) | 2 < τ t o l , f o r a l l k N s ,
then N s represents the terminal iteration step number in Equation (8) at which the aggregated free energy of the molecule converges to a sufficiently close vicinity of a free-energy-landscape local minimum. The tolerance should be chosen according to the accuracy required for the protein’s folded structure and the computational efficiency; too high a value could result in premature convergence before the protein reaches its stable conformation, while too low a value could unnecessarily prolong the computation time.
In this paper we use the norm of the vector | F cl ( θ k ) | 2 to assess convergence. As the theoretical justification for such a convergence criterion, we remark that many gradient-based optimization methods (see, e.g., [55]) have relied on the norm of the gradient as a widely used criterion to determine convergence. This criterion is based on the principle that at a local minimum, the gradient of the objective function should approach zero. Similarly, in our method we use the norm | F cl ( θ k ) | 2 to assess convergence. When this norm becomes sufficiently small (below a predefined threshold τ tol ), it indicates that the system has reached a point where further changes in the dihedral angles are negligible, signaling that the protein has reached its folded state. The value of τ tol is selected to balance computational efficiency and accuracy while avoiding premature termination of the iterations.
The conventional approach [5,6,12,51] for solving PFPCP relies on using the explicit Euler integration scheme with a fixed step size κ 0 , resulting in the successive kinetostatic folding iteration in Equation (8). As discussed in Section 2.3, selection of the step size κ 0 in Equation (8) dictates a tradeoff between the accuracy and stability of the predicted folding pathway samples and the number of integration steps.
To prevent numerical instability and large dihedral angle variations, in the explicit Euler scheme it is necessary to choose a sufficiently small integration time step κ 0 . This inevitably results in an excessive number of iterations, requiring kinetostatic folding torque computations with a large computational burden; in other words, the larger the number of iterations N s for convergence to a folded conformation of a protein molecule with N a atoms, the larger the needed number of interatomic electrostatic and van der Waals force calculations, which is of order O ( N s · N a 2 ) (see Section 2.3 for further details). Moreover, to encode more physical constraints in protein folding numerical simulations, it is necessary to perform more calculations in each iteration, which grows with the number of steps N s for convergence to the folded conformations. For instance, to encode entropy loss constraints [12], it is necessary to solve a box-constrained convex QP at each conformation of the protein molecule. The computational cost of such a convex QP using a state-of-the-art interior-point QP solver is of order O ( N 3 ) [54]. Therefore, the computational cost for encoding the entropy loss constraints through the whole kinetostatic folding simulations will be of order O ( N s · N 3 ) .
To address this issue in PFPCP, we present our Ψ TC-based numerical integration scheme in the next section.
Remark 2.
In all-atom molecular dynamics simulations of protein folding, symplectic integrators [56,57] offer numerical stability and geometric advantages over methods that rely on a fixed step size. However, their application is impeded in KCM-based folding simulations due to the KCM framework’s reliance on the negligible role of inertial forces compared to electrostatic and van der Waals interactions [18,19,51,53]. The absence of inertial effects renders symplectic integrators inapplicable in this context.

4. Explicit Ψ TC Numerical Integration for KCM-Based Protein Folding

Our goal in this section is to solve the PFPCP problem outlined in Section 3. We introduce an explicit Ψ TC numerical integration scheme with an adaptive step size tailored to KCM-based protein folding to solve the PFPCP. We begin by presenting the explicit Ψ TC solution to the PFPCP under a fixed step size and analyzing its convergence as well as stability properties in Section 4.1. Thereafter, in Section 4.2 we develop a step size adaptation rule based on the widely used switched evolution relaxation (SER) technique [41]. The flowchart of our Ψ TC scheme is depicted in Figure 4 and elaborated in Section 4.1 and Section 4.2.

4.1. Explicit Ψ TC Numerical Integration with Fixed Step Size

Let us consider the initial value problem in Equation (9) associated with the PFPCP formulated in Section 3. The explicit Ψ TC integration with fixed step size for solving the PFPCP can be described using the following steps (see the flowchart in Figure 4).
(Step 1) Initiation: We start with choosing a step size δ 0 , a positive constant ϵ , and a given tolerance τ tol . Next, we consider the iterator k and set it equal to 0. Moreover, we initiate the internal state of the Ψ TC scheme by assigning δ 0 F cl ( θ 0 ) to Z 0 . Note that Z k is the Ψ TC internal state vector in the kth, k 0 , iteration which is updated along with the protein molecule configuration vector θ k .
  • (Step 2) Predictor–Corrector Computation: After initiation, we run the Ψ TC scheme through the following predictor–corrector numerical iteration:
    θ k + 1 0 = θ k + Z k ,
    Z k + 1 = ω k ϵ F cl θ k + 1 0 + Z k ,
    θ k + 1 = θ k + Z k + 1 ,
    where
    ω k : = δ k ϵ + δ k for all k 0 .
In the Ψ TC scheme with a fixed step size, where δ k = δ 0 and ω k = δ 0 ϵ + δ 0 , the step size δ k is fixed for all iterator values k; see Section 4.2 for updating the step size at each iteration using an adaptive strategy. In other words, the SER update step (Step 4 in Section 4.2) is skipped in the Ψ TC integration with a fixed step size. Additionally, note that there is only one folding vector field F cl ( · ) computation at each predictor–corrector step of the Ψ TC iteration in (Section 4.1).
  • (Step 3) Checking Convergence: After each iteration, the convergence criterion is checked by comparing ρ : = | F cl ( θ k + 1 0 ) | 2 against the given desired tolerance τ tol . The iteration is terminated when ρ < τ tol .
  • Convergence Properties of the Explicit Ψ TC Scheme with Fixed Step Size: Considering the kinetostatic protein folding dynamics in Equation (9) and the asymptotic stability of the folded conformation θ f , our proposed Ψ TC numerical integrator computes the protein folding pathway from a given initial unfolded conformation θ 0 in the dihedral angle space. To establish the convergence properties of the explicit Ψ TC scheme in (Section 4.1), we leverage the following key properties of the underlying KCM-based folding dynamics as detailed in the literature (e.g., [5,6,12,13,51]):
P1
Any folded conformation θ f is a locally asymptotically stable equilibrium for (9).
P2
Given any folded conformation θ f , the folding vector field F cl ( · ) is uniformly bounded and uniformly Lipschitz continuously differentiable in a neighborhood N θ f of θ f .
Under Properties P1 and P2 and according to Theorem 2.1 of [38], the explicit Ψ TC scheme for Equation (9) with fixed step size enjoys the following properties:
C1
For any given positive constant ϵ in the explicit Ψ TC scheme with fixed step size, any Δ 0 > 0 , and for any sufficiently small fixed step-size δ 0 > 0 in (Section 4.1), there is an integer N s such that | θ N s θ f | p < Δ 0 , | θ N s 0 θ f | p < Δ 0 , and | Z N s | p < Δ 0 , where | · | p is an arbitrary p-norm on R 2 N (where p 1 ). Furthermore, F cl ( θ k ) is uniformly bounded for all 0 k N s .
C2
If, in addition to P1 and P2, the Jacobian of the folding torque vector field F cl ( · ) has negative real eigenvalues at the folded conformation θ f , and if the positive constant ϵ also satisfies ϵ ϱ F cl θ ( θ f ) < 4 3 , then there exists a p -norm, namely, a norm | · | p on R 4 N , such that [ θ k , Z k ] converges q-linearly to [ θ f , 0 ] (see the Notation in Section 1).
Property C1 guarantees that for any given prescribed accuracy Δ 0 it is always possible to choose a small enough step size δ 0 such that the protein configuration converges to an arbitrarily small vicinity of the folded molecule conformation determined by Δ 0 . Furthermore, the internal state of the integrator is guaranteed to remain bounded. Additionally, according to Property C2, if a certain spectral radius inequality condition is satisfied, then the protein configuration computed by the numerical integrator is guaranteed to converge to the folded conformation with a q-linear rate (see the Notation in Section 1).

4.2. Switched Evolution Relaxation (SER) Step Size Adaptation Rule

To dynamically change the step size of the Ψ TC integrator in each iteration, we adopt the SER method originally introduced by Mulder and van Leer [41] and subsequently employed within the Ψ TC framework [35,36]. The intuition underlying the SER step size adaptation rule is that when the initial step sizes are small, the integration is close to the explicit Euler scheme with fixed step size. When near the folded protein configuration θ f , the time step safely grows while keeping the protein molecule conformations within the domain of attraction of θ f . The details of the SER step size update rule can be stated as follows:
  • (Step 4) Step Size Update: We monitor the logarithmic change in the folding vector field F cl ( · ) from each iteration k to the next one k + 1 by computing the logarithmic growth monitoring variable σ k according to
    σ k = log | F cl ( θ k + 1 ) | 2 log | F cl ( θ k ) | 2 .
If the logarithmic change in the folding vector field satisfies σ k > 1 2 , then we update the integrator step-size using
δ k + 1 = min δ max , δ k | F cl ( θ k ) | 2 | F cl ( θ k + 1 ) | 2 ,
where δ max > 0 is a design parameter that sets an upper limit on the growth of step sizes in the explicit Ψ TC algorithm. On the other hand, if the logarithmic change in the folding vector field satisfies σ k 1 2 , then we do not update the step size.
Step 4 dynamically adjusts the step size δ k at each iteration. In particular, it requires that the logarithmic change in the protein folding vector field F cl ( · ) be monitored. When this logarithmic change satisfies the inequality in (14), then the step size δ k is updated. Accordingly, the parameter ω k provided by (13) is adjusted. This parameter is then utilized in the correction step of the predictor–corrector computations in (12a).
The SER step size update strategy prioritizes capturing critical transient conformations early in the protein folding process while gradually increasing the step size as the molecule approaches the folded conformation θ f . In the SER update rule provided by (15), comparison with δ max safeguards the step size δ k against becoming too large. Furthermore, the logarithmic monitoring variable σ k prevents the steps from very rapid oscillations. Additionally, as stated in the proof of Theorem 2.1, the early iterations of the Ψ TC scheme in [38] differ from its explicit Euler counterpart by O ( δ 0 2 ) , where the constant in the O-term is merely dependent on the folding vector field F cl ( · ) in a neighborhood of the initial unfolded conformation θ 0 .
Remark 3.
The proposed ΨTC method in this paper falls within the category of explicit ΨTC schemes (see, e.g., [38,39]). Explicit ΨTC schemes are generally preferred over implicit ones for protein folding pathway computations because they circumvent the need to solve linear systems. In contrast, implicit ΨTC schemes (see, e.g., [35,36]) require solving such systems, which is computationally expensive in this context due to the necessity of calculating the Jacobian of the nonlinear folding torque vector field F cl ( · ) at each conformation along the pathway.

5. Numerical Simulations

In this section, we present kinetostatic protein folding simulation results that validate our proposed Ψ TC numerical integration scheme for computing both transient and final protein conformations during the kinetostatic folding process. The simulations benchmark the performance of our Ψ TC integrator against the explicit Euler integration scheme, which is the only integrator used thus far in the kinetostatic folding literature [5,6,12,51]. Our kinetostatic folding simulations examine how unfolded protein conformations converge towards α -helix secondary structures [58]. The α -helix structure is characterized by a tightly coiled arrangement of the amino acid chain, which is a prevalent structural motif in proteins.
We conducted our simulations following the guidelines of Protofold I [6,51], utilizing an Intel® Core i7-6770HQ CPU at 2.60GHz. In our simulations, we considered three scenarios with protein molecule backbone chains consisting of 15, 30, and 50 peptide planes. These choices result in dihedral angle spaces with 32, 62, and 102 dimensions, respectively. These three protein backbone chains have N a = 78 , N a = 153 , and N a = 253 atoms, respectively. As elaborated in Section 2.2, we let the backbone chain fold kinetostatically under the influence of the interatomic force vector field associated with the initial value problem provided in Equation (9).
Using the three aforementioned protein backbone chains, we carried out two distinct groups of kinetostatic protein folding simulations associated with Ψ TC and explicit Euler integrator schemes. In the first set, we simulated the folding pathways of the three protein backbone chains using the explicit Ψ TC integrator described in Section 4. The integration scheme for Ψ TC is illustrated in the flowchart in Figure 4. The Ψ TC integrator design parameters were ϵ , δ 0 , and δ max , which are provided in Table 2.
Figure 5, Figure 6 and Figure 7 present the kinetostatic folding simulation results from the explicit Ψ TC integration schemes applied to each of the three protein backbone chains. These figures show the free energy G ( θ ) in kcal / mol for the backbone chain of protein molecules with dihedral angle vectors of dimensions 32, 62, and 102, respectively, along with their transient conformations along the folding pathway. Additionally, the figures illustrate the step size evolution δ k for the explicit Ψ TC scheme in each of the three cases. The step size δ k was updated according to the SER update rule provided by Equation (15). As the protein molecules approach their final folded conformations, the step size δ k increases and eventually saturates at δ max , which is the safeguard design parameter in the SER update rule.
Next, we performed kinetostatic folding simulations on the aforementioned three backbone chains using the explicit Euler scheme provided by Equation (8) with two different sets of fixed step sizes κ 0 . The initial protein conformations were all chosen to be the same pre-coiled backbone chains as their counterparts in the Ψ TC-based numerical simulations; in other words, to guarantee uniform starting conditions across both groups of numerical integration schemes, identical initial conformations for the protein backbone chains were adopted as pre-coiled structures near the α -helix configurations.
We compared the obtained results from our proposed Ψ TC scheme against the explicit Euler integrator method with a fixed step size. Two different step sizes were chosen for the explicit Euler schemes; the first one was set equal to the initial step size for the explicit Ψ TC scheme, namely, κ 0 δ 0 = 1 (see the Euler-1 column in Table 2), while the second step-size was set equal to half of the initial step-size for the explicit Ψ TC scheme, namely, κ 0 δ 0 = 0.5 (see the Euler-2 column in Table 2). As discussed in Section 2.3, the size of κ 0 in the explicit Euler integrator dictates a tradeoff between the stability and accuracy of the obtained folding pathways and the convergence speed of the explicit Euler schemes.
The plots in Figure 8 illustrate the free energy G ( θ ) in kcal / mol of the backbone chain of these protein molecules with dihedral angle vectors of dimension 32, 62, and 102, respectively. As can be seen from the figures, in the case of κ 0 δ 0 = 1 (i.e., explicit Euler with a larger step size), the oscillations and instability in the protein free energy profile computed from the explicit Euler scheme (the continuous curves) are due to the large step size chosen for this method. In the case of κ 0 δ 0 = 0.5 (i.e., the explicit Euler with a smaller step size), the protein free energy (the dash-dot-dashed curves) does not suffer such oscillations, but struggles to converge to the local minimum associated with the α -helix folded structure even after 1800 iterations. These results are in contrast to the stable and faster convergence of the Ψ TC-based results, where convergence takes place around iteration k = 1400 .
Finally, another notable fact in the simulations is the proximity of the obtained results during the first few steps of these three different numerical schemes. Indeed, using the proof in [38], Theorem 2.1, we can state that the early iterations of the Ψ TC scheme differ from its explicit Euler counterparts by O ( δ 0 2 ) , where the constant in the O-term is merely dependent on the folding vector field F cl ( · ) in a neighborhood of the initial unfolded conformation θ 0 . The relationship between the accuracy of the iterative computation and the speed of convergence in our Ψ TC method is closely tied to the adaptive step size mechanism. Higher accuracy requires smaller values of the stopping threshold τ tol , which increases the number of iterations. However, our adaptive step size approach dynamically adjusts the step size to accelerate convergence in the early stages and ensure accuracy in later stages, balancing the tradeoff between accuracy and computational efficiency.

6. Conclusions

In this paper, we have considered protein molecules modeled as nanomechanisms with hyper degrees of freedom consisting of numerous rigid peptide plane linkages. To compute the protein transient and final folded conformations during kinetostatic folding, we propose an explicit Ψ TC numerical integration technique with step size adaptation tailored to the underlying kinetostatic folding. Our proposed numerical scheme departs from the established literature on the KCM framework, where numerical simulations of kinetostatic protein folding have exclusively relied on explicit Euler schemes with a fixed step size. Moreover, this paper provides the numerical stability and convergence properties of the developed explicit Ψ TC numerical algorithm in the context of kinetostatic protein folding. Thanks to its convergence and stability properties, our proposed Ψ TC numerical algorithm can reduce the computational burden on kinetostatic folding numerical simulations.
Our current simulations focused primarily on in vacuo scenarios and excluded protein side chains, which simplifies the model. Future work will look to include these aspects in order to better capture the full dynamics of protein folding. Additionally, while our method improves computational efficiency, its performance in real-life applications for very large protein systems could be further optimized. We intend to explore this in our future studies, particularly examining the performance of the integrator in solute to mimic more realistic biological conditions. As future research directions, we envision that the proposed methodology will have potential applications for efficient numerical investigation of kinetostatic protein folding under solvation effects and entropy loss constraints as well as in the design and numerical simulation of protein-based nanorobots/nanomachines. Another direction for future research includes investigating the β -pleated sheet structure, which plays a crucial role in protein folding by facilitating the reorientation of the polypeptide chain. Remarkably, even small proteins such as chignolin, a synthetic mini-protein consisting of just ten amino acids, can demonstrate robust beta-hairpin folding.
In addition to kinetostatic protein folding, our proposed Ψ TC-based scheme has potential applications for developing high-fidelity numerical integrators for soft robotic mechanisms [59,60,61]. A frequently encountered issue in soft robot arms and cable-driven soft locomotive mechanisms involves numerical integration of the implicit nonlinear differential equations resulting from their elastica models (see, e.g., [60,61]). Remarkably, the protein kinematic structure in the KCM framework is exactly the same as robotic manipulators with hyper degrees of freedom, as described in the work of Mochiyama et al. (see, e.g., [45]). This type of kinematic modeling has also been used for multisection continuum robots (see, e.g., [46]). Therefore, the methodology developed in this paper is also applicable to such multisection continuum robotic mechanisms with nonlinear and high-dimensional joint space dynamics, provided that their closed-loop dynamics satisfy the same continuity and boundedness properties as the KCM-based protein folding vector field discussed here. The potential of this integrator in robotics and control systems is particularly promising, but relatively unexplored thus far. To date, apart from the work presented in this paper, one notable application in the robotics domain has been detailed in [62]. Their study conducted numerical simulations on the model predictive control (MPC) of a quadrotor, The results highlight the ability of our proposed Ψ TC integrator, which can significantly reduce computational times compared to the traditional quadratic programming solvers utilized in MPC schemes.

Author Contributions

Conceptualization, A.M.; methodology, A.M.; software, A.M.; formal analysis, A.K., K.Z. and A.M.; investigation, A.M.; resources, A.M.; data curation, A.K. and K.Z.; writing—original draft preparation, A.K., K.Z. and A.M.; writing—review and editing, A.M.; visualization, A.K. and K.Z.; supervision, A.M.; project administration, A.M.; funding acquisition, A.M. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Science Foundation (NSF) through award number CMMI-2153744.

Data Availability Statement

Dataset available on request from the authors.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Ling, M.; Howell, L.L.; Cao, J.; Chen, G. Kinetostatic and dynamic modeling of flexure-based compliant mechanisms: A survey. Appl. Mech. Rev. 2020, 72, 030802. [Google Scholar] [CrossRef]
  2. Firouzeh, A.; Paik, J. An under-actuated origami gripper with adjustable stiffness joints for multiple grasp modes. Smart Mater. Struct. 2017, 26, 055035. [Google Scholar] [CrossRef]
  3. Lilge, S.; Burgner-Kahrs, J. Kinetostatic modeling of tendon-driven parallel continuum robots. IEEE Trans. Robot. 2022, 39, 1563–1579. [Google Scholar] [CrossRef]
  4. Childs, J.A.; Rucker, C. A Kinetostatic Model for Concentric Push–Pull Robots. IEEE Trans. Robot. 2024, 40, 554–572. [Google Scholar] [CrossRef] [PubMed]
  5. Tavousi, P.; Behandish, M.; Ilieş, H.T.; Kazerounian, K. Protofold II: Enhanced model and implementation for kinetostatic protein folding. ASME J. Nanotechnol. Eng. Med. 2015, 6, 034601. [Google Scholar] [CrossRef]
  6. Tavousi, P. On the Systematic Design and Analysis of Artificial Molecular Machines. Ph.D. Thesis, University of Connecticut, Mansfield, CT, USA, 2016. [Google Scholar]
  7. Mohammadi, A.; Al Janaideh, M. Sign gradient descent algorithms for kinetostatic protein folding. In Proceedings of the 2023 International Conference on Manipulation, Automation and Robotics at Small Scales (MARSS), Abu Dhabi, United Arab Emirates, 9–13 October 2023; pp. 1–6. [Google Scholar] [CrossRef]
  8. Kazerounian, K.; Latif, K.; Rodriguez, K.; Alvarado, C. Nano-kinematics for analysis of protein molecules. ASME J. Mech. Des. 2005, 127, 699–711. [Google Scholar] [CrossRef]
  9. Gohil, M.K.; Chakraborty, A.; Dasgupta, B. Hyper-redundant robots and bioinformatics: Modelling loops in RNA. In Proceedings of the 2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Budapest, Hungary, 9–12 October 2016; pp. 3694–3700. [Google Scholar] [CrossRef]
  10. Diez, M.; Petuya, V.; Martínez-Cruz, L.A.; Hernández, A. Biokinematic protein simulation by an adaptive dihedral angle approach. Mech. Mach. Theory 2013, 69, 105–114. [Google Scholar] [CrossRef]
  11. Ekenna, C.; Thomas, S.; Amato, N.M. Adaptive local learning in sampling-based motion planning for protein folding. BMC Syst. Biol. 2016, 10, 165–179. [Google Scholar] [CrossRef]
  12. Mohammadi, A.; Spong, M.W. Quadratic optimization-based nonlinear control for protein conformation prediction. IEEE Control Syst. Lett. 2022, 6, 373–378. [Google Scholar] [CrossRef]
  13. Mohammadi, A.; Spong, M.W. Chetaev Instability Framework for Kinetostatic Compliance-Based Protein Unfolding. IEEE Control Syst. Lett. 2022, 6, 2755–2760. [Google Scholar] [CrossRef]
  14. Maruyama, Y.; Mitsutake, A. Analysis of structural stability of chignolin. J. Phys. Chem. B 2018, 122, 3801–3814. [Google Scholar] [CrossRef] [PubMed]
  15. Madden, C.; Bohnenkamp, P.; Kazerounian, K.; Ilieş, H.T. Residue level three-dimensional workspace maps for conformational trajectory planning of proteins. Int. J. Robot. Res. 2009, 28, 450–463. [Google Scholar] [CrossRef]
  16. Lee, S.; Chirikjian, G.S. Pose Analysis of Alpha-Carbons in Proteins. Int. J. Robot. Res. 2005, 24, 183–210. [Google Scholar] [CrossRef]
  17. Arkun, Y.; Gür, M. Protein folding using coarse-grained optimal control and molecular dynamics. IFAC Proc. Vol. 2011, 44, 14213–14216. [Google Scholar] [CrossRef]
  18. Arkun, Y.; Erman, B. Prediction of optimal folding routes of proteins that satisfy the principle of lowest entropy loss: Dynamic contact maps and optimal control. PLoS ONE 2010, 5, e13275. [Google Scholar] [CrossRef]
  19. Arkun, Y.; Gür, M. Combining optimal control theory and molecular dynamics for protein folding. PLoS ONE 2012, 7, e29628. [Google Scholar] [CrossRef]
  20. Kazerounian, K.; Ilies, H. The Evolving Role of Robot Kinematics in Bio-Nanotechnology. In Proceedings of the International Symposium on Advances in Robot Kinematics, Bilbao, Spain, 30 June–4 July 2024; pp. 77–87. [Google Scholar] [CrossRef]
  21. Shahbazi, Z.; Ilieş, H.T.; Kazerounian, K. Hydrogen bonds and kinematic mobility of protein molecules. ASME J. Mech. Robot. 2010, 2, 021009. [Google Scholar] [CrossRef]
  22. Chorsi, M.T.; Tavousi, P.; Mundrane, C.; Gorbatyuk, V.; Ilieş, H.; Kazerounian, K. One Degree of Freedom 7-R Closed Loop Linkage as a Building Block of Nanorobots. In Advances in Robot Kinematics 2020; Springer: Berling/Heidelberg, Germany, 2020; pp. 41–48. [Google Scholar]
  23. Chorsi, M.T.; Tavousi, P.; Mundrane, C.; Gorbatyuk, V.; Kazerounian, K.; Ilies, H. Kinematic design of functional nanoscale mechanisms from molecular primitives. J. Micro-Nano-Manuf. 2021, 9, 021005. [Google Scholar] [CrossRef]
  24. Mundrane, C.; Chorsi, M.; Vinogradova, O.; Ilieş, H.; Kazerounian, K. Exploring electric field perturbations as the actuator for nanorobots and nanomachines. In Proceedings of the International Symposium on Advances in Robot Kinematics, Bilbao, Spain, 26–30 June 2022; pp. 257–265. [Google Scholar] [CrossRef]
  25. Chorsi, M.S.; Linthicum, W.; Pozhidaeva, A.; Mundrane, C.; Mulligan, V.K.; Chen, Y.; Tavousi, P.; Gorbatyuk, V.; Vinogradova, O.; Hoch, J.C.; et al. Ultra-confined controllable cyclic peptides as supramolecular biomaterials. Nano Today 2024, 56, 102247. [Google Scholar] [CrossRef]
  26. Hamdi, M.; Ferreira, A. Multiscale design and modeling of protein-based nanomechanisms for nanorobotics. Int. J. Robot. Res. 2009, 28, 436–449. [Google Scholar] [CrossRef]
  27. Testard, N.J.; Chevallereau, C.; Wenger, P. Comparison of explicit and implicit numerical integrations for a tendon-driven robot. In Proceedings of the International Conference on Cable-Driven Parallel Robots 2023, Nantes, France, 25–28 June 2023; pp. 234–245. [Google Scholar] [CrossRef]
  28. Gibson, C.; Murphey, T.D. Geometric integration of impact during an orbital docking procedure. In Proceedings of the 2010 IEEE International Conference on Automation Science and Engineering, Toronto, ON, Canada, 21–24 August 2010; pp. 928–932. [Google Scholar] [CrossRef]
  29. Pekarek, D.; Marsden, J.E. Variational collision integrators and optimal control. In Proceedings of the 18th International Symposium on Mathematical Theory of Networks & Systems (MTNS), Blacksburg, VI, USA, 28 July–1 August 2008. [Google Scholar]
  30. Fan, T.; Schultz, J.; Murphey, T. Efficient computation of higher-order variational integrators in robotic simulation and trajectory optimization. In Algorithmic Foundations of Robotics XIII; Springer: Berling/Heidelberg, Germany, 2018; pp. 689–706. [Google Scholar] [CrossRef]
  31. Braun, D.J.; Goldfarb, M. Simulation of constrained mechanical systems—Part II: Explicit numerical integration. J. Appl. Mech. 2012, 79, 041018. [Google Scholar] [CrossRef]
  32. Fang, L.; Kissel, A.; Zhang, R.; Negrut, D. On the use of half-implicit numerical integration in multibody dynamics. ASME J. Comput. Nonlinear Dyn. 2023, 18, 014501. [Google Scholar] [CrossRef]
  33. Till, J.; Aloi, V.; Rucker, C. Real-time dynamics of soft and continuum robots based on Cosserat rod models. Int. J. Robot. Res. 2019, 38, 723–746. [Google Scholar] [CrossRef]
  34. Nordkvist, N.; Sanyal, A.K. A Lie group variational integrator for rigid body motion in SE(3) with applications to underwater vehicle dynamics. In Proceedings of the 49th IEEE Conference on Decision and Control (CDC), Atlanta, GA, USA, 15–17 December 2010; pp. 5414–5419. [Google Scholar] [CrossRef]
  35. Kelley, C.T.; Keyes, D.E. Convergence analysis of pseudo-transient continuation. SIAM J. Numer. Anal. 1998, 35, 508–523. [Google Scholar] [CrossRef]
  36. Coffey, T.S.; Kelley, C.T.; Keyes, D.E. Pseudotransient continuation and differential-algebraic equations. SIAM J. Sci. Comput. 2003, 25, 553–569. [Google Scholar] [CrossRef]
  37. Han, T.M.; Han, Y. Solving implicit equations arising from Adams-Moulton methods. BIT Numer. Math. 2002, 42, 336–350. [Google Scholar] [CrossRef]
  38. Kelley, C.; Liao, L.Z. Explicit pseudo-transient continuation. Pac. J. Optim. 2013, 9, 77–91. [Google Scholar]
  39. Han, T.; Han, Y. Numerical solution for super large scale systems. IEEE Access 2013, 1, 537–544. [Google Scholar] [CrossRef]
  40. Ceze, M.; Fidkowski, K.J. Constrained pseudo-transient continuation. Int. J. Numer. Methods Eng. 2015, 102, 1683–1703. [Google Scholar] [CrossRef]
  41. Mulder, W.A.; van Leer, B. Experiments with implicit upwind methods for the Euler equations. J. Comput. Phys. 1985, 59, 232–246. [Google Scholar] [CrossRef]
  42. Ceze, M.; Fidkowski, K. Pseudo-transient continuation, solution update methods, and CFL strategies for DG discretizations of the RANS-SA equations. In Proceedings of the 21st AIAA Computational Fluid Dynamics Conference, San Diego, CA, USA, 24–27 June 2013; p. 2686. [Google Scholar] [CrossRef]
  43. Shestakov, A.I.; Offner, S.S. A multigroup diffusion solver using pseudo transient continuation for a radiation-hydrodynamic code with patch-based AMR. J. Comput. Phys. 2008, 227, 2154–2186. [Google Scholar] [CrossRef]
  44. Rashidi, S.; Esfahani, J.A.; Maskaniyan, M. Applications of magnetohydrodynamics in biological systems—A review on the numerical studies. J. Magn. Magn. Mater. 2017, 439, 358–372. [Google Scholar] [CrossRef]
  45. Mochiyama, H.; Shimemura, E.; Kobayashi, H. Shape control of manipulators with hyper degrees of freedom. Int. J. Robot. Res. 1999, 18, 584–600. [Google Scholar] [CrossRef]
  46. Jones, B.A.; Walker, I.D. Kinematics for multisection continuum robots. IEEE Trans. Robot. 2006, 22, 43–55. [Google Scholar] [CrossRef]
  47. Seleem, I.A.; El-Hussieny, H.; Ishii, H. Imitation-Based Motion Planning and Control of a Multi-Section Continuum Robot Interacting with the Environment. IEEE Robot. Autom. Lett. 2023, 8, 1351–1358. [Google Scholar] [CrossRef]
  48. Bruder, D.; Fu, X.; Gillespie, R.B.; Remy, C.D.; Vasudevan, R. Data-driven control of soft robots using Koopman operator theory. IEEE Trans. Robot. 2020, 37, 948–961. [Google Scholar] [CrossRef]
  49. Kacem, A.; Zbiss, K.; Mohammadi, A. A Numerical Integrator for Forward Dynamics Simulations of Folding Process for Protein Molecules Modeled as Hyper-Redundant Robots. In Proceedings of the 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Detroit, MI, USA, 1–5 October 2023. [Google Scholar]
  50. Finkelstein, A.V.; Ptitsyn, O. Protein Physics: A Course of Lectures; Elsevier: Amsterdam, The Netherlands, 2016. [Google Scholar]
  51. Kazerounian, K.; Latif, K.; Alvarado, C. Protofold: A successive kinetostatic compliance method for protein conformation prediction. ASME J. Mech. Des. 2005, 127, 712–717. [Google Scholar] [CrossRef]
  52. Alvarado, C.; Kazerounian, K. On the rotational operators in protein structure simulations. Prot. Eng. 2003, 16, 717–720. [Google Scholar] [CrossRef]
  53. Adolf, D.B.; Ediger, M.D. Brownian dynamics simulations of local motions in polyisoprene. Macromolecules 1991, 24, 5834–5842. [Google Scholar] [CrossRef]
  54. Wang, Y.; Boyd, S. Fast evaluation of quadratic control-Lyapunov policy. IEEE Trans. Control Syst. Technol. 2010, 19, 939–946. [Google Scholar] [CrossRef]
  55. Fliege, J.; Vaz, A.I.F.; Vicente, L.N. Complexity of gradient descent for multiobjective optimization. Optim. Methods Softw. 2019, 34, 949–959. [Google Scholar] [CrossRef]
  56. Tranchida, J.; Plimpton, S.J.; Thibaudeau, P.; Thompson, A.P. Massively parallel symplectic algorithm for coupled magnetic spin dynamics and molecular dynamics. J. Chem. Phys. 2018, 372, 406–425. [Google Scholar] [CrossRef]
  57. Gray, S.K.; Noid, D.W.; Sumpter, B.G. Symplectic integrators for large scale molecular dynamics simulations: A comparison of several explicit methods. J. Chem. Phys. 1994, 101, 4062–4072. [Google Scholar] [CrossRef]
  58. Scholtz, J.M.; Baldwin, R.L. The mechanism of alpha-helix formation by peptides. Annu. Rev. Biophys. Biomol. Struct. 1992, 21, 95–118. [Google Scholar] [CrossRef] [PubMed]
  59. Coevoet, E.; Morales-Bieze, T.; Largilliere, F.; Zhang, Z.; Thieffry, M.; Sanz-Lopez, M.; Carrez, B.; Marchal, D.; Goury, O.; Dequidt, J.; et al. Software toolkit for modeling, simulation, and control of soft robots. Adv. Robot. 2017, 31, 1208–1224. [Google Scholar] [CrossRef]
  60. Armanini, C.; Dal Corso, F.; Misseroni, D.; Bigoni, D. From the elastica compass to the elastica catapult: An essay on the mechanics of soft robot arm. Proc. R. Soc. A Math. Phys. Eng. Sci. 2017, 473, 20160870. [Google Scholar] [CrossRef]
  61. Bern, J.M.; Banzet, P.; Poranne, R.; Coros, S. Trajectory optimization for cable-driven soft robot locomotion. In Proceedings of the Robotics: Science and Systems, Delft, The Netherlands, 15–19 July 2019; Volume 1. [Google Scholar] [CrossRef]
  62. Calogero, L.; Pagone, M.; Rizzo, A. Enhanced Quadratic Programming via Pseudo-Transient Continuation: An Application to Model Predictive Control. IEEE Control. Syst. Lett. 2024, 8, 1661–1666. [Google Scholar] [CrossRef]
Figure 1. The kinematic structure of the protein backbone chain is similar to that of robotic mechanisms with hyper degrees of freedom. Specifically, C α atoms play the role of hinges connecting peptide planes together. These C α atoms are kinematically the same as universal joints with two degrees of freedom. In kinetostatic protein folding, the peptide linkages articulate with respect to each other through dihedral angular variations facilitated by the C α atoms.
Figure 1. The kinematic structure of the protein backbone chain is similar to that of robotic mechanisms with hyper degrees of freedom. Specifically, C α atoms play the role of hinges connecting peptide planes together. These C α atoms are kinematically the same as universal joints with two degrees of freedom. In kinetostatic protein folding, the peptide linkages articulate with respect to each other through dihedral angular variations facilitated by the C α atoms.
Robotics 13 00150 g001
Figure 2. Modeling protein backbone chains as nano-kinematic structures in which peptide planes act as rigid links interconnected by revolute joints centered at the alpha carbon atoms, each offering two degrees of freedom. The nano-kinematic chain can rotate about the unit vectors u j , 1 j 2 N , which align with the C α C and N C α bonds. The body vectors, denoted by b j and 1 j 2 N , comprehensively define the relative spatial arrangement of coplanar atoms within each peptide plane. The body and unit vectors act in concert to provide a comprehensive description of the protein molecule’s 3D configuration as the dihedral angles θ j and 1 j 2 N rotate under the influence of interatomic forces.
Figure 2. Modeling protein backbone chains as nano-kinematic structures in which peptide planes act as rigid links interconnected by revolute joints centered at the alpha carbon atoms, each offering two degrees of freedom. The nano-kinematic chain can rotate about the unit vectors u j , 1 j 2 N , which align with the C α C and N C α bonds. The body vectors, denoted by b j and 1 j 2 N , comprehensively define the relative spatial arrangement of coplanar atoms within each peptide plane. The body and unit vectors act in concert to provide a comprehensive description of the protein molecule’s 3D configuration as the dihedral angles θ j and 1 j 2 N rotate under the influence of interatomic forces.
Robotics 13 00150 g002
Figure 3. Flowchart of the conventional kinetostatic folding iteration. The variation of dihedral angles in the flowchart (within the dashed borders) is governed by Equation (8). The most computationally intensive procedure at each conformation of the protein molecule (highlighted in red) consists of the electrostatic and van der Waals force computations.
Figure 3. Flowchart of the conventional kinetostatic folding iteration. The variation of dihedral angles in the flowchart (within the dashed borders) is governed by Equation (8). The most computationally intensive procedure at each conformation of the protein molecule (highlighted in red) consists of the electrostatic and van der Waals force computations.
Robotics 13 00150 g003
Figure 4. Flowchart of the proposed explicit Ψ TC integrator for computing protein folding pathways in kinetostatic folding simulations. The Ψ TC algorithm consists of four main steps: (Step 1) Initiation; (Step 2) Predictor–Corrector Computations; (Step 3) Checking Convergence; and, (Step 4) SER-based Step Size Update. In the Ψ TC integration scheme with a fixed step size, (Step 4) is skipped.
Figure 4. Flowchart of the proposed explicit Ψ TC integrator for computing protein folding pathways in kinetostatic folding simulations. The Ψ TC algorithm consists of four main steps: (Step 1) Initiation; (Step 2) Predictor–Corrector Computations; (Step 3) Checking Convergence; and, (Step 4) SER-based Step Size Update. In the Ψ TC integration scheme with a fixed step size, (Step 4) is skipped.
Robotics 13 00150 g004
Figure 5. Simulation results associated with the Ψ TC integrator. The free energy of the backbone chain of a protein molecule with a 32-dimensional dihedral angle vector (blue curve; G ( θ ) on the right axis), its transient conformations along the folding pathway, and the step size of the explicit Ψ TC scheme (black curve; δ k on the left axis). From left to right, the five plotted protein backbone chain conformations correspond to iteration numbers k = 100 , k = 300 , k = 500 , k = 1000 , and k = 1400 , respectively.
Figure 5. Simulation results associated with the Ψ TC integrator. The free energy of the backbone chain of a protein molecule with a 32-dimensional dihedral angle vector (blue curve; G ( θ ) on the right axis), its transient conformations along the folding pathway, and the step size of the explicit Ψ TC scheme (black curve; δ k on the left axis). From left to right, the five plotted protein backbone chain conformations correspond to iteration numbers k = 100 , k = 300 , k = 500 , k = 1000 , and k = 1400 , respectively.
Robotics 13 00150 g005
Figure 6. Simulation results associated with the Ψ TC integrator. The free energy of the backbone chain of a protein molecule with a 62-dimensional dihedral angle vector (blue curve; G ( θ ) on the right axis), its transient conformations along the folding pathway, and the step size of the explicit Ψ TC scheme (black curve; δ k on the left axis). From left to right, the five plotted protein backbone chain conformations correspond to iteration numbers k = 100 , k = 300 , k = 500 , k = 1000 , and k = 1400 , respectively.
Figure 6. Simulation results associated with the Ψ TC integrator. The free energy of the backbone chain of a protein molecule with a 62-dimensional dihedral angle vector (blue curve; G ( θ ) on the right axis), its transient conformations along the folding pathway, and the step size of the explicit Ψ TC scheme (black curve; δ k on the left axis). From left to right, the five plotted protein backbone chain conformations correspond to iteration numbers k = 100 , k = 300 , k = 500 , k = 1000 , and k = 1400 , respectively.
Robotics 13 00150 g006
Figure 7. Simulation results associated with the Ψ TC integrator. The free energy of the backbone chain of a protein molecule with a 102-dimensional dihedral angle vector (blue curve; G ( θ ) on the right axis), its transient conformations along the folding pathway, and the step size of the explicit Ψ TC scheme (black curve; δ k on the left axis). From left to right, the five plotted protein backbone chain conformations correspond to iteration numbers k = 100 , k = 300 , k = 500 , k = 1000 , and k = 1400 , respectively.
Figure 7. Simulation results associated with the Ψ TC integrator. The free energy of the backbone chain of a protein molecule with a 102-dimensional dihedral angle vector (blue curve; G ( θ ) on the right axis), its transient conformations along the folding pathway, and the step size of the explicit Ψ TC scheme (black curve; δ k on the left axis). From left to right, the five plotted protein backbone chain conformations correspond to iteration numbers k = 100 , k = 300 , k = 500 , k = 1000 , and k = 1400 , respectively.
Robotics 13 00150 g007
Figure 8. Comparison of the results obtained from our proposed Ψ TC scheme against the explicit Euler integrator method with a fixed step size. The free energy of the backbone chain of protein molecules obtained from the explicit Ψ TC scheme (dashed curve), explicit Euler scheme with step size satisfying κ 0 δ 0 = 1 (continuous curve), and explicit Euler scheme with step size satisfying κ 0 δ 0 = 0.5 (dash-dot-dashed curve). In the case of κ 0 δ 0 = 1 , the oscillations and instability in the protein free energy profile computed from the explicit Euler scheme (the continuous curves) are due to the large step size chosen for this method. In the case of κ 0 δ 0 = 0.5 (the dash-dot-dashed curves), the protein free energy does not suffer such oscillations, but struggles to converge to its local minimum even after 1800 iterations.
Figure 8. Comparison of the results obtained from our proposed Ψ TC scheme against the explicit Euler integrator method with a fixed step size. The free energy of the backbone chain of protein molecules obtained from the explicit Ψ TC scheme (dashed curve), explicit Euler scheme with step size satisfying κ 0 δ 0 = 1 (continuous curve), and explicit Euler scheme with step size satisfying κ 0 δ 0 = 0.5 (dash-dot-dashed curve). In the case of κ 0 δ 0 = 1 , the oscillations and instability in the protein free energy profile computed from the explicit Euler scheme (the continuous curves) are due to the large step size chosen for this method. In the case of κ 0 δ 0 = 0.5 (the dash-dot-dashed curves), the protein free energy does not suffer such oscillations, but struggles to converge to its local minimum even after 1800 iterations.
Robotics 13 00150 g008
Table 1. Sample overview of numerical integration schemes utilized for diverse robotic systems/applications.
Table 1. Sample overview of numerical integration schemes utilized for diverse robotic systems/applications.
ReferenceNumerical IntegratorApplication Area
[27] Testard et al. (2023)Implicit Euler integratorElastic tendon-driven robots
[28] Gibson & Murphey (2010)Variational integratorOrbital docking of Canadarm and the ISS
[29] Pekarek & Marsden (2008)Variational collision integratorLegged robotic locomotion
[30] Fan et al. (2018)Higher-order variational integratorsRobot trajectory optimization
[31] Braun & Goldfarb (2012)Explicit DAE integratorConstrained mechanical systems
[32] Fang et al. (2023)Half-implicit integratorIndex three DAEs in multibody dynamics
[33] Till et al. (2019)Runge–Kutta (RK4) integratorContinuum and soft robots
[34] Nordvik & Sanyal (2010)Lie group variational integratorUnderwater autonomous vehicles
[12] Mohammadi & Spong (2022)Explicit Euler integratorProtein molecules modeled as nano-mechanisms with hyper degrees of freedom
Table 2. Parameters of the integration schemes associated with the kinetostatic protein folding simulations.
Table 2. Parameters of the integration schemes associated with the kinetostatic protein folding simulations.
dim Q Ψ TC ( δ 0 ) Ψ TC ( δ max δ 0 ) Ψ TC ( δ 0 ϵ )Euler-1 ( κ 0 δ 0 )Euler-2 ( κ 0 δ 0 )
32 2 × 10 4 5 2 × 10 4 10.5
62 1 × 10 4 5 1 × 10 4 10.5
102 1 × 10 4 5 1 × 10 4 10.5
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kacem, A.; Zbiss, K.; Mohammadi, A. A Numerical Integrator for Kinetostatic Folding of Protein Molecules Modeled as Robots with Hyper Degrees of Freedom. Robotics 2024, 13, 150. https://doi.org/10.3390/robotics13100150

AMA Style

Kacem A, Zbiss K, Mohammadi A. A Numerical Integrator for Kinetostatic Folding of Protein Molecules Modeled as Robots with Hyper Degrees of Freedom. Robotics. 2024; 13(10):150. https://doi.org/10.3390/robotics13100150

Chicago/Turabian Style

Kacem, Amal, Khalil Zbiss, and Alireza Mohammadi. 2024. "A Numerical Integrator for Kinetostatic Folding of Protein Molecules Modeled as Robots with Hyper Degrees of Freedom" Robotics 13, no. 10: 150. https://doi.org/10.3390/robotics13100150

APA Style

Kacem, A., Zbiss, K., & Mohammadi, A. (2024). A Numerical Integrator for Kinetostatic Folding of Protein Molecules Modeled as Robots with Hyper Degrees of Freedom. Robotics, 13(10), 150. https://doi.org/10.3390/robotics13100150

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop