Extrapolation of Physics-Inspired Deep Networks in Learning Robot Inverse Dynamics

Li, Zhiming; Wu, Shuangshuang; Chen, Wenbai; Sun, Fuchun

doi:10.3390/math12162527

Open AccessArticle

Extrapolation of Physics-Inspired Deep Networks in Learning Robot Inverse Dynamics

¹

College of Automation, Beijing Information Science and Technology University, Beijing 100192, China

²

Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China

^*

Authors to whom correspondence should be addressed.

Mathematics 2024, 12(16), 2527; https://doi.org/10.3390/math12162527

Submission received: 25 July 2024 / Revised: 5 August 2024 / Accepted: 7 August 2024 / Published: 15 August 2024

(This article belongs to the Special Issue Advancement of Mathematical Methods in Feature Representation Learning for Artificial Intelligence, Data Mining and Robotics, 2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

Accurate robot dynamics models are crucial for safe and stable control as well as for generalization to new conditions. Data-driven methods are increasingly used in robotics dynamics modeling for their superior approximation, with extrapolation performance being a critical efficacy indicator. While deep learning is widely used, it often overlooks essential physical principles, leading to weaker extrapolation capabilities. Recent innovations have introduced physics-inspired deep networks that integrate deep learning with physics, leading to improved extrapolation due to their informed structure, but potentially to underfitting in real-world scenarios due to the presence of unmodeled phenomena. This paper presents an experimental framework to assess the extrapolation capabilities of data-driven methods. Using this framework, physics-inspired deep networks are applied to learn the inverse dynamics models of a simulated robotic manipulator and two real physical systems. The results show that under ideal observation conditions physics-inspired models can learn the system’s underlying structure and demonstrate strong extrapolation capabilities, indicating a promising direction in robotics by offering more accurate and interpretable models. However, in real systems their extrapolation often falls short because the physical priors do not capture all dynamic phenomena, indicating room for improvement in practical applications.

Keywords:

extrapolation; physics-inspired deep networks; Euler–Lagrange equations; inverse dynamics model; rigid-body dynamics

MSC:

68T40

1. Introduction

Accurately mapping inverse dynamics to connect joint movements with applied torques and forces is crucial for controlling robotic arms. Multi-degree robotic arms pose modeling challenges due to their nonlinearity and strong coupling [1]. Traditional methods relying on manually derived dynamics equations are labor-intensive and only accurate under ideal assumptions. Parameter uncertainties, omission of dynamics such as complex friction, and structural changes all reduce their effectiveness [2].

Black box data-driven approaches such as Gaussian process regression, recurrent neural networks, and feed-forward neural networks (FF-NN) are popular in robotic arm dynamics modeling for their approximation capabilities. These models derive the inverse dynamics directly from experimental data, bypassing the need for extensive physical system knowledge. For data-driven approaches, a reliable dynamics modeling method should not only possess precise predictive capabilities within the training domain but also exhibit good predictive capabilities outside the training domain (i.e., extrapolation) [3], ensuring that the robot maintains sufficient precision even in novel operational conditions. However, Black box data-driven approaches struggle to learn kinematic constraints and energy conservation laws from the data, leading to weak extrapolation, low data utilization, and a lack of physical plausibility [4,5].

To address the drawbacks of black box models, physically inspired data-driven learning methods are increasingly used in robotic dynamics modeling [6]. These approaches integrate existing knowledge with deep networks, leveraging deep learning to extract hidden structures and abstract features while preserving traditional knowledge-based benefits. These methods are known as physics-inspired deep networks [7] or physics-informed neural networks [8,9]; in this paper, these models are referred to as physics-inspired deep networks. Classic examples include deep Lagrangian networks (DeLaN) [10] and Hamiltonian neural networks [11], which integrate Lagrangian and Hamiltonian mechanics into deep learning frameworks. Lagrangian-based methods [1,4,7,8,10,12,13,14,15] design neural network models for physical systems using the Euler–Lagrange differential equations of motion, expressed in terms of generalized coordinates q, their velocity

\dot{q}

, and a Lagrangian function

L (q, \dot{q})

. The Lagrangian function, representing the difference between kinetic and potential energies, is modeled by neural networks. These energy components can be modeled either separately [7,10,15] or together [12]. This enhances physical plausibility and interpretability, allowing for more precise dynamics models and better extrapolation in rigid systems under ideal conditions [7,8,10,11,12,13]. However, the use of physical priors can lead to less accuracy compared to black box methods [14], as they often fail to capture complex nonlinear phenomena such as friction, hysteresis, and contact [16]. Thus, while physics-inspired networks outperform black box models for rigid-body dynamics, their modeling and extrapolation capabilities are limited in real robotic systems with uncertain forces due to the constraints of physical priors [1,8,14,17].

In addressing the limitations of physics-inspired networks in modeling friction due to conservative dynamics, research has focused on integrating key friction effects from robotic manipulators’ actuators into model learning [14,15]. To handle uncertainties such as nonlinear friction and flexibility, the augmented deep Lagrangian network, DeLaN-FFNN [1], was developed. By incorporating a feed-forward neural network, this enhanced model can represent uncertainties beyond rigid body dynamics, improving its effectiveness for real systems.

Existing research has not systematically experimented with or analyzed the extrapolation capabilities of physics-inspired networks. This paper introduces four different structures of DeLaN, examines their characteristics, and conducts a comprehensive evaluation of their extrapolation capabilities across various scenarios, including simulations of real systems, different control policies, and varying end-effector loads. The contributions of this paper are as follows: (i) an experimental framework for assessing data-driven methods is introduced and used to systematically analyze how physics-inspired networks can improve performance in real systems; (ii) it is shown that under ideal conditions, physics-inspired networks can learn underlying system structures and outperform standard deep learning methods in extrapolation tasks; (iii) we demonstrate that for real systems, adding incomplete physical priors can paradoxically reduce the extrapolation abilities of physics-inspired networks compared to black box models.

In the following sections, the mathematical formulation for inverse dynamics modeling is introduced (Section 2.1), followed by a summary of the theory behind Lagrangian mechanics (Section 2.2). The incorporation of rigid-body dynamics in deep learning is then explained (Section 2.3.1), along with the methods for introducing friction (Section 2.3.2) or uncertain forces (Section 2.3.3) into the learning of models. Section 3 introduces an experimental framework for assessing extrapolation. The extrapolation capabilities of physics-inspired networks and black box models are compared in both simulated (Section 4.2) and real (Section 4.3) robotic systems, with an analysis of the results presented in Section 4.4.

2. Preliminaries

The modeling of inverse dynamics and the derivation of rigid body dynamics equations through Lagrangian mechanics are outlined in this section, and four different structures of DeLaN are introduced.

2.1. Inverse Dynamics Modeling

The goal of inverse dynamics modeling is to find the function f mapping from system change to control input, i.e.,

f (q, \dot{q}, \ddot{q}) = τ,

(1)

where q,

\dot{q}

, and

\ddot{q}

represent the vectors for joint positions, joint velocities, and joint accelerations, respectively. For a robot with n degree of freedom (DoF), these vectors are of the dimension

R^{n \times 1}

, with

τ \in R^{n \times 1}

denoting the unknown joint torques that need to be learned. Coupling of the control input

τ

and system state q is essential for model-based control approaches in order to ensure precise control and responsiveness to changes in the system state. Using these definitions, the general inverse dynamics model of a robot is provided by

\begin{matrix} H (q) \ddot{q} & + c (q, \dot{q}) + g (q) + ε (q, \dot{q}, \ddot{q}) = τ, \end{matrix}

(2)

where

H (q) \in R^{n \times n}

is the symmetric and positive definite mass matrix,

c (q, \dot{q}) \in R^{n \times 1}

represents the matrix of the centrifugal force and Coriolis force,

g (q) \in R^{n \times 1}

represents the gravity vector, and

ε (q, \dot{q}, \ddot{q}) \in R^{n \times 1}

stands for the uncertain torque/force effects of those robot elements that are not considered elsewhere in the dynamics model (e.g., elasticities in the mechanical designs, model parameter inaccuracies in the masses or inertiae, vibrational effects, Stribeck friction, couplings, sensor noise) [1,2,18]. Nonlinear friction factors are the main factors affecting the robot’s high-precision motion [19]. If only friction forces are considered in

ε (q, \dot{q}, \ddot{q})

, then the expression for the inverse dynamics model can be performed as follows [20]:

H (q) \ddot{q} + c (q, \dot{q}) + g (q) + f_{r} (\dot{q}) = τ

(3)

where

f_{r} (\dot{q}) \in R^{n \times 1}

represents the component of joint friction forces. There exist various methods for deriving an inverse dynamics model from the above equation of motion when completely neglecting

ε (q, \dot{q}, \ddot{q})

[21]. The most commonly used methods for modeling robot dynamics are based on Lagrangian mechanics derived from the principles of rigid body dynamics. Robot rigid-body dynamics (

τ_{R B D}

) can be defined by the specific mathematical formulation

H (q) \ddot{q} + c (q, \dot{q}) + g (q) = τ_{RBD} .

(4)

Therefore, because Lagrangian mechanics are based on rigid body dynamics, they tend to overlook certain aspects such as

ε (q, \dot{q}, \ddot{q})

in robot inverse dynamics. As a result, they fail to accurately describe complex nonlinear phenomena such as friction, hysteresis, contact, and forces generated by flexibility and other uncertainties [16]. For ideal rigid-body dynamics data, as described by Equation (4), the data can be collected in a simulated environment. However, in a real-world environment, dynamic phenomena are not limited to rigid-body dynamics, as real-world data include the uncertain forces described in Equation (2). For a more detailed analysis of these uncertain forces, please refer to [1].

2.2. Lagrangian Mechanics

The Lagrangian

L

in Lagrangian mechanics is a function of the generalized coordinates q that describe the dynamics of a system. The Lagrangian is generally chosen to be

L (q, \dot{q}) = T (q, \dot{q}) - V (q) = \frac{1}{2} {\dot{q}}^{T} H (q) \dot{q} - V (q),

(5)

where

T (q, \dot{q})

is the kinetic energy and

V (q)

is the potential energy. Utilizing the calculus of variations, the Euler–Lagrange equation with non-conservative forces is derived by

\frac{d}{d t} \frac{𝜕 L (q, \dot{q})}{𝜕 \dot{q}} - \frac{𝜕 L (q, \dot{q})}{𝜕 q} = τ,

(6)

\frac{𝜕^{2} L (q, \dot{q})}{𝜕^{2} \dot{q}} \ddot{q} + \frac{𝜕^{} L (q, \dot{q})}{𝜕 q 𝜕 \dot{q}} \dot{q} - \frac{𝜕^{} L (q, \dot{q})}{𝜕 q} = τ,

(7)

where

τ

stands for the generalized forces. Substituting

L

for the kinetic and potential energy in Equation (5) yields the following second-order ordinary differential equation (ODE):

H (q) \ddot{q} + \underset{= c (q, \dot{q})}{\underset{︸}{\dot{H} (q) \dot{q} - \frac{1}{2} {({\dot{q}}^{T} \frac{𝜕 H (q)}{𝜕 q} \dot{q})}^{T}}} + \underset{= g (q)}{\underset{︸}{\frac{𝜕 V (q)}{𝜕 q}}} = τ .

(8)

Simplifying this expression yields Equation (4). The Lagrangian method is a commonly used approach for inverse dynamics analysis, as it allows for the simplest form of inverse dynamics models to be established when ignoring the uncertain forces

ε (q, \dot{q}, \ddot{q})

.

2.3. Physics-Inspired Deep Networks

This subsection provides an introduction on incorporating rigid-body dynamics into deep learning, based on which methods for introducing friction or uncertain forces into model learning are discussed.

2.3.1. Deep Lagrangian Networks

The DeLaN structure can be seen in Figure 1a. A DeLaN parameterizes the mass matrix H and potential energy V as two separate deep networks. Therefore, the approximate Lagrangian

L

is described by

\hat{L} (q, \dot{q}; θ, ϕ) = \frac{1}{2} {\dot{q}}^{T} \hat{H} (q; θ) \dot{q} - \hat{V} (q; ϕ),

(9)

where

θ

and

ϕ

are the respective network parameters. Then, the inverse model

τ = f (q, \dot{q}, \ddot{q}; θ, ϕ)

is approximated by

\begin{matrix} \hat{f} (q, \dot{q} & , \ddot{q}; θ, ϕ) = \hat{H} (q; θ) \ddot{q} + \hat{\dot{H}} (q; θ) \dot{q} - \frac{1}{2} {({\dot{q}}^{T} \frac{𝜕 \hat{H} (q; θ)}{𝜕 q} \dot{q})}^{T} + \frac{𝜕 \hat{V} (q; ϕ)}{𝜕 q} . \end{matrix}

(10)

The input of the network consists of position-related information on the generalized position, velocity, and acceleration. The output of the network is the approximated torque

τ

, presented in Equation (10). The optimization problem is described as follows:

(θ^{*}, ϕ^{*}) = \underset{θ, ϕ}{arg min} {∥\hat{f} (q, \dot{q}, \ddot{q}; θ, ϕ) - τ_{R}∥}^{2}_{W_{τ_{R}}}

(11)

where

τ_{R}

represents the real torque, which is used to compute the loss function with the corresponding torque

τ

output by the network,

{∥\cdot∥}_{W}

represents the Mahalanobis norm, and

W_{τ_{R}}

represents the diagonal covariance matrix of the generalized forces. It is beneficial to normalize the loss using the covariance matrix, as the magnitude of the residual might vary between different joints.

Previously, it was noted that the Lagrangian

L

can be parameterized using two networks for the mass matrix

H (q)

and potential energy

V (q)

. This is referred to as ‘structured’, and is shown in Figure 1a. An alternative approach employs a single network for both tasks, bypassing the need to learn the mass matrix

H (q)

, kinetic energy

T (q, \dot{q})

, and potential energy

V (q)

, resulting in what is termed the black-box Lagrangian, shown in Figure 1b.

2.3.2. Introducing Friction to Model Learning

DeLaN cannot model friction directly, as the learned dynamics are conservative. Referencing [15], we utilized a model assuming that the motor friction depends solely on the joint velocity

{\dot{q}}_{i}

of the i-th joint, and is independent of other joints. Depending on the model’s complexity, a combination of static, viscous, or Stribeck friction is assumed as a model prior, with their superposition described by

τ_{f_{i}} = - (τ_{C_{v}} + τ_{C_{s}} exp (- \frac{{\dot{q}}_{i}^{2}}{v})) sign ({\dot{q}}_{i}) - d {\dot{q}}_{i},

(12)

where

τ_{C_{v}}

is the coefficient of static friction, d is the coefficient of viscous friction, and

τ_{C_{s}}

and v are the coefficients of Stribeck friction. In the following, the friction coefficients are abbreviated as

φ = \{τ_{C_{v}}, τ_{C_{s}}, v, d\}

. Because the friction model is a function of the generalized positions, the friction force

τ_{f_{i}}

is a nonconservative generalized force that can be added to the dynamics model derived using Lagrangian mechanics. Equation (8) is described by

H (q) \ddot{q} + \underset{= c (q, \dot{q})}{\underset{︸}{\dot{H} (q) \dot{q} - \frac{1}{2} {({\dot{q}}^{T} \frac{𝜕 H (q)}{𝜕 q} \dot{q})}^{T}}} + \underset{= g (q)}{\underset{︸}{\frac{𝜕 V (q)}{𝜕 q}}} = τ + τ_{f} .

(13)

The simplified form of this expression is Equation (3). Given the model prior of Equation (12), the friction coefficients

φ

can be learned by treating the coefficients as network weights. In this paper, friction is introduced into model learning within DeLaN, as depicted in Figure 1c. This approach is named DeLaN-Friction.

2.3.3. Introducing Uncertain Forces to Model Learning

To address the limitation of DeLaN, which encompasses only rigid-body dynamics priors, DeLaN was combined with a standard deep network to ensure that the model can incorporate all of the force decomposition structures outlined in Equation (2). The structure of DeLaN-FFNN is shown in Figure 1d. The inverse model

τ = \hat{f} (q, \dot{q}, \ddot{q}; θ, ϕ, ψ)

can be described by

\hat{f} (q, \dot{q}, \ddot{q}; θ, ϕ, ψ) = \hat{H} (q; θ) \ddot{q} + \hat{c} (q, \dot{q}; θ) + \hat{g} (q; ϕ) + \hat{ε} (q, \dot{q}, \ddot{q}; ψ),

(14)

where

ψ

are the network parameters corresponding to torque

ε (q, \dot{q}, \ddot{q})

. To prevent scenarios where potential and kinetic energy are overlooked, with the black box model primarily influencing the predicted dynamics, the optimization objective is defined as follows:

\begin{matrix} (θ^{*}, ϕ^{*}, ψ^{*}) = \underset{θ, ϕ, ψ}{arg min} ( & {∥\hat{f} (q, \dot{q}, \ddot{q}; θ, ϕ, ψ) - τ_{R}∥}_{W_{τ_{R}}}^{2} + λ ε {(q, \dot{q}, \ddot{q}; ψ)}^{2}) \end{matrix}

(15)

where

λ

is a positive value used to adjust the proportion of the second loss term in the overall loss value.

3. Extrapolation

The inverse dynamics modeling problem can be addressed using data-driven methods, avoiding the tedious derivation of mathematical formulas [22]. The extrapolation capability of deep learning is crucial, and is being actively investigated [23]. For data-driven methods, models that are only applicable under the specific conditions of their training data collection are of limited use; it is crucial that these models, learned under a specific source distribution, maintain their effectiveness across different target distributions [24]. Evaluating a data-driven model’s stability and reliability hinges on its ability to predict inverse dynamics in unexplored configuration spaces [3]. The iid test error (accuracy under the training data distribution) is often a poor indicator of the extrapolation error (accuracy under a different distribution) [25].

Therefore, for a physical system, training trajectories are collected using certain initial conditions

π

. Based on these data, a dynamics model

\hat{f}

can be learned. This model is then evaluated under a new condition

π^{*}

, with this new condition used to collect a dataset that has a distribution different from the training set. For a robotic manipulator, two different conditions

π

and

π^{*}

can represent two different control policies, end effector holding different loads, or any other two distinct conditions; hence, it is not clear that a model

\hat{f}

learned under

π

will also be accurate under

π^{*}

.

Inspired by the experimental setup in [25] (illustrated in Figure 2), where dynamics models were learned using trajectories sampled from a condition

π

and their extrapolation performance was assessed on trajectories sampled from some unseen conditions

π^{*}

, our inverse dynamics model

\hat{f}

learned on the training set is then tested on two types of test sets in subsequent experiments. First, the accuracy of the model’s predictions on an iid test set collected under the same conditions

π

as the training set is assessed. Second, the extrapolation capabilities of the data-driven model on non-iid test sets collected under different conditions

π^{*}

from the training set are evaluated.

4. Experiment

In the experiments, physics-inspired networks are compared to black box methods for learning the nonlinear dynamics of both simulated and real systems, and their modeling effectiveness and extrapolation capabilities evaluated. For simulations, a 3-DoF robotic manipulator is constructed to follow rigid-body dynamics and provide ideal observation data for model learning. In experiments with a real system, these models are applied to a 3-DoF robot arm and KUKA LWR, where the dynamics exceed the rigid-body priors due to uncertain forces. These experiments aim to answer the following questions:

Q1: Under ideal observation data, can physics-inspired networks learn the underlying structure of physical systems?
Q2: When applied to real physical systems, will the extrapolation capabilities of physics-inspired networks be limited by physical priors?
Q3: What are the main factors that affect the extrapolation performance of physics-inspired networks in real systems?

4.1. Experiment Setup

In our experiments, a deep Lagrangian network using a single network to represent the Lagrangian (Figure 1b) is defined as black-box DeLaN. When two separate networks represent the mass matrix and potential energy (Figure 1a), this is referred to as structured DeLaN. Various physics-inspired networks were used to assess the inverse dynamics modeling, including structured/black-box DeLaN, DeLaN-Friction, DeLaN-FFNN, and a black-box feedforward network (FF-NN) without priors.

4.1.1. Evaluation Metrics

In the experiments, the mean square error (MSE), normalized mean square error (nMSE), and coefficient of determination (

R^{2}

) are used to evaluate the performance of all the learned models. The MSE and nMSE metrics are described by

M S E = \frac{1}{N} \sum_{i = 1}^{N} {(τ_{i} - {\hat{τ}}_{i})}^{2},

n M S E = \frac{\sum_{i = 1}^{N} {(τ_{i} - {\hat{τ}}_{i})}^{2}}{\sum_{i = 1}^{N} {(τ_{i} + δ)}^{2}},

where

τ_{i}

and

{\hat{τ}}_{i}

are the actual torque value and predicted torque value, respectively, N is the total amount of data, and

δ

is a small constant to ensure numerical stability. Lower the MSE and nMSE values indicate the better prediction performance. Additionally, the coefficient of determination

R^{2}

is utilized, defined as follows:

R^{2} = 1 - \frac{\sum_{i = 1}^{N} {(τ_{i} - {\hat{τ}}_{i})}^{2}}{\sum_{i = 1}^{N} {(τ_{i} - \bar{τ})}^{2}}

where

\bar{τ}

is the mean torque value of the datasets and takes values generally ranging from 0 to 1. This metric is used to evaluate and compare the overall performance of torque predictions for all joints of the robot, with higher values indicating better predictions. A value of 1 represents perfect accuracy, while values outside the 0–1 range suggest poor torque prediction performance [2].

4.1.2. Neural Network Training Details

In each experiment, the dynamics models were trained to convergence using a fixed dataset. The evaluations focused on two types of test datasets: an iid test set, which shows model performance under familiar conditions, and several non-iid test sets designed to challenge extrapolation capabilities (Figure 2). To reduce randomness from single random seed training, the results were averaged over five seeds. Neural networks were constructed using the JAX deep learning framework, which employs automatic differentiation to calculate partial derivatives within the dynamics model.

The ADAM optimizer was employed for the simulated 3-DoF manipulator ass well as for the real 3-DoF robot arm and KUKA LWR physical systems. The learning rate was set to

10^{- 4}

and the weight decay to

10^{- 5}

. The distinct hyperparameters of the dynamics models for each system are outlined in Table 1. The selection of these parameters was based on experience and experimentation, representing the optimal identified values.

4.2. Simulated Robot Experiments

For the simulation experiments, a 3-DoF robotic manipulator was used to collect ideal observation data under rigid-body dynamics conditions. The goal was to assess whether physics-inspired networks could learn the underlying system dynamics and demonstrate superior extrapolation capabilities compared to black box methods.

4.2.1. Dataset Construction

The 3-DoF robotic manipulator features three continuous revolute joints and operates in the vertical x-z plane under the influence of gravity. In order to collect ideal data, no non-conservative forces were present during motion. The manipulator was simulated using the Bullet physics engine.

To smoothly change velocity and direction without causing abrupt reactions in the robotic manipulator, the trajectory planning was based on sinusoidal functions. This ensures a smooth transition from the initial angle (

q_{0}

) to the final angle (

q_{f}

). For the i-th joint, the trajectory is defined by

q_{t}^{i} = q_{0}^{i} + \frac{q_{f}^{i} - q_{0}^{i}}{T} (t - \frac{T}{2 π} sin (\frac{2 π}{T} t)) .

In this definition, T represents the total movement time and t denotes the current time. The angle of each joint of the robotic manipulator is constrained within

[- 3, 3]

, and 200 sets of initial and final angles for the three joints are randomly generated. A total of 200 trajectories were collected, each comprising 400 data points, with a sampling period T of 8 s and a frequency of 50 Hz. This process generated a total of 80,000 datasets, which served as the training dataset, encompassing joint positions q, joint velocities

\dot{q}

, joint accelerations

\ddot{q}

, and corresponding joint torques

τ

. Calculation of the inverse dynamics based on the Newton–Euler formulation method was conducted in Pybullet.

Following the same method used to collect the training set, data from three additional trajectories were collected as the iid test set (Figure 3a). The manipulator then executed the same trajectories, except with the total time T reduced to one-2.5th of its original duration, effectively increasing velocity values by 2.5 times. This resulted in non-iid test set 0, which was used for evaluating velocity extrapolation. Additionally, with the total time unchanged, the initial and final angle constraints were expanded to

[- 10, 10]

and three trajectories were collected to form non-iid test set 1, (Figure 3b), in which each joint experiences unseen positions and higher velocities.

4.2.2. Modeling Experiments

The outcomes of the simulated robot inverse model experiments are summarized in Table 2. The table includes the nMSE for the torques learned through supervised learning and for the torque decompositions learned through unsupervised learning across the three joints for each dynamics model, with the torque decomposition encompassing the inertial torque

τ_{I}

, Coriolis torque

τ_{c}

, and gravitational torque

τ_{g}

. The MSE serves as the evaluation metric for the torque predictions of each joint. Additionally, the coefficient of determination

R^{2}

is presented on the far right as the overall modeling accuracy for inverse dynamics modeling.

Using nMSE as the evaluation metric, the dynamics models for structured DeLaN, DeLaN-Friction, and DeLaN-FFNN achieve consistent results across all test sets, accurately predicting the inertial, Coriolis, and gravitational forces, indicating their ability to learn the system’s underlying structure. Their prediction accuracy surpasses that of black-box DeLaN and FF-NN. Visualization results for the iid test set are shown in Figure 4, with non-iid test set results included in the appendix (Appendix B). For the non-iid test sets, all models experience varying degrees of accuracy deterioration, with FF-NN and black-box DeLaN showing significant declines compared to the slight deterioration in structured DeLaN, DeLaN-Friction, and DeLaN-FFNN. Notably, all models effectively learn gravitational decomposition without deterioration, even in extrapolation tests, demonstrating their ability to extract gravitational features from the data.

For each joint’s prediction error, structured DeLaN, DeLaN-Friction, and DeLaN-FFNN consistently achieve superior accuracy compared to black-box DeLaN and FF-NN on both iid and non-iid test sets. The coefficient of determination (

R^{2}

) offers an intuitive comparison of each network’s extrapolation capabilities. On the iid test set, the

R^{2}

for all networks is close to 1, indicating near-perfect predictive performance. For non-iid test sets, the

R^{2}

for structured DeLaN, DeLaN-Friction, and DeLaN-FFNN remains very close to 1, showing continued effectiveness, while the

R^{2}

for black-box DeLaN and FF-NN drops significantly, reaching around 0.5 on non-iid test set 1. Based on this analysis, structured DeLaN, DeLaN-Friction, and DeLaN-FFNN have superior extrapolation capabilities within physics-inspired networks. Notably, DeLaN-Friction and DeLaN-FFNN perform well even with data containing only conservative forces. This is because in DeLaN-Friction the friction coefficients are trained as network weights, while in DeLaN-FFNN the loss function (Equation (15)) reduces the output of the uncertainty force network when there are no uncertainties to learn.

Figure 5 illustrates each model’s performance under increased velocities to assess their ability to handle new conditions. All models are initially trained on trajectories at a velocity scale of

1 \times

, then tested at increased velocities. Data collected at

2.5 \times

the velocity correspond to iid test set 0. Structured DeLaN, DeLaN-Friction, and DeLaN-FFNN show slight linear deterioration with increasing velocity. In contrast, black-box DeLaN and FF-NN experience severe deterioration, especially before reaching

2 \times

velocity. At

3 \times

velocity, structured DeLaN, DeLaN-Friction, and DeLaN-FFNN outperform black-box DeLaN and FF-NN at

1 \times

velocity. This advantage arises because structured DeLaN’s input domain consists solely of q, making it unaffected by changes in velocities or accelerations. In contrast, FF-NN’s input domain includes

(q, \dot{q}, \ddot{q})

and black-box DeLaN’s includes

(q, \dot{q})

. DeLaN-Friction and DeLaN-FFNN, which build on DeLaN, do not show improved extrapolation because complex friction and uncertain forces were not incorporated into the simulated robotic manipulator’s setup.

The experiments show that structured DeLaN, DeLaN-Friction, and DeLaN-FFNN, which incorporate the mass matrix

H (q)

, potential energy

V (q)

, and kinetic energy

T (q, \dot{q})

as physical priors, are able to effectively learn the system’s structure from ideal observation data. These models demonstrate superior modeling accuracy and more reliable extrapolation capabilities compared to FF-NN, which lacks any priors. In contrast, black-box DeLaN, which skips learning the mass matrix, kinetic energy, and potential energy by directly training the Lagrangian

L (q, \dot{q})

through a single network, performs similarly to FF-NN.

4.3. Real Robot Experiments

In the real robot experiments, a 3-DoF robotic arm was used to evaluate the extrapolation capabilities of each dynamics model with data collected under different control policies. Additionally, data collected using a KUKA LWR robot pushing flasks of different volumes were utilized to assess the extrapolation capabilities of the dynamics models with data collected under varying end-effector loads.

4.3.1. Dataset Construction

The datasets for the 3-DoF robot arm and KUKA LWR were sourced from publicly available real-system datasets. Detailed descriptions of these datasets are provided below.

3-DoF Robot arm Dataset. (Click to enter the 3-DoF Robot dataset website: https://edmond.mpg.de/dataset.xhtml?persistentId=doi:10.17617/3.ZT6K7P, accessed on 15 April 2024). Data used to evaluate extrapolation ability to different policies was recorded on a real 3-DoF robotic system (shown in Figure 6a). Dynamics data from various distributions were collected by tracking trajectories with different frequency settings and joint ranges. The sine trajectories

X_{t}

in joint space were generated according to

X_{t} = C + A ⊙ (\sum_{i = 1}^{k} B_{i} ⊙ sin (ω_{i} t + φ_{i})),

where C, A,

B_{i}

,

ω_{i}

,

φ_{i}

are all three-dimensional vectors, k denotes the number of sine components that are superimposed, and ⊙ denotes the element-wise product. By changing the angular frequency

ω_{i}

, different frequency settings are controlled, while the joint range reachable by the manipulator is altered by adjusting C and A. The frequency is categorized into high or low, while the joint constraints are divided into the end-effector moving only in the left part of the task space or in the entire range of safe movements.

The training set included 532,000 samples of low-frequency data with joints constrained to the left side. The iid test set retained the same settings. Non-iid test set 0 included high-frequency trajectories with left-side joint constraints, non-iid test set 1 featured low-frequency trajectories spanning the entire joint space, and non-iid test set 2 contained high-frequency trajectories across the entire joint space. Each test set contained 126,000 samples. The trajectory settings are summarized in Figure 6b; for more details, please refer to [25].

This dataset provides joint positions q, velocities

\dot{q}

, and corresponding torques

τ

for each sample, but does not include acceleration data

\ddot{q}

. In real-world systems,

\ddot{q}

is often unobserved and is estimated using finite differences [7,14,26], which can amplify high-frequency noise and cause inaccuracies. To mitigate this, low-pass filters are used for more accurate acceleration estimates. Following the approach in [7,26], the accelerations were computed using finite differences and low-pass filters.

KUKA LWR Dataset. (https://cloud.cps.unileoben.ac.at/index.php/s/6aG9gynxDYMmW8M (accessed on 15 April 2024)). In the scenarios involving different end-effector loads, the task involved the KUKA LWR robot pushing flasks filled with varying volumes of liquid to a designated goal location along a curved trajectory. The dataset included two sets. The first set consisted of data collected by the robot while pushing flasks filled with either 200 mL or 400 mL of water, including 112,761 data samples. The second set consisted of data collected by the robot while pushing flasks filled with 300 mL of water, including 16,940 data samples. Each sample included joint positions q, joint velocities

\dot{q}

, joint accelerations

\ddot{q}

, and corresponding torques

τ

for the first five joints of the robotic manipulator. For more information on the data, please refer to [18,27].

The first set was divided into two parts, randomly extracting 12,761 data samples as the iid test set and using the remaining 100,000 samples as the training set. The second set of data was used as the non-iid test set to evaluate the model’s extrapolation capabilities for different loads.

4.3.2. Extrapolation across Different Policies

The purpose of modeling robots is to use these models to synthesize new control policies. Therefore, it is crucial to assess whether dynamics models can effectively extrapolate to changes in data distribution caused by different control policies.

The performance of each model in predicting joint torques for each joint is summarized in Table 3. Across all test sets for each joint, better torque prediction results are concentrated among black-box DeLaN, FF-NN, and DeLaN-FFNN, with a particular focus on FF-NN and DeLaN-FFNN. For evaluation using the coefficient of determination

R^{2}

, it is noteworthy that DeLaN and DeLaN-Friction score beyond the range of 0–1 on every test set, indicating poor prediction performance across all groups of test sets. For the iid test set and non-iid test set 0, the highest torque prediction scores are achieved by FF-NN and DeLaN-FFNN. For non-iid test sets 1 and 2, the highest scores are achieved by FF-NN. Notably, black-box DeLaN scores beyond the range of 0–1 on non-iid test set 1, indicating poor extrapolation capabilities.

Figure 7 presents the overall torque prediction performance, allowing for a direct comparison. Structured DeLaN and DeLaN-Friction exhibit the poorest accuracy across all test sets, struggling even on the iid test set. Better accuracy is achieved by FF-NN, black-box DeLaN, and DeLaN-FFNN. For the iid test set and non-iid test set 0, FF-NN and DeLaN-FFNN provide the best torque predictions. However, for the more complex non-iid test sets 1 and 2, DeLaN-FFNN’s accuracy is slightly inferior to FF-NN.

From this analysis, structured DeLaN and DeLaN-Friction lack accuracy and extrapolation capabilities, with the former limited by rigid-body dynamics priors and the latter adding actuator friction. Black-box DeLaN performs better due to fewer physical priors. FF-NN, without physical priors, achieves the best results, indicating that inadequate priors compromise performance in physics-inspired networks. DeLaN-FFNN matches the accuracy of FF-NN on the iid tests by capturing all of the dynamics in Equation (2). However, DeLaN-FFNN has inferior extrapolation in non-iid tests due to introducing uncertain forces in a black-box manner, reducing transparency and the benefits of physical principles.

4.3.3. Extrapolation across Different Loads

A robotic manipulator often needs to handle different end-effector loads, making it essential to evaluate a dynamics model’s extrapolation capabilities for varying loads.

Table 4 summarizes the torque prediction errors for each joint and the overall

R^{2}

for the KUKA LWR achieved by each dynamics model. On the iid test set, black-box DeLaN, FF-NN, and DeLaN-FFNN achieve the best accuracy, while structured DeLaN and DeLaN-Friction perform worse. All models score close to 1 for

R^{2}

, indicating nearly perfect predictions. For non-iid test sets, torque errors for black-box DeLaN, FF-NN, and DeLaN-FFNN range from

10^{- 1}

to

10^{- 2}

, while the errors for structured DeLaN and DeLaN-Friction range from

10^{2}

to

10^{3}

, demonstrating superior extrapolation for the former models. However, all models score poorly on

R^{2}

for the non-iid test sets.

Figure 8 visualizes the overall prediction performance of each model for the two test sets. For the iid test set, structured DeLaN and DeLaN-Friction have slightly higher prediction errors than black-box DeLaN, FF-NN, and DeLaN-FFNN, though all achieve nearly ideal predictions. In the non-iid test sets, all models show some deterioration, with structured DeLaN and DeLaN-Friction degrading significantly more than the others.

Despite the varying degrees of physical constraints, all physics-inspired networks and FF-NN achieve ideal predictions on the iid test set. However, structured DeLaN and DeLaN-Friction are constrained by physical priors, and lack extrapolation ability for different loads. In contrast, DeLaN-FFNN and black-box DeLaN show extrapolation capabilities similar to FF-NN.

4.4. Results Analysis and Discussion

Returning to the initial questions (Section 4), the experimental results show the following:

(1): Under ideal observation data, structured DeLaN, DeLaN-Friction, and DeLaN-FFNN are capable of learning the underlying structure of physical systems and possess strong extrapolation capabilities. However, black-box DeLaN, which is also a physics-inspired network, cannot learn the underlying structure of the system from the data due to its lack of learning ability for the mass matrix $H (q)$ , kinetic energy $T (q, \dot{q})$ , and potential energy $V (q)$ .
(2): When applied to real systems, structured DeLaN and DeLaN-Friction show weaker extrapolation than black-box DeLaN, with DeLaN-FFNN slightly behind the black-box model. The failure of physics-inspired networks to capture all dynamics phenomena indicates that excessive physical constraints can limit extrapolation abilities. Improved extrapolation in physics-inspired networks requires more comprehensive capture of the real system dynamics.
(3): In order for physics-inspired networks to achieve better extrapolation on real systems, they rely on two key factors: high-fidelity dynamics data that accurately reflects system phenomena, and the integration of comprehensive dynamics priors. These priors should transparently capture the real system dynamics instead of using black-box methods such as the uncertain forces in DeLaN-FFNN.

For Result 1, it is evident that physics-inspired deep networks are a promising direction in robotics, offering more accurate and interpretable models. These networks enhance physical plausibility and interpretability, resulting in more precise dynamics models and better extrapolation in rigid systems under ideal conditions. Similar experimental results can be found in the following papers: [1,7,8,10,11,12,13].

For Result 2, although the black-box networks achieved better results in real-world scenarios with uncertainties due to their lack of prior constraints, they can approximate any mapping relationship without restrictions, even if the relationship is spurious. Therefore, this result does not prove that they have good extrapolation capabilities; rather, it merely indicates that when the physical priors integrated into physics-inspired networks fail to capture all the dynamics phenomena of real systems, this hinders their extrapolation performance.

For Result 3, two key factors are highlighted for better extrapolation in physics-inspired networks: high-fidelity dynamics data, and comprehensive dynamics priors. For high-fidelity dynamics data, the challenge in real systems stems from often unobserved or unknown generalized coordinates q,

\dot{q}

, compounded by the difficulty of inferring complete system states from partial observations, making high-fidelity data acquisition difficult. This issue does not affect black-box models that do not depend on specific observations [7]. Additionally, in experiments, inverse dynamics modeling requires the state derivative (angular accelerations), which cannot be directly observed. This extra computation step, combined with measurement noise, introduces noisy spikes in the computed accelerations, limiting the learning capabilities of physics-inspired networks [28]. For incorporating more comprehensive dynamics priors, in structured DeLaN theory, non-conservative forces are assumed to be fully known and equal to the actuation forces directly acting on the Lagrangian coordinates q. However, this approach lacks an energy dissipation mechanism [8]. For DeLaN-Friction, the findings reveal that merely adding friction is inadequate. For DeLaN-FFNN, introducing uncertain forces through a black-box approach improves performance but sacrifices interpretability. Current attempts to embed contact forces into physics-inspired networks often target specific scenarios with restrictive assumptions, and consequently lack widespread applicability [13,29]. Several recent works have introduced an energy dissipation mechanism by incorporating the learning of a dissipation matrix in physics-inspired networks, resulting in improved modeling performance. However, their extrapolation capabilities remain to be demonstrated [8,30]. Therefore, improving the extrapolation of physics-inspired networks in real systems continues to pose significant challenges.

5. Conclusions

This work introduces four physics-inspired networks and an experimental design to evaluate the extrapolation capabilities of data-driven models in robotics inverse dynamics modeling. Experiments were conducted to assess these networks’ extrapolation ability from simulated to real systems. The results indicate that physics-inspired networks can recover the system’s underlying structure from ideal simulated data, demonstrating strong extrapolation capabilities. It is evident that physics-inspired deep networks are a promising direction in robotics, offering more accurate and interpretable models. However, in real systems these networks often fall short compared to black-box models due to insufficient physical priors and their inability to capture all dynamics phenomena, highlighting the need for further improvements. To improve extrapolation in real systems, high-fidelity data and more comprehensive physical priors are needed. Future studies should focus on these areas in order to enhance the performance of physics-inspired networks.

Author Contributions

Conceptualization, Z.L. and S.W.; software, Z.L. and S.W.; data curation, Z.L.; writing—original draft, Z.L.; validation, S.W. and W.C.; writing—reviewing and editing, S.W., W.C., and F.S.; supervision, W.C. and F.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key Research and Development Program of China (No. 2021ZD0114505), the National Natural Science Foundation of China (62276028), the Major Research Plan of the National Natural Science Foundation of China (92267110), the Beijing Municipal Natural Science Foundation–Xiaomi Innovation Joint Fund (L233006), the Qin Xin Talents Cultivation Program at Beijing Information Science & Technology University (QXTCP A202102), and the Beijing Information Science and Technology University School Research Fund (No. 2023XJJ12).

Data Availability Statement

The public datasets used in this study are referenced within the article with links provided for access. Additional data and materials related to this study will be made available upon request.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

In this section, additional details on the four physics-inspired networks (Section 2.3) used in this paper are provided. Their structural diagrams can be found in Figure 1. All network architectures are based on multilayer perceptrons (MLP).

The black-box DeLaN model directly trains a Lagrangian

L (q, \dot{q}, θ)

using an MLP (Figure 1a), while the structured DeLaN model uses two MLPs, one to train the mass matrix

H (q, θ)

and another to train the potential energy

V (q, ϕ)

. The kinetic energy is derived from the mass matrix, and the Lagrangian

L (q, \dot{q}, θ, ϕ)

is obtained by subtracting the potential energy from the kinetic energy (Figure 1b).

DeLaN-Friction is based on the structured DeLaN architecture; therefore, it also uses two MLPs to train the mass matrix

H (q, θ)

and potential energy

V (q, ϕ)

. The friction coefficients

φ = \{τ_{C_{v}}, τ_{C_{s}}, v, d\}

are learned by treating them as network weights, rather than being trained using a single MLP (Figure 1c).

DeLaN-FFNN is also based on the structured DeLaN architecture, using three MLPs to train the mass matrix

H (q, θ)

, potential energy

V (q, ϕ)

, and uncertain forces

ε (q, \dot{q}, \ddot{q}, ψ)

(Figure 1d).

In this project, all neural networks were built using the JAX and dm-Haiku packages in Python. Specifically, the JAX Autodiff system was employed to compute partial derivatives and the Hessian within the loss function. Model parameter optimization was performed with AdamW from the Optax package, which inherently includes regularization terms, removing the need for extra explicit regularization terms in the loss function.

Appendix B

In the simulation modeling experiments (Section 4.2.2), the results are visualized separately in Figure A1 and Figure A2 in order to more clearly compare the prediction results of each dynamics network on non-iid test set 0 and non-iid test set 1.

Figure A1. The learned inverse model using the training dataset, including three non-iid test set 0 trajectories not included in the training set, with results averaged over five seeds (a). The subsequent columns provide the predicted force decomposition: (b) inertial force

H (q) \ddot{q}

, (c) Coriolis and centrifugal forces

c (q, \dot{q})

, and (d) gravitational force

g (q)

.

Figure A1. The learned inverse model using the training dataset, including three non-iid test set 0 trajectories not included in the training set, with results averaged over five seeds (a). The subsequent columns provide the predicted force decomposition: (b) inertial force

H (q) \ddot{q}

, (c) Coriolis and centrifugal forces

c (q, \dot{q})

, and (d) gravitational force

g (q)

.

Figure A2. The learned inverse model using the training dataset, including three non-iid test set 1 trajectories not included in the training set, with results averaged over five seeds (a). The subsequent columns provide the predicted force decomposition: (b) inertial force

H (q) \ddot{q}

, (c) Coriolis and centrifugal forces

c (q, \dot{q})

, and (d) gravitational force

g (q)

.

Figure A2. The learned inverse model using the training dataset, including three non-iid test set 1 trajectories not included in the training set, with results averaged over five seeds (a). The subsequent columns provide the predicted force decomposition: (b) inertial force

H (q) \ddot{q}

, (c) Coriolis and centrifugal forces

c (q, \dot{q})

, and (d) gravitational force

g (q)

.

References

Wu, S.; Li, Z.; Chen, W.; Sun, F. Dynamic Modeling of Robotic Manipulator via an Augmented Deep Lagrangian Network. Tsinghua Sci. Technol. 2024, 29, 1604–1614. [Google Scholar] [CrossRef]
Valencia-Vidal, B.; Ros, E.; Abadía, I.; Luque, N.R. Bidirectional recurrent learning of inverse dynamic models for robots with elastic joints: A real-time real-world implementation. Front. Neurorobot. 2023, 17, 1166911. [Google Scholar] [CrossRef] [PubMed]
Ulbrich, S.; Bechtel, M.; Asfour, T.; Dillmann, R. Learning robot dynamics with kinematic bezier maps. In Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, Algarve Portugal, 7–12 October 2012; pp. 3598–3604. [Google Scholar]
Giacomuzzo, G.; Dalla Libera, A.; Carli, R. Embedding the Physics in Black-box Inverse Dynamics Identification: A Comparison Between Gaussian Processes and Neural Networks. IFAC-PapersOnLine 2023, 56, 1584–1590. [Google Scholar] [CrossRef]
Abdulkadirov, R.; Lyakhov, P.; Nagornov, N. Survey of optimization algorithms in modern neural networks. Mathematics 2023, 11, 2466. [Google Scholar] [CrossRef]
Cunha, B.Z.; Droz, C.; Zine, A.M.; Foulard, S.; Ichchou, M. A review of machine learning methods applied to structural dynamics and vibroacoustic. Mech. Syst. Signal Process. 2023, 200, 110535. [Google Scholar] [CrossRef]
Lutter, M.; Peters, J. Combining physics and deep learning to learn continuous-time dynamics models. Int. J. Robot. Res. 2023, 42, 83–107. [Google Scholar] [CrossRef]
Liu, J.; Borja, P.; Della Santina, C. Physics-Informed Neural Networks to Model and Control Robots: A Theoretical and Experimental Investigation. Adv. Intell. Syst. 2024, 6, 2300385. [Google Scholar] [CrossRef]
Schiassi, E.; De Florio, M.; D’Ambrosio, A.; Mortari, D.; Furfaro, R. Physics-informed neural networks and functional interpolation for data-driven parameters discovery of epidemiological compartmental models. Mathematics 2021, 9, 2069. [Google Scholar] [CrossRef]
Lutter, M.; Ritter, C.; Peters, J. Deep Lagrangian Networks: Using Physics as Model Prior for Deep Learning. In Proceedings of the International Conference on Learning Representations, Vancouver, BC, Canada, 30 April–3 May 2018. [Google Scholar]
Greydanus, S.; Dzamba, M.; Yosinski, J. Hamiltonian neural networks. In Proceedings of the Advances in Neural Information Processing Systems, Vancouver, BC, Canada, 8–14 December 2019; Volume 32. [Google Scholar]
Cranmer, M.; Greydanus, S.; Hoyer, S.; Battaglia, P.W.; Spergel, D.N.; Ho, S. Lagrangian Neural Networks. In Proceedings of the International Conference on Learning Representations 2020, Addis Ababa, Ethiopia, 26–30 April 2020. [Google Scholar]
Zhong, Y.D.; Dey, B.; Chakraborty, A. Extending lagrangian and hamiltonian neural networks with differentiable contact models. Adv. Neural Inf. Process. Syst. 2021, 34, 21910–21922. [Google Scholar]
Liu, L.; Zuo, G.; Li, J.; Li, J. Dynamics modeling with realistic constraints for trajectory tracking control of manipulator. In Proceedings of the 2022 IEEE International Conference on Real-time Computing and Robotics (RCAR), Guiyang, China, 17–22 July 2022; pp. 99–104. [Google Scholar]
Lutter, M.; Listmann, K.; Peters, J. Deep lagrangian networks for end-to-end learning of energy-based control for under-actuated systems. In Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Macao, China, 4–8 November 2019; pp. 7718–7725. [Google Scholar]
Lutter, M.; Silberbauer, J.; Watson, J.; Peters, J. A differentiable newton euler algorithm for multi-body model learning. arXiv 2020, arXiv:2010.09802. [Google Scholar]
Lutter, M.; Silberbauer, J.; Watson, J.; Peters, J. Differentiable physics models for real-world offline model-based reinforcement learning. In Proceedings of the 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi’an China, 30 May–5 June 2021; pp. 4163–4170. [Google Scholar]
Rueckert, E.; Nakatenus, M.; Tosatto, S.; Peters, J. Learning inverse dynamics models in o (n) time with lstm networks. In Proceedings of the 2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids), Birmingham, UK, 15–17 November 2017; pp. 811–816. [Google Scholar]
Wang, S.; Shao, X.; Yang, L.; Liu, N. Deep learning aided dynamic parameter identification of 6-DOF robot manipulators. IEEE Access 2020, 8, 138102–138116. [Google Scholar] [CrossRef]
Bittencourt, A.C.; Gunnarsson, S. Static friction in a robot joint—Modeling and identification of load and temperature effects. J. Dyn. Sys. Meas. Control 2012, 134, 051013. [Google Scholar] [CrossRef]
Hitzler, K.; Meier, F.; Schaal, S.; Asfour, T. Learning and adaptation of inverse dynamics models: A comparison. In Proceedings of the 2019 IEEE-RAS 19th International Conference on Humanoid Robots (Humanoids), Toronto, ON, Canada, 15–17 October 2019; pp. 491–498. [Google Scholar]
Song, L.; Wang, A.; Zhong, J. Inverse dynamics modeling and analysis of healthy human data for lower limb rehabilitation robots. Electronics 2022, 11, 3848. [Google Scholar] [CrossRef]
Packer, C.; Gao, K.; Kos, J.; Krähenbühl, P.; Koltun, V.; Song, D. Assessing generalization in deep reinforcement learning. arXiv 2018, arXiv:1810.12282. [Google Scholar]
Sahoo, S.; Lampert, C.; Martius, G. Learning equations for extrapolation and control. In Proceedings of the International Conference on Machine Learning, Stockholm, Sweden, 10–15 July 2018; pp. 4442–4450. [Google Scholar]
Agudelo-Espana, D.; Zadaianchuk, A.; Wenk, P.; Garg, A.; Akpo, J.; Grimminger, F.; Viereck, J.; Naveau, M.; Righetti, L.; Martius, G.; et al. A real-robot dataset for assessing transferability of learned dynamics models. In Proceedings of the 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France, 31 May–31 August 2020; pp. 8151–8157. [Google Scholar]
Reuss, M.; Duijkeren, N.; Krug, R.; Becker, P.; Shaj, V.; Neumann, G. End-to-end learning of hybrid inverse dynamics models for precise and compliant impedance control. arXiv 2022, arXiv:2205.13804. [Google Scholar]
Jorge, D.; Pizzuto, G.; Mistry, M. Efficient learning of inverse dynamics models for adaptive computed torque control. In Proceedings of the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 23–27 October 2022; p. 1120311208. [Google Scholar]
Saviolo, A.; Loianno, G. Learning quadrotor dynamics for precise, safe, and agile flight control. Annu. Rev. Control 2023, 55, 45–60. [Google Scholar] [CrossRef]
Hochlehnert, A.; Terenin, A.; Sæmundsson, S.; Deisenroth, M. Learning contact dynamics using physically structured neural networks. In Proceedings of the International Conference on Artificial Intelligence and Statistics, Virtual, 13–15 April 2021; pp. 2152–2160. [Google Scholar]
Duong, T.; Altawaitan, A.; Stanley, J.; Atanasov, N. Port-Hamiltonian Neural ODE Networks on Lie Groups for Robot Dynamics Learning and Control. arXiv 2024, arXiv:2401.09520. [Google Scholar] [CrossRef]

Figure 1. The inverse model using DeLaNs predicts the system’s Lagrangian

L

and calculates changes using Euler–Lagrange equations; the structured Lagrangian method (a) uses two networks for the kinetic T and potential V energies, computing

L = T - V

, while the black-box approach (b) directly predicts

L

. The structure diagram of DeLaN-Friction, which introduces friction into the model learning, is illustrated in (c). The structure diagram of DeLaN-FFNN, which introduces uncertain forces into the model learning, is shown in (d).

Figure 1. The inverse model using DeLaNs predicts the system’s Lagrangian

L

and calculates changes using Euler–Lagrange equations; the structured Lagrangian method (a) uses two networks for the kinetic T and potential V energies, computing

L = T - V

, while the black-box approach (b) directly predicts

L

. The structure diagram of DeLaN-Friction, which introduces friction into the model learning, is illustrated in (c). The structure diagram of DeLaN-FFNN, which introduces uncertain forces into the model learning, is shown in (d).

Figure 2. The experimental setup, where X represents

(q, \dot{q}, \ddot{q})

and Y represents

τ

.

Figure 2. The experimental setup, where X represents

(q, \dot{q}, \ddot{q})

and Y represents

τ

.

Figure 3. The end-effector trajectories for three test sets in the iid test set (a) and the trajectories for three test sets in non-iid test set 1 (b). The black dots represent the three free joints of the manipulator, while the black squares represent the end-effector. Taking the base of the manipulator as the origin, the horizontal axis represents the position of the trajectory on the x-axis of the x–z plane, while the vertical axis represents the position on the z-axis.

Figure 4. Display showing (a) the learned inverse model using the training dataset (three iid test trajectories were not included in the training set; the results are averaged over five seeds, and the subsequent columns provide the predicted force decomposition), (b) the inertial force

H (q) \ddot{q}

, (c) the Coriolis and Centrifugal forces

c (q, \dot{q})

, and (d) the gravitational force

g (q)

.

Figure 4. Display showing (a) the learned inverse model using the training dataset (three iid test trajectories were not included in the training set; the results are averaged over five seeds, and the subsequent columns provide the predicted force decomposition), (b) the inertial force

H (q) \ddot{q}

, (c) the Coriolis and Centrifugal forces

c (q, \dot{q})

, and (d) the gravitational force

g (q)

.

Figure 5. Performance of each dynamics model when applied to increased velocities, assessing their capability to extrapolate to new velocity conditions; the gray areas represent test data where the velocity was increased.

Figure 6. Data were recorded on a real 3-DoF robot arm (a) to evaluate the extrapolation capabilities of the dynamics models under different policies. In the trajectory settings for the training and test sets (b), ‘low’ and ‘high’ represent frequency settings, while ‘left’ and ‘full’ indicate joint ranges limited to the left space and the entire space. The iid test set settings matched those of the training set.

Figure 7. The MSE of torque predictions obtained by each dynamics model on various test sets, collected under different control policies, reflects the overall modeling accuracy. The black line in the figure represents the corresponding confidence interval.

Figure 8. The MSE of torque predictions obtained by each dynamics model on various test sets, collected with different end loads, reflects the overall modeling accuracy. The black line in the figure represents the corresponding confidence interval.

Table 1. Hyperparameters used for the dynamics models across different dynamical systems.

	3-DoF Manipulator	3-DoF Robot Arm	KUKA LWR
Batch Size	512	1024	512
Activation	Tanh	Sigmoid	Tanh
Network Dimension	[2 × 64]	[2 × 128]	[2 × 128]

Table 2. The nMSE for torque predictions and torque decomposition predictions across the three joints, MSE for each joint’s torque predictions, and coefficient of determination (

R^{2}

) for overall modeling effectiveness. The results are averaged over five random seeds, with corresponding confidence intervals.

Table 2. The nMSE for torque predictions and torque decomposition predictions across the three joints, MSE for each joint’s torque predictions, and coefficient of determination (

R^{2}

) for overall modeling effectiveness. The results are averaged over five random seeds, with corresponding confidence intervals.

		Inverse Model (nMSE)				Inverse Model (MSE)
Iid Test Set		Torque $τ$	Inertial Torque $τ_{I}$	Coriolis Torque $τ_{c}$	Gravitational Torque $τ_{g}$	Joint 0	Joint 1	Joint 2	$R^{2}$
DeLaN	Structured Lagrangian	$8.1 \times 10^{- 6} \pm 4.7 \times 10^{- 6}$	$1.6 \times 10^{- 4} \pm 1.3 \times 10^{- 4}$	$1.1 \times 10^{- 4} \pm 5.8 \times 10^{- 5}$	$7.8 \times 10^{- 6} \pm 5.0 \times 10^{- 6}$	$2.2 \times 10^{- 3} \pm 9.7 \times 10^{- 4}$	$1.3 \times 10^{- 3} \pm 1.5 \times 10^{- 3}$	$8.3 \times 10^{- 4} \pm 6.2 \times 10^{- 4}$	$1.0 \times 10^{0} \pm 4.8 \times 10^{- 6}$
DeLaN	Black-Box Lagrangian	$3.2 \times 10^{- 4} \pm 2.8 \times 10^{- 4}$	$3.5 \times 10^{- 3} \pm 8.7 \times 10^{- 4}$	$1.4 \times 10^{- 2} \pm 1.2 \times 10^{- 2}$	$1.1 \times 10^{- 5} \pm 1.0 \times 10^{- 5}$	$1.4 \times 10^{- 1} \pm 1.5 \times 10^{- 1}$	$2.5 \times 10^{- 2} \pm 1.6 \times 10^{- 2}$	$9.4 \times 10^{- 3} \pm 3.2 \times 10^{- 3}$	$1.0 \times 10^{0} \pm 2.8 \times 10^{- 4}$
FF-NN	Feed-Forward Network	$3.7 \times 10^{- 3} \pm 1.9 \times 10^{- 3}$	$3.7 \times 10^{- 2} \pm 2.1 \times 10^{- 2}$	$2.3 \times 10^{- 1} \pm 9.7 \times 10^{- 2}$	$9.8 \times 10^{- 5} \pm 6.7 \times 10^{- 5}$	$1.8 \times 10^{0} \pm 9.6 \times 10^{- 1}$	$1.2 \times 10^{- 1} \pm 6.6 \times 10^{- 2}$	$6.6 \times 10^{- 2} \pm 5.4 \times 10^{- 2}$	$1.0 \times 10^{0} \pm 1.9 \times 10^{- 3}$
DeLaN-Friction	Introducing Friction	$8.3 \times 10^{- 6} \pm 6.2 \times 10^{- 6}$	$3.6 \times 10^{- 4} \pm 3.9 \times 10^{- 4}$	$1.6 \times 10^{- 4} \pm 1.4 \times 10^{- 4}$	$6.6 \times 10^{- 6} \pm 5.0 \times 10^{- 6}$	$2.0 \times 10^{- 3} \pm 1.8 \times 10^{- 3}$	$1.5 \times 10^{- 3} \pm 8.5 \times 10^{- 4}$	$9.5 \times 10^{- 4} \pm 1.2 \times 10^{- 3}$	$1.0 \times 10^{0} \pm 6.3 \times 10^{- 6}$
DeLaN-FFNN	An Augmented DeLaN	$6.2 \times 10^{- 6} \pm 3.0 \times 10^{- 6}$	$1.6 \times 10^{- 4} \pm 1.2 \times 10^{- 4}$	$1.1 \times 10^{- 4} \pm 5.3 \times 10^{- 5}$	$5.4 \times 10^{- 6} \pm 4.7 \times 10^{- 6}$	$1.6 \times 10^{- 3} \pm 1.7 \times 10^{- 3}$	$1.0 \times 10^{- 3} \pm 6.8 \times 10^{- 4}$	$6.5 \times 10^{- 4} \pm 4.1 \times 10^{- 4}$	$1.0 \times 10^{0} \pm 3.0 \times 10^{- 6}$
Non-iid Test Set 0		Torque $τ$	Inertial Torque $τ_{I}$	Coriolis Torque $τ_{c}$	Gravitational Torque $τ_{g}$	Joint 0	Joint 1	Joint 2	$R^{2}$
DeLaN	Structured Lagrangian	$2.7 \times 10^{- 5} \pm 1.5 \times 10^{- 5}$	$1.7 \times 10^{- 4} \pm 1.3 \times 10^{- 4}$	$1.1 \times 10^{- 4} \pm 5.8 \times 10^{- 5}$	$7.7 \times 10^{- 6} \pm 4.9 \times 10^{- 6}$	$2.2 \times 10^{- 2} \pm 1.4 \times 10^{- 2}$	$1.2 \times 10^{- 2} \pm 9.5 \times 10^{- 3}$	$8.2 \times 10^{- 3} \pm 1.8 \times 10^{- 3}$	$1.0 \times 10^{0} \pm 1.5 \times 10^{- 5}$
DeLaN	Black-Box Lagrangian	$2.4 \times 10^{- 1} \pm 4.9 \times 10^{- 2}$	$3.7 \times 10^{- 3} \pm 8.6 \times 10^{- 4}$	$4.9 \times 10^{- 1} \pm 1.5 \times 10^{- 1}$	$1.1 \times 10^{- 5} \pm 1.0 \times 10^{- 5}$	$3.4 \times 10^{2} \pm 6.9 \times 10^{1}$	$3.4 \times 10^{1} \pm 6.5 \times 10^{0}$	$2.0 \times 10^{0} \pm 4.9 \times 10^{- 1}$	$7.6 \times 10^{- 1} \pm 4.9 \times 10^{- 2}$
FF-NN	Feed-Forward Network	$2.2 \times 10^{- 1} \pm 3.7 \times 10^{- 2}$	$3.6 \times 10^{- 1} \pm 7.9 \times 10^{- 2}$	$7.9 \times 10^{- 1} \pm 6.9 \times 10^{- 2}$	$9.6 \times 10^{- 5} \pm 6.5 \times 10^{- 5}$	$3.2 \times 10^{2} \pm 5.6 \times 10^{1}$	$1.7 \times 10^{1} \pm 3.6 \times 10^{0}$	$9.2 \times 10^{0} \pm 2.1 \times 10^{0}$	$7.8 \times 10^{- 1} \pm 3.7 \times 10^{- 2}$
DeLaN-Friction	Introducing Friction	$3.7 \times 10^{- 5} \pm 2.7 \times 10^{- 5}$	$3.8 \times 10^{- 4} \pm 4.2 \times 10^{- 4}$	$1.6 \times 10^{- 4} \pm 1.4 \times 10^{- 4}$	$6.5 \times 10^{- 6} \pm 5.0 \times 10^{- 6}$	$2.7 \times 10^{- 2} \pm 2.9 \times 10^{- 2}$	$1.7 \times 10^{- 2} \pm 9.1 \times 10^{- 3}$	$1.4 \times 10^{- 2} \pm 1.2 \times 10^{- 2}$	$1.0 \times 10^{0} \pm 2.7 \times 10^{- 5}$
DeLaN-FFNN	An Augmented DeLaN	$2.7 \times 10^{- 5} \pm 9.9 \times 10^{- 6}$	$1.7 \times 10^{- 4} \pm 1.3 \times 10^{- 4}$	$1.1 \times 10^{- 4} \pm 5.4 \times 10^{- 5}$	$5.3 \times 10^{- 6} \pm 4.6 \times 10^{- 6}$	$2.0 \times 10^{- 2} \pm 7.5 \times 10^{- 3}$	$1.3 \times 10^{- 2} \pm 6.1 \times 10^{- 3}$	$8.9 \times 10^{- 3} \pm 3.6 \times 10^{- 3}$	$1.0 \times 10^{0} \pm 9.9 \times 10^{- 6}$
Non-iid Test Set 1		Torque $τ$	Inertial Torque $τ_{I}$	Coriolis Torque $τ_{c}$	Gravitational Torque $τ_{g}$	Joint 0	Joint 1	Joint 2	$R^{2}$
DeLaN	Structured Lagrangian	$5.9 \times 10^{- 4} \pm 3.1 \times 10^{- 4}$	$1.0 \times 10^{- 4} \pm 2.9 \times 10^{- 5}$	$1.1 \times 10^{- 3} \pm 5.8 \times 10^{- 4}$	$3.3 \times 10^{- 6} \pm 1.6 \times 10^{- 6}$	$3.9 \times 10^{- 1} \pm 3.1 \times 10^{- 1}$	$3.7 \times 10^{- 1} \pm 3.1 \times 10^{- 1}$	$2.0 \times 10^{- 1} \pm 5.7 \times 10^{- 2}$	$1.0 \times 10^{0} \pm 3.1 \times 10^{- 4}$
DeLaN	Black-Box Lagrangian	$4.6 \times 10^{- 1} \pm 4.2 \times 10^{- 2}$	$2.7 \times 10^{- 3} \pm 1.2 \times 10^{- 3}$	$7.9 \times 10^{- 1} \pm 7.9 \times 10^{- 2}$	$8.3 \times 10^{- 6} \pm 5.1 \times 10^{- 6}$	$6.8 \times 10^{2} \pm 5.9 \times 10^{1}$	$4.9 \times 10^{1} \pm 9.1 \times 10^{0}$	$2.7 \times 10^{1} \pm 5.7 \times 10^{0}$	$5.4 \times 10^{- 1} \pm 4.2 \times 10^{- 2}$
FF-NN	Feed-Forward Network	$5.3 \times 10^{- 1} \pm 4.4 \times 10^{- 2}$	$3.4 \times 10^{- 1} \pm 1.5 \times 10^{- 1}$	$9.0 \times 10^{- 1} \pm 6.2 \times 10^{- 2}$	$6.9 \times 10^{- 5} \pm 4.2 \times 10^{- 5}$	$7.6 \times 10^{2} \pm 7.0 \times 10^{1}$	$4.3 \times 10^{1} \pm 1.9 \times 10^{1}$	$5.5 \times 10^{1} \pm 8.9 \times 10^{0}$	$4.7 \times 10^{- 1} \pm 4.5 \times 10^{- 2}$
DeLaN-Friction	Introducing Friction	$7.5 \times 10^{- 4} \pm 4.3 \times 10^{- 4}$	$1.2 \times 10^{- 4} \pm 7.5 \times 10^{- 5}$	$1.4 \times 10^{- 3} \pm 7.7 \times 10^{- 4}$	$3.2 \times 10^{- 6} \pm 1.9 \times 10^{- 6}$	$5.9 \times 10^{- 1} \pm 5.8 \times 10^{- 1}$	$3.4 \times 10^{- 1} \pm 1.0 \times 10^{- 1}$	$2.9 \times 10^{- 1} \pm 1.7 \times 10^{- 1}$	$1.0 \times 10^{0} \pm 4.3 \times 10^{- 4}$
DeLaN-FFNN	An Augmented DeLaN	$4.8 \times 10^{- 4} \pm 1.5 \times 10^{- 4}$	$8.8 \times 10^{- 5} \pm 3.4 \times 10^{- 5}$	$8.8 \times 10^{- 4} \pm 2.9 \times 10^{- 4}$	$2.8 \times 10^{- 6} \pm 1.6 \times 10^{- 6}$	$3.5 \times 10^{- 1} \pm 2.7 \times 10^{- 1}$	$2.5 \times 10^{- 1} \pm 1.3 \times 10^{- 1}$	$1.8 \times 10^{- 1} \pm 6.8 \times 10^{- 2}$	$1.0 \times 10^{0} \pm 1.5 \times 10^{- 4}$

Note: Bold indicates better results.

Table 3. The MSE for torque predictions of each joint and coefficient of determination

R^{2}

for overall modeling effectiveness. The results are averaged over five random seeds with corresponding confidence intervals.

Table 3. The MSE for torque predictions of each joint and coefficient of determination

R^{2}

for overall modeling effectiveness. The results are averaged over five random seeds with corresponding confidence intervals.

		Inverse Model (MSE)
Iid Test Set		Joint 0	Joint 1	Joint 2	$R^{2}$
DeLaN	Structured Lagrangian	$1.2 \times 10^{- 1}$ ± $5.9 \times 10^{- 4}$	$3.0 \times 10^{- 2}$ ± $4.4 \times 10^{- 4}$	$1.2 \times 10^{- 1}$ ± $9.9 \times 10^{- 4}$	−
DeLaN	Black-Box Lagrangian	$1.8 \times 10^{- 2}$ ± $6.2 \times 10^{- 5}$	$2.6 \times 10^{- 3}$ ± $3.1 \times 10^{- 5}$	$2.6 \times 10^{- 3}$ ± $2.8 \times 10^{- 5}$	$8.159 \times 10^{- 1}$ ± $7.200 \times 10^{- 4}$
FF-NN	Feed-Forward Network	$1.0 \times 10^{- 2} \pm 3.4 \times 10^{- 4}$	$7.7 \times 10^{- 4} \pm 6.3 \times 10^{- 5}$	$6.6 \times 10^{- 4} \pm 2.0 \times 10^{- 5}$	$9.092 \times 10^{- 1} \pm 2.783 \times 10^{- 3}$
DeLaN-Friction	Introducing Friction	$1.1 \times 10^{- 1}$ ± $8.2 \times 10^{- 4}$	$2.9 \times 10^{- 2}$ ± $3.9 \times 10^{- 4}$	$1.1 \times 10^{- 1}$ ± $9.6 \times 10^{- 4}$	−
DeLaN-FFNN	An Augmented DeLaN	$1.0 \times 10^{- 2} \pm 5.3 \times 10^{- 4}$	$7.6 \times 10^{- 4} \pm 1.3 \times 10^{- 4}$	$7.1 \times 10^{- 4} \pm 3.1 \times 10^{- 5}$	$9.107 \times 10^{- 1} \pm 4.870 \times 10^{- 3}$
Non-iid Test Set 0		Joint 0	Joint 1	Joint 2	$R^{2}$
DeLaN	Structured Lagrangian	$1.1 \times 10^{- 1}$ ± $3.8 \times 10^{- 2}$	$5.0 \times 10^{- 2}$ ± $7.8 \times 10^{- 3}$	$8.1 \times 10^{- 1}$ ± $3.9 \times 10^{- 2}$	−
DeLaN	Black-Box Lagrangian	$4.0 \times 10^{- 2} \pm 4.2 \times 10^{- 4}$	$8.8 \times 10^{- 3}$ ± $2.7 \times 10^{- 3}$	$3.0 \times 10^{- 2}$ ± $4.7 \times 10^{- 3}$	$5.170 \times 10^{- 1}$ ± $3.568 \times 10^{- 2}$
FF-NN	Feed-Forward Network	$4.6 \times 10^{- 2} \pm 1.8 \times 10^{- 3}$	$4.7 \times 10^{- 3} \pm 3.9 \times 10^{- 4}$	$1.4 \times 10^{- 2} \pm 5.5 \times 10^{- 4}$	$6.070 \times 10^{- 1} \pm 1.444 \times 10^{- 2}$
DeLaN-Friction	Introducing Friction	$1.2 \times 10^{- 1}$ ± $3.6 \times 10^{- 2}$	$5.0 \times 10^{- 2}$ ± $3.4 \times 10^{- 3}$	$8.1 \times 10^{- 1}$ ± $4.0 \times 10^{- 2}$	−
DeLaN-FFNN	An Augmented DeLaN	$4.8 \times 10^{- 2} \pm 1.9 \times 10^{- 3}$	$5.0 \times 10^{- 3} \pm 8.4 \times 10^{- 4}$	$1.2 \times 10^{- 2} \pm 1.3 \times 10^{- 3}$	$6.020 \times 10^{- 1} \pm 9.690 \times 10^{- 3}$
Non-iid Test Set 1		Joint 0	Joint 1	Joint 2	$R^{2}$
DeLaN	Structured Lagrangian	$7.1 \times 10^{- 1}$ ± $1.7 \times 10^{- 1}$	$3.8 \times 10^{- 1}$ ± $1.5 \times 10^{- 1}$	$3.0 \times 10^{0}$ ± $1.1 \times 10^{0}$	−
DeLaN	Black-Box Lagrangian	$3.5 \times 10^{- 1}$ ± $7.3 \times 10^{- 2}$	$4.6 \times 10^{- 2}$ ± $1.7 \times 10^{- 2}$	$4.9 \times 10^{- 2}$ ± $1.3 \times 10^{- 2}$	−
FF-NN	Feed-Forward Network	$1.1 \times 10^{- 1} \pm 1.8 \times 10^{- 2}$	$7.3 \times 10^{- 3} \pm 1.4 \times 10^{- 3}$	$1.7 \times 10^{- 3} \pm 1.1 \times 10^{- 4}$	$5.914 \times 10^{- 1} \pm 6.576 \times 10^{- 2}$
DeLaN-Friction	Introducing Friction	$8.1 \times 10^{- 1}$ ± $2.9 \times 10^{- 1}$	$4.6 \times 10^{- 1}$ ± $2.4 \times 10^{- 1}$	$3.9 \times 10^{0}$ ± $2.2 \times 10^{0}$	−
DeLaN-FFNN	An Augmented DeLaN	$1.0 \times 10^{- 1} \pm 2.8 \times 10^{- 2}$	$2.4 \times 10^{- 2}$ ± $1.9 \times 10^{- 2}$	$6.3 \times 10^{- 2}$ ± $7.8 \times 10^{- 2}$	$3.519 \times 10^{- 1}$ ± $2.594 \times 10^{- 1}$
Non-iid Test Set 2		Joint 0	Joint 1	Joint 2	$R^{2}$
DeLaN	Structured Lagrangian	$1.4 \times 10^{0}$ ± $6.5 \times 10^{- 1}$	$3.8 \times 10^{- 1}$ ± $2.8 \times 10^{- 1}$	$4.0 \times 10^{0}$ ± $1.5 \times 10^{0}$	−
DeLaN	Black-Box Lagrangian	$1.8 \times 10^{- 1} \pm 2.5 \times 10^{- 2}$	$6.4 \times 10^{- 2}$ ± $2.5 \times 10^{- 2}$	$1.1 \times 10^{- 1}$ ± $1.8 \times 10^{- 2}$	$2.965 \times 10^{- 1}$ ± $1.245 \times 10^{- 1}$
FF-NN	Feed-Forward Network	$2.4 \times 10^{- 1} \pm 4.7 \times 10^{- 2}$	$2.2 \times 10^{- 2} \pm 2.5 \times 10^{- 3}$	$6.4 \times 10^{- 2} \pm 2.5 \times 10^{- 3}$	$3.501 \times 10^{- 1} \pm 1.002 \times 10^{- 1}$
DeLaN-Friction	Introducing Friction	$1.4 \times 10^{0}$ ± $7.0 \times 10^{- 1}$	$5.0 \times 10^{- 1}$ ± $5.3 \times 10^{- 1}$	$4.5 \times 10^{0}$ ± $2.2 \times 10^{0}$	−
DeLaN-FFNN	An Augmented DeLaN	$2.8 \times 10^{- 1} \pm 5.3 \times 10^{- 2}$	$3.5 \times 10^{- 2} \pm 1.7 \times 10^{- 2}$	$1.2 \times 10^{- 1}$ ± $4.3 \times 10^{- 2}$	$1.453 \times 10^{- 1}$ ± $7.404 \times 10^{- 2}$

Note: The bolded part refers to better results, and − representing a coefficient of determination

R^{2}

range exceeding 0–1.

Table 4. The MSE for each joint and coefficient of determination

R^{2}

for each dynamics model on each set of test data, along with the average and confidence interval calculated over five random seeds.

Table 4. The MSE for each joint and coefficient of determination

R^{2}

for each dynamics model on each set of test data, along with the average and confidence interval calculated over five random seeds.

		Inverse Model (MSE)
Iid Test Set		Joint 0	Joint 1	Joint 2	Joint 3	Joint 4	$R^{2}$
DeLaN	Structured Lagrangian	$6.703 \times 10^{- 4}$ ± $6.984 \times 10^{- 5}$	$2.579 \times 10^{- 4}$ ± $7.915 \times 10^{- 5}$	$1.939 \times 10^{- 4}$ ± $4.009 \times 10^{- 5}$	$1.008 \times 10^{- 3}$ ± $4.744 \times 10^{- 4}$	$4.378 \times 10^{- 4}$ ± $1.355 \times 10^{- 4}$	$9.787 \times 10^{- 1} \pm 6.316 \times 10^{- 3}$
DeLaN	Black-Box Lagrangian	$2.337 \times 10^{- 4} \pm 1.312 \times 10^{- 5}$	$6.853 \times 10^{- 5} \pm 4.182 \times 10^{- 6}$	$5.890 \times 10^{- 5} \pm 6.225 \times 10^{- 6}$	$1.689 \times 10^{- 4} \pm 4.673 \times 10^{- 6}$	$1.390 \times 10^{- 4} \pm 1.238 \times 10^{- 5}$	$9.945 \times 10^{- 1} \pm 2.411 \times 10^{- 4}$
FF-NN	Feed-Forward Network	$2.735 \times 10^{- 4} \pm 3.426 \times 10^{- 5}$	$8.273 \times 10^{- 5} \pm 1.074 \times 10^{- 5}$	$6.578 \times 10^{- 5} \pm 9.738 \times 10^{- 6}$	$1.915 \times 10^{- 4} \pm 1.823 \times 10^{- 5}$	$1.611 \times 10^{- 4} \pm 1.659 \times 10^{- 5}$	$9.936 \times 10^{- 1} \pm 6.993 \times 10^{- 4}$
DeLaN-Friction	Introducing Friction	$6.685 \times 10^{- 4}$ ± $1.044 \times 10^{- 4}$	$2.568 \times 10^{- 4}$ ± $7.258 \times 10^{- 5}$	$2.021 \times 10^{- 4}$ ± $5.867 \times 10^{- 5}$	$9.732 \times 10^{- 4}$ ± $4.707 \times 10^{- 4}$	$4.413 \times 10^{- 4}$ ± $8.665 \times 10^{- 5}$	$9.789 \times 10^{- 1} \pm 6.276 \times 10^{- 3}$
DeLaN-FFNN	An Augmented DeLaN	$2.362 \times 10^{- 4} \pm 4.264 \times 10^{- 5}$	$8.470 \times 10^{- 5} \pm 1.571 \times 10^{- 5}$	$6.816 \times 10^{- 5} \pm 6.304 \times 10^{- 6}$	$1.861 \times 10^{- 4} \pm 3.278 \times 10^{- 5}$	$1.660 \times 10^{- 4} \pm 2.211 \times 10^{- 5}$	$9.939 \times 10^{- 1} \pm 8.587 \times 10^{- 4}$
Non-iid Test Set		Joint 0	Joint 1	Joint 2	Joint 3	Joint 4	$R^{2}$
DeLaN	Structured Lagrangian	$2.728 \times 10^{2}$ ± $2.981 \times 10^{2}$	$1.948 \times 10^{3}$ ± $2.788 \times 10^{3}$	$3.054 \times 10^{3}$ ± $3.370 \times 10^{3}$	$1.779 \times 10^{3}$ ± $2.983 \times 10^{3}$	$2.212 \times 10^{3}$ ± $1.770 \times 10^{3}$	−
DeLaN	Black-Box Lagrangian	$1.172 \times 10^{- 1} \pm 4.657 \times 10^{- 2}$	$1.006 \times 10^{- 1} \pm 9.268 \times 10^{- 2}$	$9.031 \times 10^{- 2} \pm 7.375 \times 10^{- 2}$	$1.169 \times 10^{- 1} \pm 2.503 \times 10^{- 2}$	$1.105 \times 10^{- 1} \pm 1.928 \times 10^{- 2}$	−
FF-NN	Feed-Forward Network	$2.319 \times 10^{- 1} \pm 2.948 \times 10^{- 2}$	$1.220 \times 10^{- 1} \pm 4.778 \times 10^{- 2}$	$7.723 \times 10^{- 2} \pm 3.886 \times 10^{- 2}$	$1.996 \times 10^{- 1} \pm 5.962 \times 10^{- 2}$	$2.661 \times 10^{- 1} \pm 1.213 \times 10^{- 1}$	−
DeLaN-Friction	Introducing Friction	$3.139 \times 10^{2}$ ± $3.369 \times 10^{2}$	$1.019 \times 10^{3}$ ± $1.540 \times 10^{3}$	$2.755 \times 10^{3}$ ± $1.507 \times 10^{3}$	$5.593 \times 10^{2}$ ± $6.002 \times 10^{2}$	$3.693 \times 10^{3}$ ± $4.022 \times 10^{3}$	−
DeLaN-FFNN	An Augmented DeLaN	$4.361 \times 10^{- 1} \pm 6.790 \times 10^{- 1}$	$1.373 \times 10^{- 1} \pm 8.175 \times 10^{- 2}$	$1.060 \times 10^{- 1}$ ± $7.969 \times 10^{- 2}$	$2.760 \times 10^{- 1} \pm 2.681 \times 10^{- 1}$	$3.950 \times 10^{- 1} \pm 3.146 \times 10^{- 1}$	−

Note: Bold indicates better results, while − represents a coefficient of determination

R^{2}

range exceeding 0–1.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, Z.; Wu, S.; Chen, W.; Sun, F. Extrapolation of Physics-Inspired Deep Networks in Learning Robot Inverse Dynamics. Mathematics 2024, 12, 2527. https://doi.org/10.3390/math12162527

AMA Style

Li Z, Wu S, Chen W, Sun F. Extrapolation of Physics-Inspired Deep Networks in Learning Robot Inverse Dynamics. Mathematics. 2024; 12(16):2527. https://doi.org/10.3390/math12162527

Chicago/Turabian Style

Li, Zhiming, Shuangshuang Wu, Wenbai Chen, and Fuchun Sun. 2024. "Extrapolation of Physics-Inspired Deep Networks in Learning Robot Inverse Dynamics" Mathematics 12, no. 16: 2527. https://doi.org/10.3390/math12162527

APA Style

Li, Z., Wu, S., Chen, W., & Sun, F. (2024). Extrapolation of Physics-Inspired Deep Networks in Learning Robot Inverse Dynamics. Mathematics, 12(16), 2527. https://doi.org/10.3390/math12162527

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Extrapolation of Physics-Inspired Deep Networks in Learning Robot Inverse Dynamics

Abstract

1. Introduction

2. Preliminaries

2.1. Inverse Dynamics Modeling

2.2. Lagrangian Mechanics

2.3. Physics-Inspired Deep Networks

2.3.1. Deep Lagrangian Networks

2.3.2. Introducing Friction to Model Learning

2.3.3. Introducing Uncertain Forces to Model Learning

3. Extrapolation

4. Experiment

4.1. Experiment Setup

4.1.1. Evaluation Metrics

4.1.2. Neural Network Training Details

4.2. Simulated Robot Experiments

4.2.1. Dataset Construction

4.2.2. Modeling Experiments

4.3. Real Robot Experiments

4.3.1. Dataset Construction

4.3.2. Extrapolation across Different Policies

4.3.3. Extrapolation across Different Loads

4.4. Results Analysis and Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

Appendix B

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI