Extrapolation of Physics-Inspired Deep Networks in Learning Robot Inverse Dynamics
Abstract
:1. Introduction
2. Preliminaries
2.1. Inverse Dynamics Modeling
2.2. Lagrangian Mechanics
2.3. Physics-Inspired Deep Networks
2.3.1. Deep Lagrangian Networks
2.3.2. Introducing Friction to Model Learning
2.3.3. Introducing Uncertain Forces to Model Learning
3. Extrapolation
4. Experiment
- Q1: Under ideal observation data, can physics-inspired networks learn the underlying structure of physical systems?
- Q2: When applied to real physical systems, will the extrapolation capabilities of physics-inspired networks be limited by physical priors?
- Q3: What are the main factors that affect the extrapolation performance of physics-inspired networks in real systems?
4.1. Experiment Setup
4.1.1. Evaluation Metrics
4.1.2. Neural Network Training Details
4.2. Simulated Robot Experiments
4.2.1. Dataset Construction
4.2.2. Modeling Experiments
4.3. Real Robot Experiments
4.3.1. Dataset Construction
4.3.2. Extrapolation across Different Policies
4.3.3. Extrapolation across Different Loads
4.4. Results Analysis and Discussion
- (1)
- Under ideal observation data, structured DeLaN, DeLaN-Friction, and DeLaN-FFNN are capable of learning the underlying structure of physical systems and possess strong extrapolation capabilities. However, black-box DeLaN, which is also a physics-inspired network, cannot learn the underlying structure of the system from the data due to its lack of learning ability for the mass matrix , kinetic energy , and potential energy .
- (2)
- When applied to real systems, structured DeLaN and DeLaN-Friction show weaker extrapolation than black-box DeLaN, with DeLaN-FFNN slightly behind the black-box model. The failure of physics-inspired networks to capture all dynamics phenomena indicates that excessive physical constraints can limit extrapolation abilities. Improved extrapolation in physics-inspired networks requires more comprehensive capture of the real system dynamics.
- (3)
- In order for physics-inspired networks to achieve better extrapolation on real systems, they rely on two key factors: high-fidelity dynamics data that accurately reflects system phenomena, and the integration of comprehensive dynamics priors. These priors should transparently capture the real system dynamics instead of using black-box methods such as the uncertain forces in DeLaN-FFNN.
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Appendix A
Appendix B
References
- Wu, S.; Li, Z.; Chen, W.; Sun, F. Dynamic Modeling of Robotic Manipulator via an Augmented Deep Lagrangian Network. Tsinghua Sci. Technol. 2024, 29, 1604–1614. [Google Scholar] [CrossRef]
- Valencia-Vidal, B.; Ros, E.; Abadía, I.; Luque, N.R. Bidirectional recurrent learning of inverse dynamic models for robots with elastic joints: A real-time real-world implementation. Front. Neurorobot. 2023, 17, 1166911. [Google Scholar] [CrossRef] [PubMed]
- Ulbrich, S.; Bechtel, M.; Asfour, T.; Dillmann, R. Learning robot dynamics with kinematic bezier maps. In Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, Algarve Portugal, 7–12 October 2012; pp. 3598–3604. [Google Scholar]
- Giacomuzzo, G.; Dalla Libera, A.; Carli, R. Embedding the Physics in Black-box Inverse Dynamics Identification: A Comparison Between Gaussian Processes and Neural Networks. IFAC-PapersOnLine 2023, 56, 1584–1590. [Google Scholar] [CrossRef]
- Abdulkadirov, R.; Lyakhov, P.; Nagornov, N. Survey of optimization algorithms in modern neural networks. Mathematics 2023, 11, 2466. [Google Scholar] [CrossRef]
- Cunha, B.Z.; Droz, C.; Zine, A.M.; Foulard, S.; Ichchou, M. A review of machine learning methods applied to structural dynamics and vibroacoustic. Mech. Syst. Signal Process. 2023, 200, 110535. [Google Scholar] [CrossRef]
- Lutter, M.; Peters, J. Combining physics and deep learning to learn continuous-time dynamics models. Int. J. Robot. Res. 2023, 42, 83–107. [Google Scholar] [CrossRef]
- Liu, J.; Borja, P.; Della Santina, C. Physics-Informed Neural Networks to Model and Control Robots: A Theoretical and Experimental Investigation. Adv. Intell. Syst. 2024, 6, 2300385. [Google Scholar] [CrossRef]
- Schiassi, E.; De Florio, M.; D’Ambrosio, A.; Mortari, D.; Furfaro, R. Physics-informed neural networks and functional interpolation for data-driven parameters discovery of epidemiological compartmental models. Mathematics 2021, 9, 2069. [Google Scholar] [CrossRef]
- Lutter, M.; Ritter, C.; Peters, J. Deep Lagrangian Networks: Using Physics as Model Prior for Deep Learning. In Proceedings of the International Conference on Learning Representations, Vancouver, BC, Canada, 30 April–3 May 2018. [Google Scholar]
- Greydanus, S.; Dzamba, M.; Yosinski, J. Hamiltonian neural networks. In Proceedings of the Advances in Neural Information Processing Systems, Vancouver, BC, Canada, 8–14 December 2019; Volume 32. [Google Scholar]
- Cranmer, M.; Greydanus, S.; Hoyer, S.; Battaglia, P.W.; Spergel, D.N.; Ho, S. Lagrangian Neural Networks. In Proceedings of the International Conference on Learning Representations 2020, Addis Ababa, Ethiopia, 26–30 April 2020. [Google Scholar]
- Zhong, Y.D.; Dey, B.; Chakraborty, A. Extending lagrangian and hamiltonian neural networks with differentiable contact models. Adv. Neural Inf. Process. Syst. 2021, 34, 21910–21922. [Google Scholar]
- Liu, L.; Zuo, G.; Li, J.; Li, J. Dynamics modeling with realistic constraints for trajectory tracking control of manipulator. In Proceedings of the 2022 IEEE International Conference on Real-time Computing and Robotics (RCAR), Guiyang, China, 17–22 July 2022; pp. 99–104. [Google Scholar]
- Lutter, M.; Listmann, K.; Peters, J. Deep lagrangian networks for end-to-end learning of energy-based control for under-actuated systems. In Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Macao, China, 4–8 November 2019; pp. 7718–7725. [Google Scholar]
- Lutter, M.; Silberbauer, J.; Watson, J.; Peters, J. A differentiable newton euler algorithm for multi-body model learning. arXiv 2020, arXiv:2010.09802. [Google Scholar]
- Lutter, M.; Silberbauer, J.; Watson, J.; Peters, J. Differentiable physics models for real-world offline model-based reinforcement learning. In Proceedings of the 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi’an China, 30 May–5 June 2021; pp. 4163–4170. [Google Scholar]
- Rueckert, E.; Nakatenus, M.; Tosatto, S.; Peters, J. Learning inverse dynamics models in o (n) time with lstm networks. In Proceedings of the 2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids), Birmingham, UK, 15–17 November 2017; pp. 811–816. [Google Scholar]
- Wang, S.; Shao, X.; Yang, L.; Liu, N. Deep learning aided dynamic parameter identification of 6-DOF robot manipulators. IEEE Access 2020, 8, 138102–138116. [Google Scholar] [CrossRef]
- Bittencourt, A.C.; Gunnarsson, S. Static friction in a robot joint—Modeling and identification of load and temperature effects. J. Dyn. Sys. Meas. Control 2012, 134, 051013. [Google Scholar] [CrossRef]
- Hitzler, K.; Meier, F.; Schaal, S.; Asfour, T. Learning and adaptation of inverse dynamics models: A comparison. In Proceedings of the 2019 IEEE-RAS 19th International Conference on Humanoid Robots (Humanoids), Toronto, ON, Canada, 15–17 October 2019; pp. 491–498. [Google Scholar]
- Song, L.; Wang, A.; Zhong, J. Inverse dynamics modeling and analysis of healthy human data for lower limb rehabilitation robots. Electronics 2022, 11, 3848. [Google Scholar] [CrossRef]
- Packer, C.; Gao, K.; Kos, J.; Krähenbühl, P.; Koltun, V.; Song, D. Assessing generalization in deep reinforcement learning. arXiv 2018, arXiv:1810.12282. [Google Scholar]
- Sahoo, S.; Lampert, C.; Martius, G. Learning equations for extrapolation and control. In Proceedings of the International Conference on Machine Learning, Stockholm, Sweden, 10–15 July 2018; pp. 4442–4450. [Google Scholar]
- Agudelo-Espana, D.; Zadaianchuk, A.; Wenk, P.; Garg, A.; Akpo, J.; Grimminger, F.; Viereck, J.; Naveau, M.; Righetti, L.; Martius, G.; et al. A real-robot dataset for assessing transferability of learned dynamics models. In Proceedings of the 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France, 31 May–31 August 2020; pp. 8151–8157. [Google Scholar]
- Reuss, M.; Duijkeren, N.; Krug, R.; Becker, P.; Shaj, V.; Neumann, G. End-to-end learning of hybrid inverse dynamics models for precise and compliant impedance control. arXiv 2022, arXiv:2205.13804. [Google Scholar]
- Jorge, D.; Pizzuto, G.; Mistry, M. Efficient learning of inverse dynamics models for adaptive computed torque control. In Proceedings of the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 23–27 October 2022; p. 1120311208. [Google Scholar]
- Saviolo, A.; Loianno, G. Learning quadrotor dynamics for precise, safe, and agile flight control. Annu. Rev. Control 2023, 55, 45–60. [Google Scholar] [CrossRef]
- Hochlehnert, A.; Terenin, A.; Sæmundsson, S.; Deisenroth, M. Learning contact dynamics using physically structured neural networks. In Proceedings of the International Conference on Artificial Intelligence and Statistics, Virtual, 13–15 April 2021; pp. 2152–2160. [Google Scholar]
- Duong, T.; Altawaitan, A.; Stanley, J.; Atanasov, N. Port-Hamiltonian Neural ODE Networks on Lie Groups for Robot Dynamics Learning and Control. arXiv 2024, arXiv:2401.09520. [Google Scholar] [CrossRef]
3-DoF Manipulator | 3-DoF Robot Arm | KUKA LWR | |
---|---|---|---|
Batch Size | 512 | 1024 | 512 |
Activation | Tanh | Sigmoid | Tanh |
Network Dimension | [2 × 64] | [2 × 128] | [2 × 128] |
Inverse Model (nMSE) | Inverse Model (MSE) | ||||||||
---|---|---|---|---|---|---|---|---|---|
Iid Test Set | Torque | Inertial Torque | Coriolis Torque | Gravitational Torque | Joint 0 | Joint 1 | Joint 2 | ||
DeLaN | Structured Lagrangian | ||||||||
DeLaN | Black-Box Lagrangian | ||||||||
FF-NN | Feed-Forward Network | ||||||||
DeLaN-Friction | Introducing Friction | ||||||||
DeLaN-FFNN | An Augmented DeLaN | ||||||||
Non-iid Test Set 0 | Torque | Inertial Torque | Coriolis Torque | Gravitational Torque | Joint 0 | Joint 1 | Joint 2 | ||
DeLaN | Structured Lagrangian | ||||||||
DeLaN | Black-Box Lagrangian | ||||||||
FF-NN | Feed-Forward Network | ||||||||
DeLaN-Friction | Introducing Friction | ||||||||
DeLaN-FFNN | An Augmented DeLaN | ||||||||
Non-iid Test Set 1 | Torque | Inertial Torque | Coriolis Torque | Gravitational Torque | Joint 0 | Joint 1 | Joint 2 | ||
DeLaN | Structured Lagrangian | ||||||||
DeLaN | Black-Box Lagrangian | ||||||||
FF-NN | Feed-Forward Network | ||||||||
DeLaN-Friction | Introducing Friction | ||||||||
DeLaN-FFNN | An Augmented DeLaN |
Inverse Model (MSE) | |||||
---|---|---|---|---|---|
Iid Test Set | Joint 0 | Joint 1 | Joint 2 | ||
DeLaN | Structured Lagrangian | ± | ± | ± | − |
DeLaN | Black-Box Lagrangian | ± | ± | ± | ± |
FF-NN | Feed-Forward Network | ||||
DeLaN-Friction | Introducing Friction | ± | ± | ± | − |
DeLaN-FFNN | An Augmented DeLaN | ||||
Non-iid Test Set 0 | Joint 0 | Joint 1 | Joint 2 | ||
DeLaN | Structured Lagrangian | ± | ± | ± | − |
DeLaN | Black-Box Lagrangian | ± | ± | ± | |
FF-NN | Feed-Forward Network | ||||
DeLaN-Friction | Introducing Friction | ± | ± | ± | − |
DeLaN-FFNN | An Augmented DeLaN | ||||
Non-iid Test Set 1 | Joint 0 | Joint 1 | Joint 2 | ||
DeLaN | Structured Lagrangian | ± | ± | ± | − |
DeLaN | Black-Box Lagrangian | ± | ± | ± | − |
FF-NN | Feed-Forward Network | ||||
DeLaN-Friction | Introducing Friction | ± | ± | ± | − |
DeLaN-FFNN | An Augmented DeLaN | ± | ± | ± | |
Non-iid Test Set 2 | Joint 0 | Joint 1 | Joint 2 | ||
DeLaN | Structured Lagrangian | ± | ± | ± | − |
DeLaN | Black-Box Lagrangian | ± | ± | ± | |
FF-NN | Feed-Forward Network | ||||
DeLaN-Friction | Introducing Friction | ± | ± | ± | − |
DeLaN-FFNN | An Augmented DeLaN | ± | ± |
Inverse Model (MSE) | |||||||
---|---|---|---|---|---|---|---|
Iid Test Set | Joint 0 | Joint 1 | Joint 2 | Joint 3 | Joint 4 | ||
DeLaN | Structured Lagrangian | ± | ± | ± | ± | ± | |
DeLaN | Black-Box Lagrangian | ||||||
FF-NN | Feed-Forward Network | ||||||
DeLaN-Friction | Introducing Friction | ± | ± | ± | ± | ± | |
DeLaN-FFNN | An Augmented DeLaN | ||||||
Non-iid Test Set | Joint 0 | Joint 1 | Joint 2 | Joint 3 | Joint 4 | ||
DeLaN | Structured Lagrangian | ± | ± | ± | ± | ± | − |
DeLaN | Black-Box Lagrangian | − | |||||
FF-NN | Feed-Forward Network | − | |||||
DeLaN-Friction | Introducing Friction | ± | ± | ± | ± | ± | − |
DeLaN-FFNN | An Augmented DeLaN | ± | − |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Li, Z.; Wu, S.; Chen, W.; Sun, F. Extrapolation of Physics-Inspired Deep Networks in Learning Robot Inverse Dynamics. Mathematics 2024, 12, 2527. https://doi.org/10.3390/math12162527
Li Z, Wu S, Chen W, Sun F. Extrapolation of Physics-Inspired Deep Networks in Learning Robot Inverse Dynamics. Mathematics. 2024; 12(16):2527. https://doi.org/10.3390/math12162527
Chicago/Turabian StyleLi, Zhiming, Shuangshuang Wu, Wenbai Chen, and Fuchun Sun. 2024. "Extrapolation of Physics-Inspired Deep Networks in Learning Robot Inverse Dynamics" Mathematics 12, no. 16: 2527. https://doi.org/10.3390/math12162527
APA StyleLi, Z., Wu, S., Chen, W., & Sun, F. (2024). Extrapolation of Physics-Inspired Deep Networks in Learning Robot Inverse Dynamics. Mathematics, 12(16), 2527. https://doi.org/10.3390/math12162527