Next Article in Journal
Design and Force/Angle Independent Control of a Bionic Mechanical Ankle Based on an Artificial Muscle Matrix
Next Article in Special Issue
Design of Low-Cost Modular Bio-Inspired Electric–Pneumatic Actuator (EPA)-Driven Legged Robots
Previous Article in Journal
The Aerodynamic Effect of Biomimetic Pigeon Feathered Wing on a 1-DoF Flapping Mechanism
Previous Article in Special Issue
Bionic Design and Optimization on the Flow Channel of a Legged Robot Joint Hydraulic Drive Unit Based on Additive Manufacturing
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Learning Quadrupedal High-Speed Running on Uneven Terrain

1
Department of Automation, Tsinghua University, Beijing 100084, China
2
Beijing Innovation Center for Future Chips, Tsinghua University, Beijing 100084, China
*
Author to whom correspondence should be addressed.
Biomimetics 2024, 9(1), 37; https://doi.org/10.3390/biomimetics9010037
Submission received: 25 September 2023 / Revised: 20 November 2023 / Accepted: 8 December 2023 / Published: 5 January 2024
(This article belongs to the Special Issue Bio-Inspired Locomotion and Manipulation of Legged Robot)

Abstract

Reinforcement learning (RL)-based controllers have been applied to the high-speed movement of quadruped robots on uneven terrains. The external disturbances increase as the robot moves faster on such terrains, affecting the stability of the robot. Many existing RL-based methods adopt higher control frequencies to respond quickly to the disturbance, which requires a significant computational cost. We propose a control framework that consists of an RL-based control policy updating at a low frequency and a model-based joint controller updating at a high frequency. Unlike previous methods, our policy outputs the control law for each joint, executed by the corresponding high-frequency joint controller to reduce the impact of external disturbances on the robot. We evaluated our method on various simulated terrains with height differences of up to 6 cm. We achieved a running motion of 1.8 m/s in the simulation using the Unitree A1 quadruped. The RL-based control policy updates at 50 Hz with a latency of 20 ms, while the model-based joint controller runs at 1000 Hz. The experimental results show that the proposed framework can overcome the latency caused by low-frequency updates, making it applicable for real-robot deployment.
Keywords: reinforcement learning; quadrupedal robot; high-speed locomotion reinforcement learning; quadrupedal robot; high-speed locomotion

Share and Cite

MDPI and ACS Style

Han, X.; Zhao, M. Learning Quadrupedal High-Speed Running on Uneven Terrain. Biomimetics 2024, 9, 37. https://doi.org/10.3390/biomimetics9010037

AMA Style

Han X, Zhao M. Learning Quadrupedal High-Speed Running on Uneven Terrain. Biomimetics. 2024; 9(1):37. https://doi.org/10.3390/biomimetics9010037

Chicago/Turabian Style

Han, Xinyu, and Mingguo Zhao. 2024. "Learning Quadrupedal High-Speed Running on Uneven Terrain" Biomimetics 9, no. 1: 37. https://doi.org/10.3390/biomimetics9010037

APA Style

Han, X., & Zhao, M. (2024). Learning Quadrupedal High-Speed Running on Uneven Terrain. Biomimetics, 9(1), 37. https://doi.org/10.3390/biomimetics9010037

Article Metrics

Back to TopTop