Quick Search Adv. Search

Journal of Bionic Engineering ›› 2025, Vol. 22 ›› Issue (1): 107-117.doi: 10.1007/s42235-024-00618-z

Previous Articles     Next Articles

Robust Walking and Sim-to-Real Optimization for Quadruped Robots via Reinforcement Learning

Chao Ji1,2; Diyuan Liu2; Wei Gao1; Shiwu Zhang1

  

  1. 1 Department of Precision Machinery and Precision Instrumentation, School of Engineering Science, University of Science and Technology of China, Hefei 230026, China
    2 iFLYTEK Co., Ltd., Hefei 230088, China
  • Online:2025-01-03 Published:2025-02-19
  • Contact: Wei Gao; Shiwu Zhang E-mail:weigao@ustc.edu.cn; swzhang@ustc.edu.cn
  • About author:Chao Ji1,2; Diyuan Liu2; Wei Gao1; Shiwu Zhang1

Abstract: Achieving robust walking for different stairs is one of the most challenging tasks for quadruped robots in real world. Traditional model-based methods heavily rely on environmental factors, are burdened by intricate modelling complexities, and lack generalizability. The potential for advancements in adaptive locomotion control, often impeded by complex modelling processes, can be substantially enhanced through the application of Reinforcement Learning (RL). In this paper, a learning-based method is proposed to directionally enhance the stair-climbing skill of quadruped robots under different stair conditions. First, the general policy model based on proprioceptive perception is trained as a pre-training model. Then, the pre-training model was initialized, and different terrain information from the stairs was introduced for customized training to enhance the stair-climbing skill without affecting the existing locomotion performance. Finally, the customized control policy is deployed to the real robot to realize motion control in real environments. The experimental results demonstrate that the customized control policy can significantly improve the motion performance of quadruped robots when facing complex stair terrains and has certain generalizability in other complex terrains. The proposed algorithm can be extended to various terrestrial environments.

Key words: Quadruped robot, Learning-based, Skill augmentation, Customized control policy, Sim-to-Real