Learning Robust Locomotion for Bipedal Robot via Embedded Mechanics Properties

doi:10.1007/s42235-023-00452-9

Journal of Bionic Engineering ›› 2024, Vol. 21 ›› Issue (3): 1278-1289.doi: 10.1007/s42235-023-00452-9

Learning Robust Locomotion for Bipedal Robot via Embedded Mechanics Properties

Yuanxi Zhang1 · Xuechao Chen1,2 · Fei Meng1,2 · Zhangguo Yu1,2 · Yidong Du1 · Junyao Gao1,2 · Qiang Huang1,2

1.School of Mechatronical Engineering, Beijing Institute of Technology, Beijing 100081, China 2. Key Laboratory of Biomimetic Robots and Systems, Ministry of Education, Beijing 100081, China

出版日期:2024-05-20 发布日期:2024-06-08
通讯作者: Fei Meng, Yuanxi Zhang, Xuechao Chen, Zhangguo Yu, Yidong Du; Junyao Gao, Qiang Huang E-mail:mfly0208@bit.edu.cn;zhangyuanxi@bit.edu.cn;chenxuechao@bit.edu.cn;yuzg@bit.edu.cn;duyidong@bit.edu.cn;gaojunyao@bit.edu.cn;qhuang@bit.edu.cn
作者简介:Yuanxi Zhang1 · Xuechao Chen1,2 · Fei Meng1,2 · Zhangguo Yu1,2 · Yidong Du1 · Junyao Gao1,2 · Qiang Huang1,2

Learning Robust Locomotion for Bipedal Robot via Embedded Mechanics Properties

Yuanxi Zhang1 · Xuechao Chen1,2 · Fei Meng1,2 · Zhangguo Yu1,2 · Yidong Du1 · Junyao Gao1,2 · Qiang Huang1,2

1.School of Mechatronical Engineering, Beijing Institute of Technology, Beijing 100081, China 2. Key Laboratory of Biomimetic Robots and Systems, Ministry of Education, Beijing 100081, China

Online:2024-05-20 Published:2024-06-08
Contact: Fei Meng, Yuanxi Zhang, Xuechao Chen, Zhangguo Yu, Yidong Du; Junyao Gao, Qiang Huang E-mail:mfly0208@bit.edu.cn;zhangyuanxi@bit.edu.cn;chenxuechao@bit.edu.cn;yuzg@bit.edu.cn;duyidong@bit.edu.cn;gaojunyao@bit.edu.cn;qhuang@bit.edu.cn
About author:Yuanxi Zhang1 · Xuechao Chen1,2 · Fei Meng1,2 · Zhangguo Yu1,2 · Yidong Du1 · Junyao Gao1,2 · Qiang Huang1,2

摘要/Abstract

摘要： Reinforcement learning (RL) provides much potential for locomotion of legged robot. Due to the gap between simulation and the real world, achieving sim-to-real for legged robots is challenging. However, the support polygon of legged robots can help to overcome some of these challenges. Quadruped robot has a considerable support polygon, followed by bipedal robot with actuated feet, and point-footed bipedal robot has the smallest support polygon. Therefore, despite the existing sim-to-real gap, most of the recent RL approaches are deployed to the real quadruped robots that are inherently more stable, while the RL-based locomotion of bipedal robot is challenged by zero-shot sim-to-real task. Especially for the point-footed one that gets better dynamic performance, the inevitable tumble brings extra barriers to sim-to-real task. Actually, the crux of this type of problem is the difference of mechanics properties between the physical robot and the simulated one, making it difficult to play the learned skills well on the physical bipedal robot. In this paper, we introduce the embedded mechanics properties (EMP) based on the optimization with Gaussian processes to RL training, making it possible to perform sim-to-real transfer on the BRS1-P robot used in this work, hence the trained policy can be deployed on the BRS1-P without any struggle. We validate the performance of the learning-based BRS1-P on the condition of disturbances and terrains not ever learned, demonstrating the bipedal locomotion and resistant performance.

关键词: Bipedal robot , · Reinforcement learning , · Sim-to-real , · Mechanics properties

Abstract: Reinforcement learning (RL) provides much potential for locomotion of legged robot. Due to the gap between simulation and the real world, achieving sim-to-real for legged robots is challenging. However, the support polygon of legged robots can help to overcome some of these challenges. Quadruped robot has a considerable support polygon, followed by bipedal robot with actuated feet, and point-footed bipedal robot has the smallest support polygon. Therefore, despite the existing sim-to-real gap, most of the recent RL approaches are deployed to the real quadruped robots that are inherently more stable, while the RL-based locomotion of bipedal robot is challenged by zero-shot sim-to-real task. Especially for the point-footed one that gets better dynamic performance, the inevitable tumble brings extra barriers to sim-to-real task. Actually, the crux of this type of problem is the difference of mechanics properties between the physical robot and the simulated one, making it difficult to play the learned skills well on the physical bipedal robot. In this paper, we introduce the embedded mechanics properties (EMP) based on the optimization with Gaussian processes to RL training, making it possible to perform sim-to-real transfer on the BRS1-P robot used in this work, hence the trained policy can be deployed on the BRS1-P without any struggle. We validate the performance of the learning-based BRS1-P on the condition of disturbances and terrains not ever learned, demonstrating the bipedal locomotion and resistant performance.

Key words: Bipedal robot , · Reinforcement learning , · Sim-to-real , · Mechanics properties

Yuanxi Zhang, Xuechao Chen, Fei Meng, Zhangguo Yu, Yidong Du, Junyao Gao, Qiang Huang .

Learning Robust Locomotion for Bipedal Robot via Embedded Mechanics Properties

[J]. Journal of Bionic Engineering, 2024, 21(3): 1278-1289.

Yuanxi Zhang, Xuechao Chen, Fei Meng, Zhangguo Yu, Yidong Du, Junyao Gao, Qiang Huang .

Learning Robust Locomotion for Bipedal Robot via Embedded Mechanics Properties

[J]. Journal of Bionic Engineering, 2024, 21(3): 1278-1289.

[1]	Jibo Bai, Baojiang Li, Xichao Wang, Haiyan Wang & Yuting Guo . Bionic Hand Motion Control Method Based on Imitation of Human Hand Movements and Reinforcement Learning[J]. Journal of Bionic Engineering, 2024, 21(2): 764-777.
[2]	Jinsheng Yuan, Wei Guo, Zhiyuan Hou, Fusheng Zha, Mantian Li, Pengfei Wang & Lining Sun Accesses Explore all metrics . Reinforcement Learning Navigation for Robots Based on Hippocampus Episode Cognition[J]. Journal of Bionic Engineering, 2024, 21(1): 288-302.
[3]	Mohammad Sh. Daoud, Mohammad Shehab, Laith Abualigah & Cuong-Le Thanh . Hybrid Modified Chimp Optimization Algorithm and Reinforcement Learning for Global Numeric Optimization [J]. Journal of Bionic Engineering, 2023, 20(6): 2896-2915.

Learning Robust Locomotion for Bipedal Robot via Embedded Mechanics Properties

Learning Robust Locomotion for Bipedal Robot via Embedded Mechanics Properties

赞

可视化

摘要/Abstract

引用本文

Learning Robust Locomotion for Bipedal Robot via Embedded Mechanics Properties

Learning Robust Locomotion for Bipedal Robot via Embedded Mechanics Properties

使用本文

参考文献

相关文章 3

Hybrid Modified Chimp Optimization Algorithm and Reinforcement Learning for Global Numeric Optimization

Metrics

本文评价

推荐阅读 10