Journal of Bionic Engineering ›› 2023, Vol. 20 ›› Issue (3): 953-966.doi: 10.1007/s42235-022-00309-7

• • 上一篇    下一篇

Design of Deep Reinforcement Learning Controller Through Data-assisted Model for Robotic Fish Speed Tracking

Palmani Duraisamy1; Manigandan Nagarajan Santhanakrishnan1; Amirtharajan Rengarajan2   

  1. 1 Robotics Lab, School of EEE, SASTRA Deemed University, Thanjavur, Tamil Nadu 613401, India  2 Department of ECE, School of EEE, SASTRA Deemed University, Thanjavur, Tamil Nadu 613401, India
  • 出版日期:2023-05-10 发布日期:2023-05-10
  • 通讯作者: Manigandan Nagarajan Santhanakrishnan E-mail:manigandanns@eie.sastra.edu
  • 作者简介:Palmani Duraisamy1; Manigandan Nagarajan Santhanakrishnan1; Amirtharajan Rengarajan2

Design of Deep Reinforcement Learning Controller Through Data-assisted Model for Robotic Fish Speed Tracking

Palmani Duraisamy1; Manigandan Nagarajan Santhanakrishnan1; Amirtharajan Rengarajan2   

  1. 1 Robotics Lab, School of EEE, SASTRA Deemed University, Thanjavur, Tamil Nadu 613401, India  2 Department of ECE, School of EEE, SASTRA Deemed University, Thanjavur, Tamil Nadu 613401, India
  • Online:2023-05-10 Published:2023-05-10
  • Contact: Manigandan Nagarajan Santhanakrishnan E-mail:manigandanns@eie.sastra.edu
  • About author:Palmani Duraisamy1; Manigandan Nagarajan Santhanakrishnan1; Amirtharajan Rengarajan2

摘要: It is common for robotic fish to generate thrust using reactive force generated by the tail’s physical motion, which interacts with the surrounding fluid. The coupling effect of the body strongly correlates with this thrust. However, hydrodynamics cannot be wholly modeled in analytical form. Therefore, data-assisted modeling is necessary for robotic fish. This work presents the first method of its kind using Genetic Algorithm (GA)-based optimization methods for data-assistive modeling for robotic fish applications. To begin, experimental data are collected in real time with the robotic fish that has been designed and fabricated using 3D printing. Then, the model’s influential parameters are estimated using an optimization problem. Further, a model-based deep reinforcement learning (DRL) controller is proposed to track the desired speed through extensive simulation work. In addition to a deep deterministic policy gradient (DDPG), a twin delayed DDPG (TD3) is employed in the training of the RL agent. Unfortunately, due to its local optimization problem, the RL-DDPG controller failed to perform well during training. In contrast, the RL-TD3 controller effectively learns the control policies and overcomes the local optima problem. As a final step, controller performance is evaluated under different disturbance conditions. In contrast to DDPG and GA-tuned proportional-integral controllers, the proposed model with RL-TD3 controller significantly improves the performance.

关键词: Bioinspired robot , · Data-assisted model , · Genetic algorithm , · Reinforcement learning control , · Robotic fish

Abstract: It is common for robotic fish to generate thrust using reactive force generated by the tail’s physical motion, which interacts with the surrounding fluid. The coupling effect of the body strongly correlates with this thrust. However, hydrodynamics cannot be wholly modeled in analytical form. Therefore, data-assisted modeling is necessary for robotic fish. This work presents the first method of its kind using Genetic Algorithm (GA)-based optimization methods for data-assistive modeling for robotic fish applications. To begin, experimental data are collected in real time with the robotic fish that has been designed and fabricated using 3D printing. Then, the model’s influential parameters are estimated using an optimization problem. Further, a model-based deep reinforcement learning (DRL) controller is proposed to track the desired speed through extensive simulation work. In addition to a deep deterministic policy gradient (DDPG), a twin delayed DDPG (TD3) is employed in the training of the RL agent. Unfortunately, due to its local optimization problem, the RL-DDPG controller failed to perform well during training. In contrast, the RL-TD3 controller effectively learns the control policies and overcomes the local optima problem. As a final step, controller performance is evaluated under different disturbance conditions. In contrast to DDPG and GA-tuned proportional-integral controllers, the proposed model with RL-TD3 controller significantly improves the performance.

Key words: Bioinspired robot , · Data-assisted model , · Genetic algorithm , · Reinforcement learning control , · Robotic fish