CuriousRL: Curiosity-Driven Reinforcement Learning for Adaptive Locomotion in Quadruped Robots

Sushil Bohara, Muhammad Abdullah Hanif, Muhammad Shafique

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Though Proximal Policy Optimization (PPO) has emerged as a dominant algorithm for quadruped locomotion due to its stability and ease of implementation, its learning efficiency is affected by a limited exploration ability of the algorithm. We combine PPO with the Intrinsic Curiosity Module (ICM) to form CuriousRL, which enhances the exploration aspect of PPO, making the quadruped locomotion autonomous and adaptive in dynamic environments. ICM provides intrinsic rewards to the robot in addition to the external environmental rewards from PPO, fostering exploration. We use CuriousRL to teach quadruped robots learn to walk themselves autonomously. We simulate the experiments in Isaac Gym using the ANYmal quadrupeds and measure the performances in dynamic test environments with obstacles and uneven terrains using various environment sensor data including positions, velocities, forces, and torques in the legs and joints. We illustrate that CuriousRL performs better in terms of exploring effective policies and avoiding risk-averse stationary policy adaptation and ehancing sample efficiency.

Original languageEnglish (US)
Title of host publication2024 International Joint Conference on Neural Networks, IJCNN 2024 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350359312
DOIs
StatePublished - 2024
Event2024 International Joint Conference on Neural Networks, IJCNN 2024 - Yokohama, Japan
Duration: Jun 30 2024Jul 5 2024

Publication series

NameProceedings of the International Joint Conference on Neural Networks

Conference

Conference2024 International Joint Conference on Neural Networks, IJCNN 2024
Country/TerritoryJapan
CityYokohama
Period6/30/247/5/24

Keywords

  • CuriousRL
  • Exploration
  • Intrinsic Rewards
  • Proximal Policy Optimization
  • Quadrupeds
  • Reinforcement Learning

ASJC Scopus subject areas

  • Software
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'CuriousRL: Curiosity-Driven Reinforcement Learning for Adaptive Locomotion in Quadruped Robots'. Together they form a unique fingerprint.

Cite this