Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Diversifying Robot Locomotion Behaviors with Extrinsic Behavioral Curiosity

Created by
  • Haebom

Author

Zhenglin Wan, Xingrui Yu, David Mark Bossens, Yueming Lyu, Qing Guo, Flint Xiaofeng Fan, Yew Soon Ong, Ivor Tsang

Outline

In this paper, we present a novel framework, Quality Diversity Inverse Reinforcement Learning (QD-IRL), to overcome the limitations of single-expert policy learning and enable diverse and robust robot locomotion. We integrate quality diversity optimization with IRL techniques to learn diverse behaviors from limited demonstration data. In particular, we improve the exploration of diverse walking behaviors by introducing Extrinsic Behavioral Curiosity (EBC), which receives additional curiosity rewards based on novelty of the behavioral archive from external evaluators. We evaluate EBC along with GAIL, VAIL, and DiffAIL on several robot locomotion tasks and demonstrate its performance improvement, outperforming expert performance by up to 20% in a humanoid environment. In addition, we show that EBC can be applied to Gradient-Arborescence-based QD-RL algorithms.

Takeaways, Limitations

Takeaways:
Robust and diverse robot gait learning possible from limited demonstration data
Achieve diverse behavioral exploration and performance improvement through EBC (performance improvement confirmed in GAIL, VAIL, DiffAIL, QD-RL algorithms)
Achieve performance beyond expert levels in some environments
Easy reproducibility through open source code
Limitations:
The effectiveness of EBC may be limited to specific environments and algorithms.
Further research is needed on generalization performance across different robot platforms and tasks.
Consideration needs to be given to the design and reliability of external evaluators
👍