Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

QuadKAN: KAN-Enhanced Quadruped Motion Control via End-to-End Reinforcement Learning

Created by
  • Haebom

Author

Yinuo Wang, Gavin Tao

Outline

This paper addresses vision-guided quadruped robot control using reinforcement learning (RL), emphasizing the essential integration of proprioception and vision for robust control. We propose QuadKAN, a spline-parameterized cross-modal policy using Kolmogorov-Arnold Networks (KANs). QuadKAN integrates a spline encoder for proprioception and a spline fusion head for proprioceptive-visual information. This structured class of functions aligns the state-action mapping with the piecewise smoothness of gait, improving sample efficiency, reducing action tremor and energy consumption, and providing interpretable pose-action sensitivity. We employ Multimodal Delay Randomization (MMDR) and perform end-to-end learning with Proximal Policy Optimization (PPO). Evaluation results on a variety of terrains, including uniform and uneven surfaces and scenarios with static and dynamic obstacles, demonstrate that QuadKAN consistently achieves higher returns, longer travel distances, and fewer collisions than state-of-the-art (SOTA) baseline models. These results demonstrate that spline parameterized policies are a simple, effective, and interpretable alternative for robust vision-guided walking.

Takeaways, Limitations

Takeaways:
We experimentally demonstrate that a spline parameterized cross-modal policy (QuadKAN) improves the robustness and efficiency of vision-guided quadruped robot control.
We confirm that the combination of proprioceptive and visual information is important for robust control of quadruped walking robots.
Spline parameterization achieves increased sample efficiency, reduced behavioral tremor, reduced energy consumption, and improved interpretability.
Achieving cutting-edge performance in a variety of terrains.
Limitations:
Further research is needed on the generalization performance of the proposed method.
Experimental results on real robots are not yet presented (repository to be made public).
In-depth analysis of performance in specific environments is required.
👍