Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification

Created by
  • Haebom

Author

Geonwoo Cho, Jaemoon Lee, Jaegyun Im, Subi Lee, Jihwan Lee, Sundong Kim

Outline

Skill-based reinforcement learning (SBRL) enables rapid adaptation in sparse reward environments through pre-trained skill-conditional policies. Effective skill learning requires maximizing both exploration and skill diversity. In this paper, we propose Adaptive Multi-objective Projection for balancing Exploration and Skill Diversification (AMPED) to simultaneously address both exploration and skill diversity. AMPED uses gradient surgical projection to balance exploration and skill diversity gradients during pre-training, and a skill selector during fine-tuning to select appropriate skills for downstream tasks. AMPED outperforms the SBRL baseline on various benchmarks, and component-wise role analysis confirms the performance contributions of each component in AMPED. Furthermore, we present theoretical and empirical evidence that using a greedy skill selector reduces fine-tuning sample complexity as skill diversity increases.

Takeaways, Limitations

Takeaways:
Emphasizes the importance of explicitly balancing exploration and technological diversity.
AMPED enables robust and generalizable skill learning.
The proposed AMPED methodology outperforms the SBRL baseline on various benchmarks.
Theoretical and empirical evidence demonstrates that technology diversity reduces fine-tuning sample complexity.
Limitations:
There is no Limitations specified in the paper.
👍