Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning

Created by
  • Haebom

Author

Pusen Dong, Tianchen Zhu, Yue Qiu, Haoyi Zhou, Jianxin Li

Outline

This paper presents a method for performing secure reinforcement learning under constraints expressed in natural language. Existing methods have the limitation of requiring manual design of cost functions for each constraint. In this paper, we propose the Trajectory-level Textual Constraints Translator (TTCT), which automatically generates cost functions using natural language constraints. TTCT learns by combining natural language constraints with trajectories, and experimental results demonstrate that it learns policies with lower violation rates than existing manually designed cost functions. Furthermore, we demonstrate zero-shot transfer capability, which can be applied to environments with changing constraints.

Takeaways, Limitations

Takeaways:
We present a novel method for effectively understanding constraints in natural language and training safe reinforcement learning agents.
Learning is possible with only natural language constraints, without the need to manually design a cost function.
Demonstrating applicability to various environments through zero-shot transfer capability.
Achieve lower violation rates than existing methods.
Limitations:
Further research is needed on the generalization performance of TTCT.
Need to evaluate the ability to handle complex or ambiguous natural language constraints.
Additional validation is needed for real-world application.
👍