Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

EvoCurr: Self-evolving Curriculum with Behavior Code Generation for Complex Decision-making

Created by
  • Haebom

Author

Yang Cheng, Zilai Wang, Weiyu Ma, Wenhui Zhu, Yue Deng, Jian Zhao

Outline

This paper proposes EvoCurr, a self-evolving curriculum learning framework for complex problem solving. EvoCurr adapts the learning progress of the solver by generating a sequence of problem instances with increasing difficulty. When the solver encounters difficulty, the difficulty is lowered, and when the solver successfully solves, the difficulty is increased, maintaining an optimal learning path. The solver, implemented as a code-generation model that generates Python decision tree scripts, gradually acquires the skills necessary for complex decision-making tasks. Experimental results demonstrate that the proposed method significantly improves task success rates and solution efficiency compared to existing direct solution methods.

Takeaways, Limitations

Takeaways:
We demonstrate that LLM-based curriculum learning has great potential for improving automated reasoning in complex real-world problems.
EvoCurr presents an effective method to significantly improve the performance of LLM in solving complex problems.
We empirically confirmed that curriculum learning through gradual difficulty adjustment can improve the learning efficiency of LLM.
Limitations:
Currently, it is specialized for generating Python decision tree scripts and may be difficult to apply to solving other types of problems.
Since the performance of the Curriculum Generation LLM affects the performance of the entire system, it is necessary to improve the performance of the Curriculum Generation LLM.
Further research is needed on generalization performance for problems of varying complexity.
👍