Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Memp: Exploring Agent Procedural Memory

Created by
  • Haebom

Author

Runnan Fang, Yuan Liang, Xiaobin Wang, Jialong Wu, Shuofei Qiao, Pengjun Xie, Fei Huang, Huajun Chen, Ningyu Zhang

Outline

This paper explores strategies for imbuing learnable, updatable, and lifelong procedural memory to address the fragile procedural memory problem of large-scale language model (LLM)-based agents. We propose a novel method, Memp, which extracts the agent's past trajectories into fine-grained step-by-step instructions and high-level script-like abstractions. We explore the impact of various strategies for building, retrieving, and updating procedural memory, and construct a memory repository that evolves with new experiences through a dynamic system that continuously updates, modifies, and discards its contents. Experimental results on TravelPlanner and ALFWorld demonstrate that as the memory repository is refined, the agent's success rate and efficiency on similar tasks steadily improve. Moreover, procedural memory built on a strong model retains its value, leading to significant performance improvements even when migrating to a weaker model.

Takeaways, Limitations

Takeaways:
Presenting an effective solution to the procedural memory problem for LLM-based agents.
Demonstrating the feasibility of implementing learnable, updatable, and lifelong procedural memories.
Suggesting the possibility of improving the performance of weak models through transfer of procedural memory learned from strong models.
Improving agent success rate and efficiency through the Memp method.
Limitations:
Further research is needed on the generalization performance of the proposed method.
Applicability to various tasks and environments needs to be evaluated.
Efficiency analysis of the size and management of memory storage is required.
Performance evaluation and robustness analysis in real complex environments are required.
👍