Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

RoboMemory: A Brain-inspired Multi-memory Agentic Framework for Lifelong Learning in Physical Embodied Systems

Created by
  • Haebom

Author

Mingcong Lei, Honghao Cai, Binbin Que, Zezhou Cui, Liangchen Tan, Junkun Hong, Gehan Hu, Shuangyu Zhu, Yimou Wu, Shaohan Jiang, Ge Wang, Zhen Li, Shuguang Cui, Yiming Zhao, Yatong Han

Outline

RoboMemory is a brain-inspired multi-memory framework for lifelong learning in physical systems. It addresses critical challenges such as continuous learning in real-world environments, multi-module memory latency, capturing task correlations, and mitigating infinite loops in closed-loop planning. Drawing on cognitive neuroscience, it integrates four core modules: an information preprocessor (thalamus-like), a lifelong embodied memory system (hippocampus-like), a closed-loop planning module (prefrontal cortex-like), and a low-level executor (cerebellum-like), enabling long-term planning and cumulative learning. The lifelong embodied memory system, at the heart of the framework, mitigates the inference speed issues of complex memory frameworks by parallelizing updates and retrievals across spatial, temporal, episodic, and semantic submodules. It integrates a dynamic knowledge graph (KG) and a consistent architectural design to enhance memory consistency and scalability. Evaluation results on EmbodiedBench demonstrate that RoboMemory achieves a new state-of-the-art (SOTA) benchmark, outperforming the open-source benchmark (Qwen2.5-VL-72B-Ins) by 25% on average and the closed-source state-of-the-art (SOTA) benchmark (Claude3.5-Sonnet) by 5%. Elimination studies validate core components (criticism, spatial memory, and long-term memory), and real-world deployments demonstrate significant improvements in success rates for repeated tasks, confirming its lifelong learning capabilities. RoboMemory mitigates high latency issues through scalability and serves as a baseline for integrating multi-mode memory systems into physical robots.

Takeaways, Limitations

Takeaways:
Solving lifelong learning problems in real-world environments with a brain-inspired multi-memory framework.
Alleviating inference speed issues in complex memory frameworks through parallelized memory updates/searches.
Improving memory consistency and scalability using dynamic knowledge graphs (KGs).
Achieved remarkable performance improvement compared to existing best-in-class performance benchmarks in EmbodiedBench evaluations (25% compared to open source, 5% compared to closed-source SOTA).
Validating lifelong learning capabilities and improving success rates through real-world deployments.
Provides a basic reference for integrating multimodal memory systems in physical robots.
Limitations:
The paper does not explicitly mention the specific Limitations. Further research is needed to analyze potential issues that may arise in real-world applications (e.g., handling unexpected situations, the robot's physical limitations, etc.).
Since only performance evaluation results in a specific environment (EmbodiedBench) are presented, further verification of generalizability to other environments or tasks is required.
Lack of detailed information on the closed-source model used in the comparison with SOTA.
👍