Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

EmbodiedAgent: A Scalable Hierarchical Approach to Overcome Practical Challenge in Multi-Robot Control

Created by
  • Haebom

Author

Hanwen Wan, Yifei Chen, Yixuan Deng, Zeyu Wei, Dongrui Li, Zexin Lin, Donghao Wu, Jiu Cheng, Xiaoqiang Ji

Outline

This paper introduces EmbodiedAgent, a hierarchical framework for heterogeneous multi-robot control. To address the hallucination problem arising from unrealistic tasks, EmbodiedAgent integrates a next-action prediction paradigm and a structured memory system to decompose tasks into executable robot actions and dynamically validate actions based on environmental constraints. Furthermore, we present the MultiPlan+ dataset, which contains over 18,000 annotated planning instances across 100 scenarios, including a subset of unrealistic cases to mitigate the hallucination problem. To evaluate performance, we propose the Robot Planning Assessment Schema (RPAS), which combines automated metrics with LLM-assisted expert evaluation. Experimental results demonstrate that EmbodiedAgent outperforms state-of-the-art models, achieving an RPAS score of 71.85%. Real-world validation on an office service task highlights the ability of EmbodiedAgent to coordinate heterogeneous robots toward long-term goals.

Takeaways, Limitations

Takeaways:
An Effective Hierarchical Framework for Heterogeneous Multi-Robot Control
A novel approach to alleviate hallucination problems (utilizing predictive and structured memory systems)
Release of MultiPlan+, a large-scale dataset covering a variety of scenarios.
Proposing a new evaluation standard, RPAS, combining automation and expert evaluation.
Validation of practicality through performance verification in real-world tasks
Limitations:
Further research is needed to determine the versatility and generalizability of the MultiPlan+ dataset.
Further review is needed on the objectivity and reliability of RPAS evaluation criteria.
Performance evaluation in more complex and diverse environments is needed.
Further research is needed to address unexpected issues that may arise in real-world applications.
👍