Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

Communicating Plans, Not Percepts: Scalable Multi-Agent Coordination with Embodied World Models

Created by
  • Haebom

Author

Brennen A. Hill, Mant Koh En Wei, Thangavel Jishnuanandh

Outline

This paper explores robust coordination for effective decision-making in multi-agent systems in partially observed environments. Specifically, we address the question of whether to directly engineer communication protocols or to learn them end-to-end. We compare two communication strategies for a collaborative task assignment problem. The first is Learned Direct Communication (LDC), an end-to-end learning approach where agents simultaneously generate messages and actions. The second is Imagined Trajectory Generation Module (ITGM), an intention communication approach that uses a compact learned world model to simulate future states and summarize them for communication. Experiments on goal-directed interactions in a grid-world environment demonstrate that while LDC is feasible in simple environments, a world-model-based approach demonstrates superior performance, sample efficiency, and scalability as complexity increases.

Takeaways, Limitations

Takeaways:
Structured predictive models can be integrated into MARL agents to enable proactive and goal-oriented adjustments.
An engineered, world model-based approach outperforms as complexity increases.
Limitations:
Only experiments were conducted in a simple grid world environment.
Details on the design and training of a world model such as ITGM are not provided.
Further research is needed to determine generalizability to other complex collaborative tasks.
👍