Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Language-Guided Multi-Agent Learning in Simulations: A Unified Framework and Evaluation

Created by
  • Haebom

Author

Zhengyang Li

Outline

LLM-MARL is an integrated framework that integrates large-scale language models (LLMs) with multi-agent reinforcement learning (MARL) to enhance coordination, communication, and generalization in simulated game environments. It features three modular components: a coordinator that dynamically generates subgoals, a communicator that facilitates symbolic inter-agent messaging, and a memory that supports episodic memory. Training combines PPO with language-conditional loss and LLM query gating. LLM-MARL has been evaluated on Google Research Football, MAgent Battle, and StarCraft II, consistently outperforming MAPPO and QMIX in win rate, coordination score, and zero-shot generalization. Ablation studies demonstrate that subgoal generation and language-based messaging significantly improve performance. Qualitative analysis reveals emergent behaviors such as role specialization and communication-based tactics. This research bridges language modeling and policy learning to design intelligent and cooperative agents in interactive simulations. This presents how LLM can be leveraged in multi-agent systems used for training, gaming, and human-AI collaboration.

Takeaways, Limitations

Takeaways:
We demonstrate that integrating LLM into MARL can improve the coordination, communication, and generalization performance of agents.
We found that subgoal generation and language-based messaging played a significant role in improving performance.
Improved zero-shot generalization performance suggests applicability in various environments.
Google Research has proven its effectiveness in various gaming environments, including Football, MAgent Battle, and StarCraft II.
Presenting new possibilities for human-AI collaboration and multi-agent system design.
Limitations:
The current research results are limited to a simulation environment. Scalability to real-world environments is required.
Consideration should be given to the increased computational cost and memory usage of LLM.
Further research is needed on the generalization ability of the LLM-MARL framework.
Analysis of parameters and structures optimized for specific game environments is required.
👍