Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

CooT: Learning to Coordinate In-Context with Coordination Transformers

Created by
  • Haebom

Author

Huai-Chih Wang, Hsiang-Chun Chuang, Hsi-Chun Cheng, Dai-Jie Wu, Shao-Hua Sun

Outline

In this paper, we propose Coordination Transformers (CooT), a novel framework for effective cooperation among multiple artificial agents in dynamic and uncertain environments. Unlike existing self-matching learning or group-based methods, CooT is a context-based cooperation framework that rapidly adapts to new partners by leveraging recent interaction history. Using interaction paths of diverse agent pairs as training data, it rapidly learns effective cooperation strategies without explicit supervision or fine-tuning. Evaluation results on the overcooked benchmark show that CooT significantly outperforms existing methods in cooperation tasks with previously unseen partners. Human evaluations also confirm CooT as the most effective cooperation partner, and extensive ablation studies highlight the robustness, flexibility, and context-sensitivity of CooT in multi-agent scenarios.

Takeaways, Limitations

Takeaways:
We overcome the limitations of existing methods by presenting CooT, an in-context collaboration framework that quickly adapts to new partners.
Effective collaborative strategies can be learned without explicit supervision or fine-tuning.
Overcooked benchmarks and human evaluations demonstrate superior performance compared to existing methods.
Demonstrates robustness, flexibility, and context sensitivity in multi-agent scenarios.
Limitations:
Additional validation of generalization performance in environments other than the Overcooked benchmark presented in this paper is needed.
Lack of detailed description of how interaction trajectory data of different agent pairs used in CooT's training process were collected and organized.
Further research is needed on the applicability and performance limitations of CooT to very complex environments or interactions.
👍