Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Automatic Curriculum Design for Zero-Shot Human-AI Coordination

Created by
  • Haebom

Author

Won-Sang You, Tae-Gwan Ha, Seo-Young Lee, Kyung-Joong Kim

Outline

This paper addresses the problem of zero-shot human-AI coordination. Unlike previous studies that focused on improving the cooperative ability of ego-agents in specific environments, this paper aims to address the problem of generalization to unknown environments by considering unpredictable environmental changes and differences in collaborator abilities across environments. We extend the multi-agent Unsupervised Environment Design (UED) approach to zero-shot human-AI cooperation, proposing a novel utility function and collaborator sampling technique. Evaluation results using human proxy agents and real humans in an overcooked-AI environment demonstrate that the proposed method outperforms existing models and achieves high human-AI cooperation performance even in unknown environments.

Takeaways, Limitations

Takeaways:
Presenting a novel approach to the zero-shot human-AI collaboration problem and achieving superior performance over existing methods.
Improved generalization ability across diverse environments and varying collaborator capabilities.
Demonstrating that effective human-AI collaboration is possible even in unknown environments.
Presentation of experimental results using actual human participation.
Ensuring reproducibility through open source code disclosure.
Limitations:
Experimental results limited to the Overcooked-AI environment. Further research is needed to determine generalizability to other environments.
Limitations of human proxy agents. They may not fully reflect the complexities of real-world human interactions.
Further research is needed on optimization of utility functions and collaborator sampling techniques.
👍