Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

Enabling Rapid Shared Human-AI Mental Model Alignment via the After-Action Review

Created by
  • Haebom

Author

Edward Gu, Ho Chit Siu, Melanie Platt, Isabelle Hurley, Jaime Pe na, Rohan Paleja

Outline

In this paper, we present two novel contributions to improve human-machine collaboration (HMT) research: 1) a Minecraft testbed to accelerate the testing and deployment of collaborative AI agents, and 2) a tool for replaying and analyzing behavior within HMT episodes to facilitate the development of shared mental models. The browser-based Minecraft testbed enables rapid testing of collaborative agents in a continuous spatial, real-time, and partially observable environment without the cumbersome setup typically associated with human-AI interaction user research. Furthermore, Minecraft's extensive user base and rich ecosystem of pre-built AI agents can rapidly facilitate the design of novel collaborative agents and research into understanding various human factors within HMT. The mental model alignment tool supports user-directed post-mortem analysis, including video displays of first-person perspectives of team members (i.e., humans and AI), and provides a chat interface leveraging GPT-4 to answer various queries about the AI's experience and model details.

Takeaways, Limitations

Takeaways:
Providing a fast and easy testing environment (Minecraft testbed) for research on collaborative AI agents.
Support for the development of shared mental models through a tool for HMT episode analysis (mental model alignment tool).
Facilitating research by leveraging a large user base and rich AI agent ecosystem.
Provide answers to AI experience and model-related queries through a chat interface leveraging GPT-4.
Limitations:
The specific Limitations is not stated in the paper.
👍