Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

Strategic Tradeoffs Between Humans and AI in Multi-Agent Bargaining

Created by
  • Haebom

Author

Crystal Qian, Kehang Zhu, John Horton, Benjamin S. Manning, Vivian Tsai, James Wexler, Nithum Thain

Outline

This study emphasizes the importance of evaluating LLM's performance and interactions in dynamic, multi-agent environments, given its application in diverse human activities such as business negotiation and group collaboration. Unlike existing statistical agents that excel in well-defined environments, LLM can generalize across a variety of real-world scenarios. This study compares and analyzes the outcomes and behavioral dynamics of humans (N=216), LLM (GPT-4o, Gemini 1.5 Pro), and Bayesian agents in a dynamic negotiation environment. Bayesian agents generate high surplus value through aggressive optimization but suffer from frequent deal rejections. While humans and LLMs achieve similar total surplus value, LLMs prefer conservative and concessional trades, while humans exhibit more strategic, risk-taking, and fairness-seeking behavior. Therefore, performance equivalence can mask fundamental differences in process and alignment, which is crucial for applying LLM to real-world collaborative tasks. This study establishes a baseline behavioral baseline under consistent conditions, laying the foundation for future research.

Takeaways, Limitations

Takeaways:
LLM can achieve total surplus value similar to humans.
Performance equivalence may overlook differences in the behavioral characteristics of LLMs.
LLMs tend to be conservative and prefer compromise deals.
This study provides a baseline for action research in LLM.
Limitations:
This study is limited to a specific negotiation environment.
It does not reflect all the variability in the real environment.
It is aimed at a limited number of LLMs and human groups.
👍