Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

Position: Simulating Society Requires Simulating Thought

Created by
  • Haebom

Author

Chance Jiajie Li, Jiayi Wu, Zhenze Mo, Ao Qu, Yuhan Tang, Kaiya Ivy Zhao, Yulu Gan, Jie Fan, Jiangbo Yu, Jinhua Zhao, Paul Liang, Luis Alonso, Kent Larson

Outline

Social simulations utilizing Large-Scale Language Models (LLMs) require more than just plausible behavior generation; they also require structured, modifiable, and traceable cognitive reasoning. LLM-based agents are often used to simulate individual and collective behaviors through prompting and guided fine-tuning. However, they lack internal consistency, causal inference, and belief traceability, making them unreliable in simulating how people reason, deliberate, and respond to interventions. To address this, this paper presents Generative Minds (GenMinds), a conceptual modeling paradigm inspired by cognitive science that supports structured belief representations in generative agents. Furthermore, to evaluate these agents, we introduce the REconstructing CAusal Paths (RECAP) framework, which assesses inference fidelity through causal traceability, demographic evidence, and intervention consistency. This study presents a broad shift from simple imitation to generative agents that simulate not only language but also thought for social simulation.

Takeaways, Limitations

Takeaways:
A new conceptual modeling paradigm (GenMinds) for improving the reliability of social simulations of LLM-based agents is presented.
Introducing a new benchmark (RECAP) to assess causality, demographic basis, and intervention consistency.
In social simulation, we present a paradigm shift from superficial imitation to thought simulation.
Limitations:
Lack of detailed information about the specific GenMinds implementation and performance.
Limitations of the RECAP framework's broad application to real-world social simulations.
Lack of research on the generalizability of the model and comparison with other LLM-based agents.
👍