Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

CortexDebate: Debating Sparsely and Equally for Multi-Agent Debate

Created by
  • Haebom

Author

Yiliu Sun, Zicheng Zhao, Sheng Wan, Chen Gong

Outline

In this paper, we propose CortexDebate, an improved version of the multi-agent debate (MAD) method to address the problems of hallucination and inference insufficiency of a single large-scale language model (LLM). To address the excessive input context and overconfidence problems of the existing MAD, CortexDebate uses the McKinsey-based Debate Matter (MDM) module, which acts like the white matter of the brain, to build a sparse and dynamically optimized debate graph among LLM agents. MDM integrates the McKinsey trust formula, a trustworthiness measure in sociology, to guide graph optimization through reliable evaluation. We demonstrate the effectiveness of CortexDebate through extensive experiments on eight datasets and four types of tasks.

Takeaways, Limitations

Takeaways:
We present an effective multi-agent argumentation (MAD) method to overcome the limitations of single LLM.
Solving the problems of excessive input context and overconfidence in existing MADs.
Efficient argument graph construction and optimization via MDM module mimicking white matter of the brain.
Trustworthy agent evaluation and graph management using the McKinsey Trust Formula.
Experimental validation of the effectiveness on various datasets and task types.
Limitations:
Potential sociological bias due to MDM module's dependence on McKinsey's trust formula.
Further research is needed on the scalability of the proposed method and its generalization performance to complex tasks.
The diversity and representativeness of experimental datasets needs to be reviewed.
👍