Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Collaborative Chain-of-Agents for Parametric-Retrieved Knowledge Synergy

Created by
  • Haebom

Author

Yi Jiang, Sendong Zhao, Jianbo Li, Haochun Wang, Lizhe Zhang, Yan Liu, Bing Qin

Outline

This paper proposes the Collaborative Chain-of-Agents (CoCoA) framework to address the Limitations challenge of Retrieval Augmented Generation (RAG), a promising framework for improving the performance of large-scale language models (LLMs) in knowledge-intensive tasks. Existing RAG methods fail to fully leverage the synergy between the model's internal parameter knowledge and external retrieval knowledge. CoCoA overcomes this challenge through a multi-agent approach. First, we present CoCoA-zero, which performs inference after conditional knowledge induction. Based on this, we develop CoCoA, which fine-tunes the LLM by synthesizing an extended multi-agent inference path. Experimental results demonstrate that CoCoA-zero and CoCoA achieve superior performance on open-domain and multi-step question-answering tasks.

Takeaways, Limitations

Takeaways:
We present a novel RAG framework (CoCoA) that explicitly enhances the synergy between internal and external search knowledge in LLM.
A multi-agent approach presents the potential for more accurate and efficient knowledge utilization.
It outperforms existing RAG methods in open-domain and multi-step question-answering tasks.
Presenting an effective model learning strategy through the step-by-step development of CoCoA-zero and CoCoA.
Limitations:
Further research is needed to evaluate the generalization performance of the proposed framework and its applicability to various tasks.
Lack of analysis of CoCoA's computational cost and training time.
Only performance evaluation results for a specific dataset are presented, so generalizability to other datasets needs to be verified.
👍