[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Understanding the Design Decisions of Retrieval-Augmented Generation Systems

Created by
  • Haebom

Author

Shengming Zhao, Yuchen Shao, Yuheng Huang, Jiayang Song, Zhijie Wang, Chengcheng Wan, Lei Ma

Outline

This paper presents the first comprehensive study of three key decisions for effective deployment of Retrieval-Augmented Generation (RAG): whether to deploy RAG, how much information to retrieve, and how to integrate the retrieved knowledge. Through systematic experiments on three LLMs and six datasets, we find that RAG deployment should be selective, that the optimal retrieval amount varies across tasks (5-10 documents for QA, while code generation requires scenario-specific optimization), and that knowledge integration effectiveness varies across task and model characteristics (code generation benefits greatly from prompting, while question answering sees only a small improvement). Therefore, we argue that a general-purpose RAG strategy is inadequate, and that context-aware design decisions that take into account task characteristics and model capabilities are needed.

Takeaways, Limitations

Takeaways:
RAG distribution should be selective, and shows that simply retrieving a lot of information is not effective.
The optimal amount of information to search depends on the type of task (QA vs. code generation).
Knowledge integration methods should be optimized according to task and model characteristics.
Emphasizes the importance of situation-aware RAG design considering task characteristics and model capabilities.
Provides practitioners with evidence-based RAG deployment guidelines.
Limitations:
Further research on generalization is needed due to limitations in the LLM, dataset, and task types used in this study.
Further research is needed to find RAG strategies that are optimized for specific tasks and models.
Research on more diverse knowledge integration methodologies is needed.
👍