Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

CoT-RAG: Integrating Chain of Thought and Retrieval-Augmented Generation to Enhance Reasoning in Large Language Models

Created by
  • Haebom

Author

Feiyang Li, Peng Fang, Zhan Shi, Arijit Khan, Fang Wang, Weihao Wang, Xin Zhang, Yongjian Cui

Outline

CoT-RAG is a novel inference framework that improves the performance of complex tasks for large-scale language models (LLMs). To address the limitations of existing Chain-of-Thought (CoT) inference, such as the unreliability of the inference and lower inference performance under code prompts than under natural language prompts, we present three key design features. First, we construct a CoT based on a knowledge graph, which enhances inference reliability by leveraging the knowledge graph to regulate the LLM's inference chain generation. Second, we integrate a learnable knowledge case-aware RAG into the knowledge graph, which retrieves relevant subcases and subexplanations to provide learnable information to the LLM. Third, we implement pseudoprogram prompting, which encourages the LLM to execute inference tasks as pseudoprograms, thereby enhancing logical rigor. Evaluation results on nine public datasets across three inference tasks demonstrate significant accuracy improvements of 4.0% to 44.3% over state-of-the-art methods. Testing on four domain-specific datasets also demonstrates superior accuracy and efficient execution, highlighting its practicality and scalability.

Takeaways, Limitations

Takeaways:
We improved the inference reliability and performance of LLM by leveraging knowledge graphs and RAG.
Increased logical rigor of reasoning through physician program prompting.
It has demonstrated practicality and scalability by achieving state-of-the-art performance on diverse datasets.
Limitations:
Further research may be needed to determine the generalization performance of the proposed method.
There may be bias towards certain domains.
The quality and size of the knowledge graph can affect performance.
👍