Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

From Query to Explanation: Uni-RAG for Multi-Modal Retrieval-Augmented Learning in STEM

Created by
  • Haebom

Author

Xinyi Wu, Yanhao Jia, Luwei Xiao, Shuai Zhao, Fengkuang Chiang, Erik Cambria

Outline

This paper focuses on providing effective and accessible learning experiences by interpreting various question types in AI-based education. To overcome the limitations of existing systems, this paper presents Uni-Retrieval, a lightweight multi-modal retrieval module. Uni-Retrieval extracts question type prototypes and dynamically matches them with tokens in a continuously updated Prompt Bank. The Prompt Bank, which encrypts and stores domain-specific knowledge using the MoE-LoRA module, is used to enhance the ability of Uni-Retrieval to adapt to unseen question types at test time. By integrating Uni-Retrieval with an instruction-tuned language model, we build a complete retrieval-augmented generation pipeline called Uni-RAG, which retrieves relevant training materials and generates explanations, feedback, or training content that aligns with learning objectives given style-conditional questions. Experimental results on SER and other multi-modal benchmarks demonstrate that Uni-RAG outperforms baseline retrieval and RAG systems in both retrieval accuracy and generation quality while maintaining low computational cost. This study provides a scalable, pedagogically informed, intelligent teaching system that links discovery and creation to provide personalized, explainable, and efficient learning support in a variety of STEM scenarios.

Takeaways, Limitations

Takeaways:
We present Uni-RAG, a lightweight multi-modal search and generation pipeline that effectively handles a wide range of question types.
Improved adaptability to unseen query types through Prompt Bank using MoE-LoRA.
Achieve improved search accuracy and generation quality while reducing computational costs.
Presenting the possibility of implementing an intelligent education system that is personalized, explainable, and efficient.
Limitations:
Absence of a concrete strategy for ongoing updates and management of Prompt Bank.
Further research is needed on the applicability and generalization performance to other fields outside of various STEM scenarios.
Further analysis is needed on the efficiency and scalability of MoE-LoRA modules.
Lack of long-term performance evaluation and user feedback analysis in real-world training environments.
👍