Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Hybrid AI for Responsive Multi-Turn Online Conversations with Novel Dynamic Routing and Feedback Adaptation

Created by
  • Haebom

Author

Priyaranjan Pattnayak, Amit Agarwal, Hansa Meghwani, Hitesh Laxmichand Patel, Srikant Panda

Outline

In this paper, we present a novel hybrid framework that integrates Retrieval-Augmented Generation (RAG) and intent-based structured answers to address the challenges of enterprise-scale deployment of large-scale language model (LLM)-based chatbots, such as diverse user questions, high latency, hallucinations, and difficulties in integrating frequently updated domain-specific knowledge. The framework dynamically routes complex or ambiguous questions to the RAG pipeline while leveraging predefined high-confidence answers for efficiency. It uses a conversational context manager to maintain consistency across turn-heavy interactions, integrates feedback loops to improve intent, dynamically adjusts confidence thresholds, and expands response ranges over time. Experimental results show that the proposed framework achieves high accuracy (95%) and low latency (180ms), outperforming RAG and intent-based systems across a wide range of question types, and is positioned as a scalable and adaptable solution for enterprise-scale conversational AI applications.

Takeaways, Limitations

Takeaways:
We present a novel hybrid framework that combines the advantages of RAG and intent-based structured answers to simultaneously improve accuracy and efficiency.
Providing scalable and adaptable solutions for enterprise-grade conversational AI applications.
Achieve high accuracy (95%) and low latency (180ms) for a variety of question types.
Presents the potential for continuous performance improvement through a conversation context manager and feedback loop.
Limitations:
The performance evaluation of the proposed framework may be limited to a specific dataset.
Additional research is needed on application and performance evaluation in real enterprise environments.
Further analysis is needed on the accuracy of intent recognition and confidence threshold adjustment.
Further validation of generalizability across different domains and languages is needed.
👍