Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

End-to-End Text-to-SQL with Dataset Selection: Leveraging LLMs for Adaptive Query Generation

Created by
  • Haebom

Author

Anurag Tripathi, Vaibhav Patle, Abhinav Jain, Ayush Pundir, Sairam Menon, Ajeet Kumar Singh, Dorien Herremans

Outline

This paper proposes a three-stage, end-to-end text-to-SQL framework that first identifies databases that match a user's query intent in a large-scale database environment. While existing text-to-SQL approaches directly translate natural language queries into SQL commands based on a specific database, our framework leverages LLM and prompt engineering to extract a set of rules from natural language queries. Based on these rules, we train a large-scale db_id prediction model, which incorporates a fine-tuned RoBERTa-based encoder, to predict the correct database identifier (db_id). Finally, we use a critique agent to correct errors in the generated SQL. Experimental results demonstrate that the proposed framework outperforms state-of-the-art models in terms of database intent prediction and SQL generation accuracy.

Takeaways, Limitations

Takeaways:
Provides an effective solution to text-to-SQL problems in environments containing multiple databases.
Improving the Accuracy of Database Identification and SQL Generation Using LLM and Prompt Engineering
Improving the accuracy of SQL generation results through the critique agent.
Addresses the need for database pre-specification in the existing text-to-SQL model, Limitations
Limitations:
The performance of the proposed framework may be affected by the characteristics of the LLM and database used.
Additional generalization performance evaluations are needed for various types of databases and complex queries.
Lack of detailed description of the design and learning process of the critique agent.
Further research is needed on scalability and efficiency in real-world environments.
👍