Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Feather-SQL: A Lightweight NL2SQL Framework with Dual-Model Collaboration Paradigm for Small Language Models

Created by
  • Haebom

Author

Wenqi Pei, Hailing Xu, Hengyuan Zhao, Shizheng Hou, Han Chen, Zining Zhang, Pingyi Luo, Bingsheng He

Outline

While large-scale language models (LLMs) have made significant progress in natural language to SQL (NL2SQL) translation, their reliance on closed systems and high computational resources poses challenges for data privacy and distribution. Small-scale language models (SLMs), on the other hand, struggle with NL2SQL tasks due to poor performance and incompatibility with existing frameworks. In this paper, we present Feather-SQL, a lightweight framework tailored for SLM. Feather-SQL improves SQL executable feasibility and accuracy through 1) schema pruning and concatenation, and 2) multi-path and multi-candidate generation. Furthermore, we propose a 1+1 model collaboration paradigm that pairs a powerful general-purpose chat model with a fine-tuned SQL expert, combining powerful analytic inference with high-precision SQL generation. BIRD experiments demonstrate that Feather-SQL improves the NL2SQL performance of SLM by approximately 10% without any fine-tuning. The proposed paradigm demonstrates its effectiveness by increasing the accuracy bound of SLM by 54.76%.

Takeaways, Limitations

Takeaways:
Feather-SQL, a lightweight framework for improving the performance of NL2SQL operations using SLM, is presented.
Improved SQL executable and accuracy through schema pruning and concatenation, multi-path, and multi-candidate generation.
Improving SLM accuracy through the 1+1 model collaboration paradigm (54.76% achieved)
Contribute to solving data privacy and distribution issues
Limitations:
The generalization performance of the proposed framework and its performance evaluation on other datasets are needed.
Further analysis is needed on the computational costs and efficiency of the 1+1 model collaboration paradigm.
Performance verification on datasets other than the BIRD dataset is required.
👍