Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

EllieSQL: Cost-Efficient Text-to-SQL with Complexity-Aware Routing

Created by
  • Haebom

Author

Yizhang Zhu, Runzhi Jiang, Boyan Li, Nan Tang, Yuyu Luo

Outline

To address the high computational cost of LLM-based Text-to-SQL approaches, this paper proposes EllieSQL, a complexity-based routing framework. EllieSQL estimates query complexity and assigns it to the appropriate SQL generation pipeline, processing simple queries efficiently and complex queries computationally intensively. Furthermore, we evaluate cost-effectiveness by introducing the Token Elasticity of Performance (TEP) metric, which measures the effectiveness of performance improvement relative to token investment. Experimental results demonstrate that EllieSQL reduces token usage by over 40% compared to existing approaches while improving TEP by more than twofold without any performance degradation. This contributes to cost-effective Text-to-SQL development and highlights the importance of considering resource efficiency for sustainable Text-to-SQL research.

Takeaways, Limitations

Takeaways:
We present the EllieSQL framework, which effectively addresses the high computational cost of LLM-based Text-to-SQL.
Demonstrates the potential to simultaneously improve cost-effectiveness and performance through complexity-based routing.
A new method for quantitatively evaluating the cost-effectiveness of Text-to-SQL models using the TEP metric is presented.
Presenting a new perspective for sustainable Text-to-SQL research.
Limitations:
EllieSQL's performance depends on the router it uses and how it estimates query complexity, and further research may be needed in this area.
Currently, only experimental results for the Bird development set are presented, and generalizability to other datasets requires further verification.
Since the accuracy of complexity estimation directly affects the efficiency of EllieSQL, development of more sophisticated complexity estimation techniques may be necessary.
👍