Confidence Estimation for Text-to-SQL in Large Language Models

Sepideh Entezari Maleki; Mohammadreza Pourreza; Davood Rafiei

doi:10.1609/aaai.v40i38.40523

Confidence Estimation for Text-to-SQL in Large Language Models

Authors

Sepideh Entezari Maleki University of Alberta
Mohammadreza Pourreza University of Alberta
Davood Rafiei University of Alberta

DOI:

https://doi.org/10.1609/aaai.v40i38.40523

Abstract

Confidence estimation for text-to-SQL aims to assess the reliability of model-generated SQL queries without having access to gold answers. We study this problem in the context of large language models (LLMs), where access to model weights and gradients is often constrained. We explore both black-box and white-box confidence estimation strategies, evaluating their effectiveness on cross-domain text-to-SQL benchmarks. Our evaluation highlights the superior performance of consistency-based methods among black-box models and the advantage of SQL-syntax-aware approaches for interpreting LLM logits in white-box settings. Furthermore, we show that execution-based grounding of queries provides a valuable supplementary signal, improving the effectiveness of both approaches.

AAAI-26 / IAAI-26 / EAAI-26 Proceedings Cover

Downloads

Published

2026-03-14

How to Cite

Maleki, S. E., Pourreza, M., & Rafiei, D. (2026). Confidence Estimation for Text-to-SQL in Large Language Models. Proceedings of the AAAI Conference on Artificial Intelligence, 40(38), 32474–32482. https://doi.org/10.1609/aaai.v40i38.40523

Download Citation

Issue

Vol. 40 No. 38: AAAI-26 Technical Tracks 38

Section

AAAI Technical Track on Natural Language Processing III

Confidence Estimation for Text-to-SQL in Large Language Models

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information