Zhou, T., Medina, J., & Chawla, S. (2026). Can LLMs Detect Their Confabulations? Estimating Reliability in Uncertainty-Aware Language Models. Proceedings of the AAAI Conference on Artificial Intelligence, 40(44), 38164–38172. https://doi.org/10.1609/aaai.v40i44.41155