Lyu, Qing, Kumar Shridhar, Chaitanya Malaviya, Li Zhang, Yanai Elazar, Niket Tandon, Marianna Apidianaki, Mrinmaya Sachan, and Chris Callison-Burch. “Calibrating Large Language Models With Sample Consistency”. Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 18 (April 11, 2025): 19260–19268. Accessed May 25, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/34120.