[1]
D. Jiang, J. Zhang, O. Weller, N. Weir, B. Van Durme, and D. Khashabi, “SELF-[IN]CORRECT: LLMs Struggle with Discriminating Self-Generated Responses”, AAAI, vol. 39, no. 23, pp. 24266–24275, Apr. 2025.