(1)
Jiang, D.; Zhang, J.; Weller, O.; Weir, N.; Van Durme, B.; Khashabi, D. SELF-[IN]CORRECT: LLMs Struggle With Discriminating Self-Generated Responses. AAAI 2025, 39, 24266-24275.