1.
Murugadoss B, Poelitz C, Drosos I, Le V, McKenna N, Negreanu CS, et al. Evaluating the Evaluator: Measuring LLMs’ Adherence to Task Evaluation Instructions. AAAI [Internet]. 2025 Apr. 11 [cited 2026 May 30];39(18):19589-97. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/34157