Murugadoss, B. (2025) “Evaluating the Evaluator: Measuring LLMs’ Adherence to Task Evaluation Instructions”, Proceedings of the AAAI Conference on Artificial Intelligence, 39(18), pp. 19589–19597. doi: 10.1609/aaai.v39i18.34157.