Cavalin, P., Domingues, P. H. and Pinhanez, C. (2025) “Sentence-level Aggregation of Lexical Metrics Correlates Stronger with Human Judgements than Corpus-level Aggregation”, Proceedings of the AAAI Conference on Artificial Intelligence, 39(22), pp. 23532–23540. doi: 10.1609/aaai.v39i22.34522.