Schmer-Galunder, Sonja, Ruta Wheelock, Zaria Jalan, Alyssa Chvasta, Scott Friedman, and Emily Saltz. “Annotator in the Loop: A Case Study of In-Depth Rater Engagement to Create a Prosocial Benchmark Dataset”. Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society 7, no. 1 (October 16, 2024): 1319–1328. Accessed May 14, 2026. https://ojs.aaai.org/index.php/AIES/article/view/31726.