Kumbam, Pranath Reddy, Sohaib Uddin Syed, Prashanth Thamminedi, Suhas Harish, Ian Perera, and Bonnie J Dorr. “Exploiting Explainability to Design Adversarial Attacks and Evaluate Attack Resilience in Hate-Speech Detection Models”. Proceedings of the International AAAI Conference on Web and Social Media 19, no. 1 (June 7, 2025): 1038–1050. Accessed May 29, 2026. https://ojs.aaai.org/index.php/ICWSM/article/view/35859.