[1]

Kumbam, P.R. et al. 2025. Exploiting Explainability to Design Adversarial Attacks and Evaluate Attack Resilience in Hate-Speech Detection Models. Proceedings of the International AAAI Conference on Web and Social Media. 19, 1 (Jun. 2025), 1038–1050. DOI:https://doi.org/10.1609/icwsm.v19i1.35859.