NÖTHER, Jonathan; SINGLA, Adish; RADANOVIC, Goran. Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 39, n. 26, p. 27547–27555, 2025. DOI: 10.1609/aaai.v39i26.34967. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/34967. Acesso em: 11 may. 2026.