[1]

Banerjee, S. et al. 2025. How (Un)ethical Are Instruction-Centric Responses of LLMs? Unveiling the Vulnerabilities of Safety Guardrails to Harmful Queries. Proceedings of the International AAAI Conference on Web and Social Media. 19, 1 (Jun. 2025), 193–205. DOI:https://doi.org/10.1609/icwsm.v19i1.35811.