CHASE: Contextual History for Adaptive and Simple Exploitation in Large Language Model Jailbreaking
DOI:
https://doi.org/10.1609/aaai.v40i1.36996Abstract
We propose Contextual History for Adaptive and Simple Exploitation (CHASE), a novel multi-turn method for Large Language Model (LLM) jailbreaking. Rather than directly attack an LLM that may be difficult to jailbreak, CHASE first collects jailbroken histories from an easy-to-jailbreak LLM and then transfers them to the target LLM. Through this history transfer process, CHASE misleads the target LLM into thinking that it is responsible for producing the jailbroken histories and increases the chances of successful jailbreaking by prompting it to continue the conversation. Extensive evaluations on mainstream LLMs show that CHASE consistently achieves higher attack success rates and demands fewer computational resources compared to existing methods.Downloads
Published
2026-03-14
How to Cite
Hao, Z., Li, C., Fan, Y., Cai, J., Fu, X., Wang, S., Shen, H., Yin, J., Ge, J., Luo, B., & Ng, V. (2026). CHASE: Contextual History for Adaptive and Simple Exploitation in Large Language Model Jailbreaking. Proceedings of the AAAI Conference on Artificial Intelligence, 40(1), 345-353. https://doi.org/10.1609/aaai.v40i1.36996
Issue
Section
AAAI Technical Track on Application Domains I