Zhu, L., Ning, R., Li, J., Xin, C., & Wu, H. (2024). SEER: Backdoor Detection for Vision-Language Models through Searching Target Text and Image Trigger Jointly. Proceedings of the AAAI Conference on Artificial Intelligence, 38(7), 7766–7774. https://doi.org/10.1609/aaai.v38i7.28611