[1]

Wang, Z. et al. 2026. ConfGuard: A Simple and Effective Backdoor Detection for Large Language Models. Proceedings of the AAAI Conference on Artificial Intelligence. 40, 42 (Mar. 2026), 35829–35837. DOI:https://doi.org/10.1609/aaai.v40i42.40897.