(1)
Wang, Z.; Zhang, R.; Li, H.; Fan, W.; Jiang, W.; Zhao, Q.; Xu, G. ConfGuard: A Simple and Effective Backdoor Detection for Large Language Models. AAAI 2026, 40, 35829-35837.