CL-Guard: Defending DNNs Against Backdoors via Fine-Grained Neuron Analysis and Collaborative Dual-Network Learning

Authors

  • Jie Xiao College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China Zhejiang Key Laboratory of Visual Information Intelligent Processing, Hangzhou 310023, China
  • Yuhao Huang College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China Zhejiang Key Laboratory of Visual Information Intelligent Processing, Hangzhou 310023, China
  • Yanjiao Gao College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China Zhejiang Key Laboratory of Visual Information Intelligent Processing, Hangzhou 310023, China
  • Aizhu Liu College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China Zhejiang Key Laboratory of Visual Information Intelligent Processing, Hangzhou 310023, China
  • Zhezhao Yang College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China Zhejiang Key Laboratory of Visual Information Intelligent Processing, Hangzhou 310023, China
  • Xinyue Yu College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China Zhejiang Key Laboratory of Visual Information Intelligent Processing, Hangzhou 310023, China
  • Qianwei Zhou College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China
  • Fan Terry Zhang College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China

DOI:

https://doi.org/10.1609/aaai.v40i42.40904

Abstract

Backdoor attacks on deep neural networks (DNNs) have garnered significant attention, particularly in edge computing applications. Given the complexity and opacity of DNNs, defending against backdoor attacks remains a formidable challenge. To address this, we propose CL-Guard, a dual-network-based defense framework designed to effectively eliminate potential backdoors in models. First, it leverages an inter-layer backpropagation algorithm to quantify each neuron's contribution to model prediction. Next, it constructs a critical neuron set through a recursive hierarchical partitioning method and an adaptive search strategy, identifying neurons critical to model prediction while minimizing the inclusion of backdoor-related neurons. Then, we perform sparse training on the non-critical neuron set, effectively strengthening the weights of critical neurons while disrupting the association between trigger features and backdoor-related neurons. Finally, we design a dual-network architecture that incorporates a fine-grained gradient backpropagation mechanism and dynamic collaborative learning, enabling the model to retain its original accuracy while preventing backdoor reactivation. The experimental results indicate that CL-Guard achieves an average Security Effectiveness Index (SEI) of approximately 95.42%, representing a 21.23% improvement over the state-of-the-art FT-SAM method.

Downloads

Published

2026-03-14

How to Cite

Xiao, J., Huang, Y., Gao, Y., Liu, A., Yang, Z., Yu, X., … Zhang, F. T. (2026). CL-Guard: Defending DNNs Against Backdoors via Fine-Grained Neuron Analysis and Collaborative Dual-Network Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(42), 35894–35902. https://doi.org/10.1609/aaai.v40i42.40904

Issue

Section

AAAI Technical Track on Philosophy and Ethics of AI