CL-Guard: Defending DNNs Against Backdoors via Fine-Grained Neuron Analysis and Collaborative Dual-Network Learning

Jie Xiao; Yuhao Huang; Yanjiao Gao; Aizhu Liu; Zhezhao Yang; Xinyue Yu; Qianwei Zhou; Fan Terry Zhang

doi:10.1609/aaai.v40i42.40904

Authors

Jie Xiao College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China Zhejiang Key Laboratory of Visual Information Intelligent Processing, Hangzhou 310023, China
Yuhao Huang College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China Zhejiang Key Laboratory of Visual Information Intelligent Processing, Hangzhou 310023, China
Yanjiao Gao College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China Zhejiang Key Laboratory of Visual Information Intelligent Processing, Hangzhou 310023, China
Aizhu Liu College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China Zhejiang Key Laboratory of Visual Information Intelligent Processing, Hangzhou 310023, China
Zhezhao Yang College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China Zhejiang Key Laboratory of Visual Information Intelligent Processing, Hangzhou 310023, China
Xinyue Yu College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China Zhejiang Key Laboratory of Visual Information Intelligent Processing, Hangzhou 310023, China
Qianwei Zhou College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China
Fan Terry Zhang College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China

DOI:

https://doi.org/10.1609/aaai.v40i42.40904

Abstract

Backdoor attacks on deep neural networks (DNNs) have garnered significant attention, particularly in edge computing applications. Given the complexity and opacity of DNNs, defending against backdoor attacks remains a formidable challenge. To address this, we propose CL-Guard, a dual-network-based defense framework designed to effectively eliminate potential backdoors in models. First, it leverages an inter-layer backpropagation algorithm to quantify each neuron's contribution to model prediction. Next, it constructs a critical neuron set through a recursive hierarchical partitioning method and an adaptive search strategy, identifying neurons critical to model prediction while minimizing the inclusion of backdoor-related neurons. Then, we perform sparse training on the non-critical neuron set, effectively strengthening the weights of critical neurons while disrupting the association between trigger features and backdoor-related neurons. Finally, we design a dual-network architecture that incorporates a fine-grained gradient backpropagation mechanism and dynamic collaborative learning, enabling the model to retain its original accuracy while preventing backdoor reactivation. The experimental results indicate that CL-Guard achieves an average Security Effectiveness Index (SEI) of approximately 95.42%, representing a 21.23% improvement over the state-of-the-art FT-SAM method.

CL-Guard: Defending DNNs Against Backdoors via Fine-Grained Neuron Analysis and Collaborative Dual-Network Learning

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information