Fusing Pruned and Backdoored Models: Optimal Transport-based Data-free Backdoor Mitigation

Authors

  • Weilin Lin The Hong Kong University of Science and Technology (Guangzhou)
  • Li Liu The Hong Kong University of Science and Technology (Guangzhou)
  • Jianze Li Shenzhen Research Institute of Big Data The Chinese University of Hong Kong, Shenzhen
  • Hui Xiong The Hong Kong University of Science and Technology (Guangzhou)

DOI:

https://doi.org/10.1609/aaai.v39i25.34828

Abstract

Backdoor attacks present a serious security threat to deep neuron networks (DNNs). Although numerous effective defense techniques have been proposed in recent years, they inevitably rely on the availability of either clean or poisoned data. In contrast, data-free defense techniques have evolved slowly and still lag significantly in performance. To address this issue, different from the traditional approach of pruning followed by fine-tuning, we propose a novel data-free defense method named Optimal Transport-based Backdoor Repairing (OTBR) in this work. This method, based on our findings on neuron weight changes (NWCs) of random unlearning, uses optimal transport (OT)-based model fusion to combine the advantages of both pruned and backdoored models. Specifically, we first demonstrate our findings that the NWCs of random unlearning are positively correlated with those of poison unlearning. Based on this observation, we propose a random-unlearning NWC pruning technique to eliminate the backdoor effect and obtain a backdoor-free pruned model. Then, motivated by the OT-based model fusion, we propose the pruned-to-backdoored OT-based fusion technique, which fuses pruned and backdoored models to combine the advantages of both, resulting in a model that demonstrates high clean accuracy and a low attack success rate. To our knowledge, this is the first work to apply OT and model fusion techniques to backdoor defense. Extensive experiments show that our method successfully defends against all seven backdoor attacks across three benchmark datasets, outperforming both state-of-the-art (SOTA) data-free and data-dependent methods.

Downloads

Published

2025-04-11

How to Cite

Lin, W., Liu, L., Li, J., & Xiong, H. (2025). Fusing Pruned and Backdoored Models: Optimal Transport-based Data-free Backdoor Mitigation. Proceedings of the AAAI Conference on Artificial Intelligence, 39(25), 26299–26307. https://doi.org/10.1609/aaai.v39i25.34828

Issue

Section

AAAI Technical Track on Philosophy and Ethics of AI