Authority Backdoor: A Certifiable Backdoor Mechanism for Authoring DNNs

Han Yang; Shaofeng Li; Tian Dong; Xiangyu Xu; Guangchi Liu; Zhen Ling

doi:10.1609/aaai.v40i2.37117

Authors

Han Yang Southeast University
Shaofeng Li Southeast University
Tian Dong University of Hong Kong
Xiangyu Xu Southeast University
Guangchi Liu Southeast University
Zhen Ling Southeast University

DOI:

https://doi.org/10.1609/aaai.v40i2.37117

Abstract

Deep Neural Networks (DNNs), as valuable intellectual property, face unauthorized use. Existing protections, such as digital watermarking, are largely passive; they provide only post-hoc ownership verification and cannot actively prevent the illicit use of a stolen model. This work proposes a proactive protection scheme, dubbed ``Authority Backdoor," which embeds access constraints directly into the model. In particular, the scheme utilizes a backdoor learning framework to intrinsically lock a model's utility, such that it performs normally only in the presence of a specific trigger (e.g., a hardware fingerprint). But in its absence, the DNN's performance degrades to be useless. To further enhance the security of the proposed authority scheme, the certifiable robustness is integrated to prevent an adaptive attacker from removing the implanted backdoor. The resulting framework establishes a secure authority mechanism for DNNs, combining access control with certifiable robustness against adversarial attacks. Extensive experiments on diverse architectures and datasets validate the effectiveness and certifiable robustness of the proposed framework.

Authority Backdoor: A Certifiable Backdoor Mechanism for Authoring DNNs

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information