Revocable Backdoor for Deep Model Trading
Yiran Xu, Nan Zhong, Zhenxing Qian, Xinpeng Zhang
TL;DR
The paper addresses the risk of backdoors in deployed deep models by reframing backdoors as a controllable asset within a model-trading workflow. It introduces a revocable backdoor mechanism based on trainable mask matrices that gate backdoor behavior at interior feature maps, while a coordinated trigger and loss design preserves clean-task fidelity. The key contributions include a practical withdrawal method via masks, trigger-fine-tuning to balance imperceptibility and robustness, and empirical validation across multiple datasets and architectures, showing feasibility and resilience against purification defenses. This approach offers a novel, risk-managed path for exchanging deep models as tradable digital products, with a built-in detoxification pathway upon final payment.
Abstract
Deep models are being applied in numerous fields and have become a new important digital product. Meanwhile, previous studies have shown that deep models are vulnerable to backdoor attacks, in which compromised models return attacker-desired results when a trigger appears. Backdoor attacks severely break the trust-worthiness of deep models. In this paper, we turn this weakness of deep models into a strength, and propose a novel revocable backdoor and deep model trading scenario. Specifically, we aim to compromise deep models without degrading their performance, meanwhile, we can easily detoxify poisoned models without re-training the models. We design specific mask matrices to manage the internal feature maps of the models. These mask matrices can be used to deactivate the backdoors. The revocable backdoor can be adopted in the deep model trading scenario. Sellers train models with revocable backdoors as a trial version. Buyers pay a deposit to sellers and obtain a trial version of the deep model. If buyers are satisfied with the trial version, they pay a final payment to sellers and sellers send mask matrices to buyers to withdraw revocable backdoors. We demonstrate the feasibility and robustness of our revocable backdoor by various datasets and network architectures.
