On the Adversarial Vulnerabilities of Transfer Learning in Remote Sensing
Tao Bai, Xingjian Tian, Yonghao Xu, Bihan Wen
TL;DR
This work addresses the vulnerability of transfer learning in remote sensing by showing that publicly available pretrained models can be exploited to attack downstream tasks through Adversarial Neuron Manipulation (ANM). It introduces two attack variants, ANM-S (single-neuron) and ANM-M (multiple-neuron), under a grey-box threat model where perturbations are crafted from a pretrained model and transferred to a finetuned victim model without access to its data. A Mutual Information Maximization Search (MIMS) procedure selects influential neurons for ANM-M, achieving stronger and more consistent attacks than ANM-S across diverse datasets and architectures. The findings highlight a significant security risk in transfer learning for safety-critical remote sensing tasks and motivate defense strategies such as adversarial training, ensemble methods, and real-time anomaly detection to improve robustness.
Abstract
The use of pretrained models from general computer vision tasks is widespread in remote sensing, significantly reducing training costs and improving performance. However, this practice also introduces vulnerabilities to downstream tasks, where publicly available pretrained models can be used as a proxy to compromise downstream models. This paper presents a novel Adversarial Neuron Manipulation method, which generates transferable perturbations by selectively manipulating single or multiple neurons in pretrained models. Unlike existing attacks, this method eliminates the need for domain-specific information, making it more broadly applicable and efficient. By targeting multiple fragile neurons, the perturbations achieve superior attack performance, revealing critical vulnerabilities in deep learning models. Experiments on diverse models and remote sensing datasets validate the effectiveness of the proposed method. This low-access adversarial neuron manipulation technique highlights a significant security risk in transfer learning models, emphasizing the urgent need for more robust defenses in their design when addressing the safety-critical remote sensing tasks.
