Cross-Task Attack: A Self-Supervision Generative Framework Based on Attention Shift
Qingyuan Zeng, Yunpeng Gong, Min Jiang
TL;DR
This work tackles the robustness challenge of multi-task AI systems by proposing Cross-Task Attack (CTA), a self-supervised framework that uses co-attention and anti-attention maps to guide adversarial perturbations across multiple vision tasks. CTA operates in two stages: extracting per-task attention via Grad-CAM to form a co-attention map and its complement anti-attention, then training a generator to shift input attention away from co-attention toward anti-attention without ground-truth labels. The method demonstrates strong cross-task attacks on image classification, object detection, and semantic segmentation, and maintains effectiveness against some adversarially trained defenses, outperforming prior cross-task approaches in many settings. The results imply CTA offers a practical and scalable way to stress-test and improve the robustness of multi-task AI systems, with clear insights into attention dynamics during adversarial perturbation.
Abstract
Studying adversarial attacks on artificial intelligence (AI) systems helps discover model shortcomings, enabling the construction of a more robust system. Most existing adversarial attack methods only concentrate on single-task single-model or single-task cross-model scenarios, overlooking the multi-task characteristic of artificial intelligence systems. As a result, most of the existing attacks do not pose a practical threat to a comprehensive and collaborative AI system. However, implementing cross-task attacks is highly demanding and challenging due to the difficulty in obtaining the real labels of different tasks for the same picture and harmonizing the loss functions across different tasks. To address this issue, we propose a self-supervised Cross-Task Attack framework (CTA), which utilizes co-attention and anti-attention maps to generate cross-task adversarial perturbation. Specifically, the co-attention map reflects the area to which different visual task models pay attention, while the anti-attention map reflects the area that different visual task models neglect. CTA generates cross-task perturbations by shifting the attention area of samples away from the co-attention map and closer to the anti-attention map. We conduct extensive experiments on multiple vision tasks and the experimental results confirm the effectiveness of the proposed design for adversarial attacks.
