$\textrm{A}^{\textrm{2}}$RNet: Adversarial Attack Resilient Network for Robust Infrared and Visible Image Fusion
Jiawei Li, Hongwei Yu, Jiansheng Chen, Xinlong Ding, Jinlong Wang, Jinyuan Liu, Bochao Zou, Huimin Ma
TL;DR
This work tackles the vulnerability of infrared-visible image fusion (IVIF) to adversarial perturbations by introducing the adversarial attack resilient network, $A^{2}$RNet, and a novel anti-attack loss $\mathcal{L}_{a}$. The method combines a Unet-based fusion backbone with a transformer-based Defensive Refinement Module (DRM) and Adversarial-Resistant Blocks (ARBs) that employ Mercer-based self-attention to maintain robust feature fusion under attack; adversarial training uses PGD-generated examples with pseudo-labels to guide the min-max optimization of $\mathcal{L}_{a}$. Experimental results on MFNet and $M^{3}$FD demonstrate improved fusion quality and stronger resilience in downstream tasks (detection and segmentation) under adversarial perturbations, outperforming seven SOTA methods. The work provides a practical framework for robust IVIF with potential for broader application in safety-critical vision tasks and sets the ground for future data-centric robustness in fusion systems.
Abstract
Infrared and visible image fusion (IVIF) is a crucial technique for enhancing visual performance by integrating unique information from different modalities into one fused image. Exiting methods pay more attention to conducting fusion with undisturbed data, while overlooking the impact of deliberate interference on the effectiveness of fusion results. To investigate the robustness of fusion models, in this paper, we propose a novel adversarial attack resilient network, called $\textrm{A}^{\textrm{2}}$RNet. Specifically, we develop an adversarial paradigm with an anti-attack loss function to implement adversarial attacks and training. It is constructed based on the intrinsic nature of IVIF and provide a robust foundation for future research advancements. We adopt a Unet as the pipeline with a transformer-based defensive refinement module (DRM) under this paradigm, which guarantees fused image quality in a robust coarse-to-fine manner. Compared to previous works, our method mitigates the adverse effects of adversarial perturbations, consistently maintaining high-fidelity fusion results. Furthermore, the performance of downstream tasks can also be well maintained under adversarial attacks. Code is available at https://github.com/lok-18/A2RNet.
