Perceptual Region-Driven Infrared-Visible Co-Fusion for Extreme Scene Enhancement
Jing Tao, Yonghong Zong, Banglei Guana, Pengju Sun, Taihang Lei, Yang Shanga, Qifeng Yu
TL;DR
The paper tackles the challenge of fusing infrared and visible imagery under extreme conditions while preserving visible geometric features and incorporating thermal data. It presents a region-perception-driven framework that jointly handles multi-exposure and multi-modal data using a space-variant exposure (SVE) camera, with region-guided registration and adaptive, SSIM-informed fusion. Key contributions include a space-variant hardware design, a regional perception map with BI/WC/CF/V features, an adaptive feature-merging strategy, a region-aware multi-exposure fusion, and an infrared-visible complementary fusion module. Extensive experiments on rocket-launch and public datasets demonstrate superior image quality, robustness, and applicability to additional multi-modal scenarios compared with state-of-the-art methods.
Abstract
In photogrammetry, accurately fusing infrared (IR) and visible (VIS) spectra while preserving the geometric fidelity of visible features and incorporating thermal radiation is a significant challenge, particularly under extreme conditions. Existing methods often compromise visible imagery quality, impacting measurement accuracy. To solve this, we propose a region perception-based fusion framework that combines multi-exposure and multi-modal imaging using a spatially varying exposure (SVE) camera. This framework co-fuses multi-modal and multi-exposure data, overcoming single-exposure method limitations in extreme environments. The framework begins with region perception-based feature fusion to ensure precise multi-modal registration, followed by adaptive fusion with contrast enhancement. A structural similarity compensation mechanism, guided by regional saliency maps, optimizes IR-VIS spectral integration. Moreover, the framework adapts to single-exposure scenarios for robust fusion across different conditions. Experiments conducted on both synthetic and real-world data demonstrate superior image clarity and improved performance compared to state-of-the-art methods, as evidenced by both quantitative and visual evaluations.
