Table of Contents
Fetching ...

Toward Real-world Infrared Image Super-Resolution: A Unified Autoregressive Framework and Benchmark Dataset

Yang Zou, Jun Ma, Zhidong Jiao, Xingyuan Li, Zhiying Jiang, Jinyuan Liu

Abstract

Infrared image super-resolution (IISR) under real-world conditions is a practically significant yet rarely addressed task. Pioneering works are often trained and evaluated on simulated datasets or neglect the intrinsic differences between infrared and visible imaging. In practice, however, real infrared images are affected by coupled optical and sensing degradations that jointly deteriorate both structural sharpness and thermal fidelity. To address these challenges, we propose Real-IISR, a unified autoregressive framework for real-world IISR that progressively reconstructs fine-grained thermal structures and clear backgrounds in a scale-by-scale manner via thermal-structural guided visual autoregression. Specifically, a Thermal-Structural Guidance module encodes thermal priors to mitigate the mismatch between thermal radiation and structural edges. Since non-uniform degradations typically induce quantization bias, Real-IISR adopts a Condition-Adaptive Codebook that dynamically modulates discrete representations based on degradation-aware thermal priors. Also, a Thermal Order Consistency Loss enforces a monotonic relation between temperature and pixel intensity, ensuring relative brightness order rather than absolute values to maintain physical consistency under spatial misalignment and thermal drift. We build FLIR-IISR, a real-world IISR dataset with paired LR-HR infrared images acquired via automated focus variation and motion-induced blur. Extensive experiments demonstrate the promising performance of Real-IISR, providing a unified foundation for real-world IISR and benchmarking. The dataset and code are available at: https://github.com/JZD151/Real-IISR.

Toward Real-world Infrared Image Super-Resolution: A Unified Autoregressive Framework and Benchmark Dataset

Abstract

Infrared image super-resolution (IISR) under real-world conditions is a practically significant yet rarely addressed task. Pioneering works are often trained and evaluated on simulated datasets or neglect the intrinsic differences between infrared and visible imaging. In practice, however, real infrared images are affected by coupled optical and sensing degradations that jointly deteriorate both structural sharpness and thermal fidelity. To address these challenges, we propose Real-IISR, a unified autoregressive framework for real-world IISR that progressively reconstructs fine-grained thermal structures and clear backgrounds in a scale-by-scale manner via thermal-structural guided visual autoregression. Specifically, a Thermal-Structural Guidance module encodes thermal priors to mitigate the mismatch between thermal radiation and structural edges. Since non-uniform degradations typically induce quantization bias, Real-IISR adopts a Condition-Adaptive Codebook that dynamically modulates discrete representations based on degradation-aware thermal priors. Also, a Thermal Order Consistency Loss enforces a monotonic relation between temperature and pixel intensity, ensuring relative brightness order rather than absolute values to maintain physical consistency under spatial misalignment and thermal drift. We build FLIR-IISR, a real-world IISR dataset with paired LR-HR infrared images acquired via automated focus variation and motion-induced blur. Extensive experiments demonstrate the promising performance of Real-IISR, providing a unified foundation for real-world IISR and benchmarking. The dataset and code are available at: https://github.com/JZD151/Real-IISR.
Paper Structure (16 sections, 4 equations, 9 figures, 2 tables)

This paper contains 16 sections, 4 equations, 9 figures, 2 tables.

Figures (9)

  • Figure 1: Overview of the constructed FLIR-IISR dataset. To bridge the gap between synthetic and real-world infrared image super-resolution (IISR), we construct the FLIR-IISR dataset using a FLIR T1050sc camera at a 1024×768 resolution, which contains 1,457 paired LR–HR images captured across 6 cities, 3 seasons, and 2 real blur patterns of optical and motion blur across 12 scene categories.
  • Figure 2: Overview of Real-IISR. The Thermal-Structural Guidance (TSG) module fuses thermal priors for degradation-aware encoding. The VAR backbone performs scale-by-scale generation via next-scale prediction, while the Condition-Adaptive Codebook (CAC) dynamically adjusts quantized embeddings based on degradation-aware priors for thermal fidelity. Finally, the Thermal Order Consistency Loss $\mathcal{L}_{\text{TOC}}$ preserves physically consistent thermal ordering.
  • Figure 3: Data collection pipeline of FLIR-IISR.
  • Figure 4: Efficiency comparison in terms of perceptual MUSIQ and FPS; circle diameter indicates model parameters.
  • Figure 5: Qualitative comparison of IISR with SOTA methods on FLIR-IISR and $\text{M}^3\text{FD}$ datasets. The graph illustrates grayscale fluctuations along the blue-marked sampling line, and red-marked sampling line denotes the HR.
  • ...and 4 more figures