QueryCDR: Query-Based Controllable Distortion Rectification Network for Fisheye Images
Pengbo Guo, Chengxu Liu, Xingsong Hou, Xueming Qian
TL;DR
QueryCDR tackles the challenge of rectifying fisheye distortions across varying distortion degrees without retraining. It introduces a Distortion-aware Learnable Query Mechanism (DLQM) to generate per-layer, position-aware control signals from a set of learnable queries, and pairs it with two controllable blocks—Controllable Convolution Modulating Block ($\text{CCMB}$) and Controllable Attention Modulating Block ($\text{CAMB}$)—to guide rectification in a U-shaped, multi-scale framework. The method is trained in two stages (coarse pre-training on a single distortion degree and fine-tuning across multiple degrees) to achieve broad generalization, validated by quantitative PSNR/SSIM gains and qualitative texture preservation on synthetic and real fisheye data. The results demonstrate superior performance across nine distortion degrees without retraining, enabling practical, controllable fisheye rectification with reduced data collection costs. Overall, DLQM and the dual-modulation architecture offer a robust, scalable solution for real-world fisheye correction with varying distortion patterns.
Abstract
Fisheye image rectification aims to correct distortions in images taken with fisheye cameras. Although current models show promising results on images with a similar degree of distortion as the training data, they will produce sub-optimal results when the degree of distortion changes and without retraining. The lack of generalization ability for dealing with varying degrees of distortion limits their practical application. In this paper, we take one step further to enable effective distortion rectification for images with varying degrees of distortion without retraining. We propose a novel Query-Based Controllable Distortion Rectification network for fisheye images (QueryCDR). In particular, we first present the Distortion-aware Learnable Query Mechanism (DLQM), which defines the latent spatial relationships for different distortion degrees as a series of learnable queries. Each query can be learned to obtain position-dependent rectification control conditions, providing control over the rectification process. Then, we propose two kinds of controllable modulating blocks to enable the control conditions to guide the modulation of the distortion features better. These core components cooperate with each other to effectively boost the generalization ability of the model at varying degrees of distortion. Extensive experiments on fisheye image datasets with different distortion degrees demonstrate our approach achieves high-quality and controllable distortion rectification.
