Spatially-Attentive Patch-Hierarchical Network with Adaptive Sampling for Motion Deblurring
Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan
TL;DR
This work tackles motion deblurring in dynamic scenes by introducing a pixel-adaptive, content-aware architecture that jointly leverages global attention and local, pixel-dependent filtering in a three-stage, patch-based hierarchy. A non-uniform, learnable sampling strategy concentrates computation on hard-to-restore regions, guided by reinforcement learning to select informative pixels, and GT supervision is applied progressively to refine restoration. The method integrates cross- and self-attention with a pixel-dependent filtering module to realize a robust global-local fusion, achieving state-of-the-art or competitive results on GoPro, HIDE, and RealBlur benchmarks while maintaining efficiency. Overall, the approach provides interpretable, adaptive restoration where blur varies across spatial locations, with potential applicability to broader image restoration tasks.
Abstract
This paper tackles the problem of motion deblurring of dynamic scenes. Although end-to-end fully convolutional designs have recently advanced the state-of-the-art in non-uniform motion deblurring, their performance-complexity trade-off is still sub-optimal. Most existing approaches achieve a large receptive field by increasing the number of generic convolution layers and kernel size. In this work, we propose a pixel adaptive and feature attentive design for handling large blur variations across different spatial locations and process each test image adaptively. We design a content-aware global-local filtering module that significantly improves performance by considering not only global dependencies but also by dynamically exploiting neighboring pixel information. We further introduce a pixel-adaptive non-uniform sampling strategy that implicitly discovers the difficult-to-restore regions present in the image and, in turn, performs fine-grained refinement in a progressive manner. Extensive qualitative and quantitative comparisons with prior art on deblurring benchmarks demonstrate that our approach performs favorably against the state-of-the-art deblurring algorithms.
