Single Image Rolling Shutter Removal with Diffusion Models
Zhanglei Yang, Haipeng Li, Mingbo Hong, Chen-Lin Zhang, Jiajun Li, Shuaicheng Liu
TL;DR
RS-Diffusion presents the first diffusion-model approach to single-image rolling shutter removal by formulating an image-to-motion task conditioned on the RS frame and augmented with a Patch-Attention module. The Intra Gyro Field (IGF) pipeline enables accurate GS ground-truth labeling via synchronized IMU data, culminating in the RS-Real dataset that combines realism with precise motion labels. Empirical results demonstrate state-of-the-art performance for single-frame RS correction and real-time inference on GPUs, along with robust ablations validating the IGF-driven supervision and patch-attention design. The work advances practical RS correction and provides a valuable dataset for future diffusion-based restoration research.
Abstract
We present RS-Diffusion, the first Diffusion Models-based method for single-frame Rolling Shutter (RS) correction. RS artifacts compromise visual quality of frames due to the row-wise exposure of CMOS sensors. Most previous methods have focused on multi-frame approaches, using temporal information from consecutive frames for the motion rectification. However, few approaches address the more challenging but important single frame RS correction. In this work, we present an ``image-to-motion" framework via diffusion techniques, with a designed patch-attention module. In addition, we present the RS-Real dataset, comprised of captured RS frames alongside their corresponding Global Shutter (GS) ground-truth pairs. The GS frames are corrected from the RS ones, guided by the corresponding Inertial Measurement Unit (IMU) gyroscope data acquired during capture. Experiments show that RS-Diffusion surpasses previous single-frame RS methods, demonstrates the potential of diffusion-based approaches, and provides a valuable dataset for further research.
