Event-Enhanced Blurry Video Super-Resolution
Dachun Kai, Yueyi Zhang, Jin Wang, Zeyu Xiao, Zhiwei Xiong, Xiaoyan Sun
TL;DR
This work addresses blurry video super-resolution (BVSR) by introducing event signals to augment frame information, mitigating deconvolution ambiguities and high-frequency detail loss. The proposed Ev-DeblurVSR jointly fuses intra-frame and inter-frame events with RGB frames through two key modules: Reciprocal Feature Deblurring (RFD), which refines frame features using motion cues from intra-frame events while enriching event features with scene context, and Hybrid Deformable Alignment (HDA), which blends inter-frame event trajectories with optical flow to improve motion estimation for deformable alignment. An edge-enhanced loss further emphasizes high-frequency regions during training, and extensive experiments on synthetic and real-world datasets show state-of-the-art performance with superior spatial recovery and temporal consistency, including a notable real-world gain of +2.59 dB over a strong BVSR baseline and significantly faster inference. The work demonstrates the practical potential of event data to enhance BVSR in challenging scenes, offering a new direction for joint frame-event video restoration methods.
Abstract
In this paper, we tackle the task of blurry video super-resolution (BVSR), aiming to generate high-resolution (HR) videos from low-resolution (LR) and blurry inputs. Current BVSR methods often fail to restore sharp details at high resolutions, resulting in noticeable artifacts and jitter due to insufficient motion information for deconvolution and the lack of high-frequency details in LR frames. To address these challenges, we introduce event signals into BVSR and propose a novel event-enhanced network, Ev-DeblurVSR. To effectively fuse information from frames and events for feature deblurring, we introduce a reciprocal feature deblurring module that leverages motion information from intra-frame events to deblur frame features while reciprocally using global scene context from the frames to enhance event features. Furthermore, to enhance temporal consistency, we propose a hybrid deformable alignment module that fully exploits the complementary motion information from inter-frame events and optical flow to improve motion estimation in the deformable alignment process. Extensive evaluations demonstrate that Ev-DeblurVSR establishes a new state-of-the-art performance on both synthetic and real-world datasets. Notably, on real data, our method is +2.59 dB more accurate and 7.28$\times$ faster than the recent best BVSR baseline FMA-Net. Code: https://github.com/DachunKai/Ev-DeblurVSR.
