DeLiVR: Differential Spatiotemporal Lie Bias for Efficient Video Deraining
Shuning Sun, Jialang Lu, Xiang Chen, Jichao Wang, Dianjie Lu, Guijuan Zhang, Guangwei Gao, Zhuoran Zheng
TL;DR
DeLiVR tackles rain-induced degradation in outdoor videos by enforcing geometry-aware spatiotemporal consistency. It injects spatiotemporal Lie-group differential biases directly into attention, combining a rotation-bounded SO(2) head with differential group displacement to align frames and model motion without relying on unreliable optical flow. The approach achieves state-of-the-art or competitive results on synthetic and real benchmarks, notably WeatherBench, while reducing artifacts and improving downstream task performance such as object detection and semantic segmentation. This work demonstrates that principled geometric priors integrated into attention offer robust, efficient video restoration with practical impact on real-world perception systems.
Abstract
Videos captured in the wild often suffer from rain streaks, blur, and noise. In addition, even slight changes in camera pose can amplify cross-frame mismatches and temporal artifacts. Existing methods rely on optical flow or heuristic alignment, which are computationally expensive and less robust. To address these challenges, Lie groups provide a principled way to represent continuous geometric transformations, making them well-suited for enforcing spatial and temporal consistency in video modeling. Building on this insight, we propose DeLiVR, an efficient video deraining method that injects spatiotemporal Lie-group differential biases directly into attention scores of the network. Specifically, the method introduces two complementary components. First, a rotation-bounded Lie relative bias predicts the in-plane angle of each frame using a compact prediction module, where normalized coordinates are rotated and compared with base coordinates to achieve geometry-consistent alignment before feature aggregation. Second, a differential group displacement computes angular differences between adjacent frames to estimate a velocity. This bias computation combines temporal decay and attention masks to focus on inter-frame relationships while precisely matching the direction of rain streaks. Extensive experimental results demonstrate the effectiveness of our method on publicly available benchmarks. The code is publicly available at https://github.com/Shuning0312/ICLR-DeLiVR.
