DAVIDE: Depth-Aware Video Deblurring

German F. Torres; Jussi Kalliola; Soumya Tripathy; Erman Acar; Joni-Kristian Kämäräinen

DAVIDE: Depth-Aware Video Deblurring

German F. Torres, Jussi Kalliola, Soumya Tripathy, Erman Acar, Joni-Kristian Kämäräinen

TL;DR

Video deblurring benefits from depth information, but the advantage is context-dependent. The authors introduce the DAVIDE RGB-D video deblurring dataset and develop a depth-aware extension of Shift-Net, incorporating a Depth-aware Transformer Block (DaT) and Depth fusion through GSTS and GSS mechanisms. Experimental results show depth cues boost performance for small temporal contexts, with gains diminishing as the context window increases, while depth reliability and scene attributes modulate the benefits. The work provides practical insights for leveraging depth sensors in real-world video restoration and establishes a valuable benchmark for depth-aware video deblurring research.

Abstract

Video deblurring aims at recovering sharp details from a sequence of blurry frames. Despite the proliferation of depth sensors in mobile phones and the potential of depth information to guide deblurring, depth-aware deblurring has received only limited attention. In this work, we introduce the 'Depth-Aware VIdeo DEblurring' (DAVIDE) dataset to study the impact of depth information in video deblurring. The dataset comprises synchronized blurred, sharp, and depth videos. We investigate how the depth information should be injected into the existing deep RGB video deblurring models, and propose a strong baseline for depth-aware video deblurring. Our findings reveal the significance of depth information in video deblurring and provide insights into the use cases where depth cues are beneficial. In addition, our results demonstrate that while the depth improves deblurring performance, this effect diminishes when models are provided with a longer temporal context. Project page: https://germanftv.github.io/DAVIDE.github.io/ .

DAVIDE: Depth-Aware Video Deblurring

TL;DR

Abstract

Paper Structure (37 sections, 9 equations, 10 figures, 5 tables)

This paper contains 37 sections, 9 equations, 10 figures, 5 tables.

Introduction
Related Work
Video deblurring.
Depth-Aware Deblurring.
Deblurring benchmark datasets.
Depth-Aware VIdeo DEblurring dataset (DAVIDE)
Background
Data recording
Blur synthesis
Dataset details
Method
Overview of Shift-Net
Grouped Spatial-Temporal Shift (GSTS).
Depth injection
Depth-aware Transformer Block (DaT).
...and 22 more sections

Figures (10)

Figure 1: Examples of depth-aware video deblurring (RGBD) with increasing temporal length $T$ of the context window (see \ref{['sec:exp_depth_impact']} for details).
Figure 2: The DAVIDE blur synthesis pipeline.
Figure 3: Overview of Shift-Net.
Figure 4: Depth fusion block.
Figure 5: Impact of the depth cue with varying $T$ (temporal length of the context window); (a) 95% confidence ($\pm$2 std) plot between the RGB and RGBD performance; (b) avg. results over three independent runs.
...and 5 more figures

DAVIDE: Depth-Aware Video Deblurring

TL;DR

Abstract

DAVIDE: Depth-Aware Video Deblurring

Authors

TL;DR

Abstract

Table of Contents

Figures (10)