360VFI: A Dataset and Benchmark for Omnidirectional Video Frame Interpolation
Wenxuan Lu, Mengshun Hu, Yansheng Qiu, Liang Liao, Zheng Wang
TL;DR
This work tackles the challenge of interpolation for omnidirectional 360° videos by introducing 360VFI, the first dataset and benchmark for Omnidirectional Video Frame Interpolation (Omni-VFI), and a distortion-aware network that leverages ERP distortion priors. It fuses an ERP-aware feature extractor (DistortionGuard) with a distortion-conditioned frame generator (OmniFTB) to reconstruct intermediate frames, using a distortion map based on latitude to modulate processing. The four motion settings in the dataset enable robust evaluation of interpolation under latitude-dependent distortion, and experiments show state-of-the-art performance, especially for large vertical motions. The contributions offer a practical path toward higher-frame-rate immersive 360° video with improved temporal coherence and reduced distortion artifacts.
Abstract
Head-mounted 360° displays and portable 360° cameras have significantly progressed, providing viewers a realistic and immersive experience. However, many omnidirectional videos have low frame rates that can lead to visual fatigue, and the prevailing plane frame interpolation methodologies are unsuitable for omnidirectional video interpolation because they are designed solely for traditional videos. This paper introduces the benchmark dataset, 360VFI, for Omnidirectional Video Frame Interpolation. We present a practical implementation that introduces a distortion prior from omnidirectional video into the network to modulate distortions. Specifically, we propose a pyramid distortion-sensitive feature extractor that uses the unique characteristics of equirectangular projection (ERP) format as prior information. Moreover, we devise a decoder that uses an affine transformation to further facilitate the synthesis of intermediate frames. 360VFI is the first dataset and benchmark that explores the challenge of Omnidirectional Video Frame Interpolation. Through our benchmark analysis, we present four different distortion condition scenes in the proposed 360VFI dataset to evaluate the challenges triggered by distortion during interpolation. Besides, experimental results demonstrate that Omnidirectional Video Interpolation can be effectively improved by modeling for omnidirectional distortion.
