Color Mismatches in Stereoscopic Video: Real-World Dataset and Deep Correction Method
Egor Chistov, Nikita Alutis, Dmitriy Vatolin
TL;DR
This paper addresses color mismatches between stereoscopic views, which can cause viewer discomfort, by introducing a real-world beam-splitter dataset and a deep multiscale network that leverages stereo correspondences for color transfer. The method uses an optical-flow-based correspondence mechanism, Efficient feature extraction, and a four-layer U-Net to fuse matched and neighboring information, with training under both deterministic and probabilistic distortion models. Experiments show strong performance on artificial distortions but reveal a domain shift when evaluated on real-world data, where simple global color transfers can outperform more complex, non-global methods. The work emphasizes the need for more realistic color-distortion models to improve generalization to real-world stereoscopic color mismatches and provides a publicly available dataset to drive future progress.
Abstract
Stereoscopic videos can contain color mismatches between the left and right views due to minor variations in camera settings, lenses, and even object reflections captured from different positions. The presence of color mismatches can lead to viewer discomfort and headaches. This problem can be solved by transferring color between stereoscopic views, but traditional methods often lack quality, while neural-network-based methods can easily overfit on artificial data. The scarcity of stereoscopic videos with real-world color mismatches hinders the evaluation of different methods' performance. Therefore, we filmed a video dataset, which includes both distorted frames with color mismatches and ground-truth data, using a beam-splitter. Our second contribution is a deep multiscale neural network that solves the color-mismatch-correction task by leveraging stereo correspondences. The experimental results demonstrate the effectiveness of the proposed method on a conventional dataset, but there remains room for improvement on challenging real-world data.
