Multi-view Disparity Estimation Using a Novel Gradient Consistency Model
James L. Gray, Aous T. Naman, David S. Taubman
TL;DR
This work tackles variational disparity estimation by addressing the limited validity range of the local brightness constancy linearisation. It introduces Gradient Consistency, a data-driven weighting scheme that self‑schedules the data term across multiple views and scales by incorporating gradient and scale reliability estimates, including a physical coupling between views. The Gradient Consistency Model (GCM) derives weights from an Euler–Lagrange solution and augments them with gradient‑ and scale‑inconsistency terms, yielding a robust, self‑adjusting energy that uses an $L^1$ data term and Total Variation regularisation. Empirical results on synthetic 4D Lightfield data and real Middlebury 2006 data show that GCM outperforms coarse‑to‑fine and progressive view inclusion in both convergence rate and accuracy, with clear improvements near object boundaries and strong insensitivity to the regularisation parameter.
Abstract
Variational approaches to disparity estimation typically use a linearised brightness constancy constraint, which only applies in smooth regions and over small distances. Accordingly, current variational approaches rely on a schedule to progressively include image data. This paper proposes the use of Gradient Consistency information to assess the validity of the linearisation; this information is used to determine the weights applied to the data term as part of an analytically inspired Gradient Consistency Model. The Gradient Consistency Model penalises the data term for view pairs that have a mismatch between the spatial gradients in the source view and the spatial gradients in the target view. Instead of relying on a tuned or learned schedule, the Gradient Consistency Model is self-scheduling, since the weights evolve as the algorithm progresses. We show that the Gradient Consistency Model outperforms standard coarse-to-fine schemes and the recently proposed progressive inclusion of views approach in both rate of convergence and accuracy.
