Gaussian Splatting Lucas-Kanade
Liuyue Xie, Joel Julin, Koichiro Niinuma, Laszlo A. Jeni
TL;DR
This work tackles dynamic 3D reconstruction from monocular video with limited camera motion by replacing data-driven warp priors with an analytical scene-flow regularization for Gaussian Splatting. It derives an instantaneous velocity field from the forward warp field via $v(oldsymbol{G}; t) = J^{+}_{oldsymbol{G}} rac{ullmid oldsymbol{ ext{W}_{c\rightarrow}(oldsymbol{G}; t)}_{ ext{}}}{ ext{d} t}$ and integrates it in time (using a Runge–Kutta solver) to obtain coherent Gaussian trajectories, enforcing motion and depth consistency through a loss $ ext{L}_{motion} = ext{L}_{flow} + ext{L}_{depth}$ with weights $oldsymbol{ humb{ rac{`,}}}$ and $oldsymbol{m}$-ranking depths. The approach yields improved geometric fidelity and motion separation on synthetic and real dynamic scenes, outperforming prior Gaussian Splatting variants and approaching NeRF-like quality under challenging camera motion. By delivering continuous-time warp-field regularization and reducing bias from purely data-driven priors, the method offers scalable, accurate dynamic scene reconstruction with minimal camera movement and opens avenues for 3D tracking and robust in-the-wild rendering. The results demonstrate the value of integrating analytical scene flow into deformable Gaussians for dynamic 3D vision tasks.
Abstract
Gaussian Splatting and its dynamic extensions are effective for reconstructing 3D scenes from 2D images when there is significant camera movement to facilitate motion parallax and when scene objects remain relatively static. However, in many real-world scenarios, these conditions are not met. As a consequence, data-driven semantic and geometric priors have been favored as regularizers, despite their bias toward training data and their neglect of broader movement dynamics. Departing from this practice, we propose a novel analytical approach that adapts the classical Lucas-Kanade method to dynamic Gaussian splatting. By leveraging the intrinsic properties of the forward warp field network, we derive an analytical velocity field that, through time integration, facilitates accurate scene flow computation. This enables the precise enforcement of motion constraints on warp fields, thus constraining both 2D motion and 3D positions of the Gaussians. Our method excels in reconstructing highly dynamic scenes with minimal camera movement, as demonstrated through experiments on both synthetic and real-world scenes.
