Regularizing Dynamic Radiance Fields with Kinematic Fields
Woobin Im, Geonho Cha, Sebin Lee, Jumin Lee, Juhyeong Seon, Dongyoon Wee, Sung-Eui Yoon
TL;DR
The paper tackles dynamic scene reconstruction from monocular video by introducing a kinematic field that outputs velocity $\mathbf{v}$, acceleration $\mathbf{a}$, and jerk $\mathbf{j}$ and learns it jointly with dynamic and static radiance fields $\mathcal{F}_{\text{DY}}$ and $\mathcal{F}_{\text{ST}}$. It couples these fields through photometric losses and physics-informed regularizers, including transport and rigidity terms, to enforce physically plausible motion and trajectories. Empirical results on the NDVS dataset demonstrate improved rendering quality and motion consistency, with faster training times compared to prior methods like NSFF, DynamicNeRF, and HyperNeRF, while also providing rich kinematic estimates. This physics-grounded framework advances monocular dynamic 3D reconstruction, enabling more accurate 4D representations and motion-aware scene understanding for real-world applications.
Abstract
This paper presents a novel approach for reconstructing dynamic radiance fields from monocular videos. We integrate kinematics with dynamic radiance fields, bridging the gap between the sparse nature of monocular videos and the real-world physics. Our method introduces the kinematic field, capturing motion through kinematic quantities: velocity, acceleration, and jerk. The kinematic field is jointly learned with the dynamic radiance field by minimizing the photometric loss without motion ground truth. We further augment our method with physics-driven regularizers grounded in kinematics. We propose physics-driven regularizers that ensure the physical validity of predicted kinematic quantities, including advective acceleration and jerk. Additionally, we control the motion trajectory based on rigidity equations formed with the predicted kinematic quantities. In experiments, our method outperforms the state-of-the-arts by capturing physical motion patterns within challenging real-world monocular videos.
