ParticleGS: Learning Neural Gaussian Particle Dynamics from Videos for Prior-free Physical Motion Extrapolation
Jinsheng Quan, Qiaowei Miao, Yichao Xu, Zizhuo Lin, Ying Li, Wei Yang, Zhihui Li, Yawei Luo
TL;DR
ParticleGS addresses long-horizon extrapolation of dynamic 3D scenes by learning neural physical dynamics directly from video. It introduces an Encoder–Evolver–Decoder pipeline that represents scenes as Gaussian particles with static properties and shared dynamic fields, evolved by a Neural ODE to predict future motion. A factorized encoding and a physically grounded decoder (using Rodrigues’ rotation) enable high-order, continuous-time dynamics and physically plausible extrapolation without predefined physics priors. Across multiple synthetic and real datasets, ParticleGS achieves state-of-the-art extrapolation while maintaining rendering quality comparable to leading dynamic 3D reconstruction methods, demonstrating effective physics-guided, prior-free dynamic learning.
Abstract
The ability to extrapolate dynamic 3D scenes beyond the observed timeframe is fundamental to advancing physical world understanding and predictive modeling. Existing dynamic 3D reconstruction methods have achieved high-fidelity rendering of temporal interpolation, but typically lack physical consistency in predicting the future. To overcome this issue, we propose ParticleGS, a physics-based framework that reformulates dynamic 3D scenes as physically grounded systems. ParticleGS comprises three key components: 1) an encoder that decomposes the scene into static properties and initial dynamic physical fields; 2) an evolver based on Neural Ordinary Differential Equations (Neural ODEs) that learns continuous-time dynamics for motion extrapolation; and 3) a decoder that reconstructs 3D Gaussians from evolved particle states for rendering. Through this design, ParticleGS integrates physical reasoning into dynamic 3D representations, enabling accurate and consistent prediction of the future. Experiments show that ParticleGS achieves state-of-the-art performance in extrapolation while maintaining rendering quality comparable to leading dynamic 3D reconstruction methods.
