Neural Material Adaptor for Visual Grounding of Intrinsic Dynamics

Junyi Cao; Shanyan Guan; Yanhao Ge; Wei Li; Xiaokang Yang; Chao Ma

Neural Material Adaptor for Visual Grounding of Intrinsic Dynamics

Junyi Cao, Shanyan Guan, Yanhao Ge, Wei Li, Xiaokang Yang, Chao Ma

TL;DR

This work tackles visual grounding of intrinsic dynamics from videos by introducing NeuMA, a Neural Material Adaptor that learns a residual correction ${\Delta{\mathcal{M}}_{\theta}}$ to an expert prior ${\mathcal{M}}_0$ within a differentiable physics engine. The method couples a Low-Rank adaptation of constitutive models with an elastodynamic solver (MPM) and a differentiable 3D Gaussian Splatting renderer (Particle-GS) to minimize image-based losses ${\mathcal L}_v$, enabling end-to-end grounding of motion from observations. Empirical results on synthetic and real data show NeuMA improves object-dynamics grounding and dynamic rendering while generalizing to unseen shapes and multi-object interactions, demonstrating the value of combining physical priors with data-driven corrections. The approach offers a principled, interpretable path toward accurate, generalizable modeling of intrinsic dynamics for visual understanding and simulation.

Abstract

While humans effortlessly discern intrinsic dynamics and adapt to new scenarios, modern AI systems often struggle. Current methods for visual grounding of dynamics either use pure neural-network-based simulators (black box), which may violate physical laws, or traditional physical simulators (white box), which rely on expert-defined equations that may not fully capture actual dynamics. We propose the Neural Material Adaptor (NeuMA), which integrates existing physical laws with learned corrections, facilitating accurate learning of actual dynamics while maintaining the generalizability and interpretability of physical priors. Additionally, we propose Particle-GS, a particle-driven 3D Gaussian Splatting variant that bridges simulation and observed images, allowing back-propagate image gradients to optimize the simulator. Comprehensive experiments on various dynamics in terms of grounded particle accuracy, dynamic rendering quality, and generalization ability demonstrate that NeuMA can accurately capture intrinsic dynamics.

Neural Material Adaptor for Visual Grounding of Intrinsic Dynamics

TL;DR

This work tackles visual grounding of intrinsic dynamics from videos by introducing NeuMA, a Neural Material Adaptor that learns a residual correction

to an expert prior

within a differentiable physics engine. The method couples a Low-Rank adaptation of constitutive models with an elastodynamic solver (MPM) and a differentiable 3D Gaussian Splatting renderer (Particle-GS) to minimize image-based losses

, enabling end-to-end grounding of motion from observations. Empirical results on synthetic and real data show NeuMA improves object-dynamics grounding and dynamic rendering while generalizing to unseen shapes and multi-object interactions, demonstrating the value of combining physical priors with data-driven corrections. The approach offers a principled, interpretable path toward accurate, generalizable modeling of intrinsic dynamics for visual understanding and simulation.

Neural Material Adaptor for Visual Grounding of Intrinsic Dynamics

TL;DR

Abstract

Neural Material Adaptor for Visual Grounding of Intrinsic Dynamics

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (15)