Unlocking Efficient Vehicle Dynamics Modeling via Analytic World Models
Asen Nachkov, Danda Pani Paudel, Jan-Nico Zaech, Davide Scaramuzza, Luc Van Gool
TL;DR
This work extends differentiable simulation from policy learning to world modeling by introducing Analytic World Models (AWMs) that predict, prescribe, and counterfact actions in autonomous driving. By embedding three predictor tasks—relative odometry, optimal planners, and inverse optimal state estimation—within an end-to-end differentiable graph built on Waymax, the approach enables efficient learning with backpropagation through the environment dynamics. The AWMs, trained alongside a policy under Analytic Policy Gradients, achieve stronger reactive performance, accurate imagined futures, and useful confidence signals, and they enable model-based action selection via MPC. Overall, the method demonstrates that DiffSim can substantially enhance decision-making beyond reactive control in autonomous driving, with potential for broader, real-time, end-to-end world modeling.
Abstract
Differentiable simulators represent an environment's dynamics as a differentiable function. Within robotics and autonomous driving, this property is used in Analytic Policy Gradients (APG), which relies on backpropagating through the dynamics to train accurate policies for diverse tasks. Here we show that differentiable simulation also has an important role in world modeling, where it can impart predictive, prescriptive, and counterfactual capabilities to an agent. Specifically, we design three novel task setups in which the differentiable dynamics are combined within an end-to-end computation graph not with a policy, but a state predictor. This allows us to learn relative odometry, optimal planners, and optimal inverse states. We collectively call these predictors Analytic World Models (AWMs) and demonstrate how differentiable simulation enables their efficient, end-to-end learning. In autonomous driving scenarios, they have broad applicability and can augment an agent's decision-making beyond reactive control.
