Dynamic Sparsity: Challenging Common Sparsity Assumptions for Learning World Models in Robotic Reinforcement Learning Benchmarks
Muthukumar Pandaram, Jakob Hollenstein, David Drexel, Samuele Tosatto, Antonio Rodríguez-Sánchez, Justus Piater
TL;DR
This work critically evaluates the widely used sparsity priors in learning world-model dynamics for robotic reinforcement learning by analyzing ground-truth transitions in MuJoCo Playground. It leverages Jacobians $J_s$ and $J_a$ to characterize causal structure, state-dependence, and temporal sparsity, revealing that global sparsity is rare while local, state-dependent sparsity emerges in temporally localized blocks (e.g., during contacts). Empirical results show sparsity is not easily captured by naive MLPs, even with Jacobian-guided losses, underscoring the need for models that adapt their causal structure to the current state and time. The findings argue for grounded inductive biases that reflect real-world sparsity patterns to improve generalization and planning in model-based reinforcement learning. Overall, the work highlights the nuanced role of sparsity in world models and points toward architectures capable of dynamic, context-sensitive sparsity control.
Abstract
The use of learned dynamics models, also known as world models, can improve the sample efficiency of reinforcement learning. Recent work suggests that the underlying causal graphs of such dynamics models are sparsely connected, with each of the future state variables depending only on a small subset of the current state variables, and that learning may therefore benefit from sparsity priors. Similarly, temporal sparsity, i.e. sparsely and abruptly changing local dynamics, has also been proposed as a useful inductive bias. In this work, we critically examine these assumptions by analyzing ground-truth dynamics from a set of robotic reinforcement learning environments in the MuJoCo Playground benchmark suite, aiming to determine whether the proposed notions of state and temporal sparsity actually tend to hold in typical reinforcement learning tasks. We study (i) whether the causal graphs of environment dynamics are sparse, (ii) whether such sparsity is state-dependent, and (iii) whether local system dynamics change sparsely. Our results indicate that global sparsity is rare, but instead the tasks show local, state-dependent sparsity in their dynamics and this sparsity exhibits distinct structures, appearing in temporally localized clusters (e.g., during contact events) and affecting specific subsets of state dimensions. These findings challenge common sparsity prior assumptions in dynamics learning, emphasizing the need for grounded inductive biases that reflect the state-dependent sparsity structure of real-world dynamics.
