HyperDynamics: Meta-Learning Object and Agent Dynamics with Hypernetworks
Zhou Xian, Shamit Lal, Hsiao-Yu Tung, Emmanouil Antonios Platanios, Katerina Fragkiadaki
TL;DR
HyperDynamics introduces a dynamics meta-learning framework that generates system-conditioned forward dynamics models on the fly by encoding interaction history and optional visual observations. A hypernetwork maps latent system properties to the weights of a dedicated forward model, enabling per-environment experts without maintaining a large ensemble or online gradient-based adaptation. The approach is demonstrated on object pushing and locomotion, where the generated experts rival standalone ensembles on seen environments and generalize to unseen properties with few-shot information, outperforming fixed-global models and gradient-based meta-learning baselines. The work highlights a multiplicative interaction between inferred system properties and low-dimensional state representations as a key driver of effective adaptation, with implications for scalable, transferable model-based RL.
Abstract
We propose HyperDynamics, a dynamics meta-learning framework that conditions on an agent's interactions with the environment and optionally its visual observations, and generates the parameters of neural dynamics models based on inferred properties of the dynamical system. Physical and visual properties of the environment that are not part of the low-dimensional state yet affect its temporal dynamics are inferred from the interaction history and visual observations, and are implicitly captured in the generated parameters. We test HyperDynamics on a set of object pushing and locomotion tasks. It outperforms existing dynamics models in the literature that adapt to environment variations by learning dynamics over high dimensional visual observations, capturing the interactions of the agent in recurrent state representations, or using gradient-based meta-optimization. We also show our method matches the performance of an ensemble of separately trained experts, while also being able to generalize well to unseen environment variations at test time. We attribute its good performance to the multiplicative interactions between the inferred system properties -- captured in the generated parameters -- and the low-dimensional state representation of the dynamical system.
