Disentanglement by means of action-induced representations
Gorka Muñoz-Gil, Hendrik Poulsen Nautrup, Arunava Majumder, Paulin de Schoulepnikoff, Florian Fürrutter, Marius Krumm, Hans J. Briegel
TL;DR
The paper tackles the challenge of learning disentangled representations in VAEs by introducing action-induced representations (AIR) and the stronger minimal AIR (minAIR) framework, which exploits experimental actions that couple to subsets of latent factors. It proves a disentanglement theorem showing that latent components shared across actions become disentangled, and presents VAIR, a dual-encoder VAE designed to approximate minAIR and realize action-dependent disentanglement. Across abstract, classical, and quantum physics experiments, VAIR yields interpretable latent factors (e.g., $z=(m, q/m)$ in classical dynamics and Bloch-like coordinates in quantum tomography) and consistently outperforms baselines on disentanglement metrics. By linking actions to specific degrees of freedom, the approach enables interpretable representations and paves the way for integrating active experimentation or reinforcement learning to guide discovery.
Abstract
Learning interpretable representations with variational autoencoders (VAEs) is a major goal of representation learning. The main challenge lies in obtaining disentangled representations, where each latent dimension corresponds to a distinct generative factor. This difficulty is fundamentally tied to the inability to perform nonlinear independent component analysis. Here, we introduce the framework of action-induced representations (AIRs) which models representations of physical systems given experiments (or actions) that can be performed on them. We show that, in this framework, we can provably disentangle degrees of freedom w.r.t. their action dependence. We further introduce a variational AIR architecture (VAIR) that can extract AIRs and therefore achieve provable disentanglement where standard VAEs fail. Beyond state representation, VAIR also captures the action dependence of the underlying generative factors, directly linking experiments to the degrees of freedom they influence.
