Implicit Causal Representation Learning via Switchable Mechanisms
Shayan Shirahmad Gale Bagi, Zahra Gharaee, Oliver Schulte, Mark Crowley
TL;DR
This work tackles learning implicit causal representations when ground-truth graphs are unavailable, focusing on soft interventions as a realistic setting. It introduces Augmented Implicit Causal Models with a causal mechanism switch variable $V$ and solution functions that model soft intervention effects via $h_i(v)$, enabling identifiability up to reparameterization under suitable assumptions. A formal identifiability theorem shows equivalence classes collapse when soft interventions are observed and decoders are diffeomorphisms, while training relies on a variational ELBO that jointly infers exogenous variables and the intervention switch. Empirically, ICRL-SM demonstrates improved causal disentanglement and action inference on synthetic data and causal-triplet benchmarks (Epic-Kitchens, ProcTHOR) compared to baselines, with stronger gains in sparser graphs and moderate intervention strengths. The results suggest soft-intervention modeling via a switch mechanism is a promising direction for robust, identifiable causal representation learning in real-world settings.
Abstract
Learning causal representations from observational and interventional data in the absence of known ground-truth graph structures necessitates implicit latent causal representation learning. Implicit learning of causal mechanisms typically involves two categories of interventional data: hard and soft interventions. In real-world scenarios, soft interventions are often more realistic than hard interventions, as the latter require fully controlled environments. Unlike hard interventions, which directly force changes in a causal variable, soft interventions exert influence indirectly by affecting the causal mechanism. However, the subtlety of soft interventions impose several challenges for learning causal models. One challenge is that soft intervention's effects are ambiguous, since parental relations remain intact. In this paper, we tackle the challenges of learning causal models using soft interventions while retaining implicit modelling. We propose ICLR-SM, which models the effects of soft interventions by employing a causal mechanism switch variable designed to toggle between different causal mechanisms. In our experiments, we consistently observe improved learning of identifiable, causal representations, compared to baseline approaches.
