Standardizing Structural Causal Models
Weronika Ormaniec, Scott Sussex, Lars Lorch, Bernhard Schölkopf, Andreas Krause
TL;DR
The paper tackles the problem that synthetic data from structural causal models (SCMs) used for benchmarking structure learning contain artifacts, notably variance buildup and increasing R^2 correlations along causal order. It introduces internally-standardized SCMs (iSCMs), which standardize variables during the generative process, proving they avoid Var-sortability and largely avoid R^2-sortability, and analyzing their identifiability properties. The authors establish theoretical results showing that linear iSCMs do not deterministically collapse with graph depth, while standardized SCMs can be partially identifiable from observational data under certain weight assumptions; they contrast these with non-identifiability in linear Gaussian iSCMs for forests. Empirically, iSCMs remove exploitable covariance artifacts and still permit nontrivial structure learning by standard algorithms, suggesting iSCMs as a robust benchmarking tool and a promising modeling framework beyond benchmarking. The work provides code for reproducibility and situates iSCMs as stable, scale-free, unit-consistent models with potential broader applicability in causal inference.
Abstract
Synthetic datasets generated by structural causal models (SCMs) are commonly used for benchmarking causal structure learning algorithms. However, the variances and pairwise correlations in SCM data tend to increase along the causal ordering. Several popular algorithms exploit these artifacts, possibly leading to conclusions that do not generalize to real-world settings. Existing metrics like $\operatorname{Var}$-sortability and $\operatorname{R^2}$-sortability quantify these patterns, but they do not provide tools to remedy them. To address this, we propose internally-standardized structural causal models (iSCMs), a modification of SCMs that introduces a standardization operation at each variable during the generative process. By construction, iSCMs are not $\operatorname{Var}$-sortable. We also find empirical evidence that they are mostly not $\operatorname{R^2}$-sortable for commonly-used graph families. Moreover, contrary to the post-hoc standardization of data generated by standard SCMs, we prove that linear iSCMs are less identifiable from prior knowledge on the weights and do not collapse to deterministic relationships in large systems, which may make iSCMs a useful model in causal inference beyond the benchmarking problem studied here. Our code is publicly available at: https://github.com/werkaaa/iscm.
