Boosting Causal Additive Models
Maximilian Kertel, Nadja Klein
TL;DR
The paper tackles causal discovery under additive noise models by learning a CAM-based DAG through boosting. It introduces a regression-based score over topological orderings and proves that $L^2$-boosting with early stopping yields a consistent ordering, even under misspecification. A high-dimensional extension uses component-wise boosting with an AIC-driven stopping rule and pruning to scale to large graphs; this variant remains competitive with state-of-the-art methods in simulations. Overall, the work provides a principled, tunable framework that links regression-consistency, variance estimation, and graph identifiability to robust causal-order discovery in both low- and high-dimensional regimes.
Abstract
We present a boosting-based method to learn additive Structural Equation Models (SEMs) from observational data, with a focus on the theoretical aspects of determining the causal order among variables. We introduce a family of score functions based on arbitrary regression techniques, for which we establish necessary conditions to consistently favor the true causal ordering. Our analysis reveals that boosting with early stopping meets these criteria and thus offers a consistent score function for causal orderings. To address the challenges posed by high-dimensional data sets, we adapt our approach through a component-wise gradient descent in the space of additive SEMs. Our simulation study underlines our theoretical results for lower dimensions and demonstrates that our high-dimensional adaptation is competitive with state-of-the-art methods. In addition, it exhibits robustness with respect to the choice of the hyperparameters making the procedure easy to tune.
