Enabling Causal Discovery in Post-Nonlinear Models with Normalizing Flows
Nu Hoang, Bao Duong, Thin Nguyen
TL;DR
This work tackles causal discovery under Post-Nonlinear (PNL) models by enforcing the crucial invertibility constraint through normalizing flows. It introduces CAF-PoNo, which uses a CDF flow to model the invertible function $g^{-1}$ and neural nets for $h$, enabling maximum-likelihood estimation that recovers the noise and facilitates cause-effect identification via HSIC-based independence tests. The framework extends naturally to multivariate causal discovery through a two-stage process of causal ordering identification and edge pruning, with LCIT-based conditional independence tests providing scalable scalability. Empirical results on synthetic and real data show CAF-PoNo achieving state-of-the-art performance in both bivariate and multivariate settings, including faster runtimes and robust performance under model misspecification. Overall, CAF-PoNo offers a principled, scalable, and invertibility-aware approach to causal discovery in PNL models with practical impact for complex causal systems.
Abstract
Post-nonlinear (PNL) causal models stand out as a versatile and adaptable framework for modeling intricate causal relationships. However, accurately capturing the invertibility constraint required in PNL models remains challenging in existing studies. To address this problem, we introduce CAF-PoNo (Causal discovery via Normalizing Flows for Post-Nonlinear models), harnessing the power of the normalizing flows architecture to enforce the crucial invertibility constraint in PNL models. Through normalizing flows, our method precisely reconstructs the hidden noise, which plays a vital role in cause-effect identification through statistical independence testing. Furthermore, the proposed approach exhibits remarkable extensibility, as it can be seamlessly expanded to facilitate multivariate causal discovery via causal order identification, empowering us to efficiently unravel complex causal relationships. Extensive experimental evaluations on both simulated and real datasets consistently demonstrate that the proposed method outperforms several state-of-the-art approaches in both bivariate and multivariate causal discovery tasks.
