A Skewness-Based Criterion for Addressing Heteroscedastic Noise in Causal Discovery
Yingyu Lin, Yuxing Huang, Wenqin Liu, Haoran Deng, Ignavier Ng, Kun Zhang, Mingming Gong, Yi-An Ma, Biwei Huang
TL;DR
The paper addresses causal discovery under heteroscedastic symmetric noise by introducing a score-based skewness criterion, SkewScore, that distinguishes causal from anti-causal directions without requiring exogenous-noise extraction. The approach extends to multivariate settings and leverages a two-phase, order-based DAG search to reduce conditional-independence tests to a polynomial count. Theoretical identifiability results show the score component along the cause is skewed while the effect component is not, and empirical studies demonstrate superior or competitive performance against state-of-the-art baselines, including in the presence of latent confounding. This work offers a scalable, noise-agnostic tool for causal direction inference with heteroscedastic noise, with promising applicability to real-world data and avenues for extension to higher dimensions and more complex latent structures.
Abstract
Real-world data often violates the equal-variance assumption (homoscedasticity), making it essential to account for heteroscedastic noise in causal discovery. In this work, we explore heteroscedastic symmetric noise models (HSNMs), where the effect $Y$ is modeled as $Y = f(X) + σ(X)N$, with $X$ as the cause and $N$ as independent noise following a symmetric distribution. We introduce a novel criterion for identifying HSNMs based on the skewness of the score (i.e., the gradient of the log density) of the data distribution. This criterion establishes a computationally tractable measurement that is zero in the causal direction but nonzero in the anticausal direction, enabling the causal direction discovery. We extend this skewness-based criterion to the multivariate setting and propose SkewScore, an algorithm that handles heteroscedastic noise without requiring the extraction of exogenous noise. We also conduct a case study on the robustness of SkewScore in a bivariate model with a latent confounder, providing theoretical insights into its performance. Empirical studies further validate the effectiveness of the proposed method.
