A Skewness-Based Criterion for Addressing Heteroscedastic Noise in Causal Discovery

Yingyu Lin; Yuxing Huang; Wenqin Liu; Haoran Deng; Ignavier Ng; Kun Zhang; Mingming Gong; Yi-An Ma; Biwei Huang

A Skewness-Based Criterion for Addressing Heteroscedastic Noise in Causal Discovery

Yingyu Lin, Yuxing Huang, Wenqin Liu, Haoran Deng, Ignavier Ng, Kun Zhang, Mingming Gong, Yi-An Ma, Biwei Huang

TL;DR

The paper addresses causal discovery under heteroscedastic symmetric noise by introducing a score-based skewness criterion, SkewScore, that distinguishes causal from anti-causal directions without requiring exogenous-noise extraction. The approach extends to multivariate settings and leverages a two-phase, order-based DAG search to reduce conditional-independence tests to a polynomial count. Theoretical identifiability results show the score component along the cause is skewed while the effect component is not, and empirical studies demonstrate superior or competitive performance against state-of-the-art baselines, including in the presence of latent confounding. This work offers a scalable, noise-agnostic tool for causal direction inference with heteroscedastic noise, with promising applicability to real-world data and avenues for extension to higher dimensions and more complex latent structures.

Abstract

Real-world data often violates the equal-variance assumption (homoscedasticity), making it essential to account for heteroscedastic noise in causal discovery. In this work, we explore heteroscedastic symmetric noise models (HSNMs), where the effect $Y$ is modeled as $Y = f(X) + σ(X)N$, with $X$ as the cause and $N$ as independent noise following a symmetric distribution. We introduce a novel criterion for identifying HSNMs based on the skewness of the score (i.e., the gradient of the log density) of the data distribution. This criterion establishes a computationally tractable measurement that is zero in the causal direction but nonzero in the anticausal direction, enabling the causal direction discovery. We extend this skewness-based criterion to the multivariate setting and propose SkewScore, an algorithm that handles heteroscedastic noise without requiring the extraction of exogenous noise. We also conduct a case study on the robustness of SkewScore in a bivariate model with a latent confounder, providing theoretical insights into its performance. Empirical studies further validate the effectiveness of the proposed method.

A Skewness-Based Criterion for Addressing Heteroscedastic Noise in Causal Discovery

TL;DR

Abstract

is modeled as

, with

as the cause and

as independent noise following a symmetric distribution. We introduce a novel criterion for identifying HSNMs based on the skewness of the score (i.e., the gradient of the log density) of the data distribution. This criterion establishes a computationally tractable measurement that is zero in the causal direction but nonzero in the anticausal direction, enabling the causal direction discovery. We extend this skewness-based criterion to the multivariate setting and propose SkewScore, an algorithm that handles heteroscedastic noise without requiring the extraction of exogenous noise. We also conduct a case study on the robustness of SkewScore in a bivariate model with a latent confounder, providing theoretical insights into its performance. Empirical studies further validate the effectiveness of the proposed method.

Paper Structure (22 sections, 7 theorems, 27 equations, 6 figures, 8 tables, 1 algorithm)

This paper contains 22 sections, 7 theorems, 27 equations, 6 figures, 8 tables, 1 algorithm.

Introduction
Related Work
Problem Setup and Preliminaries
Heteroscedastic Symmetric Noise Models (HSNM)
Score Function, Score Matching, and Skewness
Facts
Causal Discovery with the Skewness-Based Criterion
Identification with the Skewness-Based Criterion
Algorithm
Case Study: Robustness to the Latent Confounder in a Bivariate Setting
Experiments
Conclusion
Additional Discussion on the $\textit{SkewScore}$ meausure
Discussion on Model Assumption \ref{['assmp:E_partial_x_neq_0']}
Discussion on the Skewness Definition
...and 7 more sections

Key Result

Theorem 1

Let $(X,Y)$ follows the heteroscedastic symmetric noise model given by Eq. eq:HSNM. Let $p(x,y)$ denote the joint distribution of $(X,Y)$. The following asymmetry holds.

Figures (6)

Figure 1: Identifiability of the heteroscedastic symmetric noise model (HSNM). For the causal direction $X \rightarrow Y$ (left), the conditional distribution $p(y|x)$ is symmetric for any $x$ (top right), and the variance of $p(y|x)$ varies with different values of $x$ due to heteroscedasticity. In contrast, $p(x|y)$ is asymmetric for some $y$ (bottom right).
Figure 2: Three-variable HSNM model with latent confounder given by Eq. \ref{['eq:latent_HSNM']}.
Figure 3: Accuracy of causal direction estimation across different data generation processes for (a) bivariate heteroscedastic noise models, and (b) latent-confounded triangular heteroscedastic noise models. Results are averaged over 100 independent runs.
Figure 4: Topological order divergence and structural Hamming distance (SHD) across different number of variables (the dimension $d$). Lower values indicate better performance for both metrics.
Figure 5: Score function (a vector field) visualized on data points with projections on the $x$-axis and $y$-axis. For clarity, only the projections for one data point are shown (highlighted in orange and blue). Theorem \ref{['thm:identifiability']} examines the skewness of these projections across the data distribution.
...and 1 more figures

Theorems & Definitions (15)

Definition 1
Theorem 1
Corollary 2
Proposition 3
Example 4
Proposition 5
Remark 6
Example 7
Example 8
Example 9
...and 5 more

A Skewness-Based Criterion for Addressing Heteroscedastic Noise in Causal Discovery

TL;DR

Abstract

A Skewness-Based Criterion for Addressing Heteroscedastic Noise in Causal Discovery

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (15)