On the Parameter Identifiability of Partially Observed Linear Causal Models

Xinshuai Dong; Ignavier Ng; Biwei Huang; Yuewen Sun; Songyao Jin; Roberto Legaspi; Peter Spirtes; Kun Zhang

On the Parameter Identifiability of Partially Observed Linear Causal Models

Xinshuai Dong, Ignavier Ng, Biwei Huang, Yuewen Sun, Songyao Jin, Roberto Legaspi, Peter Spirtes, Kun Zhang

TL;DR

This work tackles parameter identifiability in partially observed linear causal models that include latent variables, addressing identification of edge coefficients beyond observed-variable edges. It develops a theory identifying three indeterminacies, provides graphical sufficiency conditions for structure and parameter identifiability (including atomic covers and trek-based criteria), and proposes a likelihood-based estimation method with a trek-rule covariance parameterization to handle variance indeterminacy. Empirical results on synthetic data (GS and OT regimes) and real-world data (Big Five) demonstrate accurate recovery of parameters up to the stated indeterminacies and robust performance under mild misspecification. The paper advances practical causal modeling with latent variables by delivering both identifiability guarantees and a scalable estimation approach suitable for real datasets.

Abstract

Linear causal models are important tools for modeling causal dependencies and yet in practice, only a subset of the variables can be observed. In this paper, we examine the parameter identifiability of these models by investigating whether the edge coefficients can be recovered given the causal structure and partially observed data. Our setting is more general than that of prior research - we allow all variables, including both observed and latent ones, to be flexibly related, and we consider the coefficients of all edges, whereas most existing works focus only on the edges between observed variables. Theoretically, we identify three types of indeterminacy for the parameters in partially observed linear causal models. We then provide graphical conditions that are sufficient for all parameters to be identifiable and show that some of them are provably necessary. Methodologically, we propose a novel likelihood-based parameter estimation method that addresses the variance indeterminacy of latent variables in a specific way and can asymptotically recover the underlying parameters up to trivial indeterminacy. Empirical studies on both synthetic and real-world datasets validate our identifiability theory and the effectiveness of the proposed method in the finite-sample regime. Code: https://github.com/dongxinshuai/scm-identify.

On the Parameter Identifiability of Partially Observed Linear Causal Models

TL;DR

Abstract

Paper Structure (40 sections, 19 theorems, 37 equations, 11 figures, 3 tables)

This paper contains 40 sections, 19 theorems, 37 equations, 11 figures, 3 tables.

Introduction and Related Work
Preliminaries
Problem Setting
Framework Comparison
Identifiability Theory
Definition of Parameter Identifiability and Indeterminacy
Graphical Condition for Structure Identifiability
Identifiability of Parameters
Parameter Estimation Method
Objective
Parameterization Trick of Covariance Matrix
Experiments
Setting and Evaluation Metric
Synthetic Data Performance
Misspecification Behavior
...and 25 more sections

Key Result

Theorem 1

Consider a model that follows Def. definition:polcm with number of latent variables $m\geq1$ and $\theta=(F_{\mathbf{L}\mathbf{L}},F_{\mathbf{L}\mathbf{X}},F_{\mathbf{X}\mathbf{L}},F_{\mathbf{X}\mathbf{X}},\Omega_{\epsilon_\mathbf{L}},\Omega_{\epsilon_\mathbf{X}})$. Let $\Lambda$ be any invertible d Then, $\tilde{\theta}$ and $\theta$ entail the same observations, i.e., $\tilde{\Sigma}_{\mathbf{X}

Figures (11)

Figure 1: Illustrations of the advantage of our framework. Within our framework, it can be shown that $\mathcal{G}_{1}$'s parameters can be identified (up to sign) while $\mathcal{G}_{2}$'s cannot. In contrast, the latent projection framework cannot even differentiate $\mathcal{G}_{1}$ from $\mathcal{G}_{2}$ as they share the same ADMG (c) after projection. Furthermore, with ADMG, any edge coefficient that involves a latent variable cannot be considered.
Figure 2: Illustrative examples to show that the graphical condition for structure-identifiability and parameter-identifiability could be very different.
Figure 3: An illustrative graph that satisfies the conditions for structure-identifiability. At the same time, it also satisfies the condition for parameter identifiability - given the structure and $\Sigma_{\mathbf{X}}$, all the parameters are identifiable only up to group sign indeterminacy.
Figure 4: Estimated edge coefficients by the proposed method for Big Five human personality dataset. Variables whose name starts with "L" are latent variables while the others are observed variables.
Figure 5: A simple graph that satisfies conditions in Theorem \ref{['thm:identifiability']}, as for each atomic cover with one latent variable, it has no observed variable.
...and 6 more figures

Theorems & Definitions (51)

Definition 1: Partially Observed Linear Causal Models
Definition 2: Identifiability of Parameters of Partially Observed Linear Causal Models
Theorem 1: Indeterminacy of Scaling of $\Omega_{\epsilon_\set{L}}$
Remark 1: Implication of Theorem \ref{['thm:indeterminacy_scaling_variance']}
Theorem 2: Group Sign Indeterminacy
Remark 2: Remark on Theorem \ref{['thm:indeterminacy_group_sign']}
Example 1: Example for Group Sign Indeterminacy and Generic Identifiability
Definition 3: Orthogonal Transformation Indeterminacy
Definition 4: Atomic Cover dong2023versatile
Example 2: Example of Atomic Cover
...and 41 more

On the Parameter Identifiability of Partially Observed Linear Causal Models

TL;DR

Abstract

On the Parameter Identifiability of Partially Observed Linear Causal Models

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (11)

Theorems & Definitions (51)