Table of Contents
Fetching ...

On the Parameter Identifiability of Partially Observed Linear Causal Models

Xinshuai Dong, Ignavier Ng, Biwei Huang, Yuewen Sun, Songyao Jin, Roberto Legaspi, Peter Spirtes, Kun Zhang

TL;DR

This work tackles parameter identifiability in partially observed linear causal models that include latent variables, addressing identification of edge coefficients beyond observed-variable edges. It develops a theory identifying three indeterminacies, provides graphical sufficiency conditions for structure and parameter identifiability (including atomic covers and trek-based criteria), and proposes a likelihood-based estimation method with a trek-rule covariance parameterization to handle variance indeterminacy. Empirical results on synthetic data (GS and OT regimes) and real-world data (Big Five) demonstrate accurate recovery of parameters up to the stated indeterminacies and robust performance under mild misspecification. The paper advances practical causal modeling with latent variables by delivering both identifiability guarantees and a scalable estimation approach suitable for real datasets.

Abstract

Linear causal models are important tools for modeling causal dependencies and yet in practice, only a subset of the variables can be observed. In this paper, we examine the parameter identifiability of these models by investigating whether the edge coefficients can be recovered given the causal structure and partially observed data. Our setting is more general than that of prior research - we allow all variables, including both observed and latent ones, to be flexibly related, and we consider the coefficients of all edges, whereas most existing works focus only on the edges between observed variables. Theoretically, we identify three types of indeterminacy for the parameters in partially observed linear causal models. We then provide graphical conditions that are sufficient for all parameters to be identifiable and show that some of them are provably necessary. Methodologically, we propose a novel likelihood-based parameter estimation method that addresses the variance indeterminacy of latent variables in a specific way and can asymptotically recover the underlying parameters up to trivial indeterminacy. Empirical studies on both synthetic and real-world datasets validate our identifiability theory and the effectiveness of the proposed method in the finite-sample regime. Code: https://github.com/dongxinshuai/scm-identify.

On the Parameter Identifiability of Partially Observed Linear Causal Models

TL;DR

This work tackles parameter identifiability in partially observed linear causal models that include latent variables, addressing identification of edge coefficients beyond observed-variable edges. It develops a theory identifying three indeterminacies, provides graphical sufficiency conditions for structure and parameter identifiability (including atomic covers and trek-based criteria), and proposes a likelihood-based estimation method with a trek-rule covariance parameterization to handle variance indeterminacy. Empirical results on synthetic data (GS and OT regimes) and real-world data (Big Five) demonstrate accurate recovery of parameters up to the stated indeterminacies and robust performance under mild misspecification. The paper advances practical causal modeling with latent variables by delivering both identifiability guarantees and a scalable estimation approach suitable for real datasets.

Abstract

Linear causal models are important tools for modeling causal dependencies and yet in practice, only a subset of the variables can be observed. In this paper, we examine the parameter identifiability of these models by investigating whether the edge coefficients can be recovered given the causal structure and partially observed data. Our setting is more general than that of prior research - we allow all variables, including both observed and latent ones, to be flexibly related, and we consider the coefficients of all edges, whereas most existing works focus only on the edges between observed variables. Theoretically, we identify three types of indeterminacy for the parameters in partially observed linear causal models. We then provide graphical conditions that are sufficient for all parameters to be identifiable and show that some of them are provably necessary. Methodologically, we propose a novel likelihood-based parameter estimation method that addresses the variance indeterminacy of latent variables in a specific way and can asymptotically recover the underlying parameters up to trivial indeterminacy. Empirical studies on both synthetic and real-world datasets validate our identifiability theory and the effectiveness of the proposed method in the finite-sample regime. Code: https://github.com/dongxinshuai/scm-identify.
Paper Structure (40 sections, 19 theorems, 37 equations, 11 figures, 3 tables)

This paper contains 40 sections, 19 theorems, 37 equations, 11 figures, 3 tables.

Key Result

Theorem 1

Consider a model that follows Def. definition:polcm with number of latent variables $m\geq1$ and $\theta=(F_{\mathbf{L}\mathbf{L}},F_{\mathbf{L}\mathbf{X}},F_{\mathbf{X}\mathbf{L}},F_{\mathbf{X}\mathbf{X}},\Omega_{\epsilon_\mathbf{L}},\Omega_{\epsilon_\mathbf{X}})$. Let $\Lambda$ be any invertible d Then, $\tilde{\theta}$ and $\theta$ entail the same observations, i.e., $\tilde{\Sigma}_{\mathbf{X}

Figures (11)

  • Figure 1: Illustrations of the advantage of our framework. Within our framework, it can be shown that $\mathcal{G}_{1}$'s parameters can be identified (up to sign) while $\mathcal{G}_{2}$'s cannot. In contrast, the latent projection framework cannot even differentiate $\mathcal{G}_{1}$ from $\mathcal{G}_{2}$ as they share the same ADMG (c) after projection. Furthermore, with ADMG, any edge coefficient that involves a latent variable cannot be considered.
  • Figure 2: Illustrative examples to show that the graphical condition for structure-identifiability and parameter-identifiability could be very different.
  • Figure 3: An illustrative graph that satisfies the conditions for structure-identifiability. At the same time, it also satisfies the condition for parameter identifiability - given the structure and $\Sigma_{\mathbf{X}}$, all the parameters are identifiable only up to group sign indeterminacy.
  • Figure 4: Estimated edge coefficients by the proposed method for Big Five human personality dataset. Variables whose name starts with "L" are latent variables while the others are observed variables.
  • Figure 5: A simple graph that satisfies conditions in Theorem \ref{['thm:identifiability']}, as for each atomic cover with one latent variable, it has no observed variable.
  • ...and 6 more figures

Theorems & Definitions (51)

  • Definition 1: Partially Observed Linear Causal Models
  • Definition 2: Identifiability of Parameters of Partially Observed Linear Causal Models
  • Theorem 1: Indeterminacy of Scaling of $\Omega_{\epsilon_\set{L}}$
  • Remark 1: Implication of Theorem \ref{['thm:indeterminacy_scaling_variance']}
  • Theorem 2: Group Sign Indeterminacy
  • Remark 2: Remark on Theorem \ref{['thm:indeterminacy_group_sign']}
  • Example 1: Example for Group Sign Indeterminacy and Generic Identifiability
  • Definition 3: Orthogonal Transformation Indeterminacy
  • Definition 4: Atomic Cover dong2023versatile
  • Example 2: Example of Atomic Cover
  • ...and 41 more