On the Complexity of Identification in Linear Structural Causal Models

Julian Dörfler; Benito van der Zander; Markus Bläser; Maciej Liskiewicz

On the Complexity of Identification in Linear Structural Causal Models

Julian Dörfler, Benito van der Zander, Markus Bläser, Maciej Liskiewicz

TL;DR

This work studies the computational complexity of identifying parameters in linear structural causal models (linear SCMs). It presents a sound, complete polynomial-space algorithm for generic identifiability, yielding an exponential-time algorithm that improves on previous double-exponential Gröbner-basis approaches, and shows that generic identifiability lies in the $\exists\forall\mathbb{R}$ (and $\forall\exists\mathbb{R}$) regime. It further proves that numerical identifiability is $\forall\mathbb{R}$-hard (in particular $co\mathsf{NP}$-hard) while remaining decidable in polynomial space, and extends the results to cyclic graphs. The paper thus sharpens the complexity landscape for identifiability in linear SCMs, providing both upper bounds via real-algebraic techniques and hardness results, with potential impact on algorithmic causal inference and related symbolic approaches.

Abstract

Learning the unknown causal parameters of a linear structural causal model is a fundamental task in causal analysis. The task, known as the problem of identification, asks to estimate the parameters of the model from a combination of assumptions on the graphical structure of the model and observational data, represented as a non-causal covariance matrix. In this paper, we give a new sound and complete algorithm for generic identification which runs in polynomial space. By standard simulation results, this algorithm has exponential running time which vastly improves the state-of-the-art double exponential time method using a Gröbner basis approach. The paper also presents evidence that parameter identification is computationally hard in general. In particular, we prove, that the task asking whether, for a given feasible correlation matrix, there are exactly one or two or more parameter sets explaining the observed matrix, is hard for $\forall R$, the co-class of the existential theory of the reals. In particular, this problem is $coNP$-hard. To our best knowledge, this is the first hardness result for some notion of identifiability.

On the Complexity of Identification in Linear Structural Causal Models

TL;DR

(and

) regime. It further proves that numerical identifiability is

-hard (in particular

-hard) while remaining decidable in polynomial space, and extends the results to cyclic graphs. The paper thus sharpens the complexity landscape for identifiability in linear SCMs, providing both upper bounds via real-algebraic techniques and hardness results, with potential impact on algorithmic causal inference and related symbolic approaches.

Abstract

, the co-class of the existential theory of the reals. In particular, this problem is

-hard. To our best knowledge, this is the first hardness result for some notion of identifiability.

Paper Structure (19 sections, 19 theorems, 21 equations, 4 figures)

This paper contains 19 sections, 19 theorems, 21 equations, 4 figures.

Introduction
Preliminaries
The Problems of Identification in Linear Causal Models
The (Existential) Theory of the Reals
Universal Quantification.
Complexity of $\mathsf{\exists\mathbb{R}}$ and $\mathsf{\forall\mathbb{R}}$.
Finding another solution
Hardness of Numerical Identifiability
Upper Bound for Numerical Identifiability
Generic Identifiability is in PSPACE
A note on cyclic graphs
Conclusion
Edge Identifiability
Hardness of Numerical Edge Identifiability
Upper Bound for Numerical Edge Identifiability
...and 4 more sections

Key Result

Lemma 1

The system above has the following solutions: In particular, the system always has a solution. It has more than one solution iff the original $\mathrm{QUAD}$-instance is satisfiable.

Figures (4)

Figure 1: An IV example.
Figure 2: Methods for generic identification in linear SCMs. An arrow from methods $A \to B$ means $B$ subsume methods $A$, i.e., any instance that can be identified by any of methods $A$ can be identified by method $B$ and this inclusion is proper. Green boxes mean there exist polynomial-time algorithms to apply the method, a red box means no such algorithm is known or the method has been proven to be NP-hard. The blue box includes the complete methods.
Figure 3: Left: A single missing edge on the top layer. Right: The gadget storing the value of each variable. $\lambda_{i,r}$ corresponds to $x_i$.
Figure 4: Left: Gadget for affine linear constraints. Right: Gadget for multiplicative constraints

Theorems & Definitions (33)

Definition 1: Generic Identifiability, foygel2012half
Definition 2: Numerical Identifiability
Lemma 1
proof
Corollary 1
Theorem 2
Lemma 3
proof : Proof of Theorem \ref{['thm:num_ident_hardness']}
Lemma 5
proof
...and 23 more

On the Complexity of Identification in Linear Structural Causal Models

TL;DR

Abstract

On the Complexity of Identification in Linear Structural Causal Models

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (33)