Root Cause Analysis of Outliers in Unknown Cyclic Graphs

Daniela Schkoda; Dominik Janzing

Root Cause Analysis of Outliers in Unknown Cyclic Graphs

Daniela Schkoda, Dominik Janzing

TL;DR

We address root cause analysis in unknown cyclic causal graphs under linear SEMs by leveraging an invariant precision-matrix transformation. The key idea is to apply the precision matrix $\Theta_{XX}$ to the anomalous observation $\tilde{x}$ to reveal the root causes and their cycle-parents, enabling a shortlisting that remains valid without a known graph. The framework extends to latent variables via projection and zig-zag latent paths, and is implemented with false discovery rate control (FDRC) using e-values for robust selection. Empirical results on simulations and real cloud data (e.g., PetShop) demonstrate improved accuracy and scalability over prior DAG-based approaches, highlighting practical applicability for microservice RCA with cycles.

Abstract

We study the propagation of outliers in cyclic causal graphs with linear structural equations, tracing them back to one or several "root cause" nodes. We show that it is possible to identify a short list of potential root causes provided that the perturbation is sufficiently strong and propagates according to the same structural equations as in the normal mode. This shortlist consists of the true root causes together with those of its parents lying on a cycle with the root cause. Notably, our method does not require prior knowledge of the causal graph.

Root Cause Analysis of Outliers in Unknown Cyclic Graphs

TL;DR

We address root cause analysis in unknown cyclic causal graphs under linear SEMs by leveraging an invariant precision-matrix transformation. The key idea is to apply the precision matrix

to the anomalous observation

to reveal the root causes and their cycle-parents, enabling a shortlisting that remains valid without a known graph. The framework extends to latent variables via projection and zig-zag latent paths, and is implemented with false discovery rate control (FDRC) using e-values for robust selection. Empirical results on simulations and real cloud data (e.g., PetShop) demonstrate improved accuracy and scalability over prior DAG-based approaches, highlighting practical applicability for microservice RCA with cycles.

Root Cause Analysis of Outliers in Unknown Cyclic Graphs

TL;DR

Abstract

Root Cause Analysis of Outliers in Unknown Cyclic Graphs

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (12)

Theorems & Definitions (12)