Robust Model Selection of Gaussian Graphical Models
Abrar Zahin, Rajasekhar Anguluri, Lalitha Sankar, Oliver Kosut, Gautam Dasarathy
TL;DR
This work tackles robust model selection for Gaussian graphical models under unknown additive noise, showing that exact graph recovery is impossible while identifiability can be achieved up to an equivalence class captured by an articulated set tree (AST). It introduces NoMAD, an ancestor-discovery algorithm that reconstructs the AST and recovers the equivalence class [G], with rigorous population and finite-sample guarantees. The approach leverages a Test Identical Ancestor (TIA), distance extensions to latent ancestors, and clustering of leaf/internal blocks to reveal local and global graph structure in noisy settings. Experimental results on synthetic and IEEE 33-bus networks demonstrate NoMAD’s robustness to noise and its advantage over traditional methods like GLASSO in preserving the correct equivalence class. The findings enable meaningful partial structure recovery useful for applications in power grids, neuroscience, and complex networks where measurement noise is inevitable.
Abstract
In Gaussian graphical model selection, noise-corrupted samples present significant challenges. It is known that even minimal amounts of noise can obscure the underlying structure, leading to fundamental identifiability issues. A recent line of work addressing this "robust model selection" problem narrows its focus to tree-structured graphical models. Even within this specific class of models, exact structure recovery is shown to be impossible. However, several algorithms have been developed that are known to provably recover the underlying tree-structure up to an (unavoidable) equivalence class. In this paper, we extend these results beyond tree-structured graphs. We first characterize the equivalence class up to which general graphs can be recovered in the presence of noise. Despite the inherent ambiguity (which we prove is unavoidable), the structure that can be recovered reveals local clustering information and global connectivity patterns in the underlying model. Such information is useful in a range of real-world problems, including power grids, social networks, protein-protein interactions, and neural structures. We then propose an algorithm which provably recovers the underlying graph up to the identified ambiguity. We further provide finite sample guarantees in the high-dimensional regime for our algorithm and validate our results through numerical simulations.
