Methods for Recovering Conditional Independence Graphs: A Survey
Harsh Shrivastava, Urszula Chajewska
TL;DR
This paper addresses recovering conditional-independence graphs from data, focusing on undirected graphs whose edges encode partial correlations between features. It surveys two core formulations: directly estimating partial correlations via regression and via sparse precision matrices with Graphical Lasso and its variants, including deep unfolding models GLAD/uGLAD and tensorized approaches such as TeraLasso and SyGlasso. To support heterogeneous data, it reviews covariance constructions for mixed datatypes and their impact on edge inference, and it discusses applications across life sciences, medical informatics, finance, and time-series, highlighting the potential for integration with deep learning and tensor methods. The work provides a consolidated taxonomy, practical implementation guidance, and a roadmap to promote wider adoption of CI-graph recovery as a mainstream data-exploration tool.
Abstract
Conditional Independence (CI) graphs are a type of probabilistic graphical models that are primarily used to gain insights about feature relationships. Each edge represents the partial correlation between the connected features which gives information about their direct dependence. In this survey, we list out different methods and study the advances in techniques developed to recover CI graphs. We cover traditional optimization methods as well as recently developed deep learning architectures along with their recommended implementations. To facilitate wider adoption, we include preliminaries that consolidate associated operations, for example techniques to obtain covariance matrix for mixed datatypes.
