Regression graphs and sparsity-inducing reparametrizations
Jakub Rybak, Heather Battey, Karthik Bharath
TL;DR
The paper shows that population-level sparsity can arise from sparsity-inducing reparametrizations of covariance structures that respect positive definiteness, linking four parametrizations to the Iwasawa decomposition and chain-graph models. It develops exact and approximate-zero characterizations across $ ext{Sigma}_{pd}$, $ ext{Sigma}_{o}$, $ ext{Sigma}_{lt}$, and $ ext{Sigma}_{ltu}$, and provides a unified, graph-based interpretation via causal ordering and path weights. It then devises estimation strategies, proving spectral-norm consistency for sparse-logarithmic estimators under high-dimensional scaling, and demonstrates practical benefits through simulations and a leukemia data example, where reparametrization improves classification performance. Overall, the work clarifies when and why sparsity on transformed scales yields improved interpretability and estimation accuracy in covariance modeling, with implications for chain graphs and high-dimensional inference.$
Abstract
That parametrization and sparsity are inherently linked raises the possibility that relevant models, not obviously sparse in their natural formulation, exhibit a population-level sparsity after reparametrization. In covariance models, positive-definiteness enforces additional constraints on how sparsity can legitimately manifest. It is therefore natural to consider reparametrization maps in which sparsity respects positive definiteness. The main purpose of this paper is to provide insight into structures on the physically-natural scale that induce and are induced by sparsity after reparametrization. The richest of the four structures initially uncovered can be generated, under a causal ordering, by the joint-response graphs studied by Wermuth & Cox (2004), while the most restrictive is that induced by sparsity on the scale of the matrix logarithm, studied by Battey (2017). The Iwasawa decomposition of the general linear group, combined with the graphical-models interpretation, points to a class of reparametrizations for the chain-graph models (Andersson et al. 2001), with undirected and directed acyclic graphs as special cases. An important insight is the interpretation of approximate zeros, explaining the modelling implications of enforcing sparsity after reparameterization: in effect, the relation between two variables would be declared null if relatively direct regression effects were negligible and others manifested through long paths. The insights have a bearing on methodology, some aspects of which are presented. A detailed simulation uses the theoretical insights to further explore regimes under which reparametrization is beneficial.
