Implicit Differentiation for Hyperparameter Tuning the Weighted Graphical Lasso
Can Pouliquen, Paulo Gonçalves, Mathurin Massias, Titouan Vayer
TL;DR
This work addresses tuning the Graphical Lasso hyperparameters by casting it as a bilevel optimization problem and deriving the hypergradient via implicit differentiation. The core contribution is a closed-form Jacobian for the GLASSO solution with respect to scalar and matrix regularization parameters, obtained through a fixed-point differentiation of the proximal update and a careful handling of non-smoothness. The authors extend the scalar case to a matrix of hyperparameters, yielding a fourth-order Jacobian tensor and showing how to reuse a Kronecker-inverse to reduce computation. Empirical results on synthetic data demonstrate that the proposed first-order approach can match grid-search in the scalar case and that matrix regularization offers substantial performance gains, albeit with non-convexity challenges that motivate further optimization refinements.
Abstract
We provide a framework and algorithm for tuning the hyperparameters of the Graphical Lasso via a bilevel optimization problem solved with a first-order method. In particular, we derive the Jacobian of the Graphical Lasso solution with respect to its regularization hyperparameters.
