On the Degrees of Freedom of some Lasso procedures
Mauro Bernardi, Antonio Canale, Marco Stefanucci
TL;DR
This work derives unbiased estimators of the effective degrees of freedom for Adaptive Lasso, Group Lasso, and Adaptive Group Lasso within Stein's unbiased risk estimation, valid for both orthogonal and non-orthogonal designs. The resulting $\hat{df}_\gamma$ expressions reveal how adaptive weights and coefficient signs inflate or deflate model complexity and show a piecewise-linear dependence on the regularization parameter $\gamma$ for adaptive methods. Empirical validation on synthetic and Diabetes data demonstrates that using the correct $\hat{df}_\gamma$ improves model selection and risk estimation, while naive active-set-based df can mislead criteria like BIC or cross-validation. Overall, the paper provides a rigorous framework for complexity-aware inference in adaptive penalized regression, bridging theory and practice and enabling more reliable model comparison in high-dimensional settings.
Abstract
The effective degrees of freedom of penalized regression models quantify the actual amount of information used to generate predictions, playing a pivotal role in model evaluation and selection. Although a closed-form estimator is available for the Lasso penalty, adaptive extensions of widely used penalized approaches, including the Adaptive Lasso and Adaptive Group Lasso, have remained without analogous theoretical characterization. This paper presents the first unbiased estimator of the effective degrees of freedom for these methods, along with their main theoretical properties, for both orthogonal and non-orthogonal designs, derived within Stein's unbiased risk estimation framework. The resulting expressions feature inflation terms influenced by the regularization parameter, coefficient signs, and least-squares estimates. These advances enable more accurate model selection criteria and unbiased prediction error estimates, illustrated through synthetic and real data. These contributions offer a rigorous theoretical foundation for understanding model complexity in adaptive regression, bridging a critical gap between theory and practice.
