Twice Epi-Differentiability of Spectral Functions and its applications
Chao Ding, Ebrahim Sarabi, Shiwei Wang
TL;DR
The paper develops a unified second-order variational framework for spectral functions by showing that twice epi-differentiability of a spectral function g(X)=(\theta\circ\lambda)(X) is equivalent to the same property for its symmetric part θ, even without convexity. It derives an explicit formula for the second subderivative d^2g(X,Y)(H) in terms of the symmetric part and eigenstructure, and uses this to characterize proto-differentiability of subgradients and the directional differentiability of proximal mappings. These results are then applied to leading eigenvalue functions and practical spectral regularizers (e.g., MCP penalties, largest eigenvalue gaps) to obtain concrete second-order properties and algorithmic implications. The findings enable robust analysis and efficient optimization strategies for a broad class of nonconvex spectral problems in statistics and machine learning, including robust PCA.
Abstract
Second-order variational properties have been shown to play important theoretical and numerical roles for different classes of optimization problems. Among such properties, twice epi-differentiability has a special place because of its ubiquitous presence in various classes of extended-real-valued functions that are important for optimization problems. We provide a useful characterization of this property for spectral functions by demonstrating that it can be characterized via the same property of the symmetric part of the spectral representation of an eigenvalue function. Our approach allows us to bypass the rather restrictive convexity assumption, used in many recent works that targeted second-order variational properties of spectral functions. By this theoretical tool, several applications on the proto-differentiability of subgradient mappings, the directional differentiability of the proximal mapping of spectral functions are achieved. We finally use our established theory to study twice epi-differentiability of leading eigenvalue functions and practical regularization terms that have important applications in statistics and the robust PCA.
