Generalization Error of $f$-Divergence Stabilized Algorithms via Duality
Francisco Daunas, Iñaki Esnaola, Samir M. Perlaza, Gholamali Aminian
TL;DR
The paper addresses generalization in learning algorithms stabilized by $f$-divergence regularization (ERM-$f$DR) and extends the framework to constrained optimization. It develops a dual formulation using the Legendre-Fenchel transform and the implicit-function theorem, yielding an explicit normalization function $N_{Q,\boldsymbol{z}}(\lambda)$ and zero duality gap with the primal problem. It then derives an exact characterization of the generalization error $\bar{\bar{\mathsf{G}}}$ in terms of the dual solution and the conjugate $f^*$, with simplifications in key cases such as Gibbs-type algorithms where $f(x)=x\log x$. Together, these results provide concrete tools to analyze and quantify generalization under $f$-divergence regularization and to identify when ERM-$f$DR solutions coincide with constrained-optimal solutions.
Abstract
The solution to empirical risk minimization with $f$-divergence regularization (ERM-$f$DR) is extended to constrained optimization problems, establishing conditions for equivalence between the solution and constraints. A dual formulation of ERM-$f$DR is introduced, providing a computationally efficient method to derive the normalization function of the ERM-$f$DR solution. This dual approach leverages the Legendre-Fenchel transform and the implicit function theorem, enabling explicit characterizations of the generalization error for general algorithms under mild conditions, and another for ERM-$f$DR solutions.
