Which Algorithms Have Tight Generalization Bounds?
Michael Gastpar, Ido Nachum, Jonathan Shafer, Thomas Weinberger
TL;DR
The paper investigates when algorithm-dependent tight generalization bounds exist by formalizing estimability and studying overparameterized settings. It proves inestimability results for inductive biases toward VC classes and toward nearly-orthogonal function families, showing that distribution-free estimators can fail to approximate population loss in these regimes. It then identifies sufficient conditions for estimability via algorithm stability and provides a simple, necessary-and-sufficient variance-based characterization of estimability. The work clarifies why many classical generalization bounds are vacuous for modern models and offers principled paths to derive tight, algorithm-dependent bounds grounded in stability and loss-variance properties.
Abstract
We study which machine learning algorithms have tight generalization bounds. First, we present conditions that preclude the existence of tight generalization bounds. Specifically, we show that algorithms that have certain inductive biases that cause them to be unstable do not admit tight generalization bounds. Next, we show that algorithms that are sufficiently stable do have tight generalization bounds. We conclude with a simple characterization that relates the existence of tight generalization bounds to the conditional variance of the algorithm's loss.
