A Characterization of List Regression
Chirag Pabbaraju, Sahasrajit Sarmasarkar
TL;DR
This work provides a complete PAC characterization of list regression, identifying when a real-valued hypothesis class is learnable with a short list of predictions under the absolute loss. It introduces two central dimensions: the $k$-OIG dimension for realizable list regression and the $k$-fat-shattering dimension for agnostic list regression, proving finiteness of these quantities is necessary and sufficient for learnability, with upper and lower bounds matching up to polylog factors. The authors develop a unified algorithmic framework based on discretization to a partial multiclass class, weak list learners via the one-inclusion graph, minimax boosting, and sample compression to achieve generalization guarantees for both realizable and agnostic settings. They also establish a sophisticated lower-bound calculus using higher-order packing and strong-fat-shattering concepts, linking these dimensions to packing numbers and discretization to relate fat-shattering with strong-fat-shattering. The work advances understanding of how list predictions extend regression theory beyond the standard $k=1$ setting and highlights open questions, such as whether ERM-based approaches can realize the agnostic list-learning guarantees via a carefully constructed list hypothesis class and its flattening.
Abstract
There has been a recent interest in understanding and characterizing the sample complexity of list learning tasks, where the learning algorithm is allowed to make a short list of $k$ predictions, and we simply require one of the predictions to be correct. This includes recent works characterizing the PAC sample complexity of standard list classification and online list classification. Adding to this theme, in this work, we provide a complete characterization of list PAC regression. We propose two combinatorial dimensions, namely the $k$-OIG dimension and the $k$-fat-shattering dimension, and show that they characterize realizable and agnostic $k$-list regression respectively. These quantities generalize known dimensions for standard regression. Our work thus extends existing list learning characterizations from classification to regression.
