Probably Approximately Precision and Recall Learning
Lee Cohen, Yishay Mansour, Shay Moran, Han Shao
TL;DR
The paper addresses learning set-valued predictions under partial feedback, where only a single positive label per input is observed during training. It extends PAC-style guarantees to a framework with precision and recall losses, introducing both scalar and Pareto-loss objectives and contrasting realizable, agnostic, and semi-realizable settings. The authors propose two core algorithms—Maximum Likelihood and a Surrogate-Loss method—that achieve near-optimal sample complexities and Pareto-front guarantees, including multiplicative rather than additive errors in the agnostic case. They establish lower bounds showing the impossibility of vanishing additive error under partial feedback and discuss semi-realizable scenarios where zero-precision learning is possible when target output sizes are bounded. The work advances understanding of learning from positive-only data with set-valued outputs and lays groundwork for future exploration of optimal factors and potential complexity measures for precision-recall learning.
Abstract
Precision and Recall are fundamental metrics in machine learning tasks where both accurate predictions and comprehensive coverage are essential, such as in multi-label learning, language generation, medical studies, and recommender systems. A key challenge in these settings is the prevalence of one-sided feedback, where only positive examples are observed during training--e.g., in multi-label tasks like tagging people in Facebook photos, we may observe only a few tagged individuals, without knowing who else appears in the image. To address learning under such partial feedback, we introduce a Probably Approximately Correct (PAC) framework in which hypotheses are set functions that map each input to a set of labels, extending beyond single-label predictions and generalizing classical binary, multi-class, and multi-label models. Our results reveal sharp statistical and algorithmic separations from standard settings: classical methods such as Empirical Risk Minimization provably fail, even for simple hypothesis classes. We develop new algorithms that learn from positive data alone, achieving optimal sample complexity in the realizable case, and establishing multiplicative--rather than additive-approximation guarantees in the agnostic case, where achieving additive regret is impossible.
