Monotonic Learning in the PAC Framework: A New Perspective
Ming Li, Chenyi Zhang, Qin Li
TL;DR
This work reframes generalization in agnostic PAC learning as a distributional problem by deriving a family of upper-bound risk distributions $F_m(\epsilon)$ that evolve with the training size $m$. It proves monotonicity for deterministic ERM learners in two standard settings: finite hypothesis spaces and finite VC-dimension, through corresponding $F_m(\epsilon)$ constructions and sample-complexity bounds. The authors validate the theory with experiments on three tasks (conjunctions, threshold functions, and Iris), showing empirical error distributions $P_m$ are bounded above by the PAC-derived $Q_m$, both shifting toward lower error as $m$ grows and eventually merging when $m$ is large. The findings provide a distribution-centered link between PAC theory and practical algorithmic behavior, offering conservative bounds and insights for stability under finite-sample conditions.
Abstract
Monotone learning describes learning processes in which expected performance consistently improves as the amount of training data increases. However, recent studies challenge this conventional wisdom, revealing significant gaps in the understanding of generalization in machine learning. Addressing these gaps is crucial for advancing the theoretical foundations of the field. In this work, we utilize Probably Approximately Correct (PAC) learning theory to construct a theoretical risk distribution that approximates a learning algorithm's actual performance. We rigorously prove that this theoretical distribution exhibits monotonicity as sample sizes increase. We identify two scenarios under which deterministic algorithms based on Empirical Risk Minimization (ERM) are monotone: (1) the hypothesis space is finite, or (2) the hypothesis space has finite VC-dimension. Experiments on two classical learning problems validate our findings by demonstrating that the monotonicity of the algorithms' generalization error is guaranteed, as its theoretical risk upper bound monotonically converges to 0.
