The Probably Approximately Correct Learning Model in Computational Learning Theory
Rocco A. Servedio
TL;DR
Valiant's PAC learning framework formalizes when efficient learning is possible by balancing statistical accuracy and computational tractability under arbitrary data distributions. The survey organizes the theory around the distribution-free model, its variants, and corresponding algorithms, including the elimination approach, feature expansion, and linear-programming methods that yield poly-time learning for several Boolean function classes, such as conjunctions, k-CNFs, and DNFs, while also detailing the hardness landscape. It covers robustness to noise via malicious, agnostic, and random classification noise models, and discusses the impact of membership queries through the PAC$+$MQ and related exact-learning connections. Finally, it surveys representation-dependent and -independent hardness results rooted in cryptography and average-case complexity, illustrating fundamental limits that persist even under favorable modeling assumptions, and it emphasizes the interdisciplinary influence of PAC ideas on boosting, SQ-learning, privacy, and beyond.
Abstract
This survey paper gives an overview of various known results on learning classes of Boolean functions in Valiant's Probably Approximately Correct (PAC) learning model and its commonly studied variants.
