Probably Approximately Correct Constrained Learning
Luiz F. O. Chamon, Alejandro Ribeiro
TL;DR
This work extends PAC learning to constrained settings by introducing PAC constrained (PACC) learnability, showing that any PAC learnable class is also PACC learnable via a constrained ERM rule and that feasibility can be enforced without increasing statistical hardness. To address the practical challenge of non-convex constrained ERMs, the authors derive a representation-independent empirical dual formulation and a primal–dual algorithm that attains a near-PACC solution with quantifiable approximation error tied to the parametrization richness and constraint difficulty. They provide a rigorous generalization analysis and demonstrate the approach on fairness and robustness problems, illustrating how dual variables offer insight into constraint tightness and bias interactions. The methodology yields a principled, scalable framework for learning under requirements in high-stakes domains, with potential extensions to reinforcement learning and non-convex loss settings.
Abstract
As learning solutions reach critical applications in social, industrial, and medical domains, the need to curtail their behavior has become paramount. There is now ample evidence that without explicit tailoring, learning can lead to biased, unsafe, and prejudiced solutions. To tackle these problems, we develop a generalization theory of constrained learning based on the probably approximately correct (PAC) learning framework. In particular, we show that imposing requirements does not make a learning problem harder in the sense that any PAC learnable class is also PAC constrained learnable using a constrained counterpart of the empirical risk minimization (ERM) rule. For typical parametrized models, however, this learner involves solving a constrained non-convex optimization program for which even obtaining a feasible solution is challenging. To overcome this issue, we prove that under mild conditions the empirical dual problem of constrained learning is also a PAC constrained learner that now leads to a practical constrained learning algorithm based solely on solving unconstrained problems. We analyze the generalization properties of this solution and use it to illustrate how constrained learning can address problems in fair and robust classification.
