A Unified and Stable Risk Minimization Framework for Weakly Supervised Learning with Theoretical Guarantees
Miao Zhang, Junpeng Li, Changchun Hua, Yana Yang
TL;DR
Weakly supervised learning often yields unstable risk estimators across diverse supervision patterns. The authors propose EoERM, an Extension of ERM, which builds a stable surrogate risk that unifies PU, UU, multi-UU, CLL, PLL, and tuple-based supervision under a single objective using a symmetric loss and a linear operator model. They establish non-asymptotic generalization guarantees based on Rademacher complexity, quantify the impact of class-prior misspecification, and provide identifiability conditions for UU. Empirical results across MNIST, Fashion-MNIST, CIFAR-10, SVHN, and KMNIST show consistent gains and robustness without heuristic stabilization, demonstrating practical applicability.
Abstract
Weakly supervised learning has emerged as a practical alternative to fully supervised learning when complete and accurate labels are costly or infeasible to acquire. However, many existing methods are tailored to specific supervision patterns -- such as positive-unlabeled (PU), unlabeled-unlabeled (UU), complementary-label (CLL), partial-label (PLL), or similarity-unlabeled annotations -- and rely on post-hoc corrections to mitigate instability induced by indirect supervision. We propose a principled, unified framework that bypasses such post-hoc adjustments by directly formulating a stable surrogate risk grounded in the structure of weakly supervised data. The formulation naturally subsumes diverse settings -- including PU, UU, CLL, PLL, multi-class unlabeled, and tuple-based learning -- under a single optimization objective. We further establish a non-asymptotic generalization bound via Rademacher complexity that clarifies how supervision structure, model capacity, and sample size jointly govern performance. Beyond this, we analyze the effect of class-prior misspecification on the bound, deriving explicit terms that quantify its impact, and we study identifiability, giving sufficient conditions -- most notably via supervision stratification across groups -- under which the target risk is recoverable. Extensive experiments show consistent gains across class priors, dataset scales, and class counts -- without heuristic stabilization -- while exhibiting robustness to overfitting.
