Robust Online Learning
Sajad Ashkezari
TL;DR
The paper studies robust online learning under adversarial perturbations where both inputs and clean labels may be chosen adversarially. It introduces the $L_\\mathcal{U}(\\mathcal{H})$ dimension, a Littlestone-like measure defined via $\\mathcal{U}$-adversarial trees, and shows it tightly governs learning performance: $\\mathbf{M}^*=L_\\mathcal{U}(\\mathcal{H})$ in the realizable setting and $\\tilde{O}(\\sqrt{L_\\mathcal{U}(\\mathcal{H})\\,T})$ regret in the agnostic setting, with extensions to multiclass scenarios. The analysis leverages an orientation game and reductions to prediction with expert advice to obtain tight upper and matching lower bounds, and it further addresses uncertain perturbation sets by introducing a finite family $\\mathcal{G}$ of perturbations with logarithmic dependence on $|\\mathcal{G}|$. Overall, the work provides a principled framework and tight bounds for robust online learnability under adversarial perturbations, highlighting avenues for future work on infinite perturbation families and partial feedback.
Abstract
We study the problem of learning robust classifiers where the classifier will receive a perturbed input. Unlike robust PAC learning studied in prior work, here the clean data and its label are also adversarially chosen. We formulate this setting as an online learning problem and consider both the realizable and agnostic learnability of hypothesis classes. We define a new dimension of classes and show it controls the mistake bounds in the realizable setting and the regret bounds in the agnostic setting. In contrast to the dimension that characterizes learnability in the PAC setting, our dimension is rather simple and resembles the Littlestone dimension. We generalize our dimension to multiclass hypothesis classes and prove similar results in the realizable case. Finally, we study the case where the learner does not know the set of allowed perturbations for each point and only has some prior on them.
