Lower Bounds on Adversarial Robustness for Multiclass Classification with General Loss Functions
Camilo Andrés García Trillos, Nicolás García Trillos
TL;DR
This work addresses sharp lower bounds on adversarial risk for multiclass classification under general losses by formulating a learner-adversary minmax problem and deriving dual and generalized-barycenter reformulations. The main contributions include explicit dual representations and optimal robust classifiers for cross-entropy, $\alpha$-logarithmic, and quadratic losses, plus a unifying barycenter view with KL or Tsallis-entropy penalties that links adversarial robustness to α-fair packing. The theoretical framework reveals connections to optimal transport and generalized barycenters, enabling computational advantages and flexible relaxations. Empirical results on synthetic data and MNIST demonstrate tighter lower bounds than 0-1 baselines and confirm the practicality and scalability of the dual/barycenter approach for multiclass adversarial robustness.
Abstract
We consider adversarially robust classification in a multiclass setting under arbitrary loss functions and derive dual and barycentric reformulations of the corresponding learner-agnostic robust risk minimization problem. We provide explicit characterizations for important cases such as the cross-entropy loss, loss functions with a power form, and the quadratic loss, extending in this way available results for the 0-1 loss. These reformulations enable efficient computation of sharp lower bounds for adversarial risks and facilitate the design of robust classifiers beyond the 0-1 loss setting. Our paper uncovers interesting connections between adversarial robustness, $α$-fair packing problems, and generalized barycenter problems for arbitrary positive measures where Kullback-Leibler and Tsallis entropies are used as penalties. Our theoretical results are accompanied with illustrative numerical experiments where we obtain tighter lower bounds for adversarial risks with the cross-entropy loss function.
