Robust support vector machines via conic optimization
Valentina Cepeda, Andrés Gómez, Shaoning Han
TL;DR
The paper tackles robust SVM learning under label uncertainty by deriving a conic-optimization-based convexification of the $0$-$1$ loss. It introduces a strong convex relaxation through a convex hull construction that yields a separable, non-convex loss $\mathcal{L}^*(u;\gamma)$ and formulates an SDP-based training problem whose outer relaxation optimizes the loss strength via $\gamma$. Computational results show the proposed conic loss matches hinge in clean data but outperforms it in the presence of outliers, with reduced variance and practical runtimes on datasets with up to thousands of samples. The approach scales to moderate feature dimensions, can be extended to kernels, and provides a robust alternative to standard hinge-based SVMs in noisy environments.
Abstract
We consider the problem of learning support vector machines robust to uncertainty. It has been established in the literature that typical loss functions, including the hinge loss, are sensible to data perturbations and outliers, thus performing poorly in the setting considered. In contrast, using the 0-1 loss or a suitable non-convex approximation results in robust estimators, at the expense of large computational costs. In this paper we use mixed-integer optimization techniques to derive a new loss function that better approximates the 0-1 loss compared with existing alternatives, while preserving the convexity of the learning problem. In our computational results, we show that the proposed estimator is competitive with the standard SVMs with the hinge loss in outlier-free regimes and better in the presence of outliers.
