Multiplicative Reweighting for Robust Neural Network Optimization
Noga Bar, Tomer Koren, Raja Giryes
TL;DR
This work introduces Multiplicative Reweighting (MR), a plug-in optimization technique that uses multiplicative weights to reweight training examples during neural network optimization. By treating each example as an expert and updating a distribution $p\in\Delta_N$ over examples based on observed losses, MR downweights noisy data while updating model parameters $\theta$ with a weighted empirical loss $F(\theta,p)=\sum_i p_i\ell_i(\theta)$. The authors prove convergence of MR with gradient-based methods and provide 1d label-noise guarantees, then demonstrate empirical gains on CIFAR-10/100 and Clothing1M under synthetic and real label noise, as well as improved adversarial robustness when MR is combined with established defenses. MR incurs modest training-time overhead and integrates easily with common optimizers like SGD and Adam, offering a practical toolkit addition for improving robustness to label noise and adversarial perturbations.
Abstract
Neural networks are widespread due to their powerful performance. Yet, they degrade in the presence of noisy labels at training time. Inspired by the setting of learning with expert advice, where multiplicative weights (MW) updates were recently shown to be robust to moderate data corruptions in expert advice, we propose to use MW for reweighting examples during neural networks optimization. We theoretically establish the convergence of our method when used with gradient descent and prove its advantages in 1d cases. We then validate empirically our findings for the general case by showing that MW improves neural networks' accuracy in the presence of label noise on CIFAR-10, CIFAR-100 and Clothing1M. We also show the impact of our approach on adversarial robustness.
