Variational Learning Induces Adaptive Label Smoothing
Sin-Han Yang, Zhedong Liu, Gian Maria Marconi, Mohammad Emtiyaz Khan
TL;DR
This work demonstrates that variational learning inherently produces adaptive label smoothing by shaping a per-example label-noise term through the posterior over parameters. By deriving exact noise forms in logistic regression, GLMs, and neural networks via IVON, the authors show that posterior expectations create instance-specific smoothing without manual adaptive strategies. Empirically, IVON outperforms traditional label smoothing and matches or exceeds the performance of tuned baselines like SAM across synthetic and real noisy datasets, including Clothing1M, while requiring no hyperparameter tuning. The results bridge Bayesian inference with label smoothing, offering a robust, scalable framework for handling mislabels, calibration, and distribution shifts in deep learning.
Abstract
We show that variational learning naturally induces an adaptive label smoothing where label noise is specialized for each example. Such label-smoothing is useful to handle examples with labeling errors and distribution shifts, but designing a good adaptivity strategy is not always easy. We propose to skip this step and simply use the natural adaptivity induced during the optimization of a variational objective. We show empirical results where a variational algorithm called IVON outperforms traditional label smoothing and yields adaptivity strategies similar to those of an existing approach. By connecting Bayesian methods to label smoothing, our work provides a new way to handle overconfident predictions.
