Training More Robust Classification Model via Discriminative Loss and Gaussian Noise Injection
Hai-Vy Nguyen, Fabrice Gamboa, Sixin Zhang, Reda Chhaibi, Serge Gratton, Thierry Giaccone
TL;DR
This work tackles the robustness-accuracy trade-off in deep classifiers by introducing a dual-objective training framework: a penultimate-layer loss that enforces intra-class compactness and maximizes inter-class margins with analytically defined decision boundaries, and a class-wise alignment of noisy data to clean data clusters. It further provides a theoretical connection: training with additive Gaussian noise implicitly reduces the eigenvalues of the input-space Hessian, i.e., the loss curvature, which regularizes robustness; reduced curvature also yields more stable features under perturbations. The authors demonstrate, across CIFAR-10, SVHN, and a road-image dataset, that their method improves robustness to diverse perturbations while preserving or even improving clean accuracy, and show that curvature reduction correlates with robustness and generalizes beyond training perturbations. Overall, the approach offers a principled, easily adoptable strategy that enhances noise robustness through feature-space regularization and curvature-aware analysis, with broad practical impact for reliable deployment of vision systems in noisy environments.
Abstract
Robustness of deep neural networks to input noise remains a critical challenge, as naive noise injection often degrades accuracy on clean (uncorrupted) data. We propose a novel training framework that addresses this trade-off through two complementary objectives. First, we introduce a loss function applied at the penultimate layer that explicitly enforces intra-class compactness and increases the margin to analytically defined decision boundaries. This enhances feature discriminativeness and class separability for clean data. Second, we propose a class-wise feature alignment mechanism that brings noisy data clusters closer to their clean counterparts. Furthermore, we provide a theoretical analysis demonstrating that improving feature stability under additive Gaussian noise implicitly reduces the curvature of the softmax loss landscape in input space, as measured by Hessian eigenvalues.This thus naturally enhances robustness without explicit curvature penalties. Conversely, we also theoretically show that lower curvatures lead to more robust models. We validate the effectiveness of our method on standard benchmarks and our custom dataset. Our approach significantly reinforces model robustness to various perturbations while maintaining high accuracy on clean data, advancing the understanding and practice of noise-robust deep learning.
