Table of Contents
Fetching ...

Using Pre-Training Can Improve Model Robustness and Uncertainty

Dan Hendrycks, Kimin Lee, Mantas Mazeika

TL;DR

The paper challenges the claim that pre-training only speeds up learning by showing that pre-training can substantially improve robustness to adversarial perturbations, label noise, and class imbalance, as well as uncertainty measures like OOD detection and calibration. It introduces adversarial pre-training and demonstrates large, sometimes state-of-the-art gains across CIFAR and ImageNet-derived settings, often outperforming task-specific methods. The results advocate adopting a pre-train-then-tune paradigm and suggest evaluating robustness and uncertainty techniques with pre-trained models to obtain realistic assessments. Overall, pre-training provides benefits beyond convergence speed, enhancing reliability and safety in deep learning systems.

Abstract

He et al. (2018) have called into question the utility of pre-training by showing that training from scratch can often yield similar performance to pre-training. We show that although pre-training may not improve performance on traditional classification metrics, it improves model robustness and uncertainty estimates. Through extensive experiments on adversarial examples, label corruption, class imbalance, out-of-distribution detection, and confidence calibration, we demonstrate large gains from pre-training and complementary effects with task-specific methods. We introduce adversarial pre-training and show approximately a 10% absolute improvement over the previous state-of-the-art in adversarial robustness. In some cases, using pre-training without task-specific methods also surpasses the state-of-the-art, highlighting the need for pre-training when evaluating future methods on robustness and uncertainty tasks.

Using Pre-Training Can Improve Model Robustness and Uncertainty

TL;DR

The paper challenges the claim that pre-training only speeds up learning by showing that pre-training can substantially improve robustness to adversarial perturbations, label noise, and class imbalance, as well as uncertainty measures like OOD detection and calibration. It introduces adversarial pre-training and demonstrates large, sometimes state-of-the-art gains across CIFAR and ImageNet-derived settings, often outperforming task-specific methods. The results advocate adopting a pre-train-then-tune paradigm and suggest evaluating robustness and uncertainty techniques with pre-trained models to obtain realistic assessments. Overall, pre-training provides benefits beyond convergence speed, enhancing reliability and safety in deep learning systems.

Abstract

He et al. (2018) have called into question the utility of pre-training by showing that training from scratch can often yield similar performance to pre-training. We show that although pre-training may not improve performance on traditional classification metrics, it improves model robustness and uncertainty estimates. Through extensive experiments on adversarial examples, label corruption, class imbalance, out-of-distribution detection, and confidence calibration, we demonstrate large gains from pre-training and complementary effects with task-specific methods. We introduce adversarial pre-training and show approximately a 10% absolute improvement over the previous state-of-the-art in adversarial robustness. In some cases, using pre-training without task-specific methods also surpasses the state-of-the-art, highlighting the need for pre-training when evaluating future methods on robustness and uncertainty tasks.

Paper Structure

This paper contains 13 sections, 4 figures, 6 tables.

Figures (4)

  • Figure 1: Training for longer is not a suitable strategy for label corruption. By training for longer, the network eventually begins to model and memorize label noise, which harms its overall performance. Labels are corrupted uniformly to incorrect classes with 60% probability, and the Wide Residual Network classifier has learning rate drops at epochs 80, 120, and 160.
  • Figure 2: Error curves for label noise correction methods using training from scratch and pre-training across a full range of label corruption strengths. For the No Correction baseline, using pre-training results in a visibly improved slope of degradation with a more pronounced elbow at higher corruption strengths. This also occurs in the complementary combinations of pre-training with previously proposed correction methods.
  • Figure 3: Class-wise test set error rates are lower across all classes with pre-training. Here the imbalanced dataset is a CIFAR-10 modification with imbalance ratio $\gamma=0.2$.
  • Figure 4: Root Mean Square Calibration Error values for models trained from scratch and models that are pre-trained. On all datasets, pre-training reduces the RMS error by more than half.