Table of Contents
Fetching ...

Hessian-aware Training for Enhancing DNNs Resilience to Parameter Corruptions

Tahmid Hasan Prato, Seijoon Kim, Lizhong Chen, Sanghyun Hong

TL;DR

The paper tackles the vulnerability of DNNs to parameter bit-flips by introducing Hessian-aware training, which minimizes the Hessian trace $Tr(H)$ to flatten the loss surface and reduce sensitivity to parameter perturbations. It employs Hutchinson's method to approximate the trace and uses Top-$p$ eigenvalues (with $p$ around 50) to stabilize training, incorporating regularization into SGD-based optimization. Across MNIST, CIFAR-10, and ImageNet, the method reduces the fraction of erratic parameters by 6–12% and increases the number of bit-flips required for adversaries to induce large accuracy drops by 2–3×, while preserving baseline accuracy. The approach shows strong synergy with hardware defenses (e.g., NeuroPot, RADAR) and remains practical for large models via layer-sampling strategies that keep overhead reasonable, enabling more robust deployment on error-prone hardware.

Abstract

Deep neural networks are not resilient to parameter corruptions: even a single-bitwise error in their parameters in memory can cause an accuracy drop of over 10%, and in the worst cases, up to 99%. This susceptibility poses great challenges in deploying models on computing platforms, where adversaries can induce bit-flips through software or bitwise corruptions may occur naturally. Most prior work addresses this issue with hardware or system-level approaches, such as integrating additional hardware components to verify a model's integrity at inference. However, these methods have not been widely deployed as they require infrastructure or platform-wide modifications. In this paper, we propose a new approach to addressing this issue: training models to be more resilient to bitwise corruptions to their parameters. Our approach, Hessian-aware training, promotes models with $flatter$ loss surfaces. We show that, while there have been training methods, designed to improve generalization through Hessian-based approaches, they do not enhance resilience to parameter corruptions. In contrast, models trained with our method demonstrate increased resilience to parameter corruptions, particularly with a 20$-$50% reduction in the number of bits whose individual flipping leads to a 90$-$100% accuracy drop. Moreover, we show the synergy between ours and existing hardware and system-level defenses.

Hessian-aware Training for Enhancing DNNs Resilience to Parameter Corruptions

TL;DR

The paper tackles the vulnerability of DNNs to parameter bit-flips by introducing Hessian-aware training, which minimizes the Hessian trace to flatten the loss surface and reduce sensitivity to parameter perturbations. It employs Hutchinson's method to approximate the trace and uses Top- eigenvalues (with around 50) to stabilize training, incorporating regularization into SGD-based optimization. Across MNIST, CIFAR-10, and ImageNet, the method reduces the fraction of erratic parameters by 6–12% and increases the number of bit-flips required for adversaries to induce large accuracy drops by 2–3×, while preserving baseline accuracy. The approach shows strong synergy with hardware defenses (e.g., NeuroPot, RADAR) and remains practical for large models via layer-sampling strategies that keep overhead reasonable, enabling more robust deployment on error-prone hardware.

Abstract

Deep neural networks are not resilient to parameter corruptions: even a single-bitwise error in their parameters in memory can cause an accuracy drop of over 10%, and in the worst cases, up to 99%. This susceptibility poses great challenges in deploying models on computing platforms, where adversaries can induce bit-flips through software or bitwise corruptions may occur naturally. Most prior work addresses this issue with hardware or system-level approaches, such as integrating additional hardware components to verify a model's integrity at inference. However, these methods have not been widely deployed as they require infrastructure or platform-wide modifications. In this paper, we propose a new approach to addressing this issue: training models to be more resilient to bitwise corruptions to their parameters. Our approach, Hessian-aware training, promotes models with loss surfaces. We show that, while there have been training methods, designed to improve generalization through Hessian-based approaches, they do not enhance resilience to parameter corruptions. In contrast, models trained with our method demonstrate increased resilience to parameter corruptions, particularly with a 2050% reduction in the number of bits whose individual flipping leads to a 90100% accuracy drop. Moreover, we show the synergy between ours and existing hardware and system-level defenses.

Paper Structure

This paper contains 21 sections, 3 equations, 8 figures, 12 tables, 1 algorithm.

Figures (8)

  • Figure 1: The distribution plots showing the number of bits in a DNN's parameters whose flipping results in RAD on the x-axis.
  • Figure 2: Visualizing LeNet's loss landscapes. From top to bottom, each row corresponds to standard training, HERO yang2022hero, and our method. From left to right, we visualize the first two convolutional layers and the three fully-conected layers.
  • Figure 3: The distribution plots computed on LeNet in MNIST (left) and ResNet18 on CIFAR10 (right).
  • Figure 4: The distribution plot computed on ResNet50 in ImageNet. Note that our fine-tuning only computes the Hessian trace from the last layer.
  • Figure 5: Comparing loss landscapes of the convolutional layers within a residual block. The left two are from the regularly-trained models, and the right ones are from those trained with our method.
  • ...and 3 more figures