DART: Input-Difficulty-AwaRe Adaptive Threshold for Early-Exit DNNs

Parth Patne; Mahdi Taheri; Christian Herglotz; Maksim Jenihhin; Milos Krstic; Michael Hübner

DART: Input-Difficulty-AwaRe Adaptive Threshold for Early-Exit DNNs

Parth Patne, Mahdi Taheri, Christian Herglotz, Maksim Jenihhin, Milos Krstic, Michael Hübner

Abstract

Early-exit deep neural networks enable adaptive inference by terminating computation when sufficient confidence is achieved, reducing cost for edge AI accelerators in resource-constrained settings. Existing methods, however, rely on suboptimal exit policies, ignore input difficulty, and optimize thresholds independently. This paper introduces DART (Input-Difficulty-Aware Adaptive Threshold), a framework that overcomes these limitations. DART introduces three key innovations: (1) a lightweight difficulty estimation module that quantifies input complexity with minimal computational overhead, (2) a joint exit policy optimization algorithm based on dynamic programming, and (3) an adaptive coefficient management system. Experiments on diverse DNN benchmarks (AlexNet, ResNet-18, VGG-16) demonstrate that DART achieves up to \textbf{3.3$\times$} speedup, \textbf{5.1$\times$} lower energy, and up to \textbf{42\%} lower average power compared to static networks, while preserving competitive accuracy. Extending DART to Vision Transformers (LeViT) yields power (5.0$\times$) and execution-time (3.6$\times$) gains but also accuracy loss (up to 17 percent), underscoring the need for transformer-specific early-exit mechanisms. We further introduce the Difficulty-Aware Efficiency Score (DAES), a novel multi-objective metric, under which DART achieves up to a 14.8 improvement over baselines, highlighting superior accuracy, efficiency, and robustness trade-offs.

DART: Input-Difficulty-AwaRe Adaptive Threshold for Early-Exit DNNs

Abstract

} speedup, \textbf{5.1

} lower energy, and up to \textbf{42\%} lower average power compared to static networks, while preserving competitive accuracy. Extending DART to Vision Transformers (LeViT) yields power (5.0

) and execution-time (3.6

) gains but also accuracy loss (up to 17 percent), underscoring the need for transformer-specific early-exit mechanisms. We further introduce the Difficulty-Aware Efficiency Score (DAES), a novel multi-objective metric, under which DART achieves up to a 14.8 improvement over baselines, highlighting superior accuracy, efficiency, and robustness trade-offs.

Paper Structure (26 sections, 20 equations, 2 figures, 2 tables, 1 algorithm)

This paper contains 26 sections, 20 equations, 2 figures, 2 tables, 1 algorithm.

Introduction
Methodology
Difficulty-Aware Input Processing
Multi-Modal Difficulty Metrics
Difficulty Score Fusion
Difficulty-Aware Efficiency Score (DAES)
Joint Exit Policy Optimization
Problem Formulation
Dynamic Programming Solution
Threshold Calibration
Adaptive Coefficient Management
Multi-Strategy Adaptation Framework
Adaptive Selection and Tracking
Framework Extensibility to Transformers
Training and Inference Pipeline
...and 11 more sections

Figures (2)

Figure 1: System-level architecture of the proposed Input-Difficulty-Aware early-exit DNN with DART integration.
Figure 2: Evolution of adaptive coefficients during training for three CIFAR-10 classes based on real experimental data.

DART: Input-Difficulty-AwaRe Adaptive Threshold for Early-Exit DNNs

Abstract

DART: Input-Difficulty-AwaRe Adaptive Threshold for Early-Exit DNNs

Authors

Abstract

Table of Contents

Figures (2)