Early Time Classification with Accumulated Accuracy Gap Control
Liran Ringel, Regev Cohen, Daniel Freedman, Michael Elad, Yaniv Romano
TL;DR
This work tackles the problem of labeling data streams as early as possible without sacrificing accuracy, by introducing calibrated stopping rules for ETSC with finite-sample, distribution-free guarantees. It develops two risk-control paradigms: marginal and conditional; the latter provides stronger, halt-time–aware guarantees by controlling the accuracy gap conditioned on accumulated halt times. The authors deploy a two-stage calibration framework—Stage 1 candidate screening and Stage 2 testing—grounded in Learn-then-Test and fixed-sequence testing to handle large hyperparameter spaces and ensure a fixed-sequence FWER control. Empirical results across structured datasets and an NLP reading-comprehension task show that conditional risk control can dramatically reduce computation while reliably maintaining the accuracy gap, with up to 94% of timesteps avoided in some settings. This framework offers practical, statistically justified early-exit mechanisms for sequential classifiers, with broad applicability to real-time inference and resource-constrained deployment.
Abstract
Early time classification algorithms aim to label a stream of features without processing the full input stream, while maintaining accuracy comparable to that achieved by applying the classifier to the entire input. In this paper, we introduce a statistical framework that can be applied to any sequential classifier, formulating a calibrated stopping rule. This data-driven rule attains finite-sample, distribution-free control of the accuracy gap between full and early-time classification. We start by presenting a novel method that builds on the Learn-then-Test calibration framework to control this gap marginally, on average over i.i.d. instances. As this algorithm tends to yield an excessively high accuracy gap for early halt times, our main contribution is the proposal of a framework that controls a stronger notion of error, where the accuracy gap is controlled conditionally on the accumulated halt times. Numerical experiments demonstrate the effectiveness, applicability, and usefulness of our method. We show that our proposed early stopping mechanism reduces up to 94% of timesteps used for classification while achieving rigorous accuracy gap control.
