More Than A Shortcut: A Hyperbolic Approach To Early-Exit Networks
Swapnil Bhosale, Cosmin Frateanu, Camilla Clark, Arnoldas Jasonas, Chris Mitchell, Xiatian Zhu, Vamsi Krishna Ithapu, Giacomo Ferroni, Cagdas Bilen, Sanjeel Parekh
TL;DR
This work tackles efficient, reliable event detection on resource-limited devices by reformulating Early-Exit networks in hyperbolic space. HypEE maps each exit's representation to a Lorentz hyperboloid and enforces a hierarchical refinement across exits via an entailment loss, with the distance from the origin serving as a geometry-grounded uncertainty metric. Experiments on audio tagging and sound event detection show that HypEE substantially improves early-exit accuracy and enables a geometry-aware triggering mechanism that increases accuracy while reducing computation, outperforming traditional Euclidean EE baselines. The results highlight the practical potential of a geometry-based approach to uncertainty and hierarchical inference for on-device audio perception systems.
Abstract
Deploying accurate event detection on resource-constrained devices is challenged by the trade-off between performance and computational cost. While Early-Exit (EE) networks offer a solution through adaptive computation, they often fail to enforce a coherent hierarchical structure, limiting the reliability of their early predictions. To address this, we propose Hyperbolic Early-Exit networks (HypEE), a novel framework that learns EE representations in the hyperbolic space. Our core contribution is a hierarchical training objective with a novel entailment loss, which enforces a partial-ordering constraint to ensure that deeper network layers geometrically refine the representations of shallower ones. Experiments on multiple audio event detection tasks and backbone architectures show that HypEE significantly outperforms standard Euclidean EE baselines, especially at the earliest, most computationally-critical exits. The learned geometry also provides a principled measure of uncertainty, enabling a novel triggering mechanism that makes the overall system both more efficient and more accurate than a conventional EE and standard backbone models without early-exits.
