Leveraging Intermediate Representations for Better Out-of-Distribution Detection

Gianluca Guglielmo; Marc Masana

Leveraging Intermediate Representations for Better Out-of-Distribution Detection

Gianluca Guglielmo, Marc Masana

TL;DR

The paper addresses the challenge of reliable out-of-distribution (OoD) detection without sacrificing in-distribution (ID) performance. It investigates intermediate-layer activations and proposes two methods: Ag-EBO, which aggregates per-layer energies for layer-agnostic OoD scoring, and R-EBO, which regularizes hidden layers with an energy-based loss to promote discriminative intermediate representations. Energies are defined per layer as $E_l({\mathbf{x}}) = -T \log \sum_{i} e^{a_l^{i}({\mathbf{x}})/T}$ and combined into $\mathbf{E}({\mathbf{x}})=(E_1({\mathbf{x}}),\dots,E_L({\mathbf{x}}))$, with evaluations across architectures and datasets (OpenOOD benchmarks) showing improved OoD detection in many settings, albeit sometimes at a small cost to ID accuracy. The work highlights practical implications for real-time OoD detection and outlines limitations related to layer selection and generalization, guiding future directions such as ID-only regularization and synthetic-data augmentation.

Abstract

In real-world applications, machine learning models must reliably detect Out-of-Distribution (OoD) samples to prevent unsafe decisions. Current OoD detection methods often rely on analyzing the logits or the embeddings of the penultimate layer of a neural network. However, little work has been conducted on the exploitation of the rich information encoded in intermediate layers. To address this, we analyze the discriminative power of intermediate layers and show that they can positively be used for OoD detection. Therefore, we propose to regularize intermediate layers with an energy-based contrastive loss, and by grouping multiple layers in a single aggregated response. We demonstrate that intermediate layer activations improves OoD detection performance by running a comprehensive evaluation across multiple datasets.

Leveraging Intermediate Representations for Better Out-of-Distribution Detection

TL;DR

Abstract

Leveraging Intermediate Representations for Better Out-of-Distribution Detection

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)