Multi-layer Radial Basis Function Networks for Out-of-distribution Detection
Amol Khanna, Chenyi Ling, Derek Everett, Edward Raff, Nathan Inkawhich
TL;DR
The paper introduces Multi-layer Radial Basis Function Networks (MLRBFNs) that integrate classification and out-of-distribution (OOD) detection within a single architecture. It addresses training challenges of deep RBF stacks by adding a depression mechanism to avoid 0-mapped classes and by decoupling predictions from confidence using binary cross-entropy, while leveraging k-means initializations to accelerate training. Through experiments on moons data, MNIST, and feature-extractor pipelines with foundation-model embeddings, MLRBFNs achieve competitive OOD detection performance against benchmarks like OpenOOD v1.5 and postprocessors, with robustness observed as the number of classification layers increases. The work demonstrates a promising direction for inherently distance-aware, OOD-sensitive networks and suggests future gains from deeper, more scalable RBF architectures and integration with other modern techniques. All mathematical notation is presented with proper Delimiters, and key equations reflect the distance-based activations and the depression mechanism that governs OOD behavior within the layers.
Abstract
Existing methods for out-of-distribution (OOD) detection use various techniques to produce a score, separate from classification, that determines how ``OOD'' an input is. Our insight is that OOD detection can be simplified by using a neural network architecture which can effectively merge classification and OOD detection into a single step. Radial basis function networks (RBFNs) inherently link classification confidence and OOD detection; however, these networks have lost popularity due to the difficult of training them in a multi-layer fashion. In this work, we develop a multi-layer radial basis function network (MLRBFN) which can be easily trained. To ensure that these networks are also effective for OOD detection, we develop a novel depression mechanism. We apply MLRBFNs as standalone classifiers and as heads on top of pretrained feature extractors, and find that they are competitive with commonly used methods for OOD detection. Our MLRBFN architecture demonstrates a promising new direction for OOD detection methods.
