Deep Orthogonal Hypersphere Compression for Anomaly Detection
Yunhe Zhang, Yan Sun, Jinyu Cai, Jicong Fan
TL;DR
This work tackles the key challenge of anomaly detection in high-dimensional spaces where a single hypersphere boundary is difficult to learn and remains poorly compact due to the soap-bubble phenomenon. It introduces two end-to-end methods: DOHSC, which uses an orthogonal projection layer to enforce near-hyperspherical, compact normal regions, and DO2HSC, which constrains normal data to a bi-hypersphere shell to alleviate incompactness. The authors also extend both methods to graph-level anomaly detection by incorporating mutual information maximization between local and global graph representations via GIN, forming a robust, scalable framework for diverse data modalities. Comprehensive experiments on image, tabular, and graph datasets show that DOHSC and especially DO2HSC achieve state-of-the-art performance, with improved boundary compactness and resilience to high-dimensional effects. The work includes theoretical insights, ablations, and practical considerations, and code is made available for reproducibility.
Abstract
Many well-known and effective anomaly detection methods assume that a reasonable decision boundary has a hypersphere shape, which however is difficult to obtain in practice and is not sufficiently compact, especially when the data are in high-dimensional spaces. In this paper, we first propose a novel deep anomaly detection model that improves the original hypersphere learning through an orthogonal projection layer, which ensures that the training data distribution is consistent with the hypersphere hypothesis, thereby increasing the true positive rate and decreasing the false negative rate. Moreover, we propose a bi-hypersphere compression method to obtain a hyperspherical shell that yields a more compact decision region than a hyperball, which is demonstrated theoretically and numerically. The proposed methods are not confined to common datasets such as image and tabular data, but are also extended to a more challenging but promising scenario, graph-level anomaly detection, which learns graph representation with maximum mutual information between the substructure and global structure features while exploring orthogonal single- or bi-hypersphere anomaly decision boundaries. The numerical and visualization results on benchmark datasets demonstrate the superiority of our methods in comparison to many baselines and state-of-the-art methods.
