SCoNE: Spherical Consistent Neighborhoods Ensemble for Effective and Efficient Multi-View Anomaly Detection
Yang Xu, Hang Zhang, Yixiao Ma, Ye Zhu, Kai Ming Ting
TL;DR
The paper tackles multi-view anomaly detection by requiring consistency of local neighborhoods across views. It introduces SCoNE, which directly uses a small set of multi-view instances to define adaptive-radius spherical neighborhoods that are shared across all views, enabling a linear-time, learning-free approach. The method provides theoretical insights into density-adaptive neighborhood sizing and cross-view consistency, and it demonstrates superior accuracy and scalability on synthetic and real-world datasets, including large-scale MNIST-like data and social networks. Overall, SCoNE offers a practical, scalable paradigm for robust multi-view anomaly detection that outperforms existing learning-based representations.
Abstract
The core problem in multi-view anomaly detection is to represent local neighborhoods of normal instances consistently across all views. Recent approaches consider a representation of local neighborhood in each view independently, and then capture the consistent neighbors across all views via a learning process. They suffer from two key issues. First, there is no guarantee that they can capture consistent neighbors well, especially when the same neighbors are in regions of varied densities in different views, resulting in inferior detection accuracy. Second, the learning process has a high computational cost of $\mathcal{O}(N^2)$, rendering them inapplicable for large datasets. To address these issues, we propose a novel method termed \textbf{S}pherical \textbf{C}onsistent \textbf{N}eighborhoods \textbf{E}nsemble (SCoNE). It has two unique features: (a) the consistent neighborhoods are represented with multi-view instances directly, requiring no intermediate representations as used in existing approaches; and (b) the neighborhoods have data-dependent properties, which lead to large neighborhoods in sparse regions and small neighborhoods in dense regions. The data-dependent properties enable local neighborhoods in different views to be represented well as consistent neighborhoods, without learning. This leads to $\mathcal{O}(N)$ time complexity. Empirical evaluations show that SCoNE has superior detection accuracy and runs orders-of-magnitude faster in large datasets than existing approaches.
