TailedCore: Few-Shot Sampling for Unsupervised Long-Tail Noisy Anomaly Detection

Yoon Gyo Jung; Jaewoo Park; Jaeho Yoon; Kuan-Chuan Peng; Wonchul Kim; Andrew Beng Jin Teoh; Octavia Camps

TailedCore: Few-Shot Sampling for Unsupervised Long-Tail Noisy Anomaly Detection

Yoon Gyo Jung, Jaewoo Park, Jaeho Yoon, Kuan-Chuan Peng, Wonchul Kim, Andrew Beng Jin Teoh, Octavia Camps

TL;DR

This work tackles unsupervised anomaly detection when normal data are both long-tailed and contaminated with noise. It introduces TailSampler, a class-size predictor that infers tail versus head samples via a reflective symmetry between inter-class and intra-class embedding similarities, enabling exclusive tail sampling. The tail-focused patches are integrated with a noise-discriminated memory bank to form TailedCore, a memory-based detector robust to both noise and class imbalance. Extensive experiments on modified MVTecAD and VisA datasets show that TailedCore consistently outperforms state-of-the-art methods across image- and pixel-level tasks under various tail/distribution and noise conditions, highlighting its practical potential for real-world industrial anomaly detection. The approach advances anomaly detection by combining principled few-shot sampling with a clean, representative memory, yielding significant improvements in challenging long-tail noisy environments.

Abstract

We aim to solve unsupervised anomaly detection in a practical challenging environment where the normal dataset is both contaminated with defective regions and its product class distribution is tailed but unknown. We observe that existing models suffer from tail-versus-noise trade-off where if a model is robust against pixel noise, then its performance deteriorates on tail class samples, and vice versa. To mitigate the issue, we handle the tail class and noise samples independently. To this end, we propose TailSampler, a novel class size predictor that estimates the class cardinality of samples based on a symmetric assumption on the class-wise distribution of embedding similarities. TailSampler can be utilized to sample the tail class samples exclusively, allowing to handle them separately. Based on these facets, we build a memory-based anomaly detection model TailedCore, whose memory both well captures tail class information and is noise-robust. We extensively validate the effectiveness of TailedCore on the unsupervised long-tail noisy anomaly detection setting, and show that TailedCore outperforms the state-of-the-art in most settings.

TailedCore: Few-Shot Sampling for Unsupervised Long-Tail Noisy Anomaly Detection

TL;DR

Abstract

TailedCore: Few-Shot Sampling for Unsupervised Long-Tail Noisy Anomaly Detection

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (2)