AnoRefiner: Anomaly-Aware Group-Wise Refinement for Zero-Shot Industrial Anomaly Detection
Dayou Huang, Feng Xue, Xurui Li, Yu Zhou
TL;DR
This work tackles the coarse localization problem in zero-shot industrial anomaly detection by introducing AnoRefiner, a plug-and-play refinement framework. It combines an Anomaly Refinement Decoder (ARD) with a Progressive Group-wise Test-Time Training (PGT) procedure to achieve pixel-level anomaly segmentation without real anomaly labels. ARD leverages anomaly score maps via anomaly-attention and bidirectional refinement to suppress background and enhance anomalous cues, while PGT simulates production-like groupwise adaptation. Experiments on MVTec AD and VisA show consistent pixel-level gains across multiple ZSAD backbones (up to 5.2% pixel-AP) and demonstrate robustness to pseudo-normal contamination, suggesting strong practical impact for automated industrial inspection.
Abstract
Zero-shot industrial anomaly detection (ZSAD) methods typically yield coarse anomaly maps as vision transformers (ViTs) extract patch-level features only. To solve this, recent solutions attempt to predict finer anomalies using features from ZSAD, but they still struggle to recover fine-grained anomalies without missed detections, mainly due to the gap between randomly synthesized training anomalies and real ones. We observe that anomaly score maps exactly provide complementary spatial cues that are largely absent from ZSAD's image features, a fact overlooked before. Inspired by this, we propose an anomaly-aware refiner (AnoRefiner) that can be plugged into most ZSAD models and improve patch-level anomaly maps to the pixel level. First, we design an anomaly refinement decoder (ARD) that progressively enhances image features using anomaly score maps, reducing the reliance on synthetic anomaly data. Second, motivated by the mass production paradigm, we propose a progressive group-wise test-time training (PGT) strategy that trains ARD in each product group for the refinement process in the next group, while staying compatible with any ZSAD method. Experiments on the MVTec AD and VisA datasets show that AnoRefiner boosts various ZSAD models by up to a 5.2\% gain in pixel-AP metrics, which can also be directly observed in many visualizations. The code will be available at https://github.com/HUST-SLOW/AnoRefiner.
