Multi-Flow: Multi-View-Enriched Normalizing Flows for Industrial Anomaly Detection
Mathis Kruse, Bodo Rosenhahn
TL;DR
Multi-Flow introduces a multi-view normalizing flow for industrial anomaly detection, explicitly fusing information across multiple viewpoints to improve exact likelihood estimation on object patches. The method operates on features from a frozen extractor, uses background removal via MVANet, and employs cross-view st-Networks with top-view and neighbor-view connections to share information across views. Training maximizes likelihood with a noise-conditioned flow and a change-of-variables loss, achieving state-of-the-art performance on Real-IAD for both sample-wise and image-wise anomaly detection, with ablations confirming the value of cross-view fusion and background removal. The approach demonstrates strong practical impact for visual inspection in manufacturing, enabling view-agnostic anomaly detection and robust performance with scalable multi-view setups.
Abstract
With more well-performing anomaly detection methods proposed, many of the single-view tasks have been solved to a relatively good degree. However, real-world production scenarios often involve complex industrial products, whose properties may not be fully captured by one single image. While normalizing flow based approaches already work well in single-camera scenarios, they currently do not make use of the priors in multi-view data. We aim to bridge this gap by using these flow-based models as a strong foundation and propose Multi-Flow, a novel multi-view anomaly detection method. Multi-Flow makes use of a novel multi-view architecture, whose exact likelihood estimation is enhanced by fusing information across different views. For this, we propose a new cross-view message-passing scheme, letting information flow between neighboring views. We empirically validate it on the real-world multi-view data set Real-IAD and reach a new state-of-the-art, surpassing current baselines in both image-wise and sample-wise anomaly detection tasks.
