AMAD: AutoMasked Attention for Unsupervised Multivariate Time Series Anomaly Detection
Tiange Huang, Yongjun Li
TL;DR
AMAD tackles unsupervised multivariate time series anomaly detection by introducing AutoMask Attention, which models multi-scale correlations, and by fusing local and global representations through Attention Mixup. The framework is trained with a reconstruction objective plus a Max-Min strategy and a Local-Global contrastive loss to encourage specialized local and global features without labeled data. Empirical results on five public benchmarks show competitive to state-of-the-art performance, with strong recall on PSM and SMAP and robust behavior across diverse datasets. The work demonstrates a practical, label-free approach that generalizes to varied anomaly patterns and temporal scales, offering improved applicability for industrial and sensor networks.
Abstract
Unsupervised multivariate time series anomaly detection (UMTSAD) plays a critical role in various domains, including finance, networks, and sensor systems. In recent years, due to the outstanding performance of deep learning in general sequential tasks, many models have been specialized for deep UMTSAD tasks and have achieved impressive results, particularly those based on the Transformer and self-attention mechanisms. However, the sequence anomaly association assumptions underlying these models are often limited to specific predefined patterns and scenarios, such as concentrated or peak anomaly patterns. These limitations hinder their ability to generalize to diverse anomaly situations, especially where the lack of labels poses significant challenges. To address these issues, we propose AMAD, which integrates \textbf{A}uto\textbf{M}asked Attention for UMTS\textbf{AD} scenarios. AMAD introduces a novel structure based on the AutoMask mechanism and an attention mixup module, forming a simple yet generalized anomaly association representation framework. This framework is further enhanced by a Max-Min training strategy and a Local-Global contrastive learning approach. By combining multi-scale feature extraction with automatic relative association modeling, AMAD provides a robust and adaptable solution to UMTSAD challenges. Extensive experimental results demonstrate that the proposed model achieving competitive performance results compared to SOTA benchmarks across a variety of datasets.
