Autoencoder-based time series anomaly detection for ATLAS Liquid Argon calorimeter data quality monitoring
Vilius Čepaitis
TL;DR
Problem: Detect rare and unseen anomalies in ATLAS Liquid Argon calorimeter data during data quality monitoring. Approach: an unsupervised LSTM autoencoder trained on good-quality time-series data reconstructs input; anomalies are flagged when the reconstruction loss exceeds a threshold, $L_{th}=1.3$, with loss defined as $L = (1/N) * \sum_{i=1}^{N} (X_i - g(f(X_i)))^2$. Data: 16-dimensional time series derived from topo-cluster $Q$-factor and timing $\tau$, aggregated across four EM partitions and normalized. Findings: validation on noise-burst data shows the AE separates noisy events from good data and can detect additional anomalous windows beyond a reference detector, with evidence of generalization to cosmic and heavy-ion data. Significance: demonstrates a practical, automated early-warning capability for LAr calorimeter DQM and potential applicability to transient calorimeter issues.
Abstract
The ATLAS experiment at the LHC employs comprehensive data quality monitoring procedures to ensure high-quality physics data. This contribution presents a long short-term memory autoencoder-based algorithm for detecting anomalies in ATLAS Liquid Argon calorimeter data, represented as multidimensional time series of statistical moments of energy cluster properties. Trained on good-quality data, the model identifies anomalous intervals. Validation is performed using a known short-term issue of noise bursts, and the potential for broader application to transient calorimeter issues is discussed.
