Automated Anomaly Detection on European XFEL Klystrons
Antonin Sulc, Annika Eichler, Tim Wilksen
TL;DR
The paper addresses reducing downtime in European XFEL klystrons by detecting anomalies in high-power RF signals without labeled faults. It employs an unsupervised deep one-class classification using an LSTM encoder to map 205-feature sequential inputs to a latent center, minimizing $s(x) = \|f_\theta(x) - c\|_2$ for normal data and producing an anomaly score. Two real events (Mar 7, 2023 and Feb 1, 2024) demonstrate the method's ability to reveal precursors and distinct state clusters via anomaly scores and TSNE embeddings, enabling earlier diagnostics. This approach offers data-driven lead time for preventive actions, potentially increasing uptime and reducing costly downtime in the accelerator facility.
Abstract
High-power multi-beam klystrons represent a key component to amplify RF to generate the accelerating field of the superconducting radio frequency (SRF) cavities at European XFEL. Exchanging these high-power components takes time and effort, thus it is necessary to minimize maintenance and downtime and at the same time maximize the device's operation. In an attempt to explore the behavior of klystrons using machine learning, we completed a series of experiments on our klystrons to determine various operational modes and conduct feature extraction and dimensionality reduction to extract the most valuable information about a normal operation. To analyze recorded data we used state-of-the-art data-driven learning techniques and recognized the most promising components that might help us better understand klystron operational states and identify early on possible faults or anomalies.
