A Hybrid Neural Architecture: Online Attosecond X-ray Characterization
Jack Hirschman, Benjamin Mencer, Razib Obaid, Amanda Shackelford, Ryan Coffee
TL;DR
The paper addresses the need for real-time, single-shot attosecond-scale characterization of SASE sub-spikes in MRCO sinograms at high repetition rates. It introduces DCIPHR, a modular hybrid architecture that fuses CNNs, a BiLSTM-based sub-spike classifier, and ResNet-based phase regressors, trained on CookieSim synthetic data to denoise, count sub-spikes, and estimate phase delays. Key results show the denoiser achieving ~${SSIM-G}$ gains of about 29%, sub-spike counting accuracy above 96%, and a double-sub-spike phase difference RMSE of 0.04652 rad, translating to ~29.6 attoseconds at a 1.2 µm streaking wavelength; end-to-end inference latency is 168.3 µs with >10 kHz throughput on Groq hardware and potential scaling to 14 kHz with FPGA. This enables real-time streaming event selection and autonomous experimental control at MHz-class rates, supporting plans for XLEAP and digital-twin approaches in next-generation high-rate XFEL facilities like LCLS-II.
Abstract
The emergence of high-repetition-rate x-ray free-electron lasers, such as SLAC's LCLS-II, serve as our canonical example for autonomous controls that necessitate high-throughput diagnostics paired with streaming computational pipelines capable of single-shot analysis with extremely low latency. We present the Deterministic Characterization with an Integrated Parallelizable Hybrid Resolver architecture, a hybrid machine learning framework designed for fast, accurate analysis of XFEL diagnostics using angular streaking-based sinogram images. This architecture integrates convolutional neural networks and bidirectional long short-term memory models to denoise input, identify x-ray sub-spike features, and extract sub-spike relative delays with sub-30 attosecond temporal resolution. Deployed on low-latency hardware, it achieves over 10~kHz throughput with \SI{168.3}{\micro\second} inference latency, indicating scalability to 14~kHz with FPGA integration. By transforming regression tasks into classification problems and leveraging optimized error encoding, we achieve high precision with low-latency performance that is critical for real-time streaming event selection and experimental control feedback signals. This represents a key development in real-time control pipelines for next-generation autonomous science, generally, and high repetition-rate x-ray experiments in particular.
