Towards generalizable deep ptychography neural networks
Albert Vong, Steven Henke, Oliver Hoidn, Hanna Ruth, Junjing Deng, Alexander Hexemer, David Shapiro, Apurva Mehta, Arianna Gleason, Levi Hancock, Nicholas Schwarz
TL;DR
This work tackles the robustness gap in deep-learning-based ptychography by introducing a probe-centric, unsupervised training workflow that pairs experimentally measured probes with synthetic objects. The proposed PtychoPINN-torch integrates physics-informed inductive biases, overlap constraints, and a multi-probe learning regime to achieve generalization across multiple beamlines and instruments, with a synthetic data regime as small as ~28k examples achieving competitive performance. The study demonstrates single-experiment transfer, multi-probe representations, and frequency-sensitive effects of synthetic objects, showing meaningful reconstruction fidelity improvements and substantial real-time inference speedups compared to conventional iterative methods. The approach promises practical deployment of experiment-steering models capable of real-time feedback under dynamic conditions, while highlighting probe learning as a dominant factor in performance and paving the way for larger probe libraries and more expressive architectures.
Abstract
X-ray ptychography is a data-intensive imaging technique expected to become ubiquitous at next-generation light sources delivering many-fold increases in coherent flux. The need for real-time feedback under accelerated acquisition rates motivates surrogate reconstruction models like deep neural networks, which offer orders-of-magnitude speedup over conventional methods. However, existing deep learning approaches lack robustness across diverse experimental conditions. We propose an unsupervised training workflow emphasizing probe learning by combining experimentally-measured probes with synthetic, procedurally generated objects. This probe-centric approach enables a single physics-informed neural network to reconstruct unseen experiments across multiple beamlines; among the first demonstrations of multi-probe generalization. We find probe learning is equally important as in-distribution learning; models trained using this synthetic workflow achieve reconstruction fidelity comparable to those trained exclusively on experimental data, even when changing the type of synthetic training object. The proposed approach enables training of experiment-steering models that provide real-time feedback under dynamic experimental conditions.
