A Physics-Inspired Deep Learning Framework with Polar Coordinate Attention for Ptychographic Imaging
Han Yue, Jun Cheng, Yu-Xuan Ren, Chien-Chun Chen, Grant A. van Riessen, Philip Heng Wai Leong, Steve Feng Shu
TL;DR
This paper tackles the phase-retrieval challenge in ptychographic imaging by reorienting deep learning toward diffraction physics. It introduces PPN, a dual-branch architecture that combines Local Dependencies via ViT blocks with Non-Local Coherence via Polar Coordinate Attention (PoCA), aligning attention with reciprocal-space geometry. PoCA encodes radial-angular correlations and a learnable center, enabling superior high-frequency preservation and robust performance across low-overlap acquisitions, with substantial speedups over iterative methods and far fewer parameters than pure transformers. The approach yields notable gains in amplitude/phase reconstruction, demonstrates generalization to experimental data, and offers data-efficient learning for high-throughput, real-world diffraction imaging. Overall, PPN advances physics-informed DL for frequency-domain inverse problems and suggests broad applicability to Cryo-EM, X-ray, and astronomical imaging alike.
Abstract
Ptychographic imaging confronts inherent challenges in applying deep learning for phase retrieval from diffraction patterns. Conventional neural architectures, both convolutional neural networks and Transformer-based methods, are optimized for natural images with Euclidean spatial neighborhood-based inductive biases that exhibit geometric mismatch with the concentric coherent patterns characteristic of diffraction data in reciprocal space. In this paper, we present PPN, a physics-inspired deep learning network with Polar Coordinate Attention (PoCA) for ptychographic imaging, that aligns neural inductive biases with diffraction physics through a dual-branch architecture separating local feature extraction from non-local coherence modeling. It consists of a PoCA mechanism that replaces Euclidean spatial priors with physically consistent radial-angular correlations. PPN outperforms existing end-to-end models, with spectral and spatial analysis confirming its greater preservation of high-frequency details. Notably, PPN maintains robust performance compared to iterative methods even at low overlap ratios, making it well suited for high-throughput imaging in real-world acquisition scenarios for samples with consistent structural characteristics.
