Quasiparticle Interference Kernel Extraction with Variational Autoencoders via Latent Alignment
Yingshuai Ji, Haomin Zhuang, Matthew Toole, James McKenzie, Xiaolong Liu, Xiangliang Zhang
TL;DR
The paper tackles the ill-posed problem of extracting a single-scatterer QPI kernel $\mathcal{A}$ from multi-scatterer STM images $\mathcal{Y}$ by introducing a two-step variational autoencoder with latent alignment. Step 1 learns a compact kernel latent space via a kernel VAE, and Step 2 trains an observation encoder to map $(\mathcal{Y},\mathcal{M})$ into that space, enforcing $\mathbf{h}_y \approx \mathbf{h}_A$. A 100-kernel synthetic dataset (50k samples) with noise augmentation is developed and validated on real Ag and FeSe data, showing superior accuracy and generalization over a one-step baseline. Ablation studies demonstrate the value of symmetric losses, pre-trained VAEs, and noise augmentation in improving robustness. This data-driven framework provides a scalable, physically informed approach to inverse problems in quantum materials and enables efficient kernel extraction and benchmarking.
Abstract
Quasiparticle interference (QPI) imaging is a powerful tool for probing electronic structures in quantum materials, but extracting the single-scatterer QPI pattern (i.e., the kernel) from a multi-scatterer image remains a fundamentally ill-posed inverse problem, because many different kernels can combine to produce almost the same observed image, and noise or overlaps further obscure the true signal. Existing solutions to this extraction problem rely on manually zooming into small local regions with isolated single-scatterers. This is infeasible for real cases where scattering conditions are too complex. In this work, we propose the first AI-based framework for QPI kernel extraction, which models the space of physically valid kernels and uses this knowledge to guide the inverse mapping. We introduce a two-step learning strategy that decouples kernel representation learning from observation-to-kernel inference. In the first step, we train a variational autoencoder to learn a compact latent space of scattering kernels. In the second step, we align the latent representation of QPI observations with those of the pre-learned kernels using a dedicated encoder. This design enables the model to infer kernels robustly under complex, entangled scattering conditions. We construct a diverse and physically realistic QPI dataset comprising 100 unique kernels and evaluate our method against a direct one-step baseline. Experimental results demonstrate that our approach achieves significantly higher extraction accuracy, improved generalization to unseen kernels. To further validate its effectiveness, we also apply the method to real QPI data from Ag and FeSe samples, where it reliably extracts meaningful kernels under complex scattering conditions.
