DiffNMR2: NMR Guided Sampling Acquisition Through Diffusion Model Uncertainty
Etienne Goffinet, Sen Yan, Fabrizio Gabellieri, Laurence Jennings, Lydia Gkoura, Filippo Castiglione, Ryan Young, Idir Malki, Ankita Singh, Thomas Launey
TL;DR
DiffNMR2 presents a diffusion-model guided sampling framework to accelerate 2D NMR acquisition by leveraging model uncertainty to adaptively select evolution times. Trained on real protein spectra, the method uses a Repaint-style diffusion inpainting pipeline to reconstruct masked frequency-time maps, enabling row-wise guided sampling. Across a real 100-protein dataset, Guided Sampling markedly outperforms baselines, achieving a 52.9% reduction in MSE at 10% acquisition and a 55.6% reduction in hallucinated peaks, with substantial wall-time savings for longer experiments. The approach integrates non-uniform sampling with diffusion-based reconstruction to deliver faster, high-fidelity spectral analyses and suggests broad applicability to higher-dimensional NMR and spectroscopy workflows.
Abstract
Nuclear Magnetic Resonance (NMR) spectrometry uses electro-frequency pulses to probe the resonance of a compound's nucleus, which is then analyzed to determine its structure. The acquisition time of high-resolution NMR spectra remains a significant bottleneck, especially for complex biological samples such as proteins. In this study, we propose a novel and efficient sub-sampling strategy based on a diffusion model trained on protein NMR data. Our method iteratively reconstructs under-sampled spectra while using model uncertainty to guide subsequent sampling, significantly reducing acquisition time. Compared to state-of-the-art strategies, our approach improves reconstruction accuracy by 52.9\%, reduces hallucinated peaks by 55.6%, and requires 60% less time in complex NMR experiments. This advancement holds promise for many applications, from drug discovery to materials science, where rapid and high-resolution spectral analysis is critical.
