Enhancing Peak Assignment in 13C NMR Spectroscopy: A Novel Approach Using Multimodal Alignment
Hao Xu, Zhengyang Zhou, Pengyu Hong
TL;DR
This work tackles the challenge of interpreting and mapping $^{13}$C NMR spectra to molecular structures, particularly under zero-shot conditions for retrieval, isomer recognition, and peak assignment. It introduces K-M3AID, a dual-level multimodal alignment framework that links molecular graphs with NMR spectra through graph-level and node-level alignments, guided by Knowledge Span-based instance discrimination. A Knowledge-Guided Instance-Wise Discrimination (KSGL) mechanism leverages continuous domain knowledge to steer cross-modal distance learning, and a communication channel enables cross-talk between encoders, with the overall objective $L = CL_{graph} + CL_{node}$. Empirical results on large-scale pretraining and zero-shot tasks show that K-M3AID achieves strong graph- and atom-level alignment, superior molecular retrieval, complete isomer recognition, and high-precision peak assignment, highlighting its potential to improve spectral interpretation and candidate ranking in practical NMR workflows. The work also demonstrates meta-learning aspects, showing that node-level skills boost graph-level alignment, and points to future work incorporating 3D graph representations to handle highly complex molecules.
Abstract
Nuclear magnetic resonance (NMR) spectroscopy plays an essential role in deciphering molecular structure and dynamic behaviors. While AI-enhanced NMR prediction models hold promise, challenges still persist in tasks such as molecular retrieval, isomer recognition, and peak assignment. In response, this paper introduces a novel solution, Multi-Level Multimodal Alignment with Knowledge-Guided Instance-Wise Discrimination (K-M3AID), which establishes correspondences between two heterogeneous modalities: molecular graphs and NMR spectra. K-M3AID employs a dual-coordinated contrastive learning architecture with three key modules: a graph-level alignment module, a node-level alignment module, and a communication channel. Notably, K-M3AID introduces knowledge-guided instance-wise discrimination into contrastive learning within the node-level alignment module. In addition, K-M3AID demonstrates that skills acquired during node-level alignment have a positive impact on graph-level alignment, acknowledging meta-learning as an inherent property. Empirical validation underscores K-M3AID's effectiveness in multiple zero-shot tasks.
