Vib2Mol: from vibrational spectra to molecular structures-a unified deep learning framework
Xinyu Lu, Hao Ma, Hui Li, Jia Li, Yi Rong, Yuqiang Li, Tong Zhu, Guokun Liu, Bin Ren
TL;DR
Vib2Mol presents a unified deep learning framework that bridges spectrum-to-structure retrieval and generation for vibrational spectroscopy, enabling versatile tasks under varying prior knowledge. The model leverages multiphase training with alignment via contrastive learning, generation via MLM and LM, and a generate-then-rerank pipeline, augmented by coarse-to-fine retrieval and a cross-modal masking strategy to handle Raman, IR, or both inputs. It achieves state-of-the-art results on theoretical IR/Raman benchmarks and outperforms baselines on experimental data, with demonstrated capabilities in predicting reaction products and sequencing peptides, including PTM site identification. The framework shows promise for autonomous discovery workflows and in situ analysis of chemical and biological processes, with potential extensions to stereochemistry and graph-based representations for broader applicability.
Abstract
There will be a paradigm shift in chemical and biological research, to be enabled by autonomous, closed-loop, real-time self-directed decision-making experimentation. Spectrum-to-structure correlation, which is to elucidate molecular structures with spectral information, is the core step in understanding the experimental results and to close the loop. However, current approaches usually divide the task into either database-dependent retrieval and database-independent generation and neglect the inherent complementarity between them. In this study, we proposed Vib2Mol, a unified deep learning framework designed to flexibly handle diverse spectrum-to-structure tasks according to the available prior knowledge by bridging the retrieval and generation. Empowered by our coarse-to-fine retrieval and generate-then-rerank strategies, Vib2Mol not only achieves state-of-the-art performance in analyzing theoretical Infrared and Raman spectra, but also outperform previous models on experimental data. Moreover, our model demonstrates promising capabilities in predicting reaction products and sequencing peptides, enabling vibrational spectroscopy a potential guide for autonomous scientific discovery workflows.
