Crossing New Frontiers: Knowledge-Augmented Large Language Model Prompting for Zero-Shot Text-Based De Novo Molecule Design
Sakhinana Sagar Srinivas, Venkataramana Runkana
TL;DR
This work introduces FrontierX: LLM-MG, a knowledge-augmented prompting framework for zero-shot text-to-molecule generation that combines outputs from off-the-shelf LLMs with domain-tuned small LMs. Top-$R$ SMILES and explanatory rationales produced by LLMs are integrated via a two-layer hierarchical multi-head attention mechanism to form cross-modal embeddings that drive a Transformer decoder to generate accurate SMILES strings. The approach achieves state-of-the-art performance on the text2mol task (ChEBI-20) and demonstrates the value of explicit explanations and cross-modal fusion, with ablations confirming the contribution of each component. The methodology enables efficient, scalable molecule design without fine-tuning large models, highlighting a practical pathway for knowledge-infused prompting in cross-domain generative chemistry.
Abstract
Molecule design is a multifaceted approach that leverages computational methods and experiments to optimize molecular properties, fast-tracking new drug discoveries, innovative material development, and more efficient chemical processes. Recently, text-based molecule design has emerged, inspired by next-generation AI tasks analogous to foundational vision-language models. Our study explores the use of knowledge-augmented prompting of large language models (LLMs) for the zero-shot text-conditional de novo molecular generation task. Our approach uses task-specific instructions and a few demonstrations to address distributional shift challenges when constructing augmented prompts for querying LLMs to generate molecules consistent with technical descriptions. Our framework proves effective, outperforming state-of-the-art (SOTA) baseline models on benchmark datasets.
