Multi-channel learning for integrating structural hierarchies into context-dependent molecular representation
Yue Wan, Jialu Wu, Tingjun Hou, Chang-Yu Hsieh, Xiaowei Jia
TL;DR
The paper tackles data scarcity and activity cliffs in molecular property prediction by introducing a prompt-guided, multi-channel self-supervised pre-training framework. It learns separate representations through molecule-distancing, scaffold-distancing, and context-prediction channels, and combines them during fine-tuning via a task-specific prompt mechanism. A novel adaptive-margin triplet loss and scaffold-invariant perturbations improve discrimination among closely related structures while preserving chemical knowledge. Across MoleculeNet and MoleculeACE benchmarks, the approach achieves state-of-the-art or competitive performance and shows enhanced robustness to activity cliffs, with representation-space analyses demonstrating better knowledge preservation during fine-tuning and interpretable channel contributions. This framework offers a principled path toward context-dependent, chemistry-aware molecular representations with practical implications for drug discovery and beyond.
Abstract
Reliable molecular property prediction is essential for various scientific endeavors and industrial applications, such as drug discovery. However, the data scarcity, combined with the highly non-linear causal relationships between physicochemical and biological properties and conventional molecular featurization schemes, complicates the development of robust molecular machine learning models. Self-supervised learning (SSL) has emerged as a popular solution, utilizing large-scale, unannotated molecular data to learn a foundational representation of chemical space that might be advantageous for downstream tasks. Yet, existing molecular SSL methods largely overlook chemical knowledge, including molecular structure similarity, scaffold composition, and the context-dependent aspects of molecular properties when operating over the chemical space. They also struggle to learn the subtle variations in structure-activity relationship. This paper introduces a novel pre-training framework that learns robust and generalizable chemical knowledge. It leverages the structural hierarchy within the molecule, embeds them through distinct pre-training tasks across channels, and aggregates channel information in a task-specific manner during fine-tuning. Our approach demonstrates competitive performance across various molecular property benchmarks and offers strong advantages in particularly challenging yet ubiquitous scenarios like activity cliffs.
