Graph Multi-Similarity Learning for Molecular Property Prediction
Hao Xu, Zhengyang Zhou, Pengyu Hong
TL;DR
The paper addresses molecular property prediction and the limitations of contrastive learning that relies on binary positive/negative pairs. It proposes GraphMSL, which learns a generalized multi-similarity metric from self-similarity across multiple modalities (SMILES, images, NMR, fingerprints) and fuses them multimodally, without requiring predefined pairs. The authors establish a convergent similarity property and demonstrate graph-level, node-level, and bi-level variants with strong performance on MoleculeNet benchmarks, together with post-hoc explainability analyses. Overall, GraphMSL provides a flexible, interpretable framework for multimodal molecular representation learning with demonstrated predictive gains and actionable design insights for drug discovery.
Abstract
Enhancing accurate molecular property prediction relies on effective and proficient representation learning. It is crucial to incorporate diverse molecular relationships characterized by multi-similarity (self-similarity and relative similarities) between molecules. However, current molecular representation learning methods fall short in exploring multi-similarity and often underestimate the complexity of relationships between molecules. Additionally, previous multi-similarity approaches require the specification of positive and negative pairs to attribute distinct predefined weights to different relative similarities, which can introduce potential bias. In this work, we introduce Graph Multi-Similarity Learning for Molecular Property Prediction (GraphMSL) framework, along with a novel approach to formulate a generalized multi-similarity metric without the need to define positive and negative pairs. In each of the chemical modality spaces (e.g.,molecular depiction image, fingerprint, NMR, and SMILES) under consideration, we first define a self-similarity metric (i.e., similarity between an anchor molecule and another molecule), and then transform it into a generalized multi-similarity metric for the anchor through a pair weighting function. GraphMSL validates the efficacy of the multi-similarity metric across MoleculeNet datasets. Furthermore, these metrics of all modalities are integrated into a multimodal multi-similarity metric, which showcases the potential to improve the performance. Moreover, the focus of the model can be redirected or customized by altering the fusion function. Last but not least, GraphMSL proves effective in drug discovery evaluations through post-hoc analyses of the learnt representations.
