Table of Contents
Fetching ...

Equivariant Masked Position Prediction for Efficient Molecular Representation

Junyi An, Chao Qu, Yun-Fei Shi, XinHao Liu, Qianwei Tang, Fenglei Cao, Yuan Qi

TL;DR

The paper tackles limited molecular data hindering generalization in GNNs by introducing Equivariant Masked Position Prediction (EMPP), a self-supervised task that predicts masked atomic positions from neighboring structure to learn quantum-mechanical features without relying on Gaussian-mixture denoising. EMPP uses SO(3)-equivariant backbones and spherical-harmonics-based direction-radius distributions to predict a masked atom's position in a well-posed manner, enabling rich 3D learning and deterministic force-related information. It provides both a pre-training mechanism (e.g., on PCQM4Mv2) and an auxiliary-task setup to boost downstream quantum-property predictions, outperforming state-of-the-art masking and denoising methods across QM9, MD17, and GEOM-Drug benchmarks. The approach yields substantial generalization gains, leverages data-generation via masking, and opens avenues for higher-order equivariant representations (e.g., $L_{max}>3$) in molecular modeling.

Abstract

Graph neural networks (GNNs) have shown considerable promise in computational chemistry. However, the limited availability of molecular data raises concerns regarding GNNs' ability to effectively capture the fundamental principles of physics and chemistry, which constrains their generalization capabilities. To address this challenge, we introduce a novel self-supervised approach termed Equivariant Masked Position Prediction (EMPP), grounded in intramolecular potential and force theory. Unlike conventional attribute masking techniques, EMPP formulates a nuanced position prediction task that is more well-defined and enhances the learning of quantum mechanical features. EMPP also bypasses the approximation of the Gaussian mixture distribution commonly used in denoising methods, allowing for more accurate acquisition of physical properties. Experimental results indicate that EMPP significantly enhances performance of advanced molecular architectures, surpassing state-of-the-art self-supervised approaches. Our code is released in https://github.com/ajy112/EMPP

Equivariant Masked Position Prediction for Efficient Molecular Representation

TL;DR

The paper tackles limited molecular data hindering generalization in GNNs by introducing Equivariant Masked Position Prediction (EMPP), a self-supervised task that predicts masked atomic positions from neighboring structure to learn quantum-mechanical features without relying on Gaussian-mixture denoising. EMPP uses SO(3)-equivariant backbones and spherical-harmonics-based direction-radius distributions to predict a masked atom's position in a well-posed manner, enabling rich 3D learning and deterministic force-related information. It provides both a pre-training mechanism (e.g., on PCQM4Mv2) and an auxiliary-task setup to boost downstream quantum-property predictions, outperforming state-of-the-art masking and denoising methods across QM9, MD17, and GEOM-Drug benchmarks. The approach yields substantial generalization gains, leverages data-generation via masking, and opens avenues for higher-order equivariant representations (e.g., ) in molecular modeling.

Abstract

Graph neural networks (GNNs) have shown considerable promise in computational chemistry. However, the limited availability of molecular data raises concerns regarding GNNs' ability to effectively capture the fundamental principles of physics and chemistry, which constrains their generalization capabilities. To address this challenge, we introduce a novel self-supervised approach termed Equivariant Masked Position Prediction (EMPP), grounded in intramolecular potential and force theory. Unlike conventional attribute masking techniques, EMPP formulates a nuanced position prediction task that is more well-defined and enhances the learning of quantum mechanical features. EMPP also bypasses the approximation of the Gaussian mixture distribution commonly used in denoising methods, allowing for more accurate acquisition of physical properties. Experimental results indicate that EMPP significantly enhances performance of advanced molecular architectures, surpassing state-of-the-art self-supervised approaches. Our code is released in https://github.com/ajy112/EMPP

Paper Structure

This paper contains 37 sections, 30 equations, 6 figures, 16 tables.

Figures (6)

  • Figure 1: (a, b, c) Comparison of three molecular self-supervised methods using real halobenzenes (Ph-X) as an example: (a) Masking atomic attributes, such as atomic number, and reconstructing them; (b) Adding noise to atomic positions and predicting the noise; (c) Completely masking positions and inferring them based on neighboring structures. (d) Principle of the denoising methods: they utilize Gaussian mixture distributions to approximate the local minima of the PES, allowing the noise terms to estimate the forces, i.e. derivatives of the PES. The two local minima in the figure correspond to two equilibrium atoms, each requiring a different standard deviation $\sigma$ for approximation. As the shape of the PES is unknown, determining $\sigma$ requires an empirical approach.
  • Figure 2: The overall framework of EMPP. The masked position can be recounstructed by the GNNs output features of the neighboring nodes, with the position determined by the predicted directions and radius from those nodes.
  • Figure 3: The curve of performance varying with the standard deviation $\sigma$.
  • Figure 4: Determinacy of atomic positions in organic molecules. The relationship between atoms and colors is (Carbon, gray; Hydrogen, white; Oxygen, red; Nitrogen, blue). When the system is fixed, most atoms have a uniquely determined position. The translucent area represents the atom of interest, and the arrow indicates its unique position.
  • Figure 5: Atoms with uncertain positions exist. In organic molecules, the positions of some atoms have multiple possibilities, corresponding to several local minima of the potential energy surface. We represent the two possible positions of the atom with arrows.
  • ...and 1 more figures