InversionGNN: A Dual Path Network for Multi-Property Molecular Optimization
Yifan Niu, Ziqi Gao, Tingyang Xu, Yang Liu, Yatao Bian, Yu Rong, Junzhou Huang, Jia Li
TL;DR
This work tackles multi-objective molecular optimization by introducing InversionGNN, a dual-path graph neural network that learns chemical-property knowledge via a direct prediction path and exploits that knowledge by gradient-guided edits in a second inversion path. The inversion path operates on differentiable scaffolding trees and uses a gradient-based Pareto search, via a non-dominating descent direction computed through a Quadratic Program, to generate Pareto-optimal molecules in discrete chemical space. The authors provide a convergence analysis showing that their relaxation yields an approximation to the true Pareto front within a bounded number of iterations and demonstrate strong, sample-efficient performance on synthetic benchmarks and real drug-design tasks, including weight-conditioned Pareto front exploration. The approach yields high Pareto diversity and property scores with substantially fewer oracle evaluations, enabling practical guidance for designing molecules that balance potency, safety, and synthesizability in real-world drug discovery.
Abstract
Exploring chemical space to find novel molecules that simultaneously satisfy multiple properties is crucial in drug discovery. However, existing methods often struggle with trading off multiple properties due to the conflicting or correlated nature of chemical properties. To tackle this issue, we introduce InversionGNN framework, an effective yet sample-efficient dual-path graph neural network (GNN) for multi-objective drug discovery. In the direct prediction path of InversionGNN, we train the model for multi-property prediction to acquire knowledge of the optimal combination of functional groups. Then the learned chemical knowledge helps the inversion generation path to generate molecules with required properties. In order to decode the complex knowledge of multiple properties in the inversion path, we propose a gradient-based Pareto search method to balance conflicting properties and generate Pareto optimal molecules. Additionally, InversionGNN is able to search the full Pareto front approximately in discrete chemical space. Comprehensive experimental evaluations show that InversionGNN is both effective and sample-efficient in various discrete multi-objective settings including drug discovery.
