Score-based 3D molecule generation with neural fields
Matthieu Kirchmeyer, Pedro O. Pinheiro, Saeed Saremi
TL;DR
We address unconditional 3D molecule generation by representing molecules as continuous atomic occupancy fields and decoding via a shared neural field conditioned on a per-molecule code. The method, FuncMol, uses a score-based generative pipeline with Neural Empirical Bayes to learn a denoiser and perform walk-jump sampling in latent space, followed by a continuous refinement step to recover atomic coordinates. This yields an all-atom generation framework that is compact, scalable, and capable of handling large molecules such as macrocyclic peptides, with competitive quality on standard benchmarks and at least an order-of-magnitude faster sampling than voxel-based baselines. The approach offers flexibility for auto-encoding/decoding variants and potential extensions to conditional generation and broader field-based molecular design tasks.
Abstract
We introduce a new representation for 3D molecules based on their continuous atomic density fields. Using this representation, we propose a new model based on walk-jump sampling for unconditional 3D molecule generation in the continuous space using neural fields. Our model, FuncMol, encodes molecular fields into latent codes using a conditional neural field, samples noisy codes from a Gaussian-smoothed distribution with Langevin MCMC (walk), denoises these samples in a single step (jump), and finally decodes them into molecular fields. FuncMol performs all-atom generation of 3D molecules without assumptions on the molecular structure and scales well with the size of molecules, unlike most approaches. Our method achieves competitive results on drug-like molecules and easily scales to macro-cyclic peptides, with at least one order of magnitude faster sampling. The code is available at https://github.com/prescient-design/funcmol.
