Table of Contents
Fetching ...

MING: A Functional Approach to Learning Molecular Generative Models

Van Khoa Nguyen, Maciej Falkiewicz, Giangiacomo Mercatali, Alexandros Kalousis

TL;DR

MING tackles molecule generation by learning distributions directly in function space rather than graphs or sequences. It introduces a diffusion process over molecular function evaluations, coupled with an EM-based denoising objective and TwinINR denoisers to operate on irregular domains defined by graph spectral coordinates. The approach yields a lightweight, fast generator that achieves competitive or superior validity, novelty, and distribution alignment (e.g., FCD, NSPDK) across QM9, ZINC250k, and MOSES, while reducing parameter count and inference time. This functional perspective offers a scalable alternative to graph-equivariant architectures and points to broader applications of function-space generative modeling in chemistry.

Abstract

Traditional molecule generation methods often rely on sequence- or graph-based representations, which can limit their expressive power or require complex permutation-equivariant architectures. This paper introduces a novel paradigm for learning molecule generative models based on functional representations. Specifically, we propose Molecular Implicit Neural Generation (MING), a diffusion-based model that learns molecular distributions in the function space. Unlike standard diffusion processes in the data space, MING employs a novel functional denoising probabilistic process, which jointly denoises information in both the function's input and output spaces by leveraging an expectation-maximization procedure for latent implicit neural representations of data. This approach enables a simple yet effective model design that accurately captures underlying function distributions. Experimental results on molecule-related datasets demonstrate MING's superior performance and ability to generate plausible molecular samples, surpassing state-of-the-art data-space methods while offering a more streamlined architecture and significantly faster generation times. The code is available at https://github.com/v18nguye/MING.

MING: A Functional Approach to Learning Molecular Generative Models

TL;DR

MING tackles molecule generation by learning distributions directly in function space rather than graphs or sequences. It introduces a diffusion process over molecular function evaluations, coupled with an EM-based denoising objective and TwinINR denoisers to operate on irregular domains defined by graph spectral coordinates. The approach yields a lightweight, fast generator that achieves competitive or superior validity, novelty, and distribution alignment (e.g., FCD, NSPDK) across QM9, ZINC250k, and MOSES, while reducing parameter count and inference time. This functional perspective offers a scalable alternative to graph-equivariant architectures and points to broader applications of function-space generative modeling in chemistry.

Abstract

Traditional molecule generation methods often rely on sequence- or graph-based representations, which can limit their expressive power or require complex permutation-equivariant architectures. This paper introduces a novel paradigm for learning molecule generative models based on functional representations. Specifically, we propose Molecular Implicit Neural Generation (MING), a diffusion-based model that learns molecular distributions in the function space. Unlike standard diffusion processes in the data space, MING employs a novel functional denoising probabilistic process, which jointly denoises information in both the function's input and output spaces by leveraging an expectation-maximization procedure for latent implicit neural representations of data. This approach enables a simple yet effective model design that accurately captures underlying function distributions. Experimental results on molecule-related datasets demonstrate MING's superior performance and ability to generate plausible molecular samples, surpassing state-of-the-art data-space methods while offering a more streamlined architecture and significantly faster generation times. The code is available at https://github.com/v18nguye/MING.

Paper Structure

This paper contains 47 sections, 2 theorems, 21 equations, 8 figures, 6 tables, 1 algorithm.

Key Result

Proposition 3.1

For a triplet-based representation of molecule function evaluation $s_{t} \triangleq (\phi_{t}, z_{t}, y_{t})$, by assuming the topological invariance $\phi_{t} \simeq \phi_{0}$ and the temporal conditional independence of latent posterior $q(z_{t}|\phi_0, y_{t})$, the KL divergence between two func

Figures (8)

  • Figure 1: MING Graphical Model. (Top) A forward process $q(s_{t} | s_{t-1})$ defines a noise schedule on function's output space. We introduce an INR-based denoising network $\theta$ that learns to estimate reverse diffusion processes on function space. (Bottom) A noisy signal $y_t$ is a function of trainable latent $z_t$ and coordinate-wise inputs $\phi_t$. We parameterize the dependency by an another INR-based network $\psi$. The unobserved latent input $z_t$ can be obtained from the noisy output $y_t$ via gradient-descent optimization.
  • Figure 2: TwinINR Operational Flows. (Left) TwinINR predicts a multi-dimension vector that encodes atom-type, bond-type, and a null token probability. While the node-coordinate system $\mathcal{T}$ bases on the eigenvector space of molecular graph Laplacian, we introduce the edge-coordinate system by taking the product of node-coordinate spaces $\mathcal{T}\times\mathcal{T}$. (Right) The TwinINR's internal architecture is composed of two identical (twin) conditional INR networks $\psi$ and $\theta$. The latent network $\psi$ estimates the latent-input representation $z$ of the diffused signal $y_t$ at the diffusion step $t$ by maximizing the latent model likelihood. The denoising network $\theta$ predicts the clean signal $y_0$ from the noisy latent input $z$. We adopt the latent-modulation strategy mehta2021modulated for each network.
  • Figure 3: Benchmarking model efficiency on ZINC250k
  • Figure 4: Validity, uniqueness, and novelty results across different number of diffusion time steps $T$ on ZINC250k
  • Figure 5: Validity (Left), uniqueness (Middle), and novelty (Right) results on QM9 using SINE and ReLU activation functions.
  • ...and 3 more figures

Theorems & Definitions (2)

  • Proposition 3.1
  • Proposition 3.2