All-atom Diffusion Transformers: Unified generative modelling of molecules and materials

Chaitanya K. Joshi; Xiang Fu; Yi-Lun Liao; Vahe Gharakhanyan; Benjamin Kurt Miller; Anuroop Sriram; Zachary W. Ulissi

All-atom Diffusion Transformers: Unified generative modelling of molecules and materials

Chaitanya K. Joshi, Xiang Fu, Yi-Lun Liao, Vahe Gharakhanyan, Benjamin Kurt Miller, Anuroop Sriram, Zachary W. Ulissi

TL;DR

ADiT presents a unified latent diffusion framework that jointly models molecules and materials via a shared all-atom latent space learned by a Transformer-based VAE, followed by a Diffusion Transformer that denoises latents and decodes them into valid structures. The approach achieves state-of-the-art or competitive results across QM9 molecules and MP20 crystals, with substantial inference speedups over equivariant diffusion models and scalable performance gains as model size increases. Joint training on both domains enables transfer learning and improves sampling validity and stability, while maintaining strong results on larger datasets like GEOM-DRUGS. This work points toward broadly generalizable foundation models for generative chemistry with practical implications for fast, cross-domain inverse design.

Abstract

Diffusion models are the standard toolkit for generative modelling of 3D atomic systems. However, for different types of atomic systems -- such as molecules and materials -- the generative processes are usually highly specific to the target system despite the underlying physics being the same. We introduce the All-atom Diffusion Transformer (ADiT), a unified latent diffusion framework for jointly generating both periodic materials and non-periodic molecular systems using the same model: (1) An autoencoder maps a unified, all-atom representations of molecules and materials to a shared latent embedding space; and (2) A diffusion model is trained to generate new latent embeddings that the autoencoder can decode to sample new molecules or materials. Experiments on MP20, QM9 and GEOM-DRUGS datasets demonstrate that jointly trained ADiT generates realistic and valid molecules as well as materials, obtaining state-of-the-art results on par with molecule and crystal-specific models. ADiT uses standard Transformers with minimal inductive biases for both the autoencoder and diffusion model, resulting in significant speedups during training and inference compared to equivariant diffusion models. Scaling ADiT up to half a billion parameters predictably improves performance, representing a step towards broadly generalizable foundation models for generative chemistry. Open source code: https://github.com/facebookresearch/all-atom-diffusion-transformer

All-atom Diffusion Transformers: Unified generative modelling of molecules and materials

TL;DR

Abstract

All-atom Diffusion Transformers: Unified generative modelling of molecules and materials

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)