Materium: An Autoregressive Approach for Material Generation
Niklas Dobberstein, Jan Hamaekers
TL;DR
Materium introduces a discrete-token autoregressive framework for crystal-structure generation by encoding lattice parameters, fractional coordinates, elements, and oxidation states into a single sequence. The model is lightweight (≈43M parameters) and trains rapidly on a single GPU, enabling fast unconditional and multi-conditional generation for properties such as band gap, magnetic density, density, space group, and HHI. Empirical results show strong unconditional performance with high StructureMatcher match rates and energy proximity to relaxed structures, along with robust conditioned generation across several properties, though novelty lags behind some state-of-the-art diffusion methods. The approach offers a computationally efficient alternative to diffusion/flow models and provides room for reinforcement-learning fine-tuning to enhance output novelty and diversity.
Abstract
We present Materium: an autoregressive transformer for generating crystal structures that converts 3D material representations into token sequences. These sequences include elements with oxidation states, fractional coordinates and lattice parameters. Unlike diffusion approaches, which refine atomic positions iteratively through many denoising steps, Materium places atoms at precise fractional coordinates, enabling fast, scalable generation. With this design, the model can be trained in a few hours on a single GPU and generate samples much faster on GPUs and CPUs than diffusion-based approaches. The model was trained and evaluated using multiple properties as conditions, including fundamental properties, such as density and space group, as well as more practical targets, such as band gap and magnetic density. In both single and combined conditions, the model performs consistently well, producing candidates that align with the requested inputs.
