Lang2Str: Two-Stage Crystal Structure Generation with LLMs and Continuous Flow Models

Cong Liu; Chengyue Gong; Zhenyu Liu; Jiale Zhao; Yuxuan Zhang

Lang2Str: Two-Stage Crystal Structure Generation with LLMs and Continuous Flow Models

Cong Liu, Chengyue Gong, Zhenyu Liu, Jiale Zhao, Yuxuan Zhang

TL;DR

A two-stage generative framework that combines the strengths of large language models (LLMs) and flow-based models for flexible and precise material generation, and achieves competitive performance on material generation and crystal structure prediction tasks.

Abstract

Generative models hold great promise for accelerating material discovery but are often limited by their inflexible single-stage generative process in designing valid and diverse materials. To address this, we propose a two-stage generative framework, Lang2Str, that combines the strengths of large language models (LLMs) and flow-based models for flexible and precise material generation. Our method frames the generative process as a conditional generative task, where an LLM provides high-level conditions by generating descriptions of material unit cells' geometric layouts and properties. These descriptions, informed by the LLM's extensive background knowledge, ensure reasonable structure designs. A conditioned flow model then decodes these textual conditions into precise continuous coordinates and unit cell parameters. This staged approach combines the structured reasoning of LLMs and the distribution modeling capabilities of flow models. Experimental results show that our method achieves competitive performance on \textit{ab initio} material generation and crystal structure prediction tasks, with generated structures exhibiting closer alignment to ground truth in both geometry and energy levels, surpassing state-of-the-art models. The flexibility and modularity of our framework further enable fine-grained control over the generation process, potentially leading to more efficient and customizable material design.

Lang2Str: Two-Stage Crystal Structure Generation with LLMs and Continuous Flow Models

TL;DR

Abstract

Paper Structure (24 sections, 6 equations, 2 figures, 7 tables)

This paper contains 24 sections, 6 equations, 2 figures, 7 tables.

Introduction
Preliminaries
Crystal and Crystal Structure Representations
Space Group
Task Definition
Methods
Flow Matching Models for Crystal Structures
Lattice parameters $\mathbf{L}$
Fraction coordinates $\mathbf{F}$
Language Descriptor for Crystals
Experiments
Ab Initio Generation
Dataset
Results on Proxy Metrics
Results on CHGNet-related Metrics
...and 9 more sections

Figures (2)

Figure 1: The generation of crystal structures.
Figure 2: Pipeline for Generating Crystal Structures from Textual Descriptions With Flows. This figure illustrates the overall pipeline for generating crystal structures using our framework at time step $t$. The process begins with retrieving space group information given chemical composition (e.g., GaTe) with CSPML. A fine-tuned LLM generates a textual description of the crystal structure given chemical composition and the retrieved space group, capturing its detailed geometric properties. The resulting text embedding is integrated with the node embeddings of the crystal structure through cross-attention layers, enabling efficient interactions between the textual and structural representations.

Lang2Str: Two-Stage Crystal Structure Generation with LLMs and Continuous Flow Models

TL;DR

Abstract

Lang2Str: Two-Stage Crystal Structure Generation with LLMs and Continuous Flow Models

Authors

TL;DR

Abstract

Table of Contents

Figures (2)