CatFlow: Co-generation of Slab-Adsorbate Systems via Flow Matching
Minkyu Kim, Nayoung Kim, Honghui Kim, Sungsoo Ahn
TL;DR
The paper tackles the challenge of designing heterogeneous catalysts by enabling end-to-end co-generation of slab structures and adsorbate coordinates. It introduces CatFlow, a flow-matching framework that uses a factorized slab-adsorbate representation—including primitive cell, transformation matrix, vacuum scaling factor, and adsorbate—to reduce modeling complexity while preserving surface orientation. A transformer-based architecture with continuous and discrete flow components learns the joint distribution for de novo generation and structure prediction, validated on OC20 against strong baselines. Results show CatFlow yields higher structural fidelity and adsorption-energy realism, approaching thermodynamic local minima and enabling efficient exploration of catalyst surfaces. This approach lays the groundwork for scalable inverse design and multi-adsorbate catalyst discovery by tightly coupling surface geometry and adsorption phenomena in a single end-to-end model.
Abstract
Discovering heterogeneous catalysts tailored for specific reaction intermediates remains a fundamental bottleneck in materials science. While traditional trial-and-error methods and recent generative models have shown promise, they struggle to capture the intrinsic coupling between surface geometry and adsorbate interactions. To address this limitation, we propose CatFlow, a flow matching-based framework for de novo design and structure prediction of heterogeneous catalysts. Our model operates on a primitive cell-based factorized representation of the slab-adsorbate complex, reducing the number of learnable variables by an average of 9.2x while explicitly encoding the surface orientation of the slab-adsorbate interface. Experiments on the Open Catalyst 2020 dataset demonstrate that CatFlow significantly improves the structural fidelity of generated catalysts compared to autoregressive and sequential baselines. Further experiments show that the generated structures accurately capture the adsorption energy distributions of physically plausible interfaces and lie closer to thermodynamic local minima.
