PDDFormer: Pairwise Distance Distribution Graph Transformer for Crystal Material Property Prediction
Xiangxiang Shen, Zheng Wan, Lingfeng Wen, Licheng Sun, Jian Yang, Xuan Tang, Shing-Ho J. Lin, Xiao He, Mingsong Chen, Xian Wei
TL;DR
This work tackles crystal data fuzziness by introducing atom-weighted and unit-cell pairwise distance distributions (WPDD/UPDD) to form a continuous, geometrically complete crystal graph. By modeling PDD as global information and integrating it into matrix-based message passing, the authors present PDDFormer, a transformer-based framework that achieves state-of-the-art performance on Materials Project and JARVIS benchmarks while maintaining computational efficiency. Theoretical guarantees are provided for continuity under perturbations via Earth Mover's Distance and for geometric completeness (WPDD) and continuity (UPDD), with practical proofs and ablations supporting the design. The approach bridges traditional crystal descriptors and dynamic atomic behavior, enabling more reliable and scalable crystal property predictions with robust invariance to perturbations.
Abstract
Crystal structures can be simplified as a periodic point set that repeats across three-dimensional space along an underlying lattice. Traditionally, crystal representation methods characterize the structure using descriptors such as lattice parameters, symmetry, and space groups. However, in reality, atoms in materials always vibrate above absolute zero, causing their positions to fluctuate continuously. This dynamic behavior disrupts the fundamental periodicity of the lattice, making crystal graphs based on static lattice parameters and conventional descriptors discontinuous under slight perturbations. Chemists proposed the pairwise distance distribution (PDD) method to address this problem. However, the completeness of PDD requires defining a large number of neighboring atoms, leading to high computational costs. Additionally, PDD does not account for atomic information, making it challenging to apply it directly to crystal material property prediction tasks. To tackle these challenges, we introduce the atom-Weighted Pairwise Distance Distribution (WPDD) and Unit cell Pairwise Distance Distribution (UPDD) and apply them to the construction of multi-edge crystal graphs. We demonstrate the continuity and general completeness of crystal graphs under slight atomic position perturbations. Moreover, by modeling PDD as global information and integrating it into matrix-based message passing, we significantly reduce computational costs. Comprehensive evaluation results show that WPDDFormer achieves state-of-the-art predictive accuracy across tasks on benchmark datasets such as the Materials Project and JARVIS-DFT.
