Rethinking Crystal Symmetry Prediction: A Decoupled Perspective
Liheng Yu, Zhe Zhao, Xucong Wang, Di Wu, Pengkun Wang
TL;DR
The paper tackles the SPC-driven challenge of predicting crystal space groups from PXRD data. It proposes XRDecoupler, a decoupled framework that injects chemical knowledge via multidimensional superclasses (crystal systems, Bravais lattices, and point groups) and combines a hierarchical PXRD pattern learner with a multi-objective optimization to balance learning across subsymbolic properties. The approach comprises two core components: Superclass-Guided Optimization, which maximizes discriminative information across sub-properties using a Pareto-optimized gradient direction, and Hierarchical PXRD Pattern Learning, which fuses local peak relations and global pattern context into a joint representation $E=\text{Concat}(E_{global},E_{local})$. Empirical results on MOF, CoREMOF, and InorganicData show XRDecoupler achieving state-of-the-art accuracy and better generalization to out-of-domain data, validating the value of embedding chemical principles into symmetry prediction and of balancing multiple superclass objectives.
Abstract
Efficiently and accurately determining the symmetry is a crucial step in the structural analysis of crystalline materials. Existing methods usually mindlessly apply deep learning models while ignoring the underlying chemical rules. More importantly, experiments show that they face a serious sub-property confusion SPC problem. To address the above challenges, from a decoupled perspective, we introduce the XRDecoupler framework, a problem-solving arsenal specifically designed to tackle the SPC problem. Imitating the thinking process of chemists, we innovatively incorporate multidimensional crystal symmetry information as superclass guidance to ensure that the model's prediction process aligns with chemical intuition. We further design a hierarchical PXRD pattern learning model and a multi-objective optimization approach to achieve high-quality representation and balanced optimization. Comprehensive evaluations on three mainstream databases (e.g., CCDC, CoREMOF, and InorganicData) demonstrate that XRDecoupler excels in performance, interpretability, and generalization.
