Machine learning for structure-property relationships: Scalability and limitations
Zhongzheng Tian, Sheng Zhang, Gia-Wei Chern
TL;DR
The paper addresses the challenge of scalable prediction of intensive properties and phase classification in many-body systems by exploiting locality through a block-based approach with block size $\ell$ and averaging over blocks to reach large systems. It introduces a symmetry-aware lattice descriptor based on group-theoretical irreducible representations under $D_4$ and $Z_2$, including reference IR coefficients, fed into a CNN+FFN to predict block quantities $\mathcal Q_{\alpha}$. Results on the 2D Ising model reveal a scaling relation between block size and correlation length: the prediction error scales as $\varepsilon \sim \ell^{-\theta} \Phi(\ell/\xi)$ with $\theta \approx 0.12$ and the critical window scales as $\Delta T \sim \ell^{-1/\nu}$ with $\nu = 1$. The framework provides a practical, transferable path to large-scale predictions while clarifying fundamental scalability limits set by the diverging correlation length, and it points to extensions to electronic properties and unsupervised methods.
Abstract
We present a scalable machine learning (ML) framework for predicting intensive properties and particularly classifying phases of many-body systems. Scalability and transferability are central to the unprecedented computational efficiency of ML methods. In general, linear-scaling computation can be achieved through the divide and conquer approach, and the locality of physical properties is key to partitioning the system into sub-domains that can be solved separately. Based on the locality assumption, ML model is developed for the prediction of intensive properties of a finite-size block. Predictions of large-scale systems can then be obtained by averaging results of the ML model from randomly sampled blocks of the system. We show that the applicability of this approach depends on whether the block-size of the ML model is greater than the characteristic length scale of the system. In particular, in the case of phase identification across a critical point, the accuracy of the ML prediction is limited by the diverging correlation length. The two-dimensional Ising model is used to demonstrate the proposed framework. We obtain an intriguing scaling relation between the prediction accuracy and the ratio of ML block size over the spin-spin correlation length. Implications for practical applications are also discussed.
