Table of Contents
Fetching ...

Atomistic Machine Learning with Cartesian Natural Tensors

Qun Chen, A. S. L. Subrahmanyam Pattamatta, David J. Srolovitz, Mingjian Wen

TL;DR

This work tackles the lack of a principled symmetry-preserving Cartesian framework for representing high-rank tensors in atomistic ML. It introduces the Cartesian Natural Tensor Network CarNet, based on irreducible Cartesian natural tensors and three core operations, to achieve $E(3)$-equivariant representations directly in Cartesian space. CarNet delivers accurate interatomic potentials for LiPS and water and high-fidelity tensor predictions including dipole moments, polarizabilities, shielding tensors, and the full rank-4 elastic-constant tensor, while maintaining MD stability and physical fidelity. The approach enables robust structure–property discovery and materials design in Cartesian coordinates, with broad potential extensions to other point-cloud domains.

Abstract

Atomistic machine learning (ML) is a transformative tool for accurate and efficient investigation of material behavior at the atomic scale. While such models have been constructed within Cartesian space to harness geometric information and preserve intuitive physical representations, they face inherent challenges - primarily due to the lack of a systematic symmetry-preserving framework for representing arbitrary physical tensors. We address these challenges by proposing Cartesian Natural Tensor Networks (CarNet) as a general framework for atomistic ML. We first develop the theory of irreducible representations using Cartesian natural tensors (their creation, operation, as well as the decomposition and reconstruction of physical tensors such as the elastic constant tensor). Leveraging this machinery, we design an equivariant Cartesian model and demonstrate its exceptional performance across diverse atomistic ML tasks. CarNet enables the development of highly accurate and reliable interatomic potentials for both materials and molecular systems. Furthermore, structure-property relationships can be readily constructed for tensorial quantities ranging from simple properties like the dipole moment to arbitrary high-rank tensors with complex symmetries such as the elastic constant tensor -- capabilities that were previously inaccessible. This work removes theoretical barriers and unleashes the power of Cartesian approaches for advanced atomistic ML in the understanding and design of new materials.

Atomistic Machine Learning with Cartesian Natural Tensors

TL;DR

This work tackles the lack of a principled symmetry-preserving Cartesian framework for representing high-rank tensors in atomistic ML. It introduces the Cartesian Natural Tensor Network CarNet, based on irreducible Cartesian natural tensors and three core operations, to achieve -equivariant representations directly in Cartesian space. CarNet delivers accurate interatomic potentials for LiPS and water and high-fidelity tensor predictions including dipole moments, polarizabilities, shielding tensors, and the full rank-4 elastic-constant tensor, while maintaining MD stability and physical fidelity. The approach enables robust structure–property discovery and materials design in Cartesian coordinates, with broad potential extensions to other point-cloud domains.

Abstract

Atomistic machine learning (ML) is a transformative tool for accurate and efficient investigation of material behavior at the atomic scale. While such models have been constructed within Cartesian space to harness geometric information and preserve intuitive physical representations, they face inherent challenges - primarily due to the lack of a systematic symmetry-preserving framework for representing arbitrary physical tensors. We address these challenges by proposing Cartesian Natural Tensor Networks (CarNet) as a general framework for atomistic ML. We first develop the theory of irreducible representations using Cartesian natural tensors (their creation, operation, as well as the decomposition and reconstruction of physical tensors such as the elastic constant tensor). Leveraging this machinery, we design an equivariant Cartesian model and demonstrate its exceptional performance across diverse atomistic ML tasks. CarNet enables the development of highly accurate and reliable interatomic potentials for both materials and molecular systems. Furthermore, structure-property relationships can be readily constructed for tensorial quantities ranging from simple properties like the dipole moment to arbitrary high-rank tensors with complex symmetries such as the elastic constant tensor -- capabilities that were previously inaccessible. This work removes theoretical barriers and unleashes the power of Cartesian approaches for advanced atomistic ML in the understanding and design of new materials.

Paper Structure

This paper contains 8 sections, 18 equations, 5 figures, 4 tables.

Figures (5)

  • Figure 1: Schematic illustration of Cartesian natural tensor operations.a. Construction of natural tensors of different ranks from a unit vector $\hat{\bm r}$. b. Tensor product between a rank-1 and a rank-2 natural tensor generates three natural tensors of ranks 1, 2, and 3. c. Any physical tensor (e.g., the nuclear shielding tensor) can be decomposed into a set of natural tensors and, conversely, reconstructed from them. $\otimes$ is the product between ordinary tensors, $\hat{\otimes}$ is the product between natural tensors, and $\oplus$ is the direct sum of natural tensors.
  • Figure 2: Overview of the CarNet model architecture. The relative distance vector $\bm r$ of an atom from its neighbor is encoded using a set of radial basis and natural tensors. The atomic species $z$ is encoded using a learnable embedding to generate the initial atom features $\bm h$. With the radial part $R$, the angular part $\bm X$, and the atom features $\bm h$, each GNN layer first constructs the atomic moment and then the hyper moment using natural tensor products. Finally, the atomic features are mapped to the target properties using an output head.
  • Figure 3: MD simulation results of bulk LiPS and water systems.a--c: Crystal structure, RDF, and MSD of Li$^+$ ions versus time of LiPS. d--f: Simulation cell, RDF of oxygen-oxygen pairs, and MSD versus time of bulk water. The MD simulations using CarNet were performed at a temperature of 520 K for LiPS and 300 K for water. The reference AIMD and experimental results are at the same temperatures, except for the X-ray diffraction data, which is at 295 K skinner2014the. Five MD simulations with different initial velocities were performed, and the reported diffusion coefficients $D$ are the average over these runs. The water cell in panel d is shown for demonstration; the actual MD simulation used a 2×2×2 replication of this cell. Simulation cells are plotted using AtomViz atomviz. Atom colors: purple (Li), orange (P), yellow (S), red (O), and white (H). RDF: radial distribution function; MSD: mean square displacement.
  • Figure 4: Performance of CarNet in predicting elastic properties. a. Predicted bulk modulus $K$, shear modulus $G$, and Young's modulus $E$ compared with reference DFT values (84 elements). b. Normalized error by crystal system. c. Directional Young's modulus $E_d$ of CaS predicted by the model. The cubic symmetry of rocksalt CaS is clearly reflected in the predicted $E_d$. MAE is the mean absolute error, and MAD is the mean absolute deviation.
  • Figure 5: Learning curve for ethanol dipole moment determination. MAE in the dipole moment $\bm\mu$ training as a function of the size of the training set for models with: a. tensor product mode 'full', 'level' and 'lite'; b. maximum tensor rank $L = 1,2,3$; c. maximum correlation degree $v=1,2,3$. The slope $s$ of each linearly fitted curve in log-log space is also reported.