Geographic-style maps with a local novelty distance help navigate the materials space
Daniel E Widdowson, Vitaliy A Kurlin
TL;DR
The paper defines the Crystal Isometry Space $\mathrm{CRIS}(\mathbb{R}^3)$ to formalize comparisons of crystals under isometry and introduces the Local Novelty Distance (LND) metric, which uses generically complete invariants derived from the Pointwise Distance Distribution $\mathrm{PDD}(S;k)$ and its PDA representation, with distances computed via Earth Mover's Distance (EMD). It demonstrates real-time novelty assessment for 43 A-lab crystals by locating near-duplicates in the ICSD and Materials Project, revealing many pre-existing matches and mapping the material universe with invariant coordinates. The authors show that these invariant maps provide stable, continuous navigation of the materials space, overcoming cell-based discontinuities and enabling rapid novelty checks. The work suggests broad utility for self-driving labs and data integrity, and points to future work connecting invariant coordinates to structure–property relationships. Overall, the approach offers a scalable, geometry-driven framework for detecting duplicates and exploring the materials landscape.
Abstract
With the advent of self-driving labs promising to synthesize large numbers of new materials, new automated tools are required for checking potential duplicates in existing structural databases before a material can be claimed as novel. To avoid duplication, we rigorously define the novelty metric of any periodic material as the smallest distance to its nearest neighbor among already known materials. Using ultra-fast structural invariants, all such nearest neighbors can be found within seconds on a typical computer even if a given crystal is disguised by changing a unit cell, perturbing atoms, or replacing chemical elements. This real-time novelty check is demonstrated by finding near-duplicates of the 43 materials produced by Berkeley's A-lab in the world's largest collections of inorganic structures, the Inorganic Crystal Structure Database and the Materials Project. To help future self-driving labs successfully identify novel materials, we propose navigation maps of the materials space where any new structure can be quickly located by its invariant descriptors similar to a geographic location on Earth.
