Known Unknowns: Out-of-Distribution Property Prediction in Materials and Molecules

Nofit Segal; Aviv Netanyahu; Kevin P. Greenman; Pulkit Agrawal; Rafael Gomez-Bombarelli

Known Unknowns: Out-of-Distribution Property Prediction in Materials and Molecules

Nofit Segal, Aviv Netanyahu, Kevin P. Greenman, Pulkit Agrawal, Rafael Gomez-Bombarelli

TL;DR

The paper tackles extrapolating material and molecular properties to out-of-distribution values. It introduces Bilinear Transduction, an anchor-based, transductive method that leverages analogical input–target changes to enable zero-shot OOD extrapolation. Across solid-state and molecular benchmarks, the approach yields substantial gains in OOD true positive rate and precision, and often improves OOD prediction accuracy compared with non-transductive baselines. This method enhances screening efficiency for high-potential candidates and provides interpretable analogies that reflect chemical changes, with broad applicability to other materials and molecular tasks.

Abstract

Discovery of high-performance materials and molecules requires identifying extremes with property values that fall outside the known distribution. Therefore, the ability to extrapolate to out-of-distribution (OOD) property values is critical for both solid-state materials and molecular design. Our objective is to train predictor models that extrapolate zero-shot to higher ranges than in the training data, given the chemical compositions of solids or molecular graphs and their property values. We propose using a transductive approach to OOD property prediction, achieving improvements in prediction accuracy. In particular, the True Positive Rate (TPR) of OOD classification of materials and molecules improved by 3x and 2.5x, respectively, and precision improved by 2x and 1.5x compared to non-transductive baselines. Our method leverages analogical input-target relations in the training and test sets, enabling generalization beyond the training target support, and can be applied to any other material and molecular tasks.

Known Unknowns: Out-of-Distribution Property Prediction in Materials and Molecules

TL;DR

Abstract

Known Unknowns: Out-of-Distribution Property Prediction in Materials and Molecules

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)