From Regression to Classification: Exploring the Benefits of Categorical Representations of Energy in MLIPs
Ahmad Ali
TL;DR
The paper investigates replacing scalar regression in machine-learned interatomic potentials with a histogram-based, categorical energy representation using HL-Gauss. This approach preserves competitive accuracy while providing a principled measure of epistemic uncertainty through distribution entropy. Across experiments on UMA and OMol25, 128-bin HL-Gauss configurations perform comparably to scalar baselines, with entropy offering an informative but imperfect uncertainty signal. The work highlights potential benefits for uncertainty quantification and stability in scaling MLIPs, while noting the need to refine the uncertainty metric further.
Abstract
Density Functional Theory (DFT) is a widely used computational method for estimating the energy and behavior of molecules. Machine Learning Interatomic Potentials (MLIPs) are models trained to approximate DFT-level energies and forces at dramatically lower computational cost. Many modern MLIPs rely on a scalar regression formulation; given information about a molecule, they predict a single energy value and corresponding forces while minimizing absolute error with DFT's calculations. In this work, we explore a multi-class classification formulation that predicts a categorical distribution over energy/force values, providing richer supervision through multiple targets. Most importantly, this approach offers a principled way to quantify model uncertainty. In particular, our method predicts a histogram of the energy/force distribution, converts scalar targets into histograms, and trains the model using cross-entropy loss. Our results demonstrate that this categorical formulation can achieve absolute error performance comparable to regression baselines. Furthermore, this representation enables the quantification of epistemic uncertainty through the entropy of the predicted distribution, offering a measure of model confidence absent in scalar regression approaches.
