Commutative Algebra Modeling in Materials Science -- A Case Study on Metal-Organic Frameworks (MOFs)
Caleb Simiyu Khaemba, Hongsong Feng, Dong Chen, Chun-Long Chen, Guo-Wei Wei
TL;DR
This work introduces category-specific commutative algebra (CSCA) as the first algebraic modeling framework for MOFs, translating multi-scale chemical connectivity into persistent facet ideals and f-vector descriptors. By partitioning atoms into chemically meaningful categories and constructing alpha-filtrations, CSCA yields category-aware, fixed-length embeddings that feed a gradient-boosting learner to predict MOF adsorption properties. The approach achieves competitive accuracy across four properties while enhancing interpretability through explicit algebraic and combinatorial descriptors tied to chemical categories. The method offers a rigorous, generalizable paradigm for structure–property relationships in porous materials and a new avenue for data-efficient, interpretable discovery in materials science.
Abstract
Metal-organic frameworks (MOFs) are a class of important crystalline and highly porous materials whose hierarchical geometry and chemistry hinder interpretable predictions in materials properties. Commutative algebra is a branch of abstract algebra that has been rarely applied in data and material sciences. We introduce the first ever commutative algebra modeling and prediction in materials science. Specifically, category-specific commutative algebra (CSCA) is proposed as a new framework for MOF representation and learning. It integrates element-based categorization with multiscale algebraic invariants to encode both local coordination motifs and global network organization of MOFs. These algebraically consistent, chemically aware representations enable compact, interpretable, and data efficient modeling of MOF properties such as Henry's constants and uptake capacities for common gases. Compared to traditional geometric and graph-based approaches, CSCA achieves comparable or superior predictive accuracy while substantially improving interpretability and stability across data sets. By aligning commutative algebra with the chemical hierarchy, the CSCA establishes a rigorous and generalizable paradigm for understanding structure and property relationships in porous materials and provides a nonlinear algebra-based framework for data-driven material discovery.
