Unifying Mixed Gas Adsorption in Molecular Sieve Membranes and MOFs using Machine Learning
Subhadeep Dasgupta, Amal R S, Prabal K. Maiti
TL;DR
This work tackles the challenge of predicting gas adsorption isotherms across two distinct classes of porous materials—carbon molecular sieve membranes (CMSMs) and metal-organic frameworks (MOFs)—including binary mixtures. It introduces a data-fusion approach that trains both neural networks and gradient-boosted trees on a combined dataset built from CMSM Grand Canonical Monte Carlo simulations and the CoRE MOF database, using a compact fingerprint of physical descriptors to predict pure and mixed gas uptakes. The study shows that joint training across material classes is essential for transferability, with neural networks (and to a comparable extent XGBoost) accurately capturing adsorption trends and smoothing isotherms, even for unseen materials like CALF-20 at 313K. The results offer a data-efficient pathway to screen gas-separation materials, potentially reducing the need for exhaustive simulations across the vast material landscape while enabling reliable predictions for mixtures. All mathematical notation is presented with proper delimiters, e.g., $q_A$, $q_B$, $d_A$, $d_B$, $T$, $P$, $GSA$, $\xi$, $\rho^{-1}$, $D_i$, and $d_{max}$.
Abstract
Recent machine learning models to accurately obtain gas adsorption isotherms focus on polymers or metal-organic frameworks (MOFs) separately. The difficulty in creating a unified model that can predict the adsorption trends in both types of adsorbents is challenging, owing to the diversity in their chemical structures. Moreover, models trained only on single gas adsorption data are incapable of predicting adsorption isotherms for binary gas mixtures. In this work, we address these problems using feature vectors comprising only the physical properties of the gas mixtures and adsorbents. Our model is trained on adsorption isotherms of both single and binary mixed gases inside carbon molecular sieving membrane (CMSM), together with data available from CoRE MOF database. The trained models are capable of accurately predicting the adsorption trends in both classes of materials, for both pure and binary components. ML architecture designed for one class of material, is not suitable for predicting the other class, even after proper training, signifying that the model must be trained jointly for proper predictions and transferability. The model is used to predict with good accuracy the CO2 uptake inside CALF-20 framework. This work opens up a new avenue for predicting complex adsorption processes for gas mixtures in a wide range of materials.
