Advancing Thermodynamic Group-Contribution Methods by Machine Learning: UNIFAC 2.0
Nicolas Hayer, Thorsten Wendel, Stephan Mandt, Hans Hasse, Fabian Jirasek
TL;DR
This work addresses the incomplete parameterization of thermodynamic GC methods, notably UNIFAC, by embedding a matrix-completion ML module to predict all pair-interaction parameters $a_{mn}$. The resulting UNIFAC 2.0 is trained end-to-end on $\ln\gamma_i$ from binary VLE data, yielding a gap-free parameter table and significantly improved accuracy (nearly halving the MSE) while greatly expanding applicability to thousands of mixtures in the Dortmund DDB. The method demonstrates robust extrapolation to unseen components and unseen pair-interaction parameters, and remains easily updatable and adaptable for tailored applications. Practically, UNIFAC 2.0 can be implemented with a simple parameter-table replacement in existing simulators, enabling more reliable and extensive thermodynamic predictions for process design and optimization.
Abstract
Accurate prediction of thermodynamic properties is pivotal in chemical engineering for optimizing process efficiency and sustainability. Physical group-contribution (GC) methods are widely employed for this purpose but suffer from historically grown, incomplete parameterizations, limiting their applicability and accuracy. In this work, we overcome these limitations by combining GC with matrix completion methods (MCM) from machine learning. We use the novel approach to predict a complete set of pair-interaction parameters for the most successful GC method: UNIFAC, the workhorse for predicting activity coefficients in liquid mixtures. The resulting new method, UNIFAC 2.0, is trained and validated on more than 224,000 experimental data points, showcasing significantly enhanced prediction accuracy (e.g., nearly halving the mean squared error) and increased scope by eliminating gaps in the original model's parameter table. Moreover, the generic nature of the approach facilitates updating the method with new data or tailoring it to specific applications.
