Graph Neural Networks embedded into Margules model for vapor-liquid equilibria prediction
Edgar Ivan Sanchez Medina, Kai Sundmacher
TL;DR
The paper investigates embedding a Graph Neural Network (GH-GNN) within the extended Margules model to predict vapor–liquid equilibria by extrapolating finite-concentration activity coefficients from infinite-dilution data. Using the GH-GNN to infer $\gamma_i^\infty$ and embedding it in a thermodynamically consistent Margules head, the approach is benchmarked against UNIFAC-Dortmund across extensive binary and limited ternary VLE data. Results show UNIFAC-Dortmund generally delivers higher accuracy, but GH-GNN–Margules offers a meaningful baseline and niche advantages, particularly when expansion data are scarce or certain mixtures lack applicable UNIFAC parameters. The study underscores the value of combining data-driven models with thermodynamic constraints, and highlights data curation and broader VLE training data as key pathways to improve predictive performance in early-stage design.
Abstract
Predictive thermodynamic models are crucial for the early stages of product and process design. In this paper the performance of Graph Neural Networks (GNNs) embedded into a relatively simple excess Gibbs energy model, the extended Margules model, for predicting vapor-liquid equilibrium is analyzed. By comparing its performance against the established UNIFAC-Dortmund model it has been shown that GNNs embedded in Margules achieves an overall lower accuracy. However, higher accuracy is observed in the case of various types of binary mixtures. Moreover, since group contribution methods, like UNIFAC, are limited due to feasibility of molecular fragmentation or availability of parameters, the GNN in Margules model offers an alternative for VLE estimation. The findings establish a baseline for the predictive accuracy that simple excess Gibbs energy models combined with GNNs trained solely on infinite dilution data can achieve.
