A physics-based data-driven model for CO$_2$ gas diffusion electrodes to drive automated laboratories
Ivan Grega, Félix Therrien, Abhishek Soni, Karry Ocean, Kevan Dettelbach, Ribwar Ahmadi, Mehrdad Mokhtari, Curtis P. Berlinguette, Yoshua Bengio
TL;DR
This work tackles the optimization of gas diffusion electrodes for CO$_2$ electroreduction under automated lab conditions by integrating a differentiable, physics-based 1D cathode model with data-driven latent parameter inference into a Gaussian process surrogate. By accommodating multi-product pathways (CO and C$_2$H$_4$) through Tafel-type kinetics and surface-fraction dynamics, the approach enables uncertainty-aware Bayesian optimization. Latent microstructural and kinetic parameters are inferred from AdaCarbon data, yielding interpretable ranges and insights into Cu- versus Ag-rich behavior. Simulated pool-based active learning demonstrates a roughly 3× efficiency gain over random sampling, highlighting the method’s potential to guide autonomous experiments in electrode design. Overall, the framework advances scalable, interpretable optimization of CO$_2$ reduction devices and paves the way for autonomous laboratories to iteratively improve GDE performance.
Abstract
The electrochemical reduction of atmospheric CO$_2$ into high-energy molecules with renewable energy is a promising avenue for energy storage that can take advantage of existing infrastructure especially in areas where sustainable alternatives to fossil fuels do not exist. Automated laboratories are currently being developed and used to optimize the composition and operating conditions of gas diffusion electrodes (GDEs), the device in which this reaction takes place. Improving the efficiency of GDEs is crucial for this technology to become viable. Here we present a modeling framework to efficiently explore the high-dimensional parameter space of GDE designs in an active learning context. At the core of the framework is an uncertainty-aware physics model calibrated with experimental data. The model has the flexibility to capture various input parameter spaces and any carbon products which can be modeled with Tafel kinetics. It is interpretable, and a Gaussian process layer can capture deviations of real data from the function space of the physical model itself. We deploy the model in a simulated active learning setup with real electrochemical data gathered by the AdaCarbon automated laboratory and show that it can be used to efficiently traverse the multi-dimensional parameter space.
