Benchmarking Domain Adaptation for Chemical Processes on the Tennessee Eastman Process
Eduardo Fernandes Montesuma, Michela Mulas, Fred Ngolè Mboula, Francesco Corona, Antoine Souloumiac
TL;DR
The paper addresses distribution shifts in fault-diagnosis for chemical processes by introducing a Tennessee Eastman Process–based benchmark and evaluating 11 domain adaptation strategies across single- and multi-source settings. It emphasizes optimal transport–based methods (e.g., JDOT, WJDOT, WBT, DaDiL) as outperforming $MMD$ and $d_{\mathcal{H}}$-based approaches, with multi-source DA offering gains when sources are informative. The study provides a detailed benchmark construction, exploratory data analysis, and a comprehensive comparison on time-series data, including a public open-source implementation to facilitate replication. The work highlights practical implications for robust cross-mode fault diagnosis and motivates further research at the intersection of DA and chemical-process monitoring.
Abstract
In system monitoring, automatic fault diagnosis seeks to infer the systems' state based on sensor readings, e.g., through machine learning models. In this context, it is of key importance that, based on historical data, these systems are able to generalize to incoming data. In parallel, many factors may induce changes in the data probability distribution, hindering the possibility of such models to generalize. In this sense, domain adaptation is an important framework for adapting models to different probability distributions. In this paper, we propose a new benchmark, based on the Tennessee Eastman Process of Downs and Vogel (1993), for benchmarking domain adaptation methods in the context of chemical processes. Besides describing the process, and its relevance for domain adaptation, we describe a series of data processing steps for reproducing our benchmark. We then test 11 domain adaptation strategies on this novel benchmark, showing that optimal transport-based techniques outperform other strategies.
