The Artificial Neural Twin -- Process Optimization and Continual Learning in Distributed Process Chains
Johannes Emmert, Ronald Mendez, Houman Mirzaalian Dastjerdi, Christopher Syben, Andreas Maier
TL;DR
The paper tackles decentralized industrial optimization under data sovereignty constraints and distribution drift, proposing the Artificial Neural Twin (ANT) that unifies MPC, Bayesian differentiable data fusion, and decentral sensor networks. ANT enables state inference across distributed process steps and backpropagation of losses to local parameters or AI-models, while preserving privacy by exchanging only material-stream information. The authors develop a differentiable data fusion framework with MAP derivatives, incorporate Covariance Intersection to handle unknown correlations, and implement an ANT algorithm with information and backpropagation phases orchestrated by a loss node. A Unity-based virtual plastic recycling facility demonstrates state estimation accuracy and near-optimal parameter convergence under constant and dynamic mass-flow scenarios, illustrating ANT’s potential for decentralized optimization and continual learning without data-model sharing. The work suggests ANT as a flexible, model-agnostic connector capable of retraining AI-based sensors and process models in response to drift, with practical relevance for industry and data-security concerns.
Abstract
Industrial process optimization and control is crucial to increase economic and ecologic efficiency. However, data sovereignty, differing goals, or the required expert knowledge for implementation impede holistic implementation. Further, the increasing use of data-driven AI-methods in process models and industrial sensory often requires regular fine-tuning to accommodate distribution drifts. We propose the Artificial Neural Twin, which combines concepts from model predictive control, deep learning, and sensor networks to address these issues. Our approach introduces differentiable data fusion to estimate the state of distributed process steps and their dependence on input data. By treating the interconnected process steps as a quasi neural-network, we can backpropagate loss gradients for process optimization or model fine-tuning to process parameters or AI models respectively. The concept is demonstrated on a virtual machine park simulated in Unity, consisting of bulk material processes in plastic recycling.
