Recurrent Auto-Encoder Model for Large-Scale Industrial Sensor Signal Analysis
Timothy Wong, Zhiyuan Luo
TL;DR
The paper addresses the challenge of analysing unbounded, high-dimensional industrial sensor streams by learning compact representations with a recurrent auto-encoder. It introduces a partial reconstruction scheme and rolling-window sampling, producing fixed-length context vectors that summarize full-system dynamics while only reconstructing a subset of sensors. Empirical results show that partial reconstruction outperforms full reconstruction in training and validation errors, and context vectors can be visualized and clustered to reflect operating states, enabling online diagnostics and maintenance planning. This approach scales to very high dimensional data and provides a practical framework for identifying process states and anomalies from unlabelled streams in large-scale industrial environments.
Abstract
Recurrent auto-encoder model summarises sequential data through an encoder structure into a fixed-length vector and then reconstructs the original sequence through the decoder structure. The summarised vector can be used to represent time series features. In this paper, we propose relaxing the dimensionality of the decoder output so that it performs partial reconstruction. The fixed-length vector therefore represents features in the selected dimensions only. In addition, we propose using rolling fixed window approach to generate training samples from unbounded time series data. The change of time series features over time can be summarised as a smooth trajectory path. The fixed-length vectors are further analysed using additional visualisation and unsupervised clustering techniques. The proposed method can be applied in large-scale industrial processes for sensors signal analysis purpose, where clusters of the vector representations can reflect the operating states of the industrial system.
