Representation Learning of Multivariate Time Series using Attention and Adversarial Training
Leon Scharwächter, Sebastian Otte
TL;DR
This work tackles robust representation learning for multivariate time series under data scarcity by combining a Transformer-based autoencoder with adversarial training to generate plausible synthetic sequences. The approach maps a prior $p_{noise}$ through a generator $G$ to produce artificial sequences, aiming to align the learned latent space with the data distribution $p_{data}$ via an adversarial discriminator. Evaluations using DTW, multivariate entropy, and t-SNE reveal that the Transformer with a Wasserstein GAN (TAE-WGAN) achieves the best dataset-matching similarity, though mode coverage and diversity remain challenging for some variants. Overall, the method shows promise for data augmentation and counterfactual generation in multivariate time series, with code and experiments available for reproducibility.
Abstract
A critical factor in trustworthy machine learning is to develop robust representations of the training data. Only under this guarantee methods are legitimate to artificially generate data, for example, to counteract imbalanced datasets or provide counterfactual explanations for blackbox decision-making systems. In recent years, Generative Adversarial Networks (GANs) have shown considerable results in forming stable representations and generating realistic data. While many applications focus on generating image data, less effort has been made in generating time series data, especially multivariate signals. In this work, a Transformer-based autoencoder is proposed that is regularized using an adversarial training scheme to generate artificial multivariate time series signals. The representation is evaluated using t-SNE visualizations, Dynamic Time Warping (DTW) and Entropy scores. Our results indicate that the generated signals exhibit higher similarity to an exemplary dataset than using a convolutional network approach.
