Fully Embedded Time-Series Generative Adversarial Networks
Joe Beck, Subhadeep Chakraborty
TL;DR
FETSGAN addresses the challenge of realistic real-valued time-series generation by jointly modeling static and temporal distributions through a seq2seq-style adversarial autoencoder that embeds entire sequences into a latent space. A First Above Threshold (FAT) operator stabilizes training by enforcing reconstruction loss progressively at a single time instance, enabling robust handling of long sequences without autoregressive exposure bias. The method introduces a three-pronged objective with discriminators in both feature and latent spaces plus a reconstruction term, yielding improved distribution matching, selective sampling ability, and state-of-the-art predictive and discriminative performance among direct feature-space adversaries. This approach provides an interpretable latent space for style-controlled synthesis and has practical implications for generating realistic synthetic time-series while mitigating mode collapse and training instability, with considerations for privacy and ethical use in real-world applications.
Abstract
Generative Adversarial Networks (GANs) should produce synthetic data that fits the underlying distribution of the data being modeled. For real valued time-series data, this implies the need to simultaneously capture the static distribution of the data, but also the full temporal distribution of the data for any potential time horizon. This temporal element produces a more complex problem that can potentially leave current solutions under-constrained, unstable during training, or prone to varying degrees of mode collapse. In FETSGAN, entire sequences are translated directly to the generator's sampling space using a seq2seq style adversarial auto encoder (AAE), where adversarial training is used to match the training distribution in both the feature space and the lower dimensional sampling space. This additional constraint provides a loose assurance that the temporal distribution of the synthetic samples will not collapse. In addition, the First Above Threshold (FAT) operator is introduced to supplement the reconstruction of encoded sequences, which improves training stability and the overall quality of the synthetic data being generated. These novel contributions demonstrate a significant improvement to the current state of the art for adversarial learners in qualitative measures of temporal similarity and quantitative predictive ability of data generated through FETSGAN.
