Table of Contents
Fetching ...

Diffusion Transformers for Tabular Data Time Series Generation

Fabrizio Garuti, Enver Sangineto, Simone Luetto, Lorenzo Forni, Rita Cucchiara

TL;DR

This work addresses the generation of time series composed of heterogeneous tabular rows with variable length. It introduces TabDiT, a latent diffusion model built on a Diffusion Transformer that encodes individual rows with an autoregressive VAE and then uses a Transformer denoiser to assemble temporally coherent sequences, while handling variable lengths via end-of-sequence signaling. A variable-range numeric representation and an autoregressive VAE decoder enable robust encoding/decoding of mixed numerical and categorical features. Across six public datasets and a large-scale bank dataset, TabDiT outperforms strong baselines in both unconditional and conditional settings, achieving strong diversity and realism and demonstrating scalability to long sequences and large data volumes.

Abstract

Tabular data generation has recently attracted a growing interest due to its different application scenarios. However, generating time series of tabular data, where each element of the series depends on the others, remains a largely unexplored domain. This gap is probably due to the difficulty of jointly solving different problems, the main of which are the heterogeneity of tabular data (a problem common to non-time-dependent approaches) and the variable length of a time series. In this paper, we propose a Diffusion Transformers (DiTs) based approach for tabular data series generation. Inspired by the recent success of DiTs in image and video generation, we extend this framework to deal with heterogeneous data and variable-length sequences. Using extensive experiments on six datasets, we show that the proposed approach outperforms previous work by a large margin.

Diffusion Transformers for Tabular Data Time Series Generation

TL;DR

This work addresses the generation of time series composed of heterogeneous tabular rows with variable length. It introduces TabDiT, a latent diffusion model built on a Diffusion Transformer that encodes individual rows with an autoregressive VAE and then uses a Transformer denoiser to assemble temporally coherent sequences, while handling variable lengths via end-of-sequence signaling. A variable-range numeric representation and an autoregressive VAE decoder enable robust encoding/decoding of mixed numerical and categorical features. Across six public datasets and a large-scale bank dataset, TabDiT outperforms strong baselines in both unconditional and conditional settings, achieving strong diversity and realism and demonstrating scalability to long sequences and large data volumes.

Abstract

Tabular data generation has recently attracted a growing interest due to its different application scenarios. However, generating time series of tabular data, where each element of the series depends on the others, remains a largely unexplored domain. This gap is probably due to the difficulty of jointly solving different problems, the main of which are the heterogeneity of tabular data (a problem common to non-time-dependent approaches) and the variable length of a time series. In this paper, we propose a Diffusion Transformers (DiTs) based approach for tabular data series generation. Inspired by the recent success of DiTs in image and video generation, we extend this framework to deal with heterogeneous data and variable-length sequences. Using extensive experiments on six datasets, we show that the proposed approach outperforms previous work by a large margin.

Paper Structure

This paper contains 25 sections, 9 equations, 19 figures, 13 tables.

Figures (19)

  • Figure 1: A schematic illustration of the denoising (a) and the VAE (b) network of TabDiT.
  • Figure 2: Age2 dataset. Analysis of the influence of the number of diffusion time steps $T$ jointly with the $\beta_{max}$ value using the MLD-TS $\downarrow$ metric.
  • Figure 3: Real and generated distributions of the values of the “secs_elapsed” attribute using Quantize as the numerical field value representation.
  • Figure 4: Real and generated distributions of “secs_elapsed” using Linear transf.
  • Figure 5: Real and generated distributions of “secs_elapsed” using Fixed digit seq.
  • ...and 14 more figures