Table of Contents
Fetching ...

Time Series Diffusion in the Frequency Domain

Jonathan Crabbé, Nicolas Huynh, Jan Stanczuk, Mihaela van der Schaar

TL;DR

This paper analyzes whether representing time series in the frequency domain is a useful inductive bias for score-based diffusion models, and shows how to adapt the denoising score matching approach to implement diffusion models in the frequency domain.

Abstract

Fourier analysis has been an instrumental tool in the development of signal processing. This leads us to wonder whether this framework could similarly benefit generative modelling. In this paper, we explore this question through the scope of time series diffusion models. More specifically, we analyze whether representing time series in the frequency domain is a useful inductive bias for score-based diffusion models. By starting from the canonical SDE formulation of diffusion in the time domain, we show that a dual diffusion process occurs in the frequency domain with an important nuance: Brownian motions are replaced by what we call mirrored Brownian motions, characterized by mirror symmetries among their components. Building on this insight, we show how to adapt the denoising score matching approach to implement diffusion models in the frequency domain. This results in frequency diffusion models, which we compare to canonical time diffusion models. Our empirical evaluation on real-world datasets, covering various domains like healthcare and finance, shows that frequency diffusion models better capture the training distribution than time diffusion models. We explain this observation by showing that time series from these datasets tend to be more localized in the frequency domain than in the time domain, which makes them easier to model in the former case. All our observations point towards impactful synergies between Fourier analysis and diffusion models.

Time Series Diffusion in the Frequency Domain

TL;DR

This paper analyzes whether representing time series in the frequency domain is a useful inductive bias for score-based diffusion models, and shows how to adapt the denoising score matching approach to implement diffusion models in the frequency domain.

Abstract

Fourier analysis has been an instrumental tool in the development of signal processing. This leads us to wonder whether this framework could similarly benefit generative modelling. In this paper, we explore this question through the scope of time series diffusion models. More specifically, we analyze whether representing time series in the frequency domain is a useful inductive bias for score-based diffusion models. By starting from the canonical SDE formulation of diffusion in the time domain, we show that a dual diffusion process occurs in the frequency domain with an important nuance: Brownian motions are replaced by what we call mirrored Brownian motions, characterized by mirror symmetries among their components. Building on this insight, we show how to adapt the denoising score matching approach to implement diffusion models in the frequency domain. This results in frequency diffusion models, which we compare to canonical time diffusion models. Our empirical evaluation on real-world datasets, covering various domains like healthcare and finance, shows that frequency diffusion models better capture the training distribution than time diffusion models. We explain this observation by showing that time series from these datasets tend to be more localized in the frequency domain than in the time domain, which makes them easier to model in the former case. All our observations point towards impactful synergies between Fourier analysis and diffusion models.
Paper Structure (23 sections, 8 theorems, 39 equations, 12 figures, 2 tables)

This paper contains 23 sections, 8 theorems, 39 equations, 12 figures, 2 tables.

Key Result

Lemma 3.0

(DFT of standard Brownian motion). Let $\bm{w}$ be a standard Brownian motion on $\mathbb{R}^{d_X}$ with $d_X = N\cdot M$, where $N \in \mathbb{N}^+$ is the number of time series steps and $M \in \mathbb{N}^+$ is the number of features tracked over time. Then $\bm{v} = U\bm{w}$ is a continuous stoch

Figures (12)

  • Figure 1: Localization of time series in the time and frequency domains. Time series are more localized in the frequency domain.
  • Figure 2: By evaluating our delocalization metrics in the time domain ($\color{NavyBlue} \Delta_\mathrm{time}$) and the frequency domain ($\color{ForestGreen} \Delta_\mathrm{freq}$), we quantitatively confirm that all the datasets are significantly more localized in the frequency domain. All the metrics are averaged over $\mathcal{D}_{\mathrm{train}}$, their mean is reported with a $95\%$ confidence interval.
  • Figure 3: Sliced Wasserstein distances of time and frequency models (blue) and localization metrics in time and frequency domains (red) when smoothing the spectral representations of the time series with Gaussian kernels of variable width. Increasing the kernel width removes the localization in the frequency domain and increases the localization in the time domain. Coincidentally, the time diffusion model becomes better than the frequency diffusion model.
  • Figure 4: Sliced Wasserstein distances of time and frequency diffusion models.
  • Figure 5: Marginal Wasserstein distances of time and frequency diffusion models.
  • ...and 7 more figures

Theorems & Definitions (17)

  • Lemma 3.0
  • proof
  • Remark 3.1
  • Proposition 3.1
  • proof
  • Proposition 3.1
  • proof
  • Proposition A.1: Unitarity of the DFT operator
  • proof
  • Proposition A.2
  • ...and 7 more