Table of Contents
Fetching ...

Probabilistic Forecasting of Irregular Time Series via Conditional Flows

Vijaya Krishna Yalavarthi, Randolf Scholz, Stefan Born, Lars Schmidt-Thieme

TL;DR

This work tackles probabilistic forecasting for irregularly sampled multivariate time series with missing values by learning conditional joint distributions $p(y\mid x^{\text{obs}}, x^{\text{qry}})$ using a novel normalizing-flow framework. ProFITi combines a sorted invertible triangular attention (SITA) with an invertible activation (Shiesh) inside an invariant conditional normalizing flow, enabling dynamic, permutation-invariant joint densities over variable-length query sets. The encoder (GraFITi) provides rich, equivariant conditioning, and training optimizes the normalized joint negative log-likelihood (njNLL). Empirical results on four real-world datasets show substantial improvements in joint likelihoods over baselines, highlighting ProFITi's ability to capture complex dependencies and non-Gaussian uncertainty in irregular IMTS data. These advancements have practical implications for domains like healthcare and climate science where accurate, uncertainty-aware forecasting of multivariate, irregular signals is crucial.

Abstract

Probabilistic forecasting of irregularly sampled multivariate time series with missing values is an important problem in many fields, including health care, astronomy, and climate. State-of-the-art methods for the task estimate only marginal distributions of observations in single channels and at single timepoints, assuming a fixed-shape parametric distribution. In this work, we propose a novel model, ProFITi, for probabilistic forecasting of irregularly sampled time series with missing values using conditional normalizing flows. The model learns joint distributions over the future values of the time series conditioned on past observations and queried channels and times, without assuming any fixed shape of the underlying distribution. As model components, we introduce a novel invertible triangular attention layer and an invertible non-linear activation function on and onto the whole real line. We conduct extensive experiments on four datasets and demonstrate that the proposed model provides $4$ times higher likelihood over the previously best model.

Probabilistic Forecasting of Irregular Time Series via Conditional Flows

TL;DR

This work tackles probabilistic forecasting for irregularly sampled multivariate time series with missing values by learning conditional joint distributions using a novel normalizing-flow framework. ProFITi combines a sorted invertible triangular attention (SITA) with an invertible activation (Shiesh) inside an invariant conditional normalizing flow, enabling dynamic, permutation-invariant joint densities over variable-length query sets. The encoder (GraFITi) provides rich, equivariant conditioning, and training optimizes the normalized joint negative log-likelihood (njNLL). Empirical results on four real-world datasets show substantial improvements in joint likelihoods over baselines, highlighting ProFITi's ability to capture complex dependencies and non-Gaussian uncertainty in irregular IMTS data. These advancements have practical implications for domains like healthcare and climate science where accurate, uncertainty-aware forecasting of multivariate, irregular signals is crucial.

Abstract

Probabilistic forecasting of irregularly sampled multivariate time series with missing values is an important problem in many fields, including health care, astronomy, and climate. State-of-the-art methods for the task estimate only marginal distributions of observations in single channels and at single timepoints, assuming a fixed-shape parametric distribution. In this work, we propose a novel model, ProFITi, for probabilistic forecasting of irregularly sampled time series with missing values using conditional normalizing flows. The model learns joint distributions over the future values of the time series conditioned on past observations and queried channels and times, without assuming any fixed shape of the underlying distribution. As model components, we introduce a novel invertible triangular attention layer and an invertible non-linear activation function on and onto the whole real line. We conduct extensive experiments on four datasets and demonstrate that the proposed model provides times higher likelihood over the previously best model.
Paper Structure (52 sections, 2 theorems, 36 equations, 5 figures, 12 tables)

This paper contains 52 sections, 2 theorems, 36 equations, 5 figures, 12 tables.

Key Result

Lemma 1

For any $K\times K$ matrix $A$ and $\epsilon>0$, the matrix $\mathbb{I}_K + \frac{1}{\|A\|_2 + \epsilon}A$ is invertible. Here, $\|A\|_2\coloneqq\max\limits_{x\ne 0} \frac{\|Ax\|_2}{\|x\|}$ denotes the spectral norm.

Figures (5)

  • Figure 1: (left) Shiesh function, (right) partial derivative.
  • Figure 2: ProFITi architecture; $\bigotimes$: dot product, $\bigodot$: Hadamard product, $\bigoplus$: addition. Functions referred to their equation numbers: sort, $\mathop{\mathrm{\text{argsort}}}\nolimits$ (eq. \ref{['eq:pi']}), GraFITi (eq. \ref{['eq:grafiti']}), SITA (eq. \ref{['eq:SITA']}), EL (eq. \ref{['eq:ft']}), Shiesh (eq. \ref{['eq:fa']}). For efficiency, we perform, sorting only once directly on $x^\textsc{qry}$ and $y$.
  • Figure 3: Demonstration of $\text{Shiesh}$ activation function with varying $b$.
  • Figure 4: Statistical test on the results of various channel orders for ProFITi.
  • Figure 5: Demonstrating (10) trajectories generated using ProFITi for Physionet'12 dataset.

Theorems & Definitions (8)

  • Example 1: Demonstration of sorting in SITA
  • Example 2: Demonstration of $S$ and $\pi$ for SITA, sort by channel id. followed by timepoint
  • Example 3: Demonstration of $S$ and $\pi$ for SITA, sort by timepoint in descending order followed by channel id. in ascending order
  • Example 4: Demonstration of $S$ and $\pi$ for SITA, sort by timepoint followed by altered order of channel id.
  • Lemma 1
  • proof
  • Theorem 1
  • proof