Probabilistic Forecasting of Irregular Time Series via Conditional Flows

Vijaya Krishna Yalavarthi; Randolf Scholz; Stefan Born; Lars Schmidt-Thieme

Probabilistic Forecasting of Irregular Time Series via Conditional Flows

Vijaya Krishna Yalavarthi, Randolf Scholz, Stefan Born, Lars Schmidt-Thieme

TL;DR

This work tackles probabilistic forecasting for irregularly sampled multivariate time series with missing values by learning conditional joint distributions $p(y\mid x^{\text{obs}}, x^{\text{qry}})$ using a novel normalizing-flow framework. ProFITi combines a sorted invertible triangular attention (SITA) with an invertible activation (Shiesh) inside an invariant conditional normalizing flow, enabling dynamic, permutation-invariant joint densities over variable-length query sets. The encoder (GraFITi) provides rich, equivariant conditioning, and training optimizes the normalized joint negative log-likelihood (njNLL). Empirical results on four real-world datasets show substantial improvements in joint likelihoods over baselines, highlighting ProFITi's ability to capture complex dependencies and non-Gaussian uncertainty in irregular IMTS data. These advancements have practical implications for domains like healthcare and climate science where accurate, uncertainty-aware forecasting of multivariate, irregular signals is crucial.

Abstract

Probabilistic forecasting of irregularly sampled multivariate time series with missing values is an important problem in many fields, including health care, astronomy, and climate. State-of-the-art methods for the task estimate only marginal distributions of observations in single channels and at single timepoints, assuming a fixed-shape parametric distribution. In this work, we propose a novel model, ProFITi, for probabilistic forecasting of irregularly sampled time series with missing values using conditional normalizing flows. The model learns joint distributions over the future values of the time series conditioned on past observations and queried channels and times, without assuming any fixed shape of the underlying distribution. As model components, we introduce a novel invertible triangular attention layer and an invertible non-linear activation function on and onto the whole real line. We conduct extensive experiments on four datasets and demonstrate that the proposed model provides $4$ times higher likelihood over the previously best model.

Probabilistic Forecasting of Irregular Time Series via Conditional Flows

TL;DR

This work tackles probabilistic forecasting for irregularly sampled multivariate time series with missing values by learning conditional joint distributions

using a novel normalizing-flow framework. ProFITi combines a sorted invertible triangular attention (SITA) with an invertible activation (Shiesh) inside an invariant conditional normalizing flow, enabling dynamic, permutation-invariant joint densities over variable-length query sets. The encoder (GraFITi) provides rich, equivariant conditioning, and training optimizes the normalized joint negative log-likelihood (njNLL). Empirical results on four real-world datasets show substantial improvements in joint likelihoods over baselines, highlighting ProFITi's ability to capture complex dependencies and non-Gaussian uncertainty in irregular IMTS data. These advancements have practical implications for domains like healthcare and climate science where accurate, uncertainty-aware forecasting of multivariate, irregular signals is crucial.

Abstract

times higher likelihood over the previously best model.

Paper Structure (52 sections, 2 theorems, 36 equations, 5 figures, 12 tables)

This paper contains 52 sections, 2 theorems, 36 equations, 5 figures, 12 tables.

Introduction
Literature Review
Probabilistic Forecasting Models for IMTS.
Normalizing Flows for variable input size.
Conditioning Normalizing Flows.
Flows with Invertible Attention.
Problem Setting & Analysis
The IMTS Forecasting Problem.
Problem Analysis and Characteristics.
Invariant Conditional Normalizing Flow Models
Normalizing flows.
Invariant conditional normalizing flows.
Invariant conditional normalizing flows via invertible attention.
Shiesh: A New Activation Function for Normalizing Flows
Overall ProFITi Model Architecture
...and 37 more sections

Key Result

Lemma 1

For any $K\times K$ matrix $A$ and $\epsilon>0$, the matrix $\mathbb{I}_K + \frac{1}{\|A\|_2 + \epsilon}A$ is invertible. Here, $\|A\|_2\coloneqq\max\limits_{x\ne 0} \frac{\|Ax\|_2}{\|x\|}$ denotes the spectral norm.

Figures (5)

Figure 1: (left) Shiesh function, (right) partial derivative.
Figure 2: ProFITi architecture; $\bigotimes$: dot product, $\bigodot$: Hadamard product, $\bigoplus$: addition. Functions referred to their equation numbers: sort, $\mathop{\mathrm{\text{argsort}}}\nolimits$ (eq. \ref{['eq:pi']}), GraFITi (eq. \ref{['eq:grafiti']}), SITA (eq. \ref{['eq:SITA']}), EL (eq. \ref{['eq:ft']}), Shiesh (eq. \ref{['eq:fa']}). For efficiency, we perform, sorting only once directly on $x^\textsc{qry}$ and $y$.
Figure 3: Demonstration of $\text{Shiesh}$ activation function with varying $b$.
Figure 4: Statistical test on the results of various channel orders for ProFITi.
Figure 5: Demonstrating (10) trajectories generated using ProFITi for Physionet'12 dataset.

Theorems & Definitions (8)

Example 1: Demonstration of sorting in SITA
Example 2: Demonstration of $S$ and $\pi$ for SITA, sort by channel id. followed by timepoint
Example 3: Demonstration of $S$ and $\pi$ for SITA, sort by timepoint in descending order followed by channel id. in ascending order
Example 4: Demonstration of $S$ and $\pi$ for SITA, sort by timepoint followed by altered order of channel id.
Lemma 1
proof
Theorem 1
proof

Probabilistic Forecasting of Irregular Time Series via Conditional Flows

TL;DR

Abstract

Probabilistic Forecasting of Irregular Time Series via Conditional Flows

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (8)