Table of Contents
Fetching ...

Learning from the past, predicting the statistics for the future, learning an evolving system

Daniel Levin, Terry Lyons, Hao Ni

TL;DR

The paper addresses learning from streaming data by representing full streams with signatures from rough path theory to enable non-parametric regression on path space.It introduces the expected signature (ES) framework, where outputs are modeled as linear functionals of input signatures plus noise, yielding a powerful, dimension-reducing feature set with strong approximation properties.By connecting ES to time series via time-joined embeddings, the authors show that classical models like AR and ARCH are special cases of ES, unifying parametric and non-parametric approaches under a common path-space regression paradigm.Empirical results on simulated time series demonstrate that ES achieves GP-level predictive accuracy with substantially lower computational cost and favorable robustness, highlighting practical benefits for streaming-data inference.

Abstract

We bring the theory of rough paths to the study of non-parametric statistics on streamed data. We discuss the problem of regression where the input variable is a stream of information, and the dependent response is also (potentially) a stream. A certain graded feature set of a stream, known in the rough path literature as the signature, has a universality that allows formally, linear regression to be used to characterise the functional relationship between independent explanatory variables and the conditional distribution of the dependent response. This approach, via linear regression on the signature of the stream, is almost totally general, and yet it still allows explicit computation. The grading allows truncation of the feature set and so leads to an efficient local description for streams (rough paths). In the statistical context this method offers potentially significant, even transformational dimension reduction. By way of illustration, our approach is applied to stationary time series including the familiar AR model and ARCH model. In the numerical examples we examined, our predictions achieve similar accuracy to the Gaussian Process (GP) approach with much lower computational cost especially when the sample size is large.

Learning from the past, predicting the statistics for the future, learning an evolving system

TL;DR

The paper addresses learning from streaming data by representing full streams with signatures from rough path theory to enable non-parametric regression on path space.It introduces the expected signature (ES) framework, where outputs are modeled as linear functionals of input signatures plus noise, yielding a powerful, dimension-reducing feature set with strong approximation properties.By connecting ES to time series via time-joined embeddings, the authors show that classical models like AR and ARCH are special cases of ES, unifying parametric and non-parametric approaches under a common path-space regression paradigm.Empirical results on simulated time series demonstrate that ES achieves GP-level predictive accuracy with substantially lower computational cost and favorable robustness, highlighting practical benefits for streaming-data inference.

Abstract

We bring the theory of rough paths to the study of non-parametric statistics on streamed data. We discuss the problem of regression where the input variable is a stream of information, and the dependent response is also (potentially) a stream. A certain graded feature set of a stream, known in the rough path literature as the signature, has a universality that allows formally, linear regression to be used to characterise the functional relationship between independent explanatory variables and the conditional distribution of the dependent response. This approach, via linear regression on the signature of the stream, is almost totally general, and yet it still allows explicit computation. The grading allows truncation of the feature set and so leads to an efficient local description for streams (rough paths). In the statistical context this method offers potentially significant, even transformational dimension reduction. By way of illustration, our approach is applied to stationary time series including the familiar AR model and ARCH model. In the numerical examples we examined, our predictions achieve similar accuracy to the Gaussian Process (GP) approach with much lower computational cost especially when the sample size is large.

Paper Structure

This paper contains 30 sections, 16 theorems, 82 equations, 11 figures, 6 tables.

Key Result

Theorem 2.8

Let $Y: [0, T] \rightarrow W$ be a continuous path and satisfy (controlledEquation), where $X: [0, T] \rightarrow E$ is a path of finite 1-variation and $g: W \rightarrow L(E, W)$ is a smooth vector field. Then for any integer $N >0$,

Figures (11)

  • Figure 1: The case for that the news happens earlier than the price move
  • Figure 2: The case for that the news happens later than the price move
  • Figure 3: $K = 250$. In Subfigure \ref{['LinearRegressionIncrements']}, we plot the estimated output against the actual output via linear regression w/t regularization represented by yellow/blue dots respectively. In Subfigures \ref{['T025Deg2']}, \ref{['T025Deg4']} and \ref{['T025Deg6']} we plot the fitting results obtained via linear regression on the truncated signatures of order $2, 4$ and $6$ respectively.
  • Figure 4: On the upper panel of figures \ref{['timestep750increments']} and \ref{['timestep750sig']}, $K = 750$, while on the lower panel of figures \ref{['timestep1000increments']} and \ref{['timestep1000sig']}, $K = 1000$. On the left panel, subfigures \ref{['timestep750increments']} and \ref{['timestep1000increments']} plot the fitting results for the testing set via linear regression on the increment features, while subfigures \ref{['timestep1000increments']} and \ref{['timestep750sig']} plot the fitting results for the testing results via linear regression on the truncated signature of order $4$.
  • Figure 5: Embedding the time series into the continuous function $R$.
  • ...and 6 more figures

Theorems & Definitions (51)

  • Example 2.1
  • Definition 2.2: The Signature of a Path
  • Example 2.3
  • Remark 2.5
  • Example 2.6
  • Example 2.7: Linear Controlled Differential Equation
  • Theorem 2.8
  • Definition 2.9
  • Theorem 2.10: Chen's Identity
  • Example 2.11
  • ...and 41 more