On Consistency of Signature Using Lasso
Xin Guo, Binnan Wang, Ruixun Zhang, Chaoyi Zhao
TL;DR
This work analyzes the statistical consistency of Lasso regression when applied to signature features of stochastic paths, contrasting Itô and Stratonovich definitions across Brownian motion, OU processes, and their discrete-time analogs. It establishes a probabilistic uniqueness of the signature-based linear representation, characterizes the correlation structures of signature components, and proves both asymptotic and finite-sample consistency under irrepresentability-type conditions. Simulations show Itô signatures are typically more consistent for Brownian-like data, while Stratonovich signatures perform better for mean-reverting dynamics; these insights extend to nonlinear learning and option pricing, where signature-based models yield high accuracy and interpretability via signature derivatives. The results provide practical guidance on selecting signature definitions and truncation orders to optimize predictive performance in time-series analysis and financial applications.
Abstract
Signatures are iterated path integrals of continuous and discrete-time processes, and their universal nonlinearity linearizes the problem of feature selection in time series data analysis. This paper studies the consistency of signature using Lasso regression, both theoretically and numerically. We establish conditions under which the Lasso regression is consistent both asymptotically and in finite sample. Furthermore, we show that the Lasso regression is more consistent with the Itô signature for time series and processes that are closer to the Brownian motion and with weaker inter-dimensional correlations, while it is more consistent with the Stratonovich signature for mean-reverting time series and processes. We demonstrate that signature can be applied to learn nonlinear functions and option prices with high accuracy, and the performance depends on properties of the underlying process and the choice of the signature.
