Signature Reconstruction from Randomized Signatures
Mie Glückstad, Nicola Muca Cirone, Josef Teichmann
TL;DR
This work investigates how signature features of a bounded-variation path $X$ can be recovered from controlled differential equations driven by randomized neural-like vector fields, with a focus on depth-two constructions. By representing iterated vector-field operators as tree-like structures and employing Taylor-expansion techniques, the authors show that, under a linear-independence condition on these tree-like fields up to order $L$, signature components up to order $L$ can be reconstructed from the family of CDE solutions, and that depth-two randomizations can render these components independent up to higher orders when $N\ge L-1$. The results extend to Lie groups, yielding a general framework that connects randomized signatures, rough-path theory, and algebraic properties of vector-field Lie polynomials. The findings quantify how increasing hidden dimension $N$ yields an exponential growth in the number of reconstructible signature features, with implications for understanding the expressive power of randomized neural-like CDEs in capturing path information. These insights bridge Lie-algebraic structure and machine-learning inspired models, offering pathways for robust feature extraction from sequential data.
Abstract
Controlled ordinary differential equations driven by continuous bounded variation curves can be considered a continuous time analogue of recurrent neural networks for the construction of expressive features of the input curves. We ask up to which extent well known signature features of such curves can be reconstructed from controlled ordinary differential equations with (untrained) random vector fields. The answer turns out to be algebraically involved, but essentially the number of signature features, which can be reconstructed from the non-linear flow of the controlled ordinary differential equation, is exponential in its hidden dimension, when the vector fields are chosen to be neural with depth two. Moreover, we characterize a general linear independence condition on arbitrary vector fields, under which the signature features up to some fixed order can always be reconstructed. Algebraically speaking this complements in a quantitative manner several well known results from the theory of Lie algebras of vector fields and puts them in a context of machine learning.
