A Primer on the Signature Method in Machine Learning
Ilya Chevyrev, Andrey Kormilitzin
TL;DR
The paper introduces the signature method as a principled, geometry-driven representation for time-ordered data by encoding a path with the infinite sequence of iterated integrals $S(X)_{a,b}$. It develops the theoretical foundations—definition, invariances, and algebraic structure via the shuffle product, Chen's identity, time-reversal, and the log-signature—while connecting signatures to rough path theory and the moment problem for random paths. On the practical side, it details how to convert discrete data into paths, apply augmentations and transformations, compute signatures with available software, and use signature/log-signature features in machine learning, including a handwritten digit classification example. The framework provides a flexible, nonparametric feature extraction pipeline with strong geometric interpretation, enabling effective modeling of sequential data across finance, healthcare, computer vision, and beyond.
Abstract
We provide an introduction to the signature method, focusing on its theoretical properties and machine learning applications. Our presentation is divided into two parts. In the first part, we present the definition and fundamental properties of the signature of a path. The signature is a sequence of numbers associated with a path that captures many of its important analytic and geometric properties. As a sequence of numbers, the signature serves as a compact description (dimension reduction) of a path. In presenting its theoretical properties, we assume only familiarity with classical real analysis and integration, and supplement theory with straightforward examples. We also mention several advanced topics, including the role of the signature in rough path theory. In the second part, we present practical applications of the signature to the area of machine learning. The signature method is a non-parametric way of transforming data into a set of features that can be used in machine learning tasks. In this method, data are converted into multi-dimensional paths, by means of embedding algorithms, of which the signature is then computed. We describe this pipeline in detail, making a link with the properties of the signature presented in the first part. We furthermore review some of the developments of the signature method in machine learning and, as an illustrative example, present a detailed application of the method to handwritten digit classification.
