A User's Guide to $\texttt{KSig}$: GPU-Accelerated Computation of the Signature Kernel
Csaba Tóth, Danilo Jr Dela Cruz, Harald Oberhauser
TL;DR
The paper introduces KSig, a GPU-accelerated, Scikit-Learn-compatible library for computing the signature kernel on sequential data, uniting exact dual methods with scalable primal approximations. It formalizes dual approaches (Signature Kernels, Truncated Kernels, and SigPDE) and contrasts them with primal random-feature strategies (RFF, RFSF, DP, TRP, TS), analyzing computational trade-offs. A comprehensive scalability study demonstrates how sequence length, truncation level, and feature-map size impact memory, time, and accuracy, recommending DP-1D as a practical starting point for large-scale tasks. The work provides practical guidance, implementation details, and datasets to illustrate deployment, highlighting significant improvements in handling large time-series problems with GPU acceleration and modular design. The developed framework paves the way for scalable, kernel-based modeling of sequential data in real-world applications.
Abstract
The signature kernel is a positive definite kernel for sequential and temporal data that has become increasingly popular in machine learning applications due to powerful theoretical guarantees, strong empirical performance, and recently introduced various scalable variations. In this chapter, we give a short introduction to $\texttt{KSig}$, a $\texttt{Scikit-Learn}$ compatible Python package that implements various GPU-accelerated algorithms for computing signature kernels, and performing downstream learning tasks. We also introduce a new algorithm based on tensor sketches which gives strong performance compared to existing algorithms. The package is available at https://github.com/tgcsaba/ksig.
