Matching correlated VAR time series
Ernesto Araya, Hemant Tyagi
TL;DR
This work tackles the problem of matching two correlated time-series under a CVAR(1,d,T) model, where a base VAR process is perturbed and permuted by an unknown π^*. The authors derive the maximum-likelihood estimator (MLE), which reduces to a quadratic assignment problem, and propose tractable alternatives via linear assignment (LA) and convex relaxations over the Birkhoff polytope with alternating minimization. They establish recovery guarantees for the LA estimator under the condition ∥A^*∥_2 < 1, yielding regimes of exact, partial, and sublinear recovery as a function of the noise level σ; they also develop relaxed-MLE algorithms and rounding strategies, validated by extensive numerical experiments showing LA often matches or outperforms MLE-relaxation approaches. The results advance planted-matching analysis from i.i.d. point clouds to correlated time-series data and offer practical algorithms for aligning unordered correlated time series in privacy, sensor fusion, and time-series alignment applications.
Abstract
We study the problem of matching correlated VAR time series databases, where a multivariate time series is observed along with a perturbed and permuted version, and the goal is to recover the unknown matching between them. To model this, we introduce a probabilistic framework in which two time series $(x_t)_{t\in[T]},(x^\#_t)_{t\in[T]}$ are jointly generated, such that $x^\#_t=x_{π^*(t)}+σ\tilde{x}_{π^*(t)}$, where $(x_t)_{t\in[T]},(\tilde{x}_t)_{t\in[T]}$ are independent and identically distributed vector autoregressive (VAR) time series of order $1$ with Gaussian increments, for a hidden $π^*$. The objective is to recover $π^*$, from the observation of $(x_t)_{t\in[T]},(x^\#_t)_{t\in[T]}$. This generalizes the classical problem of matching independent point clouds to the time series setting. We derive the maximum likelihood estimator (MLE), leading to a quadratic optimization over permutations, and theoretically analyze an estimator based on linear assignment. For the latter approach, we establish recovery guarantees, identifying thresholds for $σ$ that allow for perfect or partial recovery. Additionally, we propose solving the MLE by considering convex relaxations of the set of permutation matrices (e.g., over the Birkhoff polytope). This allows for efficient estimation of $π^*$ and the VAR parameters via alternating minimization. Empirically, we find that linear assignment often matches or outperforms MLE relaxation based approaches.
