Table of Contents
Fetching ...

Efficient Approximation of Volterra Series for High-Dimensional Systems

Navin Khoshnan, Claudia K Petritsch, Bryce-Allen Bagley

TL;DR

The paper tackles the curse of dimensionality in identifying high-dimensional nonlinear dynamical systems via Volterra series. It introduces Tensor Head Averaging (THA), an ensemble of localized MVMALS heads trained on small input subsets and averaged to predict outputs, thereby replacing full-dimension learning with subset-based modeling. The authors derive observable, finite-sample error bounds and an exact geometric decomposition that reveals a baking gain: optimization over localized heads implicitly compensates for omitted dynamics, yielding accuracy beyond simple truncation. THA achieves a dramatic reduction in computational complexity, from a dependence on the full input dimension to a dependence on the much smaller subset size, enabling scalable identification of previously intractable high-dimensional systems while preserving interpretability of Volterra kernels. The work provides a principled framework for scalable Volterra identification with theoretical guarantees and motivates future empirical validation and adaptive subset strategies.

Abstract

The identification of high-dimensional nonlinear dynamical systems via the Volterra series has significant potential, but has been severely hindered by the curse of dimensionality. Tensor Network (TN) methods such as the Modified Alternating Linear Scheme (MVMALS) have been a breakthrough for the field, offering a tractable approach by exploiting the low-rank structure in Volterra kernels. However, these techniques still encounter prohibitive computational and memory bottlenecks due to high-order polynomial scaling with respect to input dimension. To overcome this barrier, we introduce the Tensor Head Averaging (THA) algorithm, which significantly reduces complexity by constructing an ensemble of localized MVMALS models trained on small subsets of the input space. In this paper, we present a theoretical foundation for the THA algorithm. We establish observable, finite-sample bounds on the error between the THA ensemble and a full MVMALS model, and we derive an exact decomposition of the squared error. This decomposition is used to analyze the manner in which subset models implicitly compensate for omitted dynamics. We quantify this effect, and prove that correlation between the included and omitted dynamics creates an optimization incentive which drives THA's performance toward accuracy superior to a simple truncation of a full MVMALS model. THA thus offers a scalable and theoretically grounded approach for identifying previously intractable high-dimensional systems.

Efficient Approximation of Volterra Series for High-Dimensional Systems

TL;DR

The paper tackles the curse of dimensionality in identifying high-dimensional nonlinear dynamical systems via Volterra series. It introduces Tensor Head Averaging (THA), an ensemble of localized MVMALS heads trained on small input subsets and averaged to predict outputs, thereby replacing full-dimension learning with subset-based modeling. The authors derive observable, finite-sample error bounds and an exact geometric decomposition that reveals a baking gain: optimization over localized heads implicitly compensates for omitted dynamics, yielding accuracy beyond simple truncation. THA achieves a dramatic reduction in computational complexity, from a dependence on the full input dimension to a dependence on the much smaller subset size, enabling scalable identification of previously intractable high-dimensional systems while preserving interpretability of Volterra kernels. The work provides a principled framework for scalable Volterra identification with theoretical guarantees and motivates future empirical validation and adaptive subset strategies.

Abstract

The identification of high-dimensional nonlinear dynamical systems via the Volterra series has significant potential, but has been severely hindered by the curse of dimensionality. Tensor Network (TN) methods such as the Modified Alternating Linear Scheme (MVMALS) have been a breakthrough for the field, offering a tractable approach by exploiting the low-rank structure in Volterra kernels. However, these techniques still encounter prohibitive computational and memory bottlenecks due to high-order polynomial scaling with respect to input dimension. To overcome this barrier, we introduce the Tensor Head Averaging (THA) algorithm, which significantly reduces complexity by constructing an ensemble of localized MVMALS models trained on small subsets of the input space. In this paper, we present a theoretical foundation for the THA algorithm. We establish observable, finite-sample bounds on the error between the THA ensemble and a full MVMALS model, and we derive an exact decomposition of the squared error. This decomposition is used to analyze the manner in which subset models implicitly compensate for omitted dynamics. We quantify this effect, and prove that correlation between the included and omitted dynamics creates an optimization incentive which drives THA's performance toward accuracy superior to a simple truncation of a full MVMALS model. THA thus offers a scalable and theoretically grounded approach for identifying previously intractable high-dimensional systems.

Paper Structure

This paper contains 32 sections, 9 theorems, 58 equations, 1 algorithm.

Key Result

Theorem 1

Let $\widehat{Y}^{\mathrm{full}}$ be the prediction of the full MVMALS model, and $\widehat{Y}^{\mathrm{THA}}$ be the prediction of the THA ensemble. The squared normalized empirical Frobenius error between these predictions is given exactly by the following geometric decomposition: All terms in this decomposition are observable.

Theorems & Definitions (35)

  • Definition 1: Empirical Frobenius Norm and Inner Product
  • Definition 2: Empirical Gram Matrix
  • Definition 3: Full MVMALS Model Parameters $B^\star$
  • Definition 4: Mask Operator $P_{S_k}$
  • Definition 5: Coverage Operator $C_\omega$
  • Definition 6: Truncated THA Model
  • Definition 7: Tail and Empirical Coverage Bias
  • Definition 8: Empirical Coverage Bias
  • Definition 9: Empirical Baking Adjustment
  • Definition 10: Baking Alignment
  • ...and 25 more