Table of Contents
Fetching ...

A randomized algorithm to solve reduced rank operator regression

Giacomo Turri, Vladimir Kostic, Pietro Novelli, Massimiliano Pontil

TL;DR

This work tackles vector-valued regression between RKHSs by learning low-rank linear operators under regularized empirical risk minimization. It introduces Randomized Reduced Rank Regression (R$^4$), a Gaussian sketching framework that yields efficient primal and dual algorithms for computing $r$-rank estimators, with explicit error bounds on the expected risk that improve as the sketch size and power iterations grow. Theoretical results show that the randomized estimators approach the optimal risk, while numerical experiments on synthetic data, large-scale neuroscience data, and Koopman-operator regression demonstrate substantial speedups (including GPU-enabled large-scale gains) without sacrificing accuracy. The methods offer a tractable route to scalable operator learning in infinite-dimensional settings and open avenues for combining with Nyström or random features to further reduce complexity and extend applicability to broader vector-valued regression tasks. Overall, R$^4$ provides a principled, fast alternative to eigenvalue-based solvers for reduced-rank operator regression with strong theoretical guarantees and practical impact on large-scale problems.

Abstract

We present and analyze an algorithm designed for addressing vector-valued regression problems involving possibly infinite-dimensional input and output spaces. The algorithm is a randomized adaptation of reduced rank regression, a technique to optimally learn a low-rank vector-valued function (i.e. an operator) between sampled data via regularized empirical risk minimization with rank constraints. We propose Gaussian sketching techniques both for the primal and dual optimization objectives, yielding Randomized Reduced Rank Regression (R4) estimators that are efficient and accurate. For each of our R4 algorithms we prove that the resulting regularized empirical risk is, in expectation w.r.t. randomness of a sketch, arbitrarily close to the optimal value when hyper-parameteres are properly tuned. Numerical expreriments illustrate the tightness of our bounds and show advantages in two distinct scenarios: (i) solving a vector-valued regression problem using synthetic and large-scale neuroscience datasets, and (ii) regressing the Koopman operator of a nonlinear stochastic dynamical system.

A randomized algorithm to solve reduced rank operator regression

TL;DR

This work tackles vector-valued regression between RKHSs by learning low-rank linear operators under regularized empirical risk minimization. It introduces Randomized Reduced Rank Regression (R), a Gaussian sketching framework that yields efficient primal and dual algorithms for computing -rank estimators, with explicit error bounds on the expected risk that improve as the sketch size and power iterations grow. Theoretical results show that the randomized estimators approach the optimal risk, while numerical experiments on synthetic data, large-scale neuroscience data, and Koopman-operator regression demonstrate substantial speedups (including GPU-enabled large-scale gains) without sacrificing accuracy. The methods offer a tractable route to scalable operator learning in infinite-dimensional settings and open avenues for combining with Nyström or random features to further reduce complexity and extend applicability to broader vector-valued regression tasks. Overall, R provides a principled, fast alternative to eigenvalue-based solvers for reduced-rank operator regression with strong theoretical guarantees and practical impact on large-scale problems.

Abstract

We present and analyze an algorithm designed for addressing vector-valued regression problems involving possibly infinite-dimensional input and output spaces. The algorithm is a randomized adaptation of reduced rank regression, a technique to optimally learn a low-rank vector-valued function (i.e. an operator) between sampled data via regularized empirical risk minimization with rank constraints. We propose Gaussian sketching techniques both for the primal and dual optimization objectives, yielding Randomized Reduced Rank Regression (R4) estimators that are efficient and accurate. For each of our R4 algorithms we prove that the resulting regularized empirical risk is, in expectation w.r.t. randomness of a sketch, arbitrarily close to the optimal value when hyper-parameteres are properly tuned. Numerical expreriments illustrate the tightness of our bounds and show advantages in two distinct scenarios: (i) solving a vector-valued regression problem using synthetic and large-scale neuroscience datasets, and (ii) regressing the Koopman operator of a nonlinear stochastic dynamical system.
Paper Structure (8 sections, 9 theorems, 36 equations, 4 figures, 1 table, 2 algorithms)

This paper contains 8 sections, 9 theorems, 36 equations, 4 figures, 1 table, 2 algorithms.

Key Result

Proposition 2.1

\newlabelprop:RRR_recap0 For the empirical risk minimization eq:RRR the following results hold.

Figures (4)

  • Figure 1: Comparison between the upper bounds (solid lines) derived in Theorems \ref{['thm:main_1']} and \ref{['thm:main_2']} and the estimated error (dashed lines) as a function of rank (left, $s=5$) and of oversampling parameter $s$ (right, $r=5$). The estimated error is represented by the average ($\pm$ 95% confidence interval) over 1000 independent seed initializations.
  • Figure 2: Risk difference evaluated on the training (left) and test (center) sets, and comparison of fit time between R$^4$ and R$^3$ (right). The estimated quantities are represented by the average over 10 independent seed initializations.
  • Figure 3: Assessment of the performance in predicting fMRI activity in terms of squared correlation, $r^2$ (the higher the better), and fit time. We show $r^2$ for baseline and R$^3$ (left), R$^3$ and R$^4$ (center), and the fit time for R$^3$ and R$^4$ (right). The estimated quantities are represented by the average over 10 independent seed initializations.
  • Figure 4: True and estimated eigenvalues of the logistic map (left), DHD between R$^3$ estimated eigenvalues and true eigenvalues (center), and DHD between R$^3$ and R$^4$ estimated eigenvalues.

Theorems & Definitions (14)

  • Proposition 2.1
  • Proposition 2.2
  • Proposition 2.3
  • Proof 1
  • Lemma 3.1
  • Proof 2
  • Proposition 4.1
  • Proof 3
  • Theorem 4.2: Theorem 3.17 and Remark 3.18 of Diouane2022
  • Theorem 4.3
  • ...and 4 more