On the Benefits of Active Data Collection in Operator Learning
Unique Subedi, Ambuj Tewari
TL;DR
The paper addresses data efficiency in learning PDE solution operators by introducing active data collection when inputs are drawn from mean-zero processes with covariance kernel $K$. It proposes sampling along the covariance operator’s eigenfunctions and a linear estimator $\widehat{\mathcal{F}}_n=\sum_{i=1}^n w_i\otimes \varphi_i$, achieving a risk bound $\mathbb{E}_{v\sim\mu}\|\widehat{\mathcal{F}}_n(v)-\mathcal{F}(v)\|_{L^2}^2 \le \varepsilon^2\sum_{i=1}^n \lambda_i + \|\mathcal{F}\|_{op}^2 \sum_{i>n} \lambda_i$, with $\lambda_i$ as the eigenvalues of the covariance operator. By tailoring the eigenvalue decay of $K$, arbitrarily fast convergence rates can be obtained, in stark contrast to passive i.i.d. data whose risk cannot vanish faster than linear in $n$; a minimax lower bound confirms the advantage of active data collection. The approach leverages the Karhunen–Loève decomposition and is demonstrated on covariance kernels including fractional Laplacian inverses, RBF, and Brownian motion, with experiments on Poisson and heat equations confirming data-efficiency gains. The work suggests natural extensions to nonlinear operators and operator RKHS frameworks, along with connections to PCANet-like approaches for even faster rates in practice.
Abstract
We study active data collection strategies for operator learning when the target operator is linear and the input functions are drawn from a mean-zero stochastic process with continuous covariance kernels. With an active data collection strategy, we establish an error convergence rate in terms of the decay rate of the eigenvalues of the covariance kernel. We can achieve arbitrarily fast error convergence rates with sufficiently rapid eigenvalue decay of the covariance kernels. This contrasts with the passive (i.i.d.) data collection strategies, where the convergence rate is never faster than linear decay ($\sim n^{-1}$). In fact, for our setting, we show a \emph{non-vanishing} lower bound for any passive data collection strategy, regardless of the eigenvalues decay rate of the covariance kernel. Overall, our results show the benefit of active data collection strategies in operator learning over their passive counterparts.
