Table of Contents
Fetching ...

Optimal inference for the mean of random functions

Omar Kassi, Valentin Patilea

TL;DR

This work addresses inference for the mean function $\mu$ of random functions on $\mathcal{T}=[0,1]^D$ under random design and heteroscedastic noise by introducing a Fourier-series–based estimator built from de La Vallée Poussin projections. A fast control-variate linear-integration scheme is used to estimate Fourier coefficients, achieving minimax rates over Hölder classes and providing non-asymptotic $L^2$ risk bounds with explicit constants, plus non-asymptotic Gaussian approximations for coefficient vectors to enable pointwise and uniform confidence sets. The approach is made adaptive by plug-in estimators for the Hölder regularity $\alpha_\mu$, with concentration bounds ensuring reliable selection of the smoothing level and Hölder constant. The results cover both dense and sparse sampling regimes through a phase-transition in rates, and yield practical inference tools (confidence bands and regions) that respect the randomness of the design and heteroscedastic noise. Overall, the paper advances adaptive, nonparametric mean-function inference for random functional data in high dimensions using a Fourier-analytic framework with non-asymptotic guarantees.

Abstract

We study estimation and inference for the mean of real-valued random functions defined on a hypercube. The independent random functions are observed on a discrete, random subset of design points, possibly with heteroscedastic noise. We propose a novel optimal-rate estimator based on Fourier series expansions and establish a sharp non-asymptotic error bound in $L^2-$norm. Additionally, we derive a non-asymptotic Gaussian approximation bound for our estimated Fourier coefficients. Pointwise and uniform confidence sets are constructed. Our approach is made adaptive by a plug-in estimator for the Hölder regularity of the mean function, for which we derive non-asymptotic concentration bounds.

Optimal inference for the mean of random functions

TL;DR

This work addresses inference for the mean function of random functions on under random design and heteroscedastic noise by introducing a Fourier-series–based estimator built from de La Vallée Poussin projections. A fast control-variate linear-integration scheme is used to estimate Fourier coefficients, achieving minimax rates over Hölder classes and providing non-asymptotic risk bounds with explicit constants, plus non-asymptotic Gaussian approximations for coefficient vectors to enable pointwise and uniform confidence sets. The approach is made adaptive by plug-in estimators for the Hölder regularity , with concentration bounds ensuring reliable selection of the smoothing level and Hölder constant. The results cover both dense and sparse sampling regimes through a phase-transition in rates, and yield practical inference tools (confidence bands and regions) that respect the randomness of the design and heteroscedastic noise. Overall, the paper advances adaptive, nonparametric mean-function inference for random functional data in high dimensions using a Fourier-analytic framework with non-asymptotic guarantees.

Abstract

We study estimation and inference for the mean of real-valued random functions defined on a hypercube. The independent random functions are observed on a discrete, random subset of design points, possibly with heteroscedastic noise. We propose a novel optimal-rate estimator based on Fourier series expansions and establish a sharp non-asymptotic error bound in norm. Additionally, we derive a non-asymptotic Gaussian approximation bound for our estimated Fourier coefficients. Pointwise and uniform confidence sets are constructed. Our approach is made adaptive by a plug-in estimator for the Hölder regularity of the mean function, for which we derive non-asymptotic concentration bounds.

Paper Structure

This paper contains 12 sections, 20 theorems, 130 equations.

Key Result

Proposition 1

Assume $\mu \in \Sigma (\alpha_\mu, C_\mu; D) \cap \mathcal{F}_D$ with $\alpha_\mu$ and $C_\mu$ are two positive numbers. Then, a positive constant $\mathcal{C}_\mu$ exists, depending only on $\alpha_\mu$, such that

Theorems & Definitions (40)

  • Definition 1
  • Proposition 1
  • Definition 2: Leave-one-out neighbors, and Voronoi cells and volumes
  • Definition 3: Degree and cumulative length
  • Proposition 2
  • Proposition 3
  • Proposition 4
  • Remark 1
  • Proposition 5
  • Corollary 1
  • ...and 30 more