Nonparametric Regression for Random Unbiased Perturbations
Anna Lyubarskaja, Dominik Rothenhäusler
TL;DR
This work introduces random unbiased perturbations (RUPs) as dataset-level, mean-zero shifts in the conditional law $Y|X$ with fixed covariate distribution $P^X$, distinct from adversarial or per-sample shifts. It derives an extended bias–variance decomposition that adds a distributional variance term, and shows that RUPs effectively reduce the information available to an estimator through an effective sample size $n_{\mathrm{eff}} = n/(1+n\tau)$, where $\tau$ captures perturbation strength and correlation. For local polynomial estimators, the paper establishes that optimal bandwidth scales with $h^\star \asymp (\tfrac{1}{n}+\tau)^{1/(2\beta+1)}$, leading to convergence rates in terms of $n_{\mathrm{eff}}$ and, in certain regimes, to perturbation-dominated scaling $h^\star \propto \tau^{1/(2\beta+1)}$. Minimax lower bounds show these rates are fundamental under RUPs, illustrating a new uncertainty regime that shapes both tuning and limits. The results offer practical guidance for bandwidth selection and evaluation under distributional randomness and open avenues for extending the RUP framework to other nonparametric tools and problems.
Abstract
We study nonparametric regression with covariates $X$ and outcome $Y$ under random unbiased perturbations (RUPs) of the conditional distribution $Y|X$, where the marginal distribution of covariates, $P^X$, remains fixed but the conditional law, $P^{Y|X}$, varies randomly across datasets. Unlike adversarial distribution shift frameworks that yield conservative worst-case guarantees, RUPs induce dataset-level variance inflation rather than systematic bias. We provide examples of RUPs and show that this distributional uncertainty reduces the effective sample size to $n_{\mathrm{eff}} = n/(1 + n τ)$, where $τ\in [0,1]$ quantifies the perturbation strength. For local polynomial estimators, we derive an extended bias-variance decomposition that includes a distributional variance term with the same bandwidth scaling as classical sampling variance. This leads to a modified bandwidth selection principle: when distributional uncertainty dominates sampling uncertainty ($τ\gg 1/n$), optimal bandwidths scale as $τ^{1/(2β+1)}$ rather than the usual $n^{-1/(2β+1)}$, where $β$ indicates the smoothness of the function class considered. We also establish matching minimax lower bounds showing that there exists an RUP for which this effective sample size $n_{\mathrm{eff}}$ is fundamental. Our results demonstrate that random dataset-level perturbations create a distinct mode of uncertainty that affects both practical tuning and fundamental statistical limits.
