Sensitivity analysis from a single input/output sample
Sébastien Da Veiga, Fabrice Gamboa, Thierry Klein, Agnès Lagnoux, Clémentine Prieur
TL;DR
This work addresses how to estimate closed Sobol' indices $S^X=rac{ ext{Var}( ext{E}[Y|X])}{ ext{Var}(Y)}$ from a single i.i.d. input/output sample, without restrictive input independence. It introduces two mirror-type, high-order kernel regression estimators for the regression function $m(x)=\text{E}[Y|X=x]$, derives efficient influence-function-based estimators for $T=\text{E}[\text{E}[Y|X]^2]$, and proves $\sqrt{n}$-consistency and asymptotic efficiency via central limit theorems. The paper compares these estimators to PF, NN, and other kernel approaches, showing asymptotic efficiency and favorable finite-sample performance on standard test functions and a flood model. The resulting methodology provides a practical, theoretically sound tool for global sensitivity analysis when only a single $n$-sample is available.
Abstract
The main objective of this paper is to estimate optimally Sobol' indices at any order when a unique input/output i.i.d.\ sample is available. Our approach stands on three main ingredients: semi-parametric estimation theory, high-order kernel estimation (inspired by the paper of Doksum in 1995), and mirror-type transformations as introduced in Bertin 2020 and Pujol 2022. We propose two different estimators. We prove that these estimators are asymptotically normal and efficient. Furthermore, we illustrate their numerical properties on standard examples.
