Gradient-enhanced global sensitivity analysis with Poincar{é} chaos expansions
O Roustant, N Lüthen, D Heredia, B Sudret
TL;DR
The paper addresses efficient global sensitivity analysis when gradient information is available, by introducing gradient-enhanced chaos expansions based on a Poincaré basis. It proves that the derivative-stable property (orthogonal derivatives) uniquely identifies a Poincaré basis tied to weighted Poincaré inequalities and Sturm-Liouville problems. A full framework combines sparse gradient-enhanced regression with flexible derivative weighting (including the Stein kernel weight $w_{\mathrm{lin}}$) to estimate Sobol' indices and DGSM for broad distributions. Numerical experiments on toy and flood models show consistent improvements in Sobol' index estimation and surrogate accuracy, especially with weighting and derivative augmentation. The approach broadens applicability of gradient-enhanced GSA to non-classical distributions and offers practical, open-source tooling.
Abstract
Chaos expansions are widely used in global sensitivity analysis (GSA), as they leverage orthogonal bases of L2 spaces to efficiently compute Sobol' indices, particularly in data-scarce settings. When derivatives are available, we argue that a desirable property is for the derivatives of the basis functions to also form an orthogonal basis. We demonstrate that the only basis satisfying this property is the one associated with weighted Poincar{é} inequalities and Sturm-Liouville eigenvalue problems, which we refer to as the Poincar{é} basis. We then introduce a comprehensive framework for gradient-enhanced GSA that integrates recent advances in sparse, gradient-enhanced regression for surrogate modeling with the construction of weighting schemes for derivative-based sensitivity analysis. The proposed methodology is applicable to a broad class of probability measures and supports various choices of weights. We illustrate the effectiveness of the approach on a challenging flood modeling case study, where Sobol' indices are accurately estimated using limited data.
