An adaptive importance sampling algorithm for risk-averse optimization
Sandra Pieraccini, Tommaso Vanzan
TL;DR
This paper addresses risk-averse optimization by minimizing $CVaR_\beta[f(z,\boldsymbol{\xi})]$ and proposes an adaptive importance sampling algorithm that jointly tunes the sampling distribution and the sample size using a reduced-order model (ROM) of $f(z,\cdot)$. The ROM-based biasing densities oversample the current risk region, reducing gradient variance and enabling smaller per-iteration samples while preserving the linear convergence rate. A convergence analysis for the alternating optimization is provided, and numerical experiments on PDE-constrained problems demonstrate substantial computational savings. The work situates itself among adaptive sampling and ROM-based CVaR evaluation, offering a practical, scalable framework for risk-averse stochastic optimization.
Abstract
Adaptive sampling algorithms are modern and efficient methods that dynamically adjust the sample size throughout the optimization process. However, they may encounter difficulties in risk-averse settings, particularly due to the challenge of accurately sampling from the tails of the underlying distribution of random inputs. This often leads to a much faster growth of the sample size compared to risk-neutral problems. In this work, we propose a novel adaptive sampling algorithm that adapts both the sample size and the sampling distribution at each iteration. The biasing distributions are constructed on the fly, leveraging a reduced-order model of the objective function to be minimized, and are designed to oversample a so-called risk region. As a result, a reduction of the variance of the gradients is achieved, which permits to use fewer samples per iteration compared to a standard algorithm, while still preserving the asymptotic convergence rate. Our focus is on the minimization of the Conditional Value-at-Risk (CVaR), and we establish the convergence of the proposed computational framework. Numerical experiments confirm the substantial computational savings achieved by our approach.
