Adjoint Monte Carlo Method
Russel Caflisch, Yunan Yang
TL;DR
The article surveys adjoint Monte Carlo methods for optimization problems governed by kinetic PDEs, addressing the challenge of high-dimensional phase space and stochastic objective evaluations. It develops a unified framework that blends Monte Carlo gradient estimators (score function, pathwise/reparameterization, and coupling) with PDE-constrained optimization via the adjoint-state method, and demonstrates two kinetic case studies: the Radiative Transport Equation and the Boltzmann equation. Key contributions include deriving adjoint MC formulations under both OTD and DTO paradigms, introducing adjoint particle methods and adjoint DSMC for efficient, correlated forward-adjoint computations, and providing numerical evidence of variance-controlled gradient estimation and scalability in high-dimensional settings. The results highlight the practical impact of adjoint MC methods for design, control, and topology optimization in radiative transfer, rarefied gas dynamics, and related kinetic systems. Overall, the work offers a robust, extensible framework for integrating MC gradient estimators with PDE-constrained optimization to tackle complex, high-dimensional kinetic problems.
Abstract
This survey explores the development of adjoint Monte Carlo methods for solving optimization problems governed by kinetic equations, a common challenge in areas such as plasma control and device design. These optimization problems are particularly demanding due to the high dimensionality of the phase space and the randomness in evaluating the objective functional, a consequence of using a forward Monte Carlo solver. To overcome these difficulties, a range of ``adjoint Monte Carlo methods'' have been devised. These methods skillfully combine Monte Carlo gradient estimators with PDE-constrained optimization, introducing innovative solutions tailored for kinetic applications. In this review, we begin by examining three primary strategies for Monte Carlo gradient estimation: the score function approach, the reparameterization trick, and the coupling method. We also delve into the adjoint-state method, an essential element in PDE-constrained optimization. Focusing on applications in the radiative transfer equation and the nonlinear Boltzmann equation, we provide a comprehensive guide on how to integrate Monte Carlo gradient techniques within both the optimize-then-discretize and the discretize-then-optimize frameworks from PDE-constrained optimization. This approach leads to the formulation of effective adjoint Monte Carlo methods, enabling efficient gradient estimation in complex, high-dimensional optimization problems.
