An adjoint-free algorithm for conditional nonlinear optimal perturbations (CNOPs) via sampling
Bin Shi, Guodong Sun
TL;DR
The paper addresses computing conditional nonlinear optimal perturbations (CNOPs) without adjoint-gradient information by introducing a sampling-based, zeroth-order optimization method. It leverages a high-dimensional Stokes' theorem to express the gradient of a ball-averaged objective as an expectation and uses a sample average to approximate this gradient, with a Chernoff-type concentration bound guaranteeing probabilistic accuracy. The method is demonstrated on two models—the Burgers equation with small viscosity and the Lorenz-96 system—showing that, for comparable accuracy, the sampling approach can achieve substantial reductions in computation time relative to traditional adjoint-based or direct methods. Overall, the sampling CNOP algorithm offers a practical, scalable alternative for predictability analyses in atmospheric and oceanic contexts where adjoint models are unavailable or costly. The results indicate near-alignment with baseline methods in objective value and pattern fidelity, with clear time advantages when using a modest number of samples.
Abstract
In this paper, we propose a sampling algorithm based on state-of-the-art statistical machine learning techniques to obtain conditional nonlinear optimal perturbations (CNOPs), which is different from traditional (deterministic) optimization methods.1 Specifically, the traditional approach is unavailable in practice, which requires numerically computing the gradient (first-order information) such that the computation cost is expensive, since it needs a large number of times to run numerical models. However, the sampling approach directly reduces the gradient to the objective function value (zeroth-order information), which also avoids using the adjoint technique that is unusable for many atmosphere and ocean models and requires large amounts of storage. We show an intuitive analysis for the sampling algorithm from the law of large numbers and further present a Chernoff-type concentration inequality to rigorously characterize the degree to which the sample average probabilistically approximates the exact gradient. The experiments are implemented to obtain the CNOPs for two numerical models, the Burgers equation with small viscosity and the Lorenz-96 model. We demonstrate the CNOPs obtained with their spatial patterns, objective values, computation times, and nonlinear error growth. Compared with the performance of the three approaches, all the characters for quantifying the CNOPs are nearly consistent, while the computation time using the sampling approach with fewer samples is much shorter. In other words, the new sampling algorithm shortens the computation time to the utmost at the cost of losing little accuracy.
