Distribution-Free Robust Functional Predict-Then-Optimize
Yash Patel, Ambuj Tewari
TL;DR
This work tackles uncertainty quantification for neural operator surrogates solving PDEs by extending conformal prediction to infinite-dimensional Sobolev function spaces. It builds a robust predict-then-optimize pipeline that leverages calibrated, spectrum-aware prediction regions to bound downstream decision suboptimality. Theoretical results provide coverage guarantees across elliptic and parabolic PDE families, along with finite-projection and multi-stage optimization analyses to enable scalable, discretization-invariant procedures. Empirical results on quantum state discrimination and PDE-based collection problems demonstrate improved robustness and decision quality over nominal approaches. The approach offers distribution-free uncertainty control with practical implications for engineering design and quantum information tasks.
Abstract
The solution of PDEs in decision-making tasks is increasingly being undertaken with the help of neural operator surrogate models due to the need for repeated evaluation. Such methods, while significantly more computationally favorable compared to their numerical counterparts, fail to provide any calibrated notions of uncertainty in their predictions. Current methods approach this deficiency typically with ensembling or Bayesian posterior estimation. However, these approaches either require distributional assumptions that fail to hold in practice or lack practical scalability, limiting their applications in practice. We, therefore, propose a novel application of conformal prediction to produce distribution-free uncertainty quantification over the function spaces mapped by neural operators. We then demonstrate how such prediction regions enable a formal regret characterization if leveraged in downstream robust decision-making tasks. We further demonstrate how such posited robust decision-making tasks can be efficiently solved using an infinite-dimensional generalization of Danskin's Theorem and calculus of variations and empirically demonstrate the superior performance of our proposed method over more restrictive modeling paradigms, such as Gaussian Processes, across several engineering tasks.
