Overfitting in Adaptive Robust Optimization
Karl Zhu, Dimitris Bertsimas
TL;DR
Adaptive robust optimization (ARO) can be brittle because adaptivity makes some originally uncertainty-free constraints depend on realized uncertainty, potentially causing infeasibilities outside the modeled set. The authors propose constraint-dependent uncertainty sets and robust counterparts (RCs) to enforce probabilistic guarantees for softer constraints and deterministic guarantees for hard constraints, revealing that RCs act as regularization on the adaptive coefficients $V$ and shrink or permit adaptivity depending on the strength of the guarantees. They derive probabilistic RCs under Gaussian and distribution-free settings for ellipsoidal, ball-box, and budget uncertainty sets, as well as deterministic RCs for bounded and semi-bounded supports, and interpret these guarantees through a regularization lens that links guarantee strength to reduced adaptivity. The work provides principled guidance for designing uncertainty sets that balance robustness and adaptivity in ARO, with implications for energy systems and other domains where uncertainty is high but not uniformly critical across constraints.
Abstract
Adaptive robust optimization (ARO) extends static robust optimization by allowing decisions to depend on the realized uncertainty - weakly dominating static solutions within the modeled uncertainty set. However, ARO makes previous constraints that were independent of uncertainty now dependent, making it vulnerable to additional infeasibilities when realizations fall outside the uncertainty set. This phenomenon of adaptive policies being brittle is analogous to overfitting in machine learning. To mitigate against this, we propose assigning constraint-specific uncertainty set sizes, with harder constraints given stronger probabilistic guarantees. Interpreted through the overfitting lens, this acts as regularization: tighter guarantees shrink adaptive coefficients to ensure stability, while looser ones preserve useful flexibility. This view motivates a principled approach to designing uncertainty sets that balances robustness and adaptivity.
