Designing Ambiguity Sets for Distributionally Robust Optimization Using Structural Causal Optimal Transport
Ahmad-Reza Ehyaei, Golnoosh Farnadi, Samira Samadi
TL;DR
This work addresses distributional robustness when data follow a structural causal model (SCM) by introducing a structural causal optimal transport (OT) framework. It defines a structural causal ambiguity set $\\mathcal{B}^{\\mathcal{F}}(\\mathbb{P},\\delta)$ that leverages structural equations and a bijective reduced-form mapping $g$ to connect endogenous and exogenous spaces, yielding a more realistic DAS than existing $\\mathcal{A}$ and $\\mathcal{G}$ variants. A relaxed, entropy-regularized version $W^{\\mathcal{F}_{\\varepsilon}}$ enables a difference-of-convex formulation and a Sinkhorn-based algorithm, with finite-sample guarantees when SCMs are estimated and a dimension-free shrinkage rate arising from exogenous independence. The paper demonstrates, via a simple two-variable ANM example, that incorporating structural equations reduces worst-case loss and produces more coherent ambiguity sets, offering a scalable approach for designing DAS under causal structure in DRO problems.
Abstract
Distributionally robust optimization tackles out-of-sample issues like overfitting and distribution shifts by adopting an adversarial approach over a range of possible data distributions, known as the ambiguity set. To balance conservatism and accuracy, these sets must include realistic probability distributions by leveraging information from the nominal distribution. Assuming that nominal distributions arise from a structural causal model with a directed acyclic graph $\mathcal{G}$ and structural equations, previous methods such as adapted and $\mathcal{G}$-causal optimal transport have only utilized causal graph information in designing ambiguity sets. In this work, we propose incorporating structural equations, which include causal graph information, to enhance ambiguity sets, resulting in more realistic distributions. We introduce structural causal optimal transport and its associated ambiguity set, demonstrating their advantages and connections to previous methods. A key benefit of our approach is a relaxed version, where a regularization term replaces the complex causal constraints, enabling an efficient algorithm via difference-of-convex programming to solve structural causal optimal transport. We also show that when structural information is absent and must be estimated, our approach remains effective and provides finite sample guarantees. Lastly, we address the radius of ambiguity sets, illustrating how our method overcomes the curse of dimensionality in optimal transport problems, achieving faster shrinkage with dimension-free order.
