Wasserstein Distributionally Robust Online Learning

Guixian Chen; Salar Fattahi; Soroosh Shafiee

Wasserstein Distributionally Robust Online Learning

Guixian Chen, Salar Fattahi, Soroosh Shafiee

TL;DR

The key insight is a novel connection between the worst-case expectation problem, an inherently infinite-dimensional optimization problem, and a classical and tractable budget allocation problem, which is of independent interest.

Abstract

We study distributionally robust online learning, where a risk-averse learner updates decisions sequentially to guard against worst-case distributions drawn from a Wasserstein ambiguity set centered at past observations. While this paradigm is well understood in the offline setting through Wasserstein Distributionally Robust Optimization (DRO), its online extension poses significant challenges in both convergence and computation. In this paper, we address these challenges. First, we formulate the problem as an online saddle-point stochastic game between a decision maker and an adversary selecting worst-case distributions, and propose a general framework that converges to a robust Nash equilibrium coinciding with the solution of the corresponding offline Wasserstein DRO problem. Second, we address the main computational bottleneck, which is the repeated solution of worst-case expectation problems. For the important class of piecewise concave loss functions, we propose a tailored algorithm that exploits problem geometry to achieve substantial speedups over state-of-the-art solvers such as Gurobi. The key insight is a novel connection between the worst-case expectation problem, an inherently infinite-dimensional optimization problem, and a classical and tractable budget allocation problem, which is of independent interest.

Wasserstein Distributionally Robust Online Learning

TL;DR

Abstract

Paper Structure (28 sections, 22 theorems, 156 equations, 4 algorithms)

This paper contains 28 sections, 22 theorems, 156 equations, 4 algorithms.

Introduction
Summary of Contributions
Related Works
Wasserstein DRO
Online Wasserstein DRO
Online Algorithms for Robust Optimization
Adversarial Training and Domain Shift
Risk-Averse Online Learning
Distributionally Robust Regret Optimization
Notation and Outline
Problem Setup
Convergence Analysis
Wasserstein Oracle
Conclusion
Algorithmic Details
...and 13 more sections

Key Result

lemma 1

Suppose Assumptions asp::region and asp::convexity hold. Moreover, if $p=1$, suppose in addition that either $\Xi$ is compact or there exists a constant $g>0$ such that $\ell(x,\xi) \le g(1+\|\xi\|^r)$ for all $\xi \in \Xi$ and some $r \in (0,1)$. Then,

Theorems & Definitions (36)

lemma 1: Existence of Saddle Point
lemma 2
lemma 3
lemma 4
theorem 1
corollary 1
theorem 2
lemma 5
lemma 6
lemma 7
...and 26 more

Wasserstein Distributionally Robust Online Learning

TL;DR

Abstract

Wasserstein Distributionally Robust Online Learning

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (36)