Table of Contents
Fetching ...

Online $\mathrm{L}^{\natural}$-Convex Minimization

Ken Yokoyama, Shinji Ito, Tatsuya Matsuoka, Kei Kimura, Makoto Yokoo

TL;DR

The paper addresses online decision problems with nonlinear discrete objectives that fall outside the online submodular framework by introducing online $L^{\natural}$-convex minimization on integer lattices. It develops efficient full-information and bandit algorithms based on convex extensions (the Lovász extension and a piecewise-linear convex extension) and provides regret analyses, achieving $\mathbb{E}[R_T]=O(\hat{L}N\sqrt{dT})$ in the full information setting and $\mathbb{E}[R_T]=O(MdNT^{2/3})$ in the bandit setting, with a matching lower bound $\Omega(\hat{L}N\sqrt{dT})$ for the former. The framework is illustrated through applications to online spare-parts inventory and call-center shift scheduling, showing that these problems can be formulated as online $L^{\natural}$-convex minimization and solved with provable guarantees. Overall, the work extends online convex/ submodular optimization to discrete, lattice-based domains, enabling efficient handling of a broader class of nonlinear, combinatorial online problems with practical impact.

Abstract

An online decision-making problem is a learning problem in which a player repeatedly makes decisions in order to minimize the long-term loss. These problems that emerge in applications often have nonlinear combinatorial objective functions, and developing algorithms for such problems has attracted considerable attention. An existing general framework for dealing with such objective functions is the online submodular minimization. However, practical problems are often out of the scope of this framework, since the domain of a submodular function is limited to a subset of the unit hypercube. To manage this limitation of the existing framework, we in this paper introduce the online $\mathrm{L}^{\natural}$-convex minimization, where an $\mathrm{L}^{\natural}$-convex function generalizes a submodular function so that the domain is a subset of the integer lattice. We propose computationally efficient algorithms for the online $\mathrm{L}^{\natural}$-convex function minimization in two major settings: the full information and the bandit settings. We analyze the regrets of these algorithms and show in particular that our algorithm for the full information setting obtains a tight regret bound up to a constant factor. We also demonstrate several motivating examples that illustrate the usefulness of the online $\mathrm{L}^{\natural}$-convex minimization.

Online $\mathrm{L}^{\natural}$-Convex Minimization

TL;DR

The paper addresses online decision problems with nonlinear discrete objectives that fall outside the online submodular framework by introducing online -convex minimization on integer lattices. It develops efficient full-information and bandit algorithms based on convex extensions (the Lovász extension and a piecewise-linear convex extension) and provides regret analyses, achieving in the full information setting and in the bandit setting, with a matching lower bound for the former. The framework is illustrated through applications to online spare-parts inventory and call-center shift scheduling, showing that these problems can be formulated as online -convex minimization and solved with provable guarantees. Overall, the work extends online convex/ submodular optimization to discrete, lattice-based domains, enabling efficient handling of a broader class of nonlinear, combinatorial online problems with practical impact.

Abstract

An online decision-making problem is a learning problem in which a player repeatedly makes decisions in order to minimize the long-term loss. These problems that emerge in applications often have nonlinear combinatorial objective functions, and developing algorithms for such problems has attracted considerable attention. An existing general framework for dealing with such objective functions is the online submodular minimization. However, practical problems are often out of the scope of this framework, since the domain of a submodular function is limited to a subset of the unit hypercube. To manage this limitation of the existing framework, we in this paper introduce the online -convex minimization, where an -convex function generalizes a submodular function so that the domain is a subset of the integer lattice. We propose computationally efficient algorithms for the online -convex function minimization in two major settings: the full information and the bandit settings. We analyze the regrets of these algorithms and show in particular that our algorithm for the full information setting obtains a tight regret bound up to a constant factor. We also demonstrate several motivating examples that illustrate the usefulness of the online -convex minimization.
Paper Structure (16 sections, 16 theorems, 37 equations, 4 tables, 3 algorithms)

This paper contains 16 sections, 16 theorems, 37 equations, 4 tables, 3 algorithms.

Key Result

lemma 1

Let $f:D\rightarrow \mathbb{R}$ be a submodular function with $D$ being a simple distributive lattice and let $\hat{f}:\overline{D}\rightarrow \mathbb{R}$ be the Lovász extention of $f$. The following properties hold for $\hat{f}$:

Theorems & Definitions (34)

  • definition 1: $\mathrm{L}^{\natural}$-convex set fujishige2000notes
  • definition 2: $\mathrm{L}^{\natural}$-convex function fujishige2000notes
  • definition 3: simple distributive lattice
  • definition 4: chain
  • definition 5: Lovász extension on a simple distributive lattice fujishige2005submodular
  • lemma 1: Properties of the Lovász extension bach2013learningfujishige2005submodular
  • definition 6: maximal chain associated with $x$
  • definition 7: convex extension of $\mathrm{L}^{\natural}$-convex functions fujishige2005submodular
  • lemma 2: fujishige2005submodular
  • lemma 3
  • ...and 24 more