Online Submodular Maximization via Online Convex Optimization
Tareq Si Salem, Gözde Özcan, Iasonas Nikolaou, Evimaria Terzi, Stratis Ioannidis
TL;DR
The paper develops a general framework to tackle online submodular maximization under matroid constraints by reducing it to online convex optimization through concave relaxations and randomized rounding. It introduces the Rounding Augmented OCO (RAOCO) policy and proves that, for weighted threshold potential (WTP) functions, the OCO regret transfers to an $oldsymbol{\alpha}$-regret in the online submodular setting, with the approximation factor improving beyond the classic $1-1/e$ when the threshold degree is finite. The authors extend the reduction to dynamic, optimistic, and bandit variants, offering sublinear dynamic regret bounds and optimistic guarantees, and provide specialized results for matroid polytopes using negatively correlated rounding (swap/pipage). Empirically, RAOCO with OGA/OMA delivers strong performance across influence maximization, facility location, and related problems, significantly outperforming baselines in both integral and fractional settings while maintaining favorable computational efficiency. The work thus offers a principled, scalable approach to online submodular optimization with practical implications for a broad class of combinatorial problems.
Abstract
We study monotone submodular maximization under general matroid constraints in the online setting. We prove that online optimization of a large class of submodular functions, namely, weighted threshold potential functions, reduces to online convex optimization (OCO). This is precisely because functions in this class admit a concave relaxation; as a result, OCO policies, coupled with an appropriate rounding scheme, can be used to achieve sublinear regret in the combinatorial setting. We show that our reduction extends to many different versions of the online learning problem, including the dynamic regret, bandit, and optimistic-learning settings.
