Table of Contents
Fetching ...

Boosting Gradient Ascent for Continuous DR-submodular Maximization

Qixin Zhang, Zongqi Wan, Zengde Deng, Zaiyi Chen, Xiaoming Sun, Jialin Zhang, Yu Yang

TL;DR

The paper addresses the suboptimal approximation of projected gradient ascent for continuous DR-submodular maximization by introducing a non-oblivious auxiliary function F that reweights gradient information. By solving a factor-revealing optimization, the authors derive an optimal weight function, yielding stationary points of F with a (1−e^{−γ})-approximation for monotone γ-weak DR-submodular objectives and a (1−∥x̲∥∞)/4-approximation for non-monotone objectives, thereby boosting PGA in offline, online, bandit, and minimax contexts. They develop unbiased gradient estimators for F, extend the boosting framework to online delayed and bandit feedback, and prove regret guarantees that match or improve existing bounds, along with empirical demonstrations on four problem families. The approach provides a scalable, theory-backed method to achieve tight approximation ratios in diverse settings, offering practical impact for large-scale continuous submodular optimization. Overall, the work unifies and advances boosting techniques for continuous submodular maximization, with broad applicability and provable performance improvements.

Abstract

Projected Gradient Ascent (PGA) is the most commonly used optimization scheme in machine learning and operations research areas. Nevertheless, numerous studies and examples have shown that the PGA methods may fail to achieve the tight approximation ratio for continuous DR-submodular maximization problems. To address this challenge, we present a boosting technique in this paper, which can efficiently improve the approximation guarantee of the standard PGA to \emph{optimal} with only small modifications on the objective function. The fundamental idea of our boosting technique is to exploit non-oblivious search to derive a novel auxiliary function $F$, whose stationary points are excellent approximations to the global maximum of the original DR-submodular objective $f$. Specifically, when $f$ is monotone and $γ$-weakly DR-submodular, we propose an auxiliary function $F$ whose stationary points can provide a better $(1-e^{-γ})$-approximation than the $(γ^2/(1+γ^2))$-approximation guaranteed by the stationary points of $f$ itself. Similarly, for the non-monotone case, we devise another auxiliary function $F$ whose stationary points can achieve an optimal $\frac{1-\min_{\boldsymbol{x}\in\mathcal{C}}\|\boldsymbol{x}\|_{\infty}}{4}$-approximation guarantee where $\mathcal{C}$ is a convex constraint set. In contrast, the stationary points of the original non-monotone DR-submodular function can be arbitrarily bad~\citep{chen2023continuous}. Furthermore, we demonstrate the scalability of our boosting technique on four problems. In all of these four problems, our resulting variants of boosting PGA algorithm beat the previous standard PGA in several aspects such as approximation ratio and efficiency. Finally, we corroborate our theoretical findings with numerical experiments, which demonstrate the effectiveness of our boosting PGA methods.

Boosting Gradient Ascent for Continuous DR-submodular Maximization

TL;DR

The paper addresses the suboptimal approximation of projected gradient ascent for continuous DR-submodular maximization by introducing a non-oblivious auxiliary function F that reweights gradient information. By solving a factor-revealing optimization, the authors derive an optimal weight function, yielding stationary points of F with a (1−e^{−γ})-approximation for monotone γ-weak DR-submodular objectives and a (1−∥x̲∥∞)/4-approximation for non-monotone objectives, thereby boosting PGA in offline, online, bandit, and minimax contexts. They develop unbiased gradient estimators for F, extend the boosting framework to online delayed and bandit feedback, and prove regret guarantees that match or improve existing bounds, along with empirical demonstrations on four problem families. The approach provides a scalable, theory-backed method to achieve tight approximation ratios in diverse settings, offering practical impact for large-scale continuous submodular optimization. Overall, the work unifies and advances boosting techniques for continuous submodular maximization, with broad applicability and provable performance improvements.

Abstract

Projected Gradient Ascent (PGA) is the most commonly used optimization scheme in machine learning and operations research areas. Nevertheless, numerous studies and examples have shown that the PGA methods may fail to achieve the tight approximation ratio for continuous DR-submodular maximization problems. To address this challenge, we present a boosting technique in this paper, which can efficiently improve the approximation guarantee of the standard PGA to \emph{optimal} with only small modifications on the objective function. The fundamental idea of our boosting technique is to exploit non-oblivious search to derive a novel auxiliary function , whose stationary points are excellent approximations to the global maximum of the original DR-submodular objective . Specifically, when is monotone and -weakly DR-submodular, we propose an auxiliary function whose stationary points can provide a better -approximation than the -approximation guaranteed by the stationary points of itself. Similarly, for the non-monotone case, we devise another auxiliary function whose stationary points can achieve an optimal -approximation guarantee where is a convex constraint set. In contrast, the stationary points of the original non-monotone DR-submodular function can be arbitrarily bad~\citep{chen2023continuous}. Furthermore, we demonstrate the scalability of our boosting technique on four problems. In all of these four problems, our resulting variants of boosting PGA algorithm beat the previous standard PGA in several aspects such as approximation ratio and efficiency. Finally, we corroborate our theoretical findings with numerical experiments, which demonstrate the effectiveness of our boosting PGA methods.
Paper Structure (58 sections, 38 theorems, 136 equations, 6 figures, 9 tables, 4 algorithms)

This paper contains 58 sections, 38 theorems, 136 equations, 6 figures, 9 tables, 4 algorithms.

Key Result

Lemma 3

If $f$ is a differentiable monotone $\gamma$-weakly DR-submodular function, then for any stationary point $\boldsymbol{x}\in\mathcal{C}$ of $f$, we have

Figures (6)

  • Figure 1: In \ref{['graph1']}, we test the performance of the four algorithms for the special monotone submodular function in hassani2017gradient where the GA(5) and BGA(5) start from $\boldsymbol{x}_{loc}$. Simultaneously, we present the results for all algorithm starting from the origin in \ref{['graph2']}. \ref{['graph3']} show the performance of four algorithms for the special non-monotone submodular function in chen2023continuous where the GA(5) and BGA(5) start from $\widetilde{\boldsymbol{x}}_{loc}$. Similarly, \ref{['graph4']} presents the results from the origin point.
  • Figure 2: \ref{['graph31']} shows the performance of GA(5),BGA(5),CG,and SCG in monotone movie recommendation task. In \ref{['graph32']}, we report the results of GA(5),BGA(5),Measured FW,Measured FW-VR and Non-mono FW in non-monotone movie recommendation.
  • Figure 3: In \ref{['graph41']}-\ref{['graph43']}, we report the results for the online monotone movie recommendation task under full information, delayed feedback and bandit feedback. Similarly, \ref{['graph44']}-\ref{['graph46']} show the results of three different scenarios about online non-monotone movie recommendation tasks.
  • Figure 4: In \ref{['graph_minimax_1']}-\ref{['graph_minimax_2']}, we show the comparison of our proposed methods for Convex-facility Location. Similarly, the results about attack for item recommendation are presented in \ref{['graph_minimax_3']}-\ref{['graph_minimax_4']}.
  • Figure 5: In \ref{['graph21']}, we test the performance of GA(5),BGA(5),CG,and SCG in simulated continuous monotone DR-submodular quadratic programming. \ref{['graph22']} reports the results of GA(5),BGA(5),Measured FW,Measured FW-VR and Non-mono FW in simulated non-monotone DR-submodular quadratic programming.
  • ...and 1 more figures

Theorems & Definitions (53)

  • Definition 1
  • Remark 2
  • Lemma 3: hassani2017gradient
  • Remark 4
  • Lemma 5: Proof in \ref{['proof:lem2']}
  • Theorem 6: Proof in the \ref{['proof:thm1']}
  • Corollary 7
  • Remark 8
  • Theorem 9: Proof in \ref{['proof:thm2']}
  • Remark 10
  • ...and 43 more