Table of Contents
Fetching ...

A Benchmark Study of Deep-RL Methods for Maximum Coverage Problems over Graphs

Zhicheng Liang, Yu Yang, Xiangyu Ke, Xiaokui Xiao, Yunjun Gao

TL;DR

The paper benchmarks five Deep-RL methods (S2V-DQN, Geometric-QN, GCOMB, RL4IM, LeNSE) for MCP and IM against strong baselines, revealing that Lazy Greedy and IMM/OPIM typically outperform Deep-RL in MCP and IM respectively. It systematically analyzes training-time considerations, graph-distribution generalization, and data-size effects, highlighting practical limitations of current Deep-RL approaches. The study demonstrates that, despite data-driven ambitions, traditional submodular optimization and RIS-based techniques remain more effective and scalable under the IC model, with notable exceptions in atypical budget regimes. The findings underscore challenges in generalization, efficiency, and baseline adequacy, offering concrete directions for improving Deep-RL methods in graph-based combinatorial optimization.

Abstract

Recent years have witnessed a growing trend toward employing deep reinforcement learning (Deep-RL) to derive heuristics for combinatorial optimization (CO) problems on graphs. Maximum Coverage Problem (MCP) and its probabilistic variant on social networks, Influence Maximization (IM), have been particularly prominent in this line of research. In this paper, we present a comprehensive benchmark study that thoroughly investigates the effectiveness and efficiency of five recent Deep-RL methods for MCP and IM. These methods were published in top data science venues, namely S2V-DQN, Geometric-QN, GCOMB, RL4IM, and LeNSE. Our findings reveal that, across various scenarios, the Lazy Greedy algorithm consistently outperforms all Deep-RL methods for MCP. In the case of IM, theoretically sound algorithms like IMM and OPIM demonstrate superior performance compared to Deep-RL methods in most scenarios. Notably, we observe an abnormal phenomenon in IM problem where Deep-RL methods slightly outperform IMM and OPIM when the influence spread nearly does not increase as the budget increases. Furthermore, our experimental results highlight common issues when applying Deep-RL methods to MCP and IM in practical settings. Finally, we discuss potential avenues for improving Deep-RL methods. Our benchmark study sheds light on potential challenges in current deep reinforcement learning research for solving combinatorial optimization problems.

A Benchmark Study of Deep-RL Methods for Maximum Coverage Problems over Graphs

TL;DR

The paper benchmarks five Deep-RL methods (S2V-DQN, Geometric-QN, GCOMB, RL4IM, LeNSE) for MCP and IM against strong baselines, revealing that Lazy Greedy and IMM/OPIM typically outperform Deep-RL in MCP and IM respectively. It systematically analyzes training-time considerations, graph-distribution generalization, and data-size effects, highlighting practical limitations of current Deep-RL approaches. The study demonstrates that, despite data-driven ambitions, traditional submodular optimization and RIS-based techniques remain more effective and scalable under the IC model, with notable exceptions in atypical budget regimes. The findings underscore challenges in generalization, efficiency, and baseline adequacy, offering concrete directions for improving Deep-RL methods in graph-based combinatorial optimization.

Abstract

Recent years have witnessed a growing trend toward employing deep reinforcement learning (Deep-RL) to derive heuristics for combinatorial optimization (CO) problems on graphs. Maximum Coverage Problem (MCP) and its probabilistic variant on social networks, Influence Maximization (IM), have been particularly prominent in this line of research. In this paper, we present a comprehensive benchmark study that thoroughly investigates the effectiveness and efficiency of five recent Deep-RL methods for MCP and IM. These methods were published in top data science venues, namely S2V-DQN, Geometric-QN, GCOMB, RL4IM, and LeNSE. Our findings reveal that, across various scenarios, the Lazy Greedy algorithm consistently outperforms all Deep-RL methods for MCP. In the case of IM, theoretically sound algorithms like IMM and OPIM demonstrate superior performance compared to Deep-RL methods in most scenarios. Notably, we observe an abnormal phenomenon in IM problem where Deep-RL methods slightly outperform IMM and OPIM when the influence spread nearly does not increase as the budget increases. Furthermore, our experimental results highlight common issues when applying Deep-RL methods to MCP and IM in practical settings. Finally, we discuss potential avenues for improving Deep-RL methods. Our benchmark study sheds light on potential challenges in current deep reinforcement learning research for solving combinatorial optimization problems.
Paper Structure (30 sections, 17 figures, 9 tables, 1 algorithm)

This paper contains 30 sections, 17 figures, 9 tables, 1 algorithm.

Figures (17)

  • Figure 1: Performance overview of methods on MCP and IM: Values represented as Method (Coverage standard deviation, Runtime standard deviation). Geometry-QN was excluded from (b) due to the scalability issues on most graphs.
  • Figure 2: Benchmarking Framework
  • Figure 3: General pipeline of Deep-RL methods for solving MCP/IM over Graphs
  • Figure 4: MCP: Coverage and Runtime curve
  • Figure 5: IM: influence curve under different weight models
  • ...and 12 more figures