A Reduction-Driven Local Search for the Generalized Independent Set Problem
Yiping Liu, Yi Zhou, Zhenxiang Xu, Mingyu Xiao, Jin-Kao Hao
TL;DR
The paper tackles Generalized Independent Set (GIS), an extension of MIS with vertex profits and edge penalties, formulated on $G=(V,E_p\cup E_r)$ and objective $\alpha(G)=\max_S nb(S)$. It introduces 14 data-reduction rules with optimality guarantees and a reduction-driven local search (RLS) that tightly integrates reductions into preprocessing, initialization, and neighborhood search. Empirical evaluation on 278 graphs shows that RLS delivers superior solution quality and scalability, solving instances with up to $2.6\times 10^8$ edges where prior methods fail, and the data reductions are a key factor in achieving this performance. The approach demonstrates the practicality of kernelization-style reductions for GIS and highlights directions for further enhancement and application to dynamic GIS settings.
Abstract
The Generalized Independent Set (GIS) problem extends the classical maximum independent set problem by incorporating profits for vertices and penalties for edges. This generalized problem has been identified in diverse applications in fields such as forest harvest planning, competitive facility location, social network analysis, and even machine learning. However, solving the GIS problem in large-scale, real-world networks remains computationally challenging. In this paper, we explore data reduction techniques to address this challenge. We first propose 14 reduction rules that can reduce the input graph with rigorous optimality guarantees. We then present a reduction-driven local search (RLS) algorithm that integrates these reduction rules into the pre-processing, the initial solution generation, and the local search components in a computationally efficient way. The RLS is empirically evaluated on 278 graphs arising from different application scenarios. The results indicates that the RLS is highly competitive -- For most graphs, it achieves significantly superior solutions compared to other known solvers, and it effectively provides solutions for graphs exceeding 260 million edges, a task at which every other known method fails. Analysis also reveals that the data reduction plays a key role in achieving such a competitive performance.
