Reheated Gradient-based Discrete Sampling for Combinatorial Optimization
Muheng Li, Ruqi Zhang
TL;DR
This paper identifies wandering in contours as a fundamental inefficiency in gradient-based discrete samplers for combinatorial optimization. It introduces ReSCO, a reheating mechanism that resets the SA temperature to a theoretically informed critical temperature based on specific heat, enabling better exploration after wandering begins. The authors provide extensive experiments across MIS, MC, MaxCut, and Graph Balanced Partition showing that ReSCO improves solution quality with minimal overhead and broad applicability to gradient-based samplers. The work offers a practical, theoretically motivated approach to balance exploration and exploitation in discrete gradient-based CO solvers, with strong empirical gains and directions for future theoretical analysis.
Abstract
Recently, gradient-based discrete sampling has emerged as a highly efficient, general-purpose solver for various combinatorial optimization (CO) problems, achieving performance comparable to or surpassing the popular data-driven approaches. However, we identify a critical issue in these methods, which we term ''wandering in contours''. This behavior refers to sampling new different solutions that share very similar objective values for a long time, leading to computational inefficiency and suboptimal exploration of potential solutions. In this paper, we introduce a novel reheating mechanism inspired by the concept of critical temperature and specific heat in physics, aimed at overcoming this limitation. Empirically, our method demonstrates superiority over existing sampling-based and data-driven algorithms across a diverse array of CO problems.
