A Scalable Lift-and-Project Differentiable Approach For the Maximum Cut Problem
Ismail Alkhouri, Mian Wu, Cunxi Yu, Jia Liu, Rongrong Wang, Alvaro Velasquez
TL;DR
This work tackles scaling MaxCut on large graphs with a training-data-free, differentiable approach. It introduces two continuous relaxations, an unlifted QUCO and a lifted LUCO, and augments them with a dimension-alternating DECO algorithm, degree-based initializations, and an evolutionary hyper-parameter search. Theoretical fixed-point analyses accompany practical GPU-accelerated optimizers, and extensive experiments show competitive cuts against neural and GPU-based sampling baselines, especially on very large graphs where exact or learned methods struggle. The result is a scalable, data-less optimization framework that leverages over-parameterization and parallelism to outperform several baselines while maintaining tractability.
Abstract
We propose a scalable framework for solving the Maximum Cut (MaxCut) problem in large graphs using projected gradient ascent on quadratic objectives. Our approach is differentiable and leverages GPUs for gradient-based optimization. It is not a machine learning method and does not require training data. Starting from a continuous relaxation of the classical quadratic binary formulation, we present a parallelized strategy that explores multiple initialization vectors in batch. We analyze the relaxed objective, showing it is convex and has fixed-points corresponding to local optima, particularly at boundary points, highlighting a key challenge in non-convex optimization. To improve exploration, we introduce a lifted quadratic formulation that over-parameterizes the solution space. We also provide a theoretical characterization of these lifted fixed-points. Finally, we propose DECO, a dimension-alternating algorithm that switches between the unlifted and lifted formulations, combined with importance-based degree initialization and a population-based evolutionary hyper-parameter search. Experiments on diverse graph families show that our methods attain comparable or superior performance relative to recent neural networks and GPU-accelerated sampling approaches.
