Adaptive Stabilization Based on Machine Learning for Column Generation
Yunzhuang Shen, Yuan Sun, Xiaodong Li, Zhiguang Cao, Andrew Eberhard, Guangquan Zhang
TL;DR
This paper addresses slow convergence in column generation due to dual oscillations by introducing ASCG-ML, which combines machine-learned predictions of the optimal dual solution with an adaptive stabilization penalty. The method defines a generalized dual that penalizes deviations from ML predictions and dynamically tunes the penalty via the Lagrangian gap to balance guidance and convergence. Empirical results on graph coloring show substantial improvements in convergence speed and solution times, with ML models (FFNN and GCN) offering complementary strengths across instance families. The approach demonstrates robust generalization and suggests a path to integrating ML more deeply into CG across problems beyond graph coloring, aided by exact and heuristic pricing variants. The work provides both theoretical convergence guarantees and practical evidence of accelerated CG performance.
Abstract
Column generation (CG) is a well-established method for solving large-scale linear programs. It involves iteratively optimizing a subproblem containing a subset of columns and using its dual solution to generate new columns with negative reduced costs. This process continues until the dual values converge to the optimal dual solution to the original problem. A natural phenomenon in CG is the heavy oscillation of the dual values during iterations, which can lead to a substantial slowdown in the convergence rate. Stabilization techniques are devised to accelerate the convergence of dual values by using information beyond the state of the current subproblem. However, there remains a significant gap in obtaining more accurate dual values at an earlier stage. To further narrow this gap, this paper introduces a novel approach consisting of 1) a machine learning approach for accurate prediction of optimal dual solutions and 2) an adaptive stabilization technique that effectively capitalizes on accurate predictions. On the graph coloring problem, we show that our method achieves a significantly improved convergence rate compared to traditional methods.
