Machine Learning for Combinatorial Optimization: a Methodological Tour d'Horizon
Yoshua Bengio, Andrea Lodi, Antoine Prouvost
TL;DR
This survey addresses how machine learning can aid combinatorial optimization by learning decisions within CO algorithms or by producing end-to-end solutions. It distinguishes demonstration (imitation) and experience (reinforcement learning) as two learning paradigms and discusses policy learning and algorithm configuration. It introduces a unifying learning objective framework across multi-instance distributions, surrogate rewards, and generalization concerns. It argues that hybrid approaches—combining data-driven components with exact optimization guarantees—are the most practical path forward for CO.
Abstract
This paper surveys the recent attempts, both from the machine learning and operations research communities, at leveraging machine learning to solve combinatorial optimization problems. Given the hard nature of these problems, state-of-the-art algorithms rely on handcrafted heuristics for making decisions that are otherwise too expensive to compute or mathematically not well defined. Thus, machine learning looks like a natural candidate to make such decisions in a more principled and optimized way. We advocate for pushing further the integration of machine learning and combinatorial optimization and detail a methodology to do so. A main point of the paper is seeing generic optimization problems as data points and inquiring what is the relevant distribution of problems to use for learning on a given task.
