From Small to Large: A Graph Convolutional Network Approach for Solving Assortment Optimization Problems
Guokai Li, Pin Gao, Stefanus Jasin, Zizhuo Wang
TL;DR
This paper tackles the NP-hard constrained assortment optimization problem under MMNL by proposing a graph convolutional network (GCN) framework that learns to map problem parameters to likely optimal assortments. A graph representation captures customers, products, and linear constraints, enabling a GCN to output product inclusion probabilities, which drive three inference policies: GI, GILS, and GIP. The approach demonstrates strong generalization from small-scale training to large-scale instances (up to 2,000 products), achieving competitive ratios around 85-96% with milliseconds-to-seconds inference times, and outpacing traditional heuristics. The framework also extends to unknown-choice-model settings via a three-stage pipeline with choice-GCN and solution-GCN, yielding fast, scalable, and high-quality solutions. Overall, this work presents the first learning-to-optimize framework for constrained assortment problems and highlights the practical impact of data-driven, scalable decision-support in revenue management.
Abstract
Assortment optimization seeks to select a subset of substitutable products, subject to constraints, to maximize expected revenue. The problem is NP-hard due to its combinatorial and nonlinear nature and arises frequently in industries such as e-commerce, where platforms must solve thousands of such problems each minute. We propose a graph convolutional network (GCN) framework to efficiently solve constrained assortment optimization problems. Our approach constructs a graph representation of the problem, trains a GCN to learn the mapping from problem parameters to optimal assortments, and develops three inference policies based on the GCN's output. Owing to the GCN's ability to generalize across instance sizes, patterns learned from small-scale samples can be transferred to large-scale problems. Numerical experiments show that a GCN trained on instances with 20 products achieves over 85% of the optimal revenue on problems with up to 2,000 products within seconds, outperforming existing heuristics in both accuracy and efficiency. We further extend the framework to settings with an unknown choice model using transaction data and demonstrate similar performance and scalability.
