HGCN2SP: Hierarchical Graph Convolutional Network for Two-Stage Stochastic Programming
Yang Wu, Yifan Zhang, Zhenxing Liang, Jian Cheng
TL;DR
This work tackles the challenge of solving large-scale two-stage stochastic programming (2SP) by learning to select representative scenarios. It introduces HGCN2SP, a hierarchical graph model that encodes each scenario as a bipartite subgraph and captures cross-scenario relationships with a higher-level graph, processed by a hierarchical GCN and an attention-based decoder. The policy is trained with PPO using solver feedback as a reward, balancing solution quality and solving time. Empirical results on CFLP and NDP show that HGCN2SP achieves high-quality decisions in substantially less time than baselines and generalizes well to larger-scale problems and more scenarios, with ablations confirming the importance of the hierarchical architecture. The approach offers a practical, scalable path to efficient scenario reduction in 2SP, with potential impact on how uncertain, large-scale decision problems are solved in practice.
Abstract
Two-stage Stochastic Programming (2SP) is a standard framework for modeling decision-making problems under uncertainty. While numerous methods exist, solving such problems with many scenarios remains challenging. Selecting representative scenarios is a practical method for accelerating solutions. However, current approaches typically rely on clustering or Monte Carlo sampling, failing to integrate scenario information deeply and overlooking the significant impact of the scenario order on solving time. To address these issues, we develop HGCN2SP, a novel model with a hierarchical graph designed for 2SP problems, encoding each scenario and modeling their relationships hierarchically. The model is trained in a reinforcement learning paradigm to utilize the feedback of the solver. The policy network is equipped with a hierarchical graph convolutional network for feature encoding and an attention-based decoder for scenario selection in proper order. Evaluation of two classic 2SP problems demonstrates that HGCN2SP provides high-quality decisions in a short computational time. Furthermore, HGCN2SP exhibits remarkable generalization capabilities in handling large-scale instances, even with a substantial number of variables or scenarios that were unseen during the training phase.
