Dynamic Basis Function Generation for Network Revenue Management
Daniel Adelman, Christiane Barz, Alba V. Olivares-Nadal
TL;DR
This work develops an iterative, basis-function–driven ADP framework for Network Revenue Management, introducing the Nonlinear Incremental Algorithm (NLIAlg) that incrementally adds and optimizes nonlinear basis functions to approximate the value function. To tackle large instances, it proposes a Two-Phase Incremental Algorithm (2PIAlg) that first uses flow-balance principles to estimate fixed basis-function parameters (Phase I) and then solves a linearized master problem to optimize remaining parameters (Phase II), with a heuristic variant (H-2PIAlg) enabling add-on mode on very large problems. A key theoretical insight extends flow-balance concepts to a stochastic NRM setting, guiding new basis-function generation via imbalance maximization and contraction of the nonlinear search space through norm-constrained exponential ridge bases. Computational results show that H-2PIAlg substantially improves policies and upper bounds over benchmarks like AA, SPLA, and NSEP, particularly when capacity is scarce or memory limits hinder alternative methods. The framework also offers a flexible stand-alone or add-on workflow, providing scalable, progressively better value-function approximations for high-dimensional NRM problems.
Abstract
This paper introduces an algorithm that dynamically generates basis functions to approximate the value function in Network Revenue Management. Unlike existing algorithms sampling the parameters of new basis functions, this Nonlinear Incremental Algorithm (NLIAlg) iteratively refines the value function approximation by optimizing these parameters. For larger instances, the Two-Phase Incremental Algorithm (2PIAlg) modifies NLIAlg to leverage the efficiency of LP solvers. It reduces the size of a large-dimensional nonlinear problem and transforms it into an LP by fixing the basis function parameters, which are then optimized in a second phase using the flow imbalance ideas from Adelman and Klabjan (2012). This marks the first application of these techniques in a stochastic setting. The algorithms can operate in two modes: (1) Standalone mode, to construct a value function approximation from scratch, and (2) Add-on mode, to refine an existing approximation. Our numerical experiments indicate that while NLIAlg and 2PIAlg in standalone mode are only feasible for small-scale problems, the heuristic version of 2PIAlg (H-2PIAlg) in add-on mode, using the Affine Approximation and exponential ridge basis functions, can handle extremely large instances that may cause benchmark network revenue management methods to run out of memory. In these scenarios, H-2PIAlg delivers substantially better policies and upper bounds than the Affine Approximation. Furthermore, H-2PIAlg achieves higher average revenues in policy simulations compared to network revenue management benchmarks in instances with limited capacity.
