PASTA: A Unified Framework for Offline Assortment Learning
Juncheng Dong, Weibin Mo, Zhengling Qi, Cong Shi, Ethan X. Fang, Vahid Tarokh
TL;DR
PASTA addresses offline assortment optimization with an unknown choice model and data with incomplete coverage. It introduces a pessimistic framework that constructs a likelihood-ratio–based uncertainty set and solves a max–min revenue problem to robustly identify an optimal assortment. The authors prove finite-sample regret bounds for MNL and NL families and establish a minimax lower bound under MNL, confirming the method's optimality in sample and model complexity. Empirical results on simulated data demonstrate substantial improvements over standard as-if optimization, especially when the optimal assortment is sparsely represented in the offline data.
Abstract
We study a broad class of assortment optimization problems in an offline and data-driven setting. In such problems, a firm lacks prior knowledge of the underlying choice model, and aims to determine an optimal assortment based on historical customer choice data. The combinatorial nature of assortment optimization often results in insufficient data coverage, posing a significant challenge in designing provably effective solutions. To address this, we introduce a novel Pessimistic Assortment Optimization (PASTA) framework that leverages the principle of pessimism to achieve optimal expected revenue under general choice models. Notably, PASTA requires only that the offline data distribution contains an optimal assortment, rather than providing the full coverage of all feasible assortments. Theoretically, we establish the first finite-sample regret bounds for offline assortment optimization across several widely used choice models, including the multinomial logit and nested logit models. Additionally, we derive a minimax regret lower bound, proving that PASTA is minimax optimal in terms of sample and model complexity. Numerical experiments further demonstrate that our method outperforms existing baseline approaches.
