Optimizing Agricultural Order Fulfillment Systems: A Hybrid Tree Search Approach
Pranay Thangeda, Hoda Helmi, Melkior Ornik
TL;DR
The paper tackles seed order fulfillment under stochastic arrivals and tight deadlines by modeling the wave scheduling as a Markov Decision Process and solving it with an Adaptive Hybrid Tree Search that augments Monte Carlo Tree Search with problem-specific action-space reduction. It leverages historical data to build time-varying arrival models and uses a dual-criteria candidate set and a dynamic peak-reducing factor to prune actions, enabling long-horizon planning in a large, uncertain domain. Empirical results from a high-fidelity warehouse simulator show that the proposed method significantly outperforms greedy, rolling-horizon, and vanilla MCTS baselines in on-time fulfillment and average delays, across multiple sensitivity analyses. The work demonstrates the practical viability of online, domain-informed planning for complex agricultural fulfillment systems and suggests avenues for real-time adaptation and robustness enhancements.
Abstract
Efficient order fulfillment is vital in the agricultural industry, particularly due to the seasonal nature of seed supply chains. This paper addresses the challenge of optimizing seed orders fulfillment in a centralized warehouse where orders are processed in waves, taking into account the unpredictable arrival of seed stocks and strict order deadlines. We model the wave scheduling problem as a Markov decision process and propose an adaptive hybrid tree search algorithm that combines Monte Carlo tree search with domain-specific knowledge to efficiently navigate the complex, dynamic environment of seed distribution. By leveraging historical data and stochastic modeling, our method enables forecast-informed scheduling decisions that balance immediate requirements with long-term operational efficiency. The key idea is that we can augment Monte Carlo tree search algorithm with problem-specific side information that dynamically reduces the number of candidate actions at each decision step to handle the large state and action spaces that render traditional solution methods computationally intractable. Extensive simulations with realistic parameters, including a diverse range of products, a high volume of orders, and authentic seasonal durations, demonstrate that the proposed approach significantly outperforms existing industry standard methods.
