DeliverAI: Reinforcement Learning Based Distributed Path-Sharing Network for Food Deliveries
Ashman Mehra, Snehanshu Saha, Vaskar Raychoudhury, Archana Mathur
TL;DR
DeliverAI addresses the last-mile food delivery problem by learning path-sharing policies via multi-agent reinforcement learning on a city-scale Overlay Network. It employs Q-learning with inter-agent communication through Preferred Action Sets to route deliveries across hotspots, balancing distance, fleet size, and delivery time to obtain a Pareto-optimal trade-off. Across Chicago-based simulations, DeliverAI reduces total delivery distance by about 13% and fleet size by about 12%, while achieving roughly 50% higher fleet utilization and maintaining high delivery success rates. The approach enables real-time, scalable decision-making for city-wide on-demand food delivery networks, with potential for larger fleets and more advanced RL methods in future work.
Abstract
Delivery of items from the producer to the consumer has experienced significant growth over the past decade and has been greatly fueled by the recent pandemic. Amazon Fresh, Shopify, UberEats, InstaCart, and DoorDash are rapidly growing and are sharing the same business model of consumer items or food delivery. Existing food delivery methods are sub-optimal because each delivery is individually optimized to go directly from the producer to the consumer via the shortest time path. We observe a significant scope for reducing the costs associated with completing deliveries under the current model. We model our food delivery problem as a multi-objective optimization, where consumer satisfaction and delivery costs, both, need to be optimized. Taking inspiration from the success of ride-sharing in the taxi industry, we propose DeliverAI - a reinforcement learning-based path-sharing algorithm. Unlike previous attempts for path-sharing, DeliverAI can provide real-time, time-efficient decision-making using a Reinforcement learning-enabled agent system. Our novel agent interaction scheme leverages path-sharing among deliveries to reduce the total distance traveled while keeping the delivery completion time under check. We generate and test our methodology vigorously on a simulation setup using real data from the city of Chicago. Our results show that DeliverAI can reduce the delivery fleet size by 12\%, the distance traveled by 13%, and achieve 50% higher fleet utilization compared to the baselines.
