InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply Chains
Yinzhu Quan, Zefang Liu
TL;DR
This work proposes InvAgent, a large language model (LLM)–based zero-shot multi-agent system for inventory management in multi-echelon supply chains. By assigning an LLM-driven agent to each stage and a user proxy to coordinate, InvAgent enables adaptive, explainable decisions through chain-of-thought reasoning and structured prompts without prior training. The framework is evaluated across multiple demand scenarios, showing competitive performance, especially under variability, while highlighting the benefits of prompt design and CoT for transparency and robustness. The findings suggest that LLM-based autonomous agents can reduce inventory costs and stockouts, offering a scalable, explainable alternative to traditional heuristics and reinforcement learning approaches, with avenues for RL fine-tuning and real-world data integration in future work.
Abstract
Supply chain management (SCM) involves coordinating the flow of goods, information, and finances across various entities to deliver products efficiently. Effective inventory management is crucial in today's volatile and uncertain world. Previous research has demonstrated the superiority of heuristic methods and reinforcement learning applications in inventory management. However, the application of large language models (LLMs) as autonomous agents in multi-agent systems for inventory management remains underexplored. This study introduces a novel approach using LLMs to manage multi-agent inventory systems. Leveraging their zero-shot learning capabilities, our model, InvAgent, enhances resilience and improves efficiency across the supply chain network. Our contributions include utilizing LLMs for zero-shot learning to enable adaptive and informed decision-making without prior training, providing explainability and clarity through chain-of-thought, and demonstrating dynamic adaptability to varying demand scenarios while reducing costs and preventing stockouts. Extensive evaluations across different scenarios highlight the efficiency of our model in SCM.
