Table of Contents
Fetching ...

Learning production functions for supply chains with graph neural networks

Serina Chang, Zhiyin Lin, Benjamin Yan, Swapnil Bembde, Qi Xiu, Chi Heem Wong, Yu Qin, Frank Kloster, Alex Luo, Raj Palleti, Jure Leskovec

TL;DR

This work defines temporal production graphs (TPGs) where unobserved production functions inside firms govern the flow from inputs to outputs and external transactions form time-evolving hyperedges. The authors propose a class of models that jointly learn production functions via an inventory module and forecast future transactions by coupling this module with extended temporal GNNs (SC-TGN, SC-GraphMixer), capable of handling hyperedges and predicting edge weights. They introduce a trainable inventory-based attention mechanism, a specialized inventory loss, and a two-stage decoder, enabling both production-function inference (MAP) and transaction forecasting (MRR, RMSE). The approach is validated on real transaction-level data and on SupplySim-generated data that mimics real-world properties and shocks, showing substantial improvements over strong baselines in both production-function learning (up to 50% MAP gains) and edge forecasting (up to 62% improvement). The work advances supply-chain ML and temporal graph learning by providing open-source tooling (SupplySim) and demonstrating practical applicability to demand forecasting, risk detection, and inventory optimization in production networks.

Abstract

The global economy relies on the flow of goods over supply chain networks, with nodes as firms and edges as transactions between firms. While we may observe these external transactions, they are governed by unseen production functions, which determine how firms internally transform the input products they receive into output products that they sell. In this setting, it can be extremely valuable to infer these production functions, to improve supply chain visibility and to forecast future transactions more accurately. However, existing graph neural networks (GNNs) cannot capture these hidden relationships between nodes' inputs and outputs. Here, we introduce a new class of models for this setting by combining temporal GNNs with a novel inventory module, which learns production functions via attention weights and a special loss function. We evaluate our models extensively on real supply chains data and data generated from our new open-source simulator, SupplySim. Our models successfully infer production functions, outperforming the strongest baseline by 6%-50% (across datasets), and forecast future transactions, outperforming the strongest baseline by 11%-62%

Learning production functions for supply chains with graph neural networks

TL;DR

This work defines temporal production graphs (TPGs) where unobserved production functions inside firms govern the flow from inputs to outputs and external transactions form time-evolving hyperedges. The authors propose a class of models that jointly learn production functions via an inventory module and forecast future transactions by coupling this module with extended temporal GNNs (SC-TGN, SC-GraphMixer), capable of handling hyperedges and predicting edge weights. They introduce a trainable inventory-based attention mechanism, a specialized inventory loss, and a two-stage decoder, enabling both production-function inference (MAP) and transaction forecasting (MRR, RMSE). The approach is validated on real transaction-level data and on SupplySim-generated data that mimics real-world properties and shocks, showing substantial improvements over strong baselines in both production-function learning (up to 50% MAP gains) and edge forecasting (up to 62% improvement). The work advances supply-chain ML and temporal graph learning by providing open-source tooling (SupplySim) and demonstrating practical applicability to demand forecasting, risk detection, and inventory optimization in production networks.

Abstract

The global economy relies on the flow of goods over supply chain networks, with nodes as firms and edges as transactions between firms. While we may observe these external transactions, they are governed by unseen production functions, which determine how firms internally transform the input products they receive into output products that they sell. In this setting, it can be extremely valuable to infer these production functions, to improve supply chain visibility and to forecast future transactions more accurately. However, existing graph neural networks (GNNs) cannot capture these hidden relationships between nodes' inputs and outputs. Here, we introduce a new class of models for this setting by combining temporal GNNs with a novel inventory module, which learns production functions via attention weights and a special loss function. We evaluate our models extensively on real supply chains data and data generated from our new open-source simulator, SupplySim. Our models successfully infer production functions, outperforming the strongest baseline by 6%-50% (across datasets), and forecast future transactions, outperforming the strongest baseline by 11%-62%
Paper Structure (63 sections, 20 equations, 7 figures, 8 tables)

This paper contains 63 sections, 20 equations, 7 figures, 8 tables.

Figures (7)

  • Figure 1: (a) Illustration of our problem setting: we observe time-varying transactions between firms and do not observe production functions within firms. Our goals are to learn the production functions and predict future transactions. (b) Example of our model architecture, combining our inventory module with our extended version of TGN, SC-TGN.
  • Figure 2: SupplySim generates data matching real data on key characteristics: (a) power law degree distribution, (b) community structure, (c) low clustering, (d) time-varying transactions, with possible shocks or missing data.
  • Figure 3: True production functions (left) and predictions from inventory module (right), trained on SS-std.
  • Figure 4: Visualizing products in our synthetic datasets. Each point represents the position of one of the 50 products, and points are color-coded by the product's tier. We also denote part-product relations between Tier 1 and Tier 2 products, where an arrow from product $p_1$ to product $p_2$ means that $p_1$ is required to make $p_2$. For each product, we sample its number of parts from {1, 2, 3, 4} uniformly, then assign its parts to the closest products in the previous tier, resulting in commonly co-occurring parts.
  • Figure 5: Comparing inventory module's loss \ref{['eqn:inv-loss']} vs. MAP on ground-truth production functions, trained on SS-std.
  • ...and 2 more figures