Reconstructing Large Scale Production Networks
Ashwin Bhattathiripad, Vipin P Veetil
TL;DR
The paper addresses the challenge of reconstructing large-scale, granular production networks from publicly available sectoral flows and firm-size distributions. It introduces a four-step framework: a sector-aware augmented logistic gravity model to generate link probabilities, a Bernoulli backbone to form directed unweighted graphs, a Markov-regularity closure to ensure irreducibility and aperiodicity, and a minimum-energy weighting to produce a stable, weighted network that preserves firm and sector margins. The authors apply the method to the full US economy, reconstructing a network with over 5 million firms and 100 million-plus links, and show that the resulting topology aligns with known properties of real networks and is robust to parameter variation via bootstrap analysis. They also extend the approach to networks with multiple production units (factories) and discuss parallelization and binning strategies that render the method scalable to truly large systems. Overall, the work provides a practical, open-source pipeline for simulating shocks and dynamics on large-scale, granular production networks, enabling policy-relevant analyses that were previously infeasible due to data limitations.
Abstract
This paper develops an algorithm to reconstruct large weighted firm-to-firm networks using information about the size of the firms and sectoral input-output flows. Our algorithm is based on a four-step procedure. We first generate a matrix of probabilities of connections between all firms in the economy using an augmented gravity model embedded in a logistic function that takes firm size as mass. The model is parameterized to allow for the probability of a link between two firms to depend not only on their sizes but also on flows across the sectors to which they belong. We then use a Bernoulli draw to construct a directed but unweighted random graph from the probability distribution generated by the logistic-gravity function. We make the graph aperiodic by adding self-loops and irreducible by adding links between Strongly Connected Components while limiting distortions to sectoral flows. We convert the unweighted network to a weighted network by solving a convex quadratic programming problem that minimizes the Euclidean norm of the weights. The solution preserves the observed firm sizes and sectoral flows within reasonable bounds, while limiting the strength of the self-loops. Computationally, the algorithm is O(N2) in the worst case, but it can be evaluated in O(N) via sector-wise binning of firm sizes, albeit with an approximation error. We implement the algorithm to reconstruct the full US production network with more than 5 million firms and 100 million buyer-seller connections. The reconstructed network exhibits topological properties consistent with small samples of the real US buyer-seller networks, including fat-tails in degree distribution, mild clustering, and near-zero reciprocity. We provide open-source code of the algorithm to enable researchers to reconstruct large-scale granular production networks from publicly available data.
