ImPORTance: Machine Learning-Driven Analysis of Global Port Significance and Network Dynamics for Improved Operational Efficiency
Emanuele Carlini, Domenico Di Gangi, Vinicius Monteiro de Lira, Hanna Kavalionak, Amilcar Soares, Gabriel Spadon
TL;DR
This work tackles the question of what makes certain ports globally central by building a Ports Network from three years of AIS data and predicting port centrality from World Port Index features using a Random Forest classifier. Centrality is defined as $A(p) = \frac{\sum_{c \in \mathscr{C}} z(c,p)}{|\,\mathscr{C}|}$, aggregating six measures including in/out degree, PageRank variants, betweenness, and closeness, with SHAP and SAGE used for local and global interpretability. The study finds that cargo depth and longitude are among the strongest predictors of centrality, with harbor size also contributing, and demonstrates robust predictive performance (AUC up to ~0.88) to identify central ports. These findings support data-driven planning for port development and resource allocation and offer a framework to extend analysis to other vessel modalities and regions.
Abstract
Seaports play a crucial role in the global economy, and researchers have sought to understand their significance through various studies. In this paper, we aim to explore the common characteristics shared by important ports by analyzing the network of connections formed by vessel movement among them. To accomplish this task, we adopt a bottom-up network construction approach that combines three years' worth of AIS (Automatic Identification System) data from around the world, constructing a Ports Network that represents the connections between different ports. Through this representation, we utilize machine learning to assess the relative significance of various port features. Our model examined such features and revealed that geographical characteristics and the port's depth are indicators of a port's importance to the Ports Network. Accordingly, this study employs a data-driven approach and utilizes machine learning to provide a comprehensive understanding of the factors contributing to the extent of ports. Our work aims to inform decision-making processes related to port development, resource allocation, and infrastructure planning within the industry.
