MSD-LLM: Predicting Ship Detention in Port State Control Inspections with Large Language Model

Jiongchao Jin; Xiuju Fu; Xiaowei Gao; Tao Cheng; Ran Yan

MSD-LLM: Predicting Ship Detention in Port State Control Inspections with Large Language Model

Jiongchao Jin, Xiuju Fu, Xiaowei Gao, Tao Cheng, Ran Yan

TL;DR

MSD-LLM tackles ship detention prediction in Port State Control data by fusing a Dual Subspace Recovery Auto-Encoder (DSRAE) with a multimodal Large Language Model (M-LLM). The DSRAE extracts robust PSC representations through a dual-branch reconstruction framework and a margin-based separation loss, while progressive learning mitigates extreme data imbalance. The LLM is trained via a grouping and ranking strategy, blending DSRAE features with prompts to yield detention probability scores and flexible thresholds for decision-making. Evaluated on extensive Tokyo MoU PSC records, MSD-LLM delivers substantial gains in AUC and related metrics over strong baselines, demonstrating practical robustness to real-world data shifts and paving the way for integrated maritime risk prediction using domain-aware LLMs.

Abstract

Maritime transportation is the backbone of global trade, making ship inspection essential for ensuring maritime safety and environmental protection. Port State Control (PSC), conducted by national ports, enforces compliance with safety regulations, with ship detention being the most severe consequence, impacting both ship schedules and company reputations. Traditional machine learning methods for ship detention prediction are limited by the capacity of representation learning and thus suffer from low accuracy. Meanwhile, autoencoder-based deep learning approaches face challenges due to the severe data imbalance in learning historical PSC detention records. To address these limitations, we propose Maritime Ship Detention with Large Language Models (MSD-LLM), integrating a dual robust subspace recovery (DSR) layer-based autoencoder with a progressive learning pipeline to handle imbalanced data and extract meaningful PSC representations. Then, a large language model groups and ranks features to identify likely detention cases, enabling dynamic thresholding for flexible detention predictions. Extensive evaluations on 31,707 PSC inspection records from the Asia-Pacific region show that MSD-LLM outperforms state-of-the-art methods more than 12\% on Area Under the Curve (AUC) for Singapore ports. Additionally, it demonstrates robustness to real-world challenges, making it adaptable to diverse maritime risk assessment scenarios.

MSD-LLM: Predicting Ship Detention in Port State Control Inspections with Large Language Model

TL;DR

Abstract

MSD-LLM: Predicting Ship Detention in Port State Control Inspections with Large Language Model

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)