Table of Contents
Fetching ...

MH-GIN: Multi-scale Heterogeneous Graph-based Imputation Network for AIS Data (Extended Version)

Hengyu Liu, Tianyi Li, Yuqiang He, Kristian Torp, Yushuai Li, Christian S. Jensen

TL;DR

MH-GIN introduces a novel framework for imputing missing AIS data by explicitly modeling multi-scale dependencies among heterogeneous attributes. It combines type-specific encoders, a hierarchical temporal feature extractor, and a two-stage multi-scale graph propagation that aligns within scales and fuses cross-scale information across attributes. The approach achieves substantial imputation improvements over state-of-the-art methods (average gains around 57%) while maintaining efficiency, demonstrated on two real AIS datasets. This work enables more accurate maritime safety and monitoring applications by recovering high-fidelity spatio-temporal, cyclical, continuous, and discrete attributes across multiple update rates.

Abstract

Location-tracking data from the Automatic Identification System, much of which is publicly available, plays a key role in a range of maritime safety and monitoring applications. However, the data suffers from missing values that hamper downstream applications. Imputing the missing values is challenging because the values of different heterogeneous attributes are updated at diverse rates, resulting in the occurrence of multi-scale dependencies among attributes. Existing imputation methods that assume similar update rates across attributes are unable to capture and exploit such dependencies, limiting their imputation accuracy. We propose MH-GIN, a Multi-scale Heterogeneous Graph-based Imputation Network that aims improve imputation accuracy by capturing multi-scale dependencies. Specifically, MH-GIN first extracts multi-scale temporal features for each attribute while preserving their intrinsic heterogeneous characteristics. Then, it constructs a multi-scale heterogeneous graph to explicitly model dependencies between heterogeneous attributes to enable more accurate imputation of missing values through graph propagation. Experimental results on two real-world datasets find that MH-GIN is capable of an average 57% reduction in imputation errors compared to state-of-the-art methods, while maintaining computational efficiency. The source code and implementation details of MH-GIN are publicly available https://github.com/hyLiu1994/MH-GIN.

MH-GIN: Multi-scale Heterogeneous Graph-based Imputation Network for AIS Data (Extended Version)

TL;DR

MH-GIN introduces a novel framework for imputing missing AIS data by explicitly modeling multi-scale dependencies among heterogeneous attributes. It combines type-specific encoders, a hierarchical temporal feature extractor, and a two-stage multi-scale graph propagation that aligns within scales and fuses cross-scale information across attributes. The approach achieves substantial imputation improvements over state-of-the-art methods (average gains around 57%) while maintaining efficiency, demonstrated on two real AIS datasets. This work enables more accurate maritime safety and monitoring applications by recovering high-fidelity spatio-temporal, cyclical, continuous, and discrete attributes across multiple update rates.

Abstract

Location-tracking data from the Automatic Identification System, much of which is publicly available, plays a key role in a range of maritime safety and monitoring applications. However, the data suffers from missing values that hamper downstream applications. Imputing the missing values is challenging because the values of different heterogeneous attributes are updated at diverse rates, resulting in the occurrence of multi-scale dependencies among attributes. Existing imputation methods that assume similar update rates across attributes are unable to capture and exploit such dependencies, limiting their imputation accuracy. We propose MH-GIN, a Multi-scale Heterogeneous Graph-based Imputation Network that aims improve imputation accuracy by capturing multi-scale dependencies. Specifically, MH-GIN first extracts multi-scale temporal features for each attribute while preserving their intrinsic heterogeneous characteristics. Then, it constructs a multi-scale heterogeneous graph to explicitly model dependencies between heterogeneous attributes to enable more accurate imputation of missing values through graph propagation. Experimental results on two real-world datasets find that MH-GIN is capable of an average 57% reduction in imputation errors compared to state-of-the-art methods, while maintaining computational efficiency. The source code and implementation details of MH-GIN are publicly available https://github.com/hyLiu1994/MH-GIN.

Paper Structure

This paper contains 41 sections, 4 theorems, 43 equations, 4 figures, 10 tables.

Key Result

Lemma 1

The multi-scale propagation mechanism is numerically stable if the spectral radius of each propagation matrix satisfies $\rho(\mathbf{P}) = \mathbf{D}^{-1/2} \hat{\mathbf{A}} \mathbf{D}^{-1/2} \leq 1$.

Figures (4)

  • Figure 1: An example of AIS records from a vessel.
  • Figure 2: An example of missing attribute imputation based on a multi-scale heterogeneous graph.
  • Figure 3: An example of multi-scale heterogeneous graph.
  • Figure 4: Overview of the multi-scale heterogeneous graph imputation network for AIS data.

Theorems & Definitions (17)

  • Example 1
  • Definition 1: AIS Record
  • Definition 2: Vessel-specific AIS Record Sequence
  • Definition 3: Intrinsic Characteristics of AIS Attributes
  • Definition 4: Multiple Time Scales
  • Definition 5: Multi-scale Temporal Features
  • Example 2
  • Definition 6: Multi-scale Heterogeneous Graph
  • Example 3
  • Definition 7: Observation Mask
  • ...and 7 more