CloudTracks: A Dataset for Localizing Ship Tracks in Satellite Images of Clouds

Muhammad Ahmed Chaudhry; Lyna Kim; Jeremy Irvin; Yuzu Ido; Sonia Chu; Jared Thomas Isobe; Andrew Y. Ng; Duncan Watson-Parris

CloudTracks: A Dataset for Localizing Ship Tracks in Satellite Images of Clouds

Muhammad Ahmed Chaudhry, Lyna Kim, Jeremy Irvin, Yuzu Ido, Sonia Chu, Jared Thomas Isobe, Andrew Y. Ng, Duncan Watson-Parris

TL;DR

The paper tackles the challenge of localizing ship tracks in satellite imagery to study anthropogenic aerosol effects on clouds. It introduces CloudTracks, a dataset of 3,560 MODIS images with over 12,000 ship-track instances, along with semantic and instance segmentation masks derived from careful labeling and preprocessing. Benchmarking shows state-of-the-art localization performance (IoU up to 61.29 with semantic segmentation) and improved instance counting (MAE as low as 1.64) compared to prior work, while also highlighting the remaining difficulties with thin, overlapping tracks. The work aims to spur new ML approaches for elongated, occluded features in geospatial imagery and to support climate research on aerosol–cloud interactions, with the dataset openly released to the community.

Abstract

Clouds play a significant role in global temperature regulation through their effect on planetary albedo. Anthropogenic emissions of aerosols can alter the albedo of clouds, but the extent of this effect, and its consequent impact on temperature change, remains uncertain. Human-induced clouds caused by ship aerosol emissions, commonly referred to as ship tracks, provide visible manifestations of this effect distinct from adjacent cloud regions and therefore serve as a useful sandbox to study human-induced clouds. However, the lack of large-scale ship track data makes it difficult to deduce their general effects on cloud formation. Towards developing automated approaches to localize ship tracks at scale, we present CloudTracks, a dataset containing 3,560 satellite images labeled with more than 12,000 ship track instance annotations. We train semantic segmentation and instance segmentation model baselines on our dataset and find that our best model substantially outperforms previous state-of-the-art for ship track localization (61.29 vs. 48.65 IoU). We also find that the best instance segmentation model is able to identify the number of ship tracks in each image more accurately than the previous state-of-the-art (1.64 vs. 4.99 MAE). However, we identify cases where the best model struggles to accurately localize and count ship tracks, so we believe CloudTracks will stimulate novel machine learning approaches to better detect elongated and overlapping features in satellite images. We release our dataset openly at {zenodo.org/records/10042922}.

CloudTracks: A Dataset for Localizing Ship Tracks in Satellite Images of Clouds

TL;DR

Abstract

Paper Structure (21 sections, 5 figures, 4 tables)

This paper contains 21 sections, 5 figures, 4 tables.

Introduction
Data
MODIS Images
Labeling Methodology
Dataset Preprocessing
Experiments
Semantic Segmentation
Instance Segmentation
Evaluation
Localization
Instance Counting
Results
Localization
Instance Counting
Discussion
...and 6 more sections

Figures (5)

Figure 1: Example images in CloudTracks with ship track instance annotations overlaid. Images in the dataset can contain a single ship track instance (a), intersecting ship track instances (b), densely packed ship track instances (c), and no instances (d).
Figure 2: Example images in the test set showcasing the patterns of improvements of our models over the baseline semantic segmentation model. The images from left to right are: False-colored satellite image, Ground-truth annotation, watson2022shipping prediction, Best Semantic Segmentation prediction, and Best Instance Segmentation prediction. The first row shows more contiguous tracks predicted by the improved models, second row shows an example of increased sensitivity to ship tracks, and third shows less extraneous predictions.
Figure 3: Instance counting performance of the three segmentation models on the CloudTracks test set. The optimal line is shown in teal and best fit line between the model predicted counts and ground truth counts is shown in pink.
Figure 4: Example instance segmentation predictions on the test set. Ground truth ship track polygons are shown in red and ship track predicted masks are shown in green with corresponding predicted bounding box and confidence score in blue. The model often successfully identified and localized instances of long ship tracks (a) and densely packed tracks (b). Common mistakes included difficulties identifying single instances in cases where many ship tracks cross and images with features that mimic ship tracks, leading to false positive predictions.
Figure 5: Example demonstrating the motivation to use buffering when computing relaxed evaluation metrics. Importantly, using the original metrics without buffering, an almost perfect prediction (middle figure) translated by a few pixels achieves a very low IoU score.

CloudTracks: A Dataset for Localizing Ship Tracks in Satellite Images of Clouds

TL;DR

Abstract

CloudTracks: A Dataset for Localizing Ship Tracks in Satellite Images of Clouds

Authors

TL;DR

Abstract

Table of Contents

Figures (5)