Whale Detection Enhancement through Synthetic Satellite Images
Akshaj Gaur, Cheng Liu, Xiaomin Lin, Nare Karapetyan, Yiannis Aloimonos
TL;DR
This work tackles the data scarcity barrier in whale detection within aerial and satellite imagery by introducing SeaDroneSim2, a Blender-based simulator that generates photorealistic maritime scenes with ground-truth segmentation masks. It offers a configurable pipeline to render varying water properties, lighting, turbidity, and camera parameters, producing datasets that can augment limited real data. Empirical results show that combining synthetic data with a small amount of real data significantly improves detection/segmentation performance (up to around 15% relative gains in $DR$ metrics) over using real data alone, while also highlighting some limits of sim-to-real transfer when real data is abundant. The authors open-source both the SeaDroneSim2 tool and the generated dataset, enabling broader adoption and extension to additional marine objects and under challenging conditions such as ice cover.
Abstract
With a number of marine populations in rapid decline, collecting and analyzing data about marine populations has become increasingly important to develop effective conservation policies for a wide range of marine animals, including whales. Modern computer vision algorithms allow us to detect whales in images in a wide range of domains, further speeding up and enhancing the monitoring process. However, these algorithms heavily rely on large training datasets, which are challenging and time-consuming to collect particularly in marine or aquatic environments. Recent advances in AI however have made it possible to synthetically create datasets for training machine learning algorithms, thus enabling new solutions that were not possible before. In this work, we present a solution - SeaDroneSim2 benchmark suite, which addresses this challenge by generating aerial, and satellite synthetic image datasets to improve the detection of whales and reduce the effort required for training data collection. We show that we can achieve a 15% performance boost on whale detection compared to using the real data alone for training, by augmenting a 10% real data. We open source both the code of the simulation platform SeaDroneSim2 and the dataset generated through it.
