Hierarchical Multi-Label Classification with Missing Information for Benthic Habitat Imagery
Isaac Xu, Benjamin Misiuk, Scott C. Lowe, Martin Gillis, Craig J. Brown, Thomas Trappenberg
TL;DR
The paper tackles hierarchical multi-label classification of benthic habitat imagery under real-world missing-information conditions. It proposes a two-stage approach: self-supervised pretraining on a large in-domain benthic image corpus (BenthicNet) followed by hierarchical heads evaluation and constrained outputs (via C-HMCNN). Key contributions include establishing methods to train HML models with incomplete annotations and benchmarking in-domain SSL pretraining against ImageNet baselines, particularly benefiting small regional datasets. The findings demonstrate that in-domain SSL pretraining can extend deeper into the annotation hierarchy and improve performance under missing information, with practical implications for automated underwater image annotation and other domains with hierarchical labels.
Abstract
In this work, we apply state-of-the-art self-supervised learning techniques on a large dataset of seafloor imagery, \textit{BenthicNet}, and study their performance for a complex hierarchical multi-label (HML) classification downstream task. In particular, we demonstrate the capacity to conduct HML training in scenarios where there exist multiple levels of missing annotation information, an important scenario for handling heterogeneous real-world data collected by multiple research groups with differing data collection protocols. We find that, when using smaller one-hot image label datasets typical of local or regional scale benthic science projects, models pre-trained with self-supervision on a larger collection of in-domain benthic data outperform models pre-trained on ImageNet. In the HML setting, we find the model can attain a deeper and more precise classification if it is pre-trained with self-supervision on in-domain data. We hope this work can establish a benchmark for future models in the field of automated underwater image annotation tasks and can guide work in other domains with hierarchical annotations of mixed resolution.
