Improved detection of discarded fish species through BoxAL active learning
Maria Sokolova, Pieter M. Blok, Angelo Mencarelli, Arjan Vroegop, Aloysius van Helmond, Gert Kootstra
TL;DR
This paper presents BoxAL, an active-learning framework that injects epistemic uncertainty estimation into Faster R-CNN via Monte-Carlo dropout to efficiently select informative images for annotating discarded fish on trawlers. By decomposing uncertainty into semantic, spatial, and occurrence components and aggregating them into image-level certainty, BoxAL reduces labeling needs while improving mean AP on a challenging, highly variable dataset. The approach achieves a final mAP of $39.0\pm1.6$ with certainty-based sampling, outperforming random sampling ($34.8\pm1.8$) and reaching comparable performance with $700$ labeled images, i.e., $400$ fewer than the random baseline. The work demonstrates practical gains for electronic monitoring of bycatch, offering a scalable, open-source solution to support the transition from landing obligations to documentation obligations in fisheries.
Abstract
In recent years, powerful data-driven deep-learning techniques have been developed and applied for automated catch registration. However, these methods are dependent on the labelled data, which is time-consuming, labour-intensive, expensive to collect and need expert knowledge. In this study, we present an active learning technique, named BoxAL, which includes estimation of epistemic certainty of the Faster R-CNN object-detection model. The method allows selecting the most uncertain training images from an unlabeled pool, which are then used to train the object-detection model. To evaluate the method, we used an open-source image dataset obtained with a dedicated image-acquisition system developed for commercial trawlers targeting demersal species. We demonstrated, that our approach allows reaching the same object-detection performance as with the random sampling using 400 fewer labelled images. Besides, mean AP score was significantly higher at the last training iteration with 1100 training images, specifically, 39.0±1.6 and 34.8±1.8 for certainty-based sampling and random sampling, respectively. Additionally, we showed that epistemic certainty is a suitable method to sample images that the current iteration of the model cannot deal with yet. Our study additionally showed that the sampled new data is more valuable for training than the remaining unlabeled data. Our software is available on https://github.com/pieterblok/boxal.
