Scribbles for All: Benchmarking Scribble Supervised Segmentation Across Datasets
Wolfgang Boettcher, Lukas Hoyer, Ozan Unal, Jan Eric Lenssen, Bernt Schiele
TL;DR
This work tackles the high labeling cost of dense semantic segmentation by introducing Scribbles for All, an automatic scribble generator that converts dense annotations into scribble labels across multiple datasets. It creates four new scribble datasets (s4Pascal, s4Cityscapes, s4KITTI360, s4ADE20K) and demonstrates that state-of-the-art scribble-based methods can achieve competitive performance relative to fully supervised baselines, enabling robust benchmarking in diverse domains. The paper provides a detailed algorithm with design objectives to mimic human scribbles and validates the approach through extensive experiments, including scribble-length ablations. Overall, Scribbles for All broadens the evaluation of scribble-supervised segmentation, supports domain-adaptive research, and encourages using scribble-labeled data to leverage foundation models for practical, low-cost annotation strategies.
Abstract
In this work, we introduce Scribbles for All, a label and training data generation algorithm for semantic segmentation trained on scribble labels. Training or fine-tuning semantic segmentation models with weak supervision has become an important topic recently and was subject to significant advances in model quality. In this setting, scribbles are a promising label type to achieve high quality segmentation results while requiring a much lower annotation effort than usual pixel-wise dense semantic segmentation annotations. The main limitation of scribbles as source for weak supervision is the lack of challenging datasets for scribble segmentation, which hinders the development of novel methods and conclusive evaluations. To overcome this limitation, Scribbles for All provides scribble labels for several popular segmentation datasets and provides an algorithm to automatically generate scribble labels for any dataset with dense annotations, paving the way for new insights and model advancements in the field of weakly supervised segmentation. In addition to providing datasets and algorithm, we evaluate state-of-the-art segmentation models on our datasets and show that models trained with our synthetic labels perform competitively with respect to models trained on manual labels. Thus, our datasets enable state-of-the-art research into methods for scribble-labeled semantic segmentation. The datasets, scribble generation algorithm, and baselines are publicly available at https://github.com/wbkit/Scribbles4All
