FISBe: A real-world benchmark dataset for instance segmentation of long-range thin filamentous structures
Lisa Mais, Peter Hirsch, Claire Managan, Ramya Kandarpa, Josef Lorenz Rumberger, Annika Reinke, Lena Maier-Hein, Gudrun Ihrke, Dagmar Kainmueller
TL;DR
FISBe introduces a real-world 3D light microscopy benchmark for instance segmentation of long-range, thin filamentous neurons, addressing a gap where existing datasets rely on synthetic data. It provides 101 MCFO brain images with pixel‑wise neuron masks and a tailored evaluation suite combining centerline-based avF1 and centerline coverage into a composite score $S = 0.5 \times avF1 + 0.5 \times C$, plus FS and FM for overlap errors. The authors benchmark three baselines (PatchPerPix, FFN, and color clustering) and show that current methods struggle with long-range dependencies and overlaps, highlighting the need for new approaches. By releasing the data, metrics, and baselines, the work aims to spur advances in long-range data modeling and to enable downstream neuroscience analyses, while acknowledging biases toward sparser samples and computational demands. The dataset is hosted with CC BY 4.0 licensing, enabling community use and extension, and future work includes self-supervised pretraining and novel long-range models.
Abstract
Instance segmentation of neurons in volumetric light microscopy images of nervous systems enables groundbreaking research in neuroscience by facilitating joint functional and morphological analyses of neural circuits at cellular resolution. Yet said multi-neuron light microscopy data exhibits extremely challenging properties for the task of instance segmentation: Individual neurons have long-ranging, thin filamentous and widely branching morphologies, multiple neurons are tightly inter-weaved, and partial volume effects, uneven illumination and noise inherent to light microscopy severely impede local disentangling as well as long-range tracing of individual neurons. These properties reflect a current key challenge in machine learning research, namely to effectively capture long-range dependencies in the data. While respective methodological research is buzzing, to date methods are typically benchmarked on synthetic datasets. To address this gap, we release the FlyLight Instance Segmentation Benchmark (FISBe) dataset, the first publicly available multi-neuron light microscopy dataset with pixel-wise annotations. In addition, we define a set of instance segmentation metrics for benchmarking that we designed to be meaningful with regard to downstream analyses. Lastly, we provide three baselines to kick off a competition that we envision to both advance the field of machine learning regarding methodology for capturing long-range data dependencies, and facilitate scientific discovery in basic neuroscience.
