A Dataset for Semantic Segmentation in the Presence of Unknowns
Zakaria Laskar, Tomas Vojir, Matej Grcic, Iaroslav Melekhov, Shankar Gangisettye, Juho Kannala, Jiri Matas, Giorgos Tolias, C. V. Jawahar
TL;DR
ISSU addresses the gap in evaluating semantic segmentation under unknowns by providing a real-world, open-set annotated dataset with both closed-set and anomaly labels. It comprises ISSU-Train, ISSU-Test-Static, and ISSU-Test-Temporal collected from Indian roads, enabling in-domain and cross-domain, as well as cross-sensor and temporal evaluations. The paper benchmarks multiple pixel- and mask-level baselines, conducts extensive ablations on anomaly size and lighting, and demonstrates significant generalization gaps under domain shifts and sensor changes. The dataset's static and temporal splits, along with ROI-focused anomaly labeling, facilitate robust evaluation and spur development of domain-general anomaly segmentation methods with practical impact for safety-critical driving.
Abstract
Before deployment in the real-world deep neural networks require thorough evaluation of how they handle both knowns, inputs represented in the training data, and unknowns (anomalies). This is especially important for scene understanding tasks with safety critical applications, such as in autonomous driving. Existing datasets allow evaluation of only knowns or unknowns - but not both, which is required to establish "in the wild" suitability of deep neural network models. To bridge this gap, we propose a novel anomaly segmentation dataset, ISSU, that features a diverse set of anomaly inputs from cluttered real-world environments. The dataset is twice larger than existing anomaly segmentation datasets, and provides a training, validation and test set for controlled in-domain evaluation. The test set consists of a static and temporal part, with the latter comprised of videos. The dataset provides annotations for both closed-set (knowns) and anomalies, enabling closed-set and open-set evaluation. The dataset covers diverse conditions, such as domain and cross-sensor shift, illumination variation and allows ablation of anomaly detection methods with respect to these variations. Evaluation results of current state-of-the-art methods confirm the need for improvements especially in domain-generalization, small and large object segmentation.
