Tree-based Semantic Losses: Application to Sparsely-supervised Large Multi-class Hyperspectral Segmentation
Junwen Wang, Oscar Maccormac, William Rochford, Aaron Kujawa, Jonathan Shapey, Tom Vercauteren
TL;DR
This work tackles sparse, multi-class segmentation in hyperspectral surgical imaging by exploiting a predefined label hierarchy through two semantically informed losses. It introduces a Wasserstein-based loss and a tree-based semantic cross-entropy loss that leverage hierarchical label structure, and integrates them with an OOD-aware, sparse-annotation training framework. On a 107-class dataset gathered from neurosurgical procedures, the proposed losses achieve state-of-the-art performance for both top-level and leaf-node segmentation, while enabling pixel-level OOD detection without harming in-distribution accuracy. Across extensive ablations, the authors show that top-level hierarchical weighting substantially influences performance and that the methods yield more semantically coherent distinctions between normal and abnormal tissues, with clear qualitative and quantitative gains.
Abstract
Hyperspectral imaging (HSI) shows great promise for surgical applications, offering detailed insights into biological tissue differences beyond what the naked eye can perceive. Refined labelling efforts are underway to train vision systems to distinguish large numbers of subtly varying classes. However, commonly used learning methods for biomedical segmentation tasks penalise all errors equivalently and thus fail to exploit any inter-class semantics in the label space. In this work, we introduce two tree-based semantic loss functions which take advantage of a hierarchical organisation of the labels. We further incorporate our losses in a recently proposed approach for training with sparse, background-free annotations. Extensive experiments demonstrate that our proposed method reaches state-of-the-art performance on a sparsely annotated HSI dataset comprising $107$ classes organised in a clinically-defined semantic tree structure. Furthermore, our method enables effective detection of out-of-distribution (OOD) pixels without compromising segmentation performance on in-distribution (ID) pixels.
