MUSE: Multi-Scale Dense Self-Distillation for Nucleus Detection and Classification
Zijiang Yang, Hanqing Chao, Bokai Zhao, Yelin Yang, Yunshuo Zhang, Dongmei Fu, Junping Zhang, Le Lu, Ke Yan, Dakai Jin, Minfeng Xu, Yun Bian, Hui Jiang
TL;DR
MUSE introduces NuLo, a nucleus-based local self-distillation mechanism, to enable flexible cross-scale self-supervision for nucleus detection and classification. The approach blends a lightweight encoder-decoder ViT backbone with multi-scale patching and a large-field-of-view semi-supervised fine-tuning pipeline, leveraging unlabeled pathology data to learn discriminative nucleus representations. Empirical results on multiple benchmarks show that MUSE outperforms supervised baselines and generic pathology foundation models, demonstrating strong data efficiency and robustness across tissue types and magnifications. The work highlights the importance of task-specific pretraining and cross-scale nucleus-context learning for dense nucleus-level prediction in histopathology.
Abstract
Nucleus detection and classification (NDC) in histopathology analysis is a fundamental task that underpins a wide range of high-level pathology applications. However, existing methods heavily rely on labor-intensive nucleus-level annotations and struggle to fully exploit large-scale unlabeled data for learning discriminative nucleus representations. In this work, we propose MUSE (MUlti-scale denSE self-distillation), a novel self-supervised learning method tailored for NDC. At its core is NuLo (Nucleus-based Local self-distillation), a coordinate-guided mechanism that enables flexible local self-distillation based on predicted nucleus positions. By removing the need for strict spatial alignment between augmented views, NuLo allows critical cross-scale alignment, thus unlocking the capacity of models for fine-grained nucleus-level representation. To support MUSE, we design a simple yet effective encoder-decoder architecture and a large field-of-view semi-supervised fine-tuning strategy that together maximize the value of unlabeled pathology images. Extensive experiments on three widely used benchmarks demonstrate that MUSE effectively addresses the core challenges of histopathological NDC. The resulting models not only surpass state-of-the-art supervised baselines but also outperform generic pathology foundation models.
