Adaptive Multi-Scale Integration Unlocks Robust Cell Annotation in Histopathology Images
Yinuo Xu, Yan Cui, Mingyao Li, Zhi Huang
TL;DR
NuClass addresses the gap in per-cell phenotyping in histopathology by combining nucleus-focused morphology with tissue-context signals through a gated, multi-scale framework. It leverages a marker-guided Xenium dataset spanning 8 organs and 16 classes to train two specialized paths—Path local for nuclear features and Path global for contextual cues—and a per-cell gate that fuses their predictions in probability space. The approach yields strong, interpretable performance across three held-out cohorts, achieving up to 96% F1 on its best class and producing calibrated confidence estimates and Grad-CAM explanations. This work demonstrates that selective, uncertainty-aware fusion of multi-scale information can bridge slide-level foundation models and reliable cell-level phenotype prediction, with practical implications for scalable, cross-organ cellular analysis in pathology.
Abstract
Identifying cell types and subtypes in routine histopathology is fundamental for understanding disease. Existing tile-based models capture nuclear detail but miss the broader tissue context that influences cell identity. Current human annotations are coarse-grained and uneven across studies, making fine-grained, subtype-level classification difficult. In this study, we build a marker-guided dataset from Xenium spatial transcriptomics with single-cell resolution labels for more than two million cells across eight organs and 16 classes to address the lack of high-quality annotations. Leveraging this data resource, we introduce NuClass, a pathologist workflow inspired framework for cell-wise multi-scale integration of nuclear morphology and microenvironmental context. It combines Path local, which focuses on nuclear morphology from 224x224 pixel crops, and Path global, which models the surrounding 1024x1024 pixel neighborhood, through a learnable gating module that balances local and global information. An uncertainty-guided objective directs the global path to prioritize regions where the local path is uncertain, and we provide calibrated confidence estimates and Grad-CAM maps for interpretability. Evaluated on three fully held-out cohorts, NuClass achieves up to 96 percent F1 for its best-performing class, outperforming strong baselines. Our results demonstrate that multi-scale, uncertainty-aware fusion can bridge the gap between slide-level pathological foundation models and reliable, cell-level phenotype prediction.
