SegAssess: Panoramic quality mapping for robust and transferable unsupervised segmentation assessment
Bingnan Yang, Mi Zhang, Zhili Zhang, Zhan Zhang, Yuanxin Zhao, Xiangyun Hu, Jianya Gong
TL;DR
SegAssess reframes segmentation quality assessment as Panoramic Quality Mapping (PQM), enabling pixel-level classification into TP, FP, TN, and FN to produce a complete quality map without ground truth. Built on an enhanced HQ-SAM backbone, it introduces an Edge Guided Compaction (EGC) with Aggregated Semantic Filter (ASF) and an Augmented Mixup Sampling (AMS) strategy to boost edge accuracy and cross-domain generalization. The approach achieves state-of-the-art performance on multiple remote sensing SQA benchmarks and demonstrates strong zero-shot transfer to unseen masks from both model- and human-sourced origins, addressing transferability gaps in prior methods. Collectively, PQM and SegAssess offer a robust, transferable framework for unsupervised SQA in large-scale geospatial analysis, with practical implications for quality control across diverse RS datasets and sensor modalities.
Abstract
High-quality image segmentation is fundamental to pixel-level geospatial analysis in remote sensing, necessitating robust segmentation quality assessment (SQA), particularly in unsupervised settings lacking ground truth. Although recent deep learning (DL) based unsupervised SQA methods show potential, they often suffer from coarse evaluation granularity, incomplete assessments, and poor transferability. To overcome these limitations, this paper introduces Panoramic Quality Mapping (PQM) as a new paradigm for comprehensive, pixel-wise SQA, and presents SegAssess, a novel deep learning framework realizing this approach. SegAssess distinctively formulates SQA as a fine-grained, four-class panoramic segmentation task, classifying pixels within a segmentation mask under evaluation into true positive (TP), false positive (FP), true negative (TN), and false negative (FN) categories, thereby generating a complete quality map. Leveraging an enhanced Segment Anything Model (SAM) architecture, SegAssess uniquely employs the input mask as a prompt for effective feature integration via cross-attention. Key innovations include an Edge Guided Compaction (EGC) branch with an Aggregated Semantic Filter (ASF) module to refine predictions near challenging object edges, and an Augmented Mixup Sampling (AMS) training strategy integrating multi-source masks to significantly boost cross-domain robustness and zero-shot transferability. Comprehensive experiments demonstrate that SegAssess achieves state-of-the-art (SOTA) performance and exhibits remarkable zero-shot transferability to unseen masks. The code is available at https://github.com/Yangbn97/SegAssess.
