SpaceSeg: A High-Precision Intelligent Perception Segmentation Method for Multi-Spacecraft On-Orbit Targets
Hao Liu, Pengyu Guo, Siyuan Yang, Zeqing Jiang, Qinglei Hu, Dongyu Li
TL;DR
SpaceSeg tackles the problem of high-precision segmentation of multiple on-orbit spacecraft under diverse deep-space conditions by extending a vision foundation model with four innovations: the Multi-Scale Hierarchical Attention Refinement Decoder (MSHARD), Multi-spacecraft Connected Component Analysis (MS-CCA), Spatial Domain Adaptation Transform (SDAT), and a task-specific loss function. By freezing a hierarchical MAE-based encoder, adapting the SAM2 mask decoder via spatial selective dual-stream fine-tuning, and integrating the novel decoding and labeling modules, SpaceSeg achieves a state-of-the-art $mIoU$ of $89.87\%$ and $mAcc$ of $99.98\%$ on the SpaceES dataset. The SpaceES dataset encompasses 1–4 spacecraft, 17 target types, multi-scale scenes, and realistic space imaging disturbances, enabling robust evaluation across multi-spacecraft and multi-background scenarios. The work demonstrates that vision foundation models can be effectively specialized for space perception tasks, with tangible benefits for space situational awareness, debris monitoring, docking, and autonomous servicing in deep-space missions.
Abstract
With the continuous advancement of human exploration into deep space, intelligent perception and high-precision segmentation technology for on-orbit multi-spacecraft targets have become critical factors for ensuring the success of modern space missions. However, the complex deep space environment, diverse imaging conditions, and high variability in spacecraft morphology pose significant challenges to traditional segmentation methods. This paper proposes SpaceSeg, an innovative vision foundation model-based segmentation framework with four core technical innovations: First, the Multi-Scale Hierarchical Attention Refinement Decoder (MSHARD) achieves high-precision feature decoding through cross-resolution feature fusion via hierarchical attention. Second, the Multi-spacecraft Connected Component Analysis (MS-CCA) effectively resolves topological structure confusion in dense targets. Third, the Spatial Domain Adaptation Transform framework (SDAT) eliminates cross-domain disparities and resist spatial sensor perturbations through composite enhancement strategies. Finally, a custom Multi-Spacecraft Segmentation Task Loss Function is created to significantly improve segmentation robustness in deep space scenarios. To support algorithm validation, we construct the first multi-scale on-orbit multi-spacecraft semantic segmentation dataset SpaceES, which covers four types of spatial backgrounds and 17 typical spacecraft targets. In testing, SpaceSeg achieves state-of-the-art performance with 89.87$\%$ mIoU and 99.98$\%$ mAcc, surpassing existing best methods by 5.71 percentage points. The dataset and code are open-sourced at https://github.com/Akibaru/SpaceSeg to provide critical technical support for next-generation space situational awareness systems.
