Table of Contents
Fetching ...

Occlusion-Aware Seamless Segmentation

Yihong Cao, Jiaming Zhang, Hao Shi, Kunyu Peng, Yuhongxuan Zhang, Hui Zhang, Rainer Stiefelhagen, Kailun Yang

TL;DR

This work introduces a novel task, Occlusion-Aware Seamless Seamless Segmentation (OASS), which simultaneously tackles all three challenges of panoramic semantic segmentation, and proposes the first solution UnmaskFormer, aiming at unmasking the narrow FoV, occlusions, and domain gaps all at once.

Abstract

Panoramic images can broaden the Field of View (FoV), occlusion-aware prediction can deepen the understanding of the scene, and domain adaptation can transfer across viewing domains. In this work, we introduce a novel task, Occlusion-Aware Seamless Segmentation (OASS), which simultaneously tackles all these three challenges. For benchmarking OASS, we establish a new human-annotated dataset for Blending Panoramic Amodal Seamless Segmentation, i.e., BlendPASS. Besides, we propose the first solution UnmaskFormer, aiming at unmasking the narrow FoV, occlusions, and domain gaps all at once. Specifically, UnmaskFormer includes the crucial designs of Unmasking Attention (UA) and Amodal-oriented Mix (AoMix). Our method achieves state-of-the-art performance on the BlendPASS dataset, reaching a remarkable mAPQ of 26.58% and mIoU of 43.66%. On public panoramic semantic segmentation datasets, i.e., SynPASS and DensePASS, our method outperforms previous methods and obtains 45.34% and 48.08% in mIoU, respectively. The fresh BlendPASS dataset and our source code are available at https://github.com/yihong-97/OASS.

Occlusion-Aware Seamless Segmentation

TL;DR

This work introduces a novel task, Occlusion-Aware Seamless Seamless Segmentation (OASS), which simultaneously tackles all three challenges of panoramic semantic segmentation, and proposes the first solution UnmaskFormer, aiming at unmasking the narrow FoV, occlusions, and domain gaps all at once.

Abstract

Panoramic images can broaden the Field of View (FoV), occlusion-aware prediction can deepen the understanding of the scene, and domain adaptation can transfer across viewing domains. In this work, we introduce a novel task, Occlusion-Aware Seamless Segmentation (OASS), which simultaneously tackles all these three challenges. For benchmarking OASS, we establish a new human-annotated dataset for Blending Panoramic Amodal Seamless Segmentation, i.e., BlendPASS. Besides, we propose the first solution UnmaskFormer, aiming at unmasking the narrow FoV, occlusions, and domain gaps all at once. Specifically, UnmaskFormer includes the crucial designs of Unmasking Attention (UA) and Amodal-oriented Mix (AoMix). Our method achieves state-of-the-art performance on the BlendPASS dataset, reaching a remarkable mAPQ of 26.58% and mIoU of 43.66%. On public panoramic semantic segmentation datasets, i.e., SynPASS and DensePASS, our method outperforms previous methods and obtains 45.34% and 48.08% in mIoU, respectively. The fresh BlendPASS dataset and our source code are available at https://github.com/yihong-97/OASS.
Paper Structure (5 sections, 1 equation, 4 figures)

This paper contains 5 sections, 1 equation, 4 figures.

Figures (4)

  • Figure S1: The process of modeling amodal-oriented masked source images.
  • Figure S2: Visualization results of Panoptic Segmentation. From top to bottom are (a) Image, (b) Ground truth, (c) DATR zheng2023look_neighbor, (d) Trans4PASS zhang2022bending, (e) UniDAPS zhang2022UniDAPS, (f) EDAPS saha2023edaps, (g) Source-Only, and (h) UnmaskFormer (Ours).
  • Figure S3: Visualization results of Semantic Segmentation. From top to bottom are (a) Image, (b) Ground truth, (c) DATR zheng2023look_neighbor, (d) Trans4PASS zhang2022bending, (e) UniDAPS zhang2022UniDAPS, (f) EDAPS saha2023edaps, (g) Source-Only, and (h) UnmaskFormer (Ours).
  • Figure S4: More visualization results of SynPASS. From top to bottom are: Image, the prediction of Trans4PASS zhang2022bending, the prediction of our UnmaskFormer, and Ground truth.