Synthesize Boundaries: A Boundary-aware Self-consistent Framework for Weakly Supervised Salient Object Detection
Binwei Xu, Haoran Liang, Ronghua Liang, Peng Chen
TL;DR
This work tackles boundary-accurate salient object detection under scribble-based weak supervision by introducing boundary supervision through synthetic concave regions inserted into scribble-labeled objects. A self-consistent framework with a Global Integral Branch (GIB) and a Boundary-Aware Branch (BAB) leverages real and synthetic images to jointly learn complete object saliency and sharp boundaries, reinforced by a carefully designed loss that fuses saliency structure, local coherence, and cross-branch consistency. Empirical results on five benchmarks demonstrate clear gains over state-of-the-art scribble-based methods and competitive performance against fully supervised approaches, validating the effectiveness of boundary-oriented synthetic data and the self-consistent training strategy. The approach offers a practical path to improving boundary quality in weakly supervised SOD without requiring extra data, with potential extensions to learning-based synthetic generation and broader weakly supervised tasks.
Abstract
Fully supervised salient object detection (SOD) has made considerable progress based on expensive and time-consuming data with pixel-wise annotations. Recently, to relieve the labeling burden while maintaining performance, some scribble-based SOD methods have been proposed. However, learning precise boundary details from scribble annotations that lack edge information is still difficult. In this paper, we propose to learn precise boundaries from our designed synthetic images and labels without introducing any extra auxiliary data. The synthetic image creates boundary information by inserting synthetic concave regions that simulate the real concave regions of salient objects. Furthermore, we propose a novel self-consistent framework that consists of a global integral branch (GIB) and a boundary-aware branch (BAB) to train a saliency detector. GIB aims to identify integral salient objects, whose input is the original image. BAB aims to help predict accurate boundaries, whose input is the synthetic image. These two branches are connected through a self-consistent loss to guide the saliency detector to predict precise boundaries while identifying salient objects. Experimental results on five benchmarks demonstrate that our method outperforms the state-of-the-art weakly supervised SOD methods and further narrows the gap with the fully supervised methods.
