Proposal-free Network for Instance-level Object Segmentation
Xiaodan Liang, Yunchao Wei, Xiaohui Shen, Jianchao Yang, Liang Lin, Shuicheng Yan
TL;DR
PFN addresses the challenging task of instance-level segmentation without region proposals by jointly predicting category-level masks, per-pixel instance location vectors, and per-category instance counts. It introduces multi-scale instance location prediction with coordinate maps and uses spectral clustering to convert pixel-level outputs into instance masks, aided by a category-level protection to ensure consistency. The approach is trained in two stages (category-level followed by instance-level) and demonstrates substantial performance gains on PASCAL VOC 2012, achieving $AP^r$ of $58.7\%$ at 0.5 IoU, far surpassing prior proposal-based methods. This work offers a faster, simpler, and more scalable alternative for accurate instance segmentation with significant practical impact for downstream vision tasks.
Abstract
Instance-level object segmentation is an important yet under-explored task. The few existing studies are almost all based on region proposal methods to extract candidate segments and then utilize object classification to produce final results. Nonetheless, generating accurate region proposals itself is quite challenging. In this work, we propose a Proposal-Free Network (PFN ) to address the instance-level object segmentation problem, which outputs the instance numbers of different categories and the pixel-level information on 1) the coordinates of the instance bounding box each pixel belongs to, and 2) the confidences of different categories for each pixel, based on pixel-to-pixel deep convolutional neural network. All the outputs together, by using any off-the-shelf clustering method for simple post-processing, can naturally generate the ultimate instance-level object segmentation results. The whole PFN can be easily trained in an end-to-end way without the requirement of a proposal generation stage. Extensive evaluations on the challenging PASCAL VOC 2012 semantic segmentation benchmark demonstrate that the proposed PFN solution well beats the state-of-the-arts for instance-level object segmentation. In particular, the $AP^r$ over 20 classes at 0.5 IoU reaches 58.7% by PFN, significantly higher than 43.8% and 46.3% by the state-of-the-art algorithms, SDS [9] and [16], respectively.
