AINet+: Advancing Superpixel Segmentation via Cascaded Association Implantation
Yaxiong Wang, Yunchao Wei, Yujiao Wu, Xueming Qian, Li Zhu, Yi Yang
TL;DR
The paper tackles the limitation of conventional CNN-based superpixel methods that rely on restricted receptive fields, hindering explicit modeling of pixel-grid interactions. It introduces Association Implantation (AI), which embeds grid features around each pixel and applies a 3×3 convolution to distill pixel–grid context, enabling progressive refinement through hierarchical association learning and a boundary-perceiving loss to sharpen boundary delineation. The proposed AINet+ architecture combines AI at multiple layers and a boundary-focused objective, achieving state-of-the-art performance on BSDS500, NYUv2, ISIC-2017, and ACDC, while also improving downstream tasks such as object proposal generation and stereo matching. This approach provides a principled, explicit pixel-grid modeling framework for superpixel segmentation with strong cross-domain generalization and practical downstream impact.
Abstract
Superpixel segmentation has seen significant progress benefiting from the deep convolutional networks. The typical approach entails initial division of the image into grids, followed by a learning process that assigns each pixel to adjacent grid segments. However, reliance on convolutions with confined receptive fields results in an implicit, rather than explicit, understanding of pixel-grid interactions. This limitation often leads to a deficit of contextual information during the mapping of associations. To counteract this, we introduce the Association Implantation (AI) module, designed to allow networks to explicitly engage with pixel-grid relationships. This module embeds grid features directly into the vicinity of the central pixel and employs convolutional operations on an enlarged window, facilitating an adaptive transfer of knowledge. This approach enables the network to explicitly extract context at the pixel-grid level, which is more aligned with the objectives of superpixel segmentation than mere pixel-wise interactions. By integrating the AI module across various layers, we enable a progressive refinement of pixel-superpixel relationships from coarse to fine. To further enhance the assignment of boundary pixels, we've engineered a boundary-aware loss function. This function aids in the discrimination of boundary-adjacent pixels at the feature level, thereby empowering subsequent modules to precisely identify boundary pixels and enhance overall boundary accuracy. Our method has been rigorously tested on four benchmarks, including BSDS500, NYUv2, ACDC, and ISIC2017, and our model can achieve competitive performance with comparison methods.
