Patch-Based Deep Unsupervised Image Segmentation using Graph Cuts
Isaac Wasserman, Jeova Farias Sales Rocha Neto
TL;DR
GraPL tackles unsupervised pixel-level image segmentation by learning a patch-level CNN classifier guided by an iterative graph-cut energy. It alternates between patch-label optimization via min-st-cut and gradient updates of the network parameters, yielding a fully convolutional segmenter without labeled data. The method can leverage pretrained patch embeddings (e.g., DINOv2) as affinity cues while relying on graph cuts for regularization, achieving state-of-the-art performance on BSDS500 (mIoU ≈ 0.53, accuracy ≈ 0.57). This single-image, postprocessing-free framework highlights the value of integrating graph-based regularization into deep patch-based segmentation.
Abstract
Unsupervised image segmentation aims at grouping different semantic patterns in an image without the use of human annotation. Similarly, image clustering searches for groupings of images based on their semantic content without supervision. Classically, both problems have captivated researchers as they drew from sound mathematical concepts to produce concrete applications. With the emergence of deep learning, the scientific community turned its attention to complex neural network-based solvers that achieved impressive results in those domains but rarely leveraged the advances made by classical methods. In this work, we propose a patch-based unsupervised image segmentation strategy that bridges advances in unsupervised feature extraction from deep clustering methods with the algorithmic help of classical graph-based methods. We show that a simple convolutional neural network, trained to classify image patches and iteratively regularized using graph cuts, naturally leads to a state-of-the-art fully-convolutional unsupervised pixel-level segmenter. Furthermore, we demonstrate that this is the ideal setting for leveraging the patch-level pairwise features generated by vision transformer models. Our results on real image data demonstrate the effectiveness of our proposed methodology.
