Clicks2Line: Using Lines for Interactive Image Segmentation
Chaewon Lee, Chang-Su Kim
TL;DR
The paper addresses interactive image segmentation by reducing user clicks for elongated objects through an adaptive input strategy that can switch between clicks and lines. It introduces Clicks2Line, which uses a line-generation pipeline with a line candidate map $\\boldsymbol{\mathcal{L}}$, a target map $\\mathcal{T}$, and a weight map $\\mathcal{W}$ to select an optimal line via $i^{*}=\\arg \\max_{i} (\\mathcal{L} \\times \\mathcal{W})^{i}$ and then converts the line intersection with $\\mathcal{T}$ into two input clicks. The method is validated on GrabCut and Berkeley datasets with a ViT-B backbone and NoC metrics, showing competitive or superior performance in most scenarios, especially for elongated shapes, while qualitatively reducing the number of required user interactions. This work demonstrates that line-guided guidance can significantly lower user effort in interactive segmentation and suggests broader applicability of line-based hints for long objects.
Abstract
For click-based interactive segmentation methods, reducing the number of clicks required to obtain a desired segmentation result is essential. Although recent click-based methods yield decent segmentation results, we observe that substantial amount of clicks are required to segment elongated regions. To reduce the amount of user-effort required, we propose using lines instead of clicks for such cases. In this paper, an interactive segmentation algorithm which adaptively adopts either clicks or lines as input is proposed. Experimental results demonstrate that using lines can generate better segmentation results than clicks for several cases.
