Iteratively Trained Interactive Segmentation
Sabarinath Mahadevan, Paul Voigtlaender, Bastian Leibe
TL;DR
This work presents Iteratively Trained Interactive Segmentation (ITIS), a click-based interactive segmentation framework that uses an iterative training loop to add correction clicks reflecting actual user behavior. By encoding clicks as Gaussians, optionally incorporating a distance-transform mask, and training with an iterative correction strategy, ITIS achieves state-of-the-art performance with fewer interactions on standard benchmarks. The approach also demonstrates robustness to test-time click strategies and proves effective in video mask correction and KITTI instance annotation, indicating practical impact for large-scale annotation workflows.
Abstract
Deep learning requires large amounts of training data to be effective. For the task of object segmentation, manually labeling data is very expensive, and hence interactive methods are needed. Following recent approaches, we develop an interactive object segmentation system which uses user input in the form of clicks as the input to a convolutional network. While previous methods use heuristic click sampling strategies to emulate user clicks during training, we propose a new iterative training strategy. During training, we iteratively add clicks based on the errors of the currently predicted segmentation. We show that our iterative training strategy together with additional improvements to the network architecture results in improved results over the state-of-the-art.
