Rapid Object Annotation
Misha Denil
TL;DR
This work tackles the bottleneck of annotating bounding boxes for novel objects in video by introducing an interactive annotation tool that leverages an objectness prior and label propagation. The approach combines CenterNet-inspired descriptor maps, continuous tracking, and feature caching to deliver substantial speedups—roughly fivefold—over traditional extreme-click methods while maintaining reasonable label quality. Through experiments on three target objects and multiple annotation styles, the authors demonstrate dramatic reductions in annotation time and provide IoU-based assessments of label accuracy. The findings suggest practical utility for rapid data collection to train detectors for new objects, albeit with attention to UI latency and generalization to diverse scenes.
Abstract
In this report we consider the problem of rapidly annotating a video with bounding boxes for a novel object. We describe a UI and associated workflow designed to make this process fast for an arbitrary novel target.
