HOLa: HoloLens Object Labeling
Michael Schwimmbeck, Serouj Khajarian, Konstantin Holzapfel, Johannes Schmidt, Stefanie Remmele
TL;DR
The paper presents HOLa, a Unity/Python application that leverages SAM-Track to enable fully automatic single-object labeling on HoloLens 2 with minimal user input, addressing the data annotation bottleneck in medical AR. By integrating a seed-point prompted SAM-Track within a two-mode workflow (recording and labeling), HOLa performs frame-wise pixel labeling across sequences while maintaining a simple initialization process. Across five experiments spanning phantom and open-liver clinical scenes, HOLa achieves Dice scores between 0.875 and 0.982 and delivers substantial labeling speedups (~500x) over manual annotation, with performance comparable to inter-rater variability. The work demonstrates the feasibility of applying foundation-model-based tracking to AR data, discusses limitations related to object fragmentation and image quality, and provides open-source tooling to facilitate rapid data management in AR research.
Abstract
In the context of medical Augmented Reality (AR) applications, object tracking is a key challenge and requires a significant amount of annotation masks. As segmentation foundation models like the Segment Anything Model (SAM) begin to emerge, zero-shot segmentation requires only minimal human participation obtaining high-quality object masks. We introduce a HoloLens-Object-Labeling (HOLa) Unity and Python application based on the SAM-Track algorithm that offers fully automatic single object annotation for HoloLens 2 while requiring minimal human participation. HOLa does not have to be adjusted to a specific image appearance and could thus alleviate AR research in any application field. We evaluate HOLa for different degrees of image complexity in open liver surgery and in medical phantom experiments. Using HOLa for image annotation can increase the labeling speed by more than 500 times while providing Dice scores between 0.875 and 0.982, which are comparable to human annotators. Our code is publicly available at: https://github.com/mschwimmbeck/HOLa
