Benchmarking SAM2-based Trackers on FMOX
Senem Aktas, Charles Markham, John McDonald, Rozenn Dahyot
TL;DR
The study benchmarks SAM2-based trackers on fast-moving object datasets (FMOX) to reveal limitations in current methods. It compares SAM2, DAM4SAM, SAMURAI, and EfficientTAM using mIoU and mDice across 46 FMOX sequences, with initialization from ground-truth bounding boxes. The results show DAM4SAM and SAMURAI consistently outperform SAM2 and EfficientTAM on challenging sequences, while EfficientTAM offers faster compute with a trade-off in accuracy. The findings highlight the importance of memory management and motion-aware strategies for FMOs and provide guidance for deploying SAM2-based trackers in real-time, high-speed contexts.
Abstract
Several object tracking pipelines extending Segment Anything Model 2 (SAM2) have been proposed in the past year, where the approach is to follow and segment the object from a single exemplar template provided by the user on a initialization frame. We propose to benchmark these high performing trackers (SAM2, EfficientTAM, DAM4SAM and SAMURAI) on datasets containing fast moving objects (FMO) specifically designed to be challenging for tracking approaches. The goal is to understand better current limitations in state-of-the-art trackers by providing more detailed insights on the behavior of these trackers. We show that overall the trackers DAM4SAM and SAMURAI perform well on more challenging sequences.
