ProTracker: Probabilistic Integration for Robust and Accurate Point Tracking
Tingyang Zhang, Chen Wang, Zhiyang Dou, Qingzhe Gao, Jiahui Lei, Baoquan Chen, Lingjie Liu
TL;DR
ProTracker addresses the challenge of long-term, arbitrary-point video tracking by unifying short-term, local optical flow with long-term, global correspondences through a probabilistic integration framework. A hybrid filter prunes unreliable predictions, and bidirectional flow integration aggregates multiple noisy estimates to produce drift-resistant trajectories. The method further enhances robustness by jointly incorporating long-term keypoints, enabling re-localization after disappearance and improved occlusion handling. Across TAP-Vid and BADJA benchmarks, ProTracker achieves state-of-the-art performance among optimization-based trackers and competitive results against supervised methods, illustrating strong practical impact for drift-free, dense point tracking in dynamic scenes.
Abstract
We propose ProTracker, a novel framework for accurate and robust long-term dense tracking of arbitrary points in videos. Previous methods relying on global cost volumes effectively handle large occlusions and scene changes but lack precision and temporal awareness. In contrast, local iteration-based methods accurately track smoothly transforming scenes but face challenges with occlusions and drift. To address these issues, we propose a probabilistic framework that marries the strengths of both paradigms by leveraging local optical flow for predictions and refined global heatmaps for observations. This design effectively combines global semantic information with temporally aware low-level features, enabling precise and robust long-term tracking of arbitrary points in videos. Extensive experiments demonstrate that ProTracker attains state-of-the-art performance among optimization-based approaches and surpasses supervised feed-forward methods on multiple benchmarks. The code and model will be released after publication.
