EgoPoints: Advancing Point Tracking for Egocentric Videos

Ahmad Darkhalil; Rhodri Guerrier; Adam W. Harley; Dima Damen

EgoPoints: Advancing Point Tracking for Egocentric Videos

Ahmad Darkhalil, Rhodri Guerrier, Adam W. Harley, Dima Damen

TL;DR

This work introduces EgoPoints, the first dense point-tracking benchmark tailored to egocentric videos, featuring 517 sequences with 4.7K tracks and new metrics to quantify in-view, out-of-view, and re-identification performance. To address the observed deficiencies, the authors propose K-EPIC, a semi-real training pipeline that fuses scene points from EPIC Fields with dynamic-object points from Kubric, generating 11K sequences and 22.1M tracks for robust fine-tuning. Empirical results show that fine-tuning state-of-the-art trackers (notably CoTracker and PIPs++) on K-EPIC improves EgoPoints performance across multiple metrics, while preserving accuracy on traditional third-person benchmarks; however, re-identification remains a central challenge with substantial headroom for improvement. Overall, EgoPoints provides a valuable benchmark and data-generation approach that promotes progress in egocentric dense point tracking, with practical implications for human-robot collaboration and augmented reality.

Abstract

We introduce EgoPoints, a benchmark for point tracking in egocentric videos. We annotate 4.7K challenging tracks in egocentric sequences. Compared to the popular TAP-Vid-DAVIS evaluation benchmark, we include 9x more points that go out-of-view and 59x more points that require re-identification (ReID) after returning to view. To measure the performance of models on these challenging points, we introduce evaluation metrics that specifically monitor tracking performance on points in-view, out-of-view, and points that require re-identification. We then propose a pipeline to create semi-real sequences, with automatic ground truth. We generate 11K such sequences by combining dynamic Kubric objects with scene points from EPIC Fields. When fine-tuning point tracking methods on these sequences and evaluating on our annotated EgoPoints sequences, we improve CoTracker across all metrics, including the tracking accuracy $δ^\star_{\text{avg}}$ by 2.7 percentage points and accuracy on ReID sequences (ReID$δ_{\text{avg}}$) by 2.4 points. We also improve $δ^\star_{\text{avg}}$ and ReID$δ_{\text{avg}}$ of PIPs++ by 0.3 and 2.8 respectively.

EgoPoints: Advancing Point Tracking for Egocentric Videos

TL;DR

Abstract

EgoPoints: Advancing Point Tracking for Egocentric Videos

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)