EgoTraj-Bench: Towards Robust Trajectory Prediction Under Ego-view Noisy Observations

Jiayi Liu; Jiaming Zhou; Ke Ye; Kun-Yu Lin; Allan Wang; Junwei Liang

EgoTraj-Bench: Towards Robust Trajectory Prediction Under Ego-view Noisy Observations

Jiayi Liu, Jiaming Zhou, Ke Ye, Kun-Yu Lin, Allan Wang, Junwei Liang

TL;DR

This work tackles robust trajectory forecasting from ego-centric observations by introducing EgoTraj-Bench, the first real-world benchmark that injects authentic ego-view noise into BEV-grounded supervision. It proposes BiFlow, a dual-stream flow-matching model with EgoAnchor that jointly denoises histories and predicts futures, leveraging a shared latent encoder and intention priors to stabilize predictions under occlusion, ID switches, and drift. Empirical results show BiFlow achieving state-of-the-art performance and improved robustness (reducing errors by approximately 10–15% on average) across real-world EgoTraj-Bench and related datasets. The benchmark and method together provide a practical foundation for deploying ego-centric trajectory prediction systems robust to deployment-level perceptual disturbances.

Abstract

Reliable trajectory prediction from an ego-centric perspective is crucial for robotic navigation in human-centric environments. However, existing methods typically assume idealized observation histories, failing to account for the perceptual artifacts inherent in first-person vision, such as occlusions, ID switches, and tracking drift. This discrepancy between training assumptions and deployment reality severely limits model robustness. To bridge this gap, we introduce EgoTraj-Bench, the first real-world benchmark that grounds noisy, first-person visual histories in clean, bird's-eye-view future trajectories, enabling robust learning under realistic perceptual constraints. Building on this benchmark, we propose BiFlow, a dual-stream flow matching model that concurrently denoises historical observations and forecasts future motion by leveraging a shared latent representation. To better model agent intent, BiFlow incorporates our EgoAnchor mechanism, which conditions the prediction decoder on distilled historical features via feature modulation. Extensive experiments show that BiFlow achieves state-of-the-art performance, reducing minADE and minFDE by 10-15% on average and demonstrating superior robustness. We anticipate that our benchmark and model will provide a critical foundation for developing trajectory forecasting systems truly resilient to the challenges of real-world, ego-centric perception.

EgoTraj-Bench: Towards Robust Trajectory Prediction Under Ego-view Noisy Observations

TL;DR

Abstract

EgoTraj-Bench: Towards Robust Trajectory Prediction Under Ego-view Noisy Observations

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)