GOHOME: Graph-Oriented Heatmap Output for future Motion Estimation
Thomas Gilles, Stefano Sabatini, Dzmitry Tsishkou, Bogdan Stanciulescu, Fabien Moutarde
TL;DR
GOHOME introduces a graph-oriented heatmap approach for future motion estimation by encoding HD-Map lanelets with a graph neural network and generating lane-level rasters that project into a global heatmap of future positions. This design avoids full-image CNNs, enables multimodal and uncertainty-aware predictions, and uses sparse sampling plus ranking to produce final trajectories efficiently. The method achieves competitive or state-of-the-art results across Argoverse, NuScenes, and Interaction datasets, with substantial speed-ups and memory reductions compared to CNN-based baselines, and benefits further from model ensembling. Overall, GOHOME demonstrates scalable, accurate, and versatile trajectory prediction suitable for real-time autonomous navigation and decision-making.
Abstract
In this paper, we propose GOHOME, a method leveraging graph representations of the High Definition Map and sparse projections to generate a heatmap output representing the future position probability distribution for a given agent in a traffic scene. This heatmap output yields an unconstrained 2D grid representation of agent future possible locations, allowing inherent multimodality and a measure of the uncertainty of the prediction. Our graph-oriented model avoids the high computation burden of representing the surrounding context as squared images and processing it with classical CNNs, but focuses instead only on the most probable lanes where the agent could end up in the immediate future. GOHOME reaches 2$nd$ on Argoverse Motion Forecasting Benchmark on the MissRate$_6$ metric while achieving significant speed-up and memory burden diminution compared to Argoverse 1$^{st}$ place method HOME. We also highlight that heatmap output enables multimodal ensembling and improve 1$^{st}$ place MissRate$_6$ by more than 15$\%$ with our best ensemble on Argoverse. Finally, we evaluate and reach state-of-the-art performance on the other trajectory prediction datasets nuScenes and Interaction, demonstrating the generalizability of our method.
