SITUATE: Indoor Human Trajectory Prediction through Geometric Features and Self-Supervised Vision Representation
Luigi Capogrosso, Andrea Toaiari, Andrea Avogaro, Uzair Khan, Aditya Jivoji, Franco Fummi, Marco Cristani
TL;DR
Indoor human trajectory forecasting is challenged by dense indoor layouts and semantic constraints. SITUATE addresses this by fusing equivariant/invariant geometric feature learning with a self-supervised vision representation (BEiT) of scene layouts, enabling scene-aware and transformation-consistent predictions. The model combines EquiGCN and InvGCN blocks, scene tokens, and a principled feature initialization to predict future trajectories, achieving state-of-the-art results on THÖR and Supermarket and competitive outdoor performance, with ablations confirming the importance of the scene representation and regularization components. This approach offers practical impact for indoor robotics and location-based services by providing accurate, context-aware forecasts that generalize across indoor and outdoor scenarios, and the authors provide public code for reproducibility.
Abstract
Patterns of human motion in outdoor and indoor environments are substantially different due to the scope of the environment and the typical intentions of people therein. While outdoor trajectory forecasting has received significant attention, indoor forecasting is still an underexplored research area. This paper proposes SITUATE, a novel approach to cope with indoor human trajectory prediction by leveraging equivariant and invariant geometric features and a self-supervised vision representation. The geometric learning modules model the intrinsic symmetries and human movements inherent in indoor spaces. This concept becomes particularly important because self-loops at various scales and rapid direction changes often characterize indoor trajectories. On the other hand, the vision representation module is used to acquire spatial-semantic information about the environment to predict users' future locations more accurately. We evaluate our method through comprehensive experiments on the two most famous indoor trajectory forecasting datasets, i.e., THÖR and Supermarket, obtaining state-of-the-art performance. Furthermore, we also achieve competitive results in outdoor scenarios, showing that indoor-oriented forecasting models generalize better than outdoor-oriented ones. The source code is available at https://github.com/intelligolabs/SITUATE.
