DRoPE: Directional Rotary Position Embedding for Efficient Agent Interaction Modeling
Jianbo Zhao, Taiyu Ban, Zhihao Liu, Hangning Zhou, Xiyang Wang, Qibin Zhou, Hailong Qin, Mu Yang, Lei Liu, Bin Li
TL;DR
DRoPE addresses efficient inter-agent interaction modeling for autonomous driving trajectory generation by introducing a directional extension of Rotary Position Embedding (RoPE) that encodes relative angular information. By unifying angular scalars into a global embedding, DRoPE enables angular-aware attention (DRoPE-RoPE) with space complexity $O(N)$ and preserved transformer-style time complexity, avoiding the $O(N^2)$ space blows of explicit RPE. Theoretical analysis demonstrates correctness in modeling periodic angles, and empirical results on Waymo Motion show state-of-the-art minADE with reduced memory and FLOPs compared to RPE-based approaches. This approach yields scalable, accurate trajectory generation suitable for real-time deployment and offers a path to applying efficient periodic angle modeling in other domains.
Abstract
Accurate and efficient modeling of agent interactions is essential for trajectory generation, the core of autonomous driving systems. Existing methods, scene-centric, agent-centric, and query-centric frameworks, each present distinct advantages and drawbacks, creating an impossible triangle among accuracy, computational time, and memory efficiency. To break this limitation, we propose Directional Rotary Position Embedding (DRoPE), a novel adaptation of Rotary Position Embedding (RoPE), originally developed in natural language processing. Unlike traditional relative position embedding (RPE), which introduces significant space complexity, RoPE efficiently encodes relative positions without explicitly increasing complexity but faces inherent limitations in handling angular information due to periodicity. DRoPE overcomes this limitation by introducing a uniform identity scalar into RoPE's 2D rotary transformation, aligning rotation angles with realistic agent headings to naturally encode relative angular information. We theoretically analyze DRoPE's correctness and efficiency, demonstrating its capability to simultaneously optimize trajectory generation accuracy, time complexity, and space complexity. Empirical evaluations compared with various state-of-the-art trajectory generation models, confirm DRoPE's good performance and significantly reduced space complexity, indicating both theoretical soundness and practical effectiveness. The video documentation is available at https://drope-traj.github.io/.
