ReDirector: Creating Any-Length Video Retakes with Rotary Camera Encoding
Byeongjun Park, Byung-Hoon Kim, Hyungjin Chung, Jong Chul Ye
TL;DR
ReDirector tackles the challenge of generating realistic video retakes for variable-length inputs under dynamic camera motion. It rectifies prior RoPE misuses by applying a shared 3D RoPE to both input and target videos and introduces Rotary Camera Encoding (RoCE), a camera-conditioned RoPE phase shift that encodes multi-view geometry within attention. The method couples RoCE with geometry-aware attention, yielding improved geometric consistency, accurate dynamic object localization, and better background preservation across long sequences and out-of-distribution trajectories. Evaluations on DAVIS, iPhone datasets, and ReCamMaster trajectories show robust camera controllability and superior visual quality compared to warping-based and implicit baselines, with strong generalization to unseen lengths and resolutions. The work advances practical camera-controlled video retargeting by enabling flexible, high-fidelity retakes without reliance on external depth or geometry estimators, using token-level multi-view reasoning within a diffusion-Transformer framework.
Abstract
We present ReDirector, a novel camera-controlled video retake generation method for dynamically captured variable-length videos. In particular, we rectify a common misuse of RoPE in previous works by aligning the spatiotemporal positions of the input video and the target retake. Moreover, we introduce Rotary Camera Encoding (RoCE), a camera-conditioned RoPE phase shift that captures and integrates multi-view relationships within and across the input and target videos. By integrating camera conditions into RoPE, our method generalizes to out-of-distribution camera trajectories and video lengths, yielding improved dynamic object localization and static background preservation. Extensive experiments further demonstrate significant improvements in camera controllability, geometric consistency, and video quality across various trajectories and lengths.
