U-ARE-ME: Uncertainty-Aware Rotation Estimation in Manhattan Environments
Aalok Patwardhan, Callum Rhodes, Gwangbin Bae, Andrew J. Davison
TL;DR
U-ARE-ME tackles monocular camera rotation estimation without depth or intrinsics by exploiting Manhattan World priors and per-pixel surface normals with learned uncertainty. It combines a single-frame uncertainty-weighted rotation estimation on $SO(3)$ with a robust multi-frame sliding-window factor-graph that enforces temporal consistency and accounts for varying information content across frames. The method demonstrates competitive accuracy to RGB‑D approaches and superior robustness to dropped frames and non-Manhattan regions, validated on ICL-NUIM, TUM RGB‑D, and ScanNet, and extends to applications such as up-vector estimation, horizon detection in non-inertial frames, and ground segmentation. Its ability to produce per-frame uncertainty further stabilizes long sequences and enables reliable real-time deployment on RGB-only data.
Abstract
Camera rotation estimation from a single image is a challenging task, often requiring depth data and/or camera intrinsics, which are generally not available for in-the-wild videos. Although external sensors such as inertial measurement units (IMUs) can help, they often suffer from drift and are not applicable in non-inertial reference frames. We present U-ARE-ME, an algorithm that estimates camera rotation along with uncertainty from uncalibrated RGB images. Using a Manhattan World assumption, our method leverages the per-pixel geometric priors encoded in single-image surface normal predictions and performs optimisation over the SO(3) manifold. Given a sequence of images, we can use the per-frame rotation estimates and their uncertainty to perform multi-frame optimisation, achieving robustness and temporal consistency. Our experiments demonstrate that U-ARE-ME performs comparably to RGB-D methods and is more robust than sparse feature-based SLAM methods. We encourage the reader to view the accompanying video at https://callum-rhodes.github.io/U-ARE-ME for a visual overview of our method.
