Cycle-Sync: Robust Global Camera Pose Estimation through Enhanced Cycle-Consistent Synchronization
Shaohan Li, Yunpeng Shi, Gilad Lerman
TL;DR
Cycle-Sync presents a robust global framework for camera pose estimation that jointly recovers rotations and locations without bundle adjustment. It introduces a Welsh-type objective and an MPLS-based cycle-consistency strategy to solve location estimation, along with a plug-and-play outlier rejection module and an integrated rotation synchronization via MPLS-cycle. Theoretical guarantees establish deterministic exact recovery under adversarial corruption with improved sample complexity, and extensive synthetic and real-data experiments show Cycle-Sync outperforming state-of-the-art pose estimators and full SfM pipelines. The approach enhances robustness to highly corrupted and variable-distance measurements, enabling accurate 3D reconstruction in challenging SfM scenarios with practical efficiency gains.
Abstract
We introduce Cycle-Sync, a robust and global framework for estimating camera poses (both rotations and locations). Our core innovation is a location solver that adapts message-passing least squares (MPLS) -- originally developed for group synchronization -- to camera location estimation. We modify MPLS to emphasize cycle-consistent information, redefine cycle consistencies using estimated distances from previous iterations, and incorporate a Welsch-type robust loss. We establish the strongest known deterministic exact-recovery guarantee for camera location estimation, showing that cycle consistency alone -- without access to inter-camera distances -- suffices to achieve the lowest sample complexity currently known. To further enhance robustness, we introduce a plug-and-play outlier rejection module inspired by robust subspace recovery, and we fully integrate cycle consistency into MPLS for rotation synchronization. Our global approach avoids the need for bundle adjustment. Experiments on synthetic and real datasets show that Cycle-Sync consistently outperforms leading pose estimators, including full structure-from-motion pipelines with bundle adjustment.
