fCOP: Focal Length Estimation from Category-level Object Priors
Xinyue Zhang, Jiaqi Yang, Xiangting Meng, Abdelrahman Mohamed, Laurent Kneip
TL;DR
The paper tackles monocular focal length estimation from a single image without strong scene geometry priors by leveraging category-level object priors and monocular depth predictions. It introduces fCOP, a minimal closed-form solver that recovers focal length from triplets of correspondences while decoupling focal length from object scale and pose, and it embeds robust estimation via Interval Stabbing with frame-wise consistency enforcement. Comprehensive experiments on synthetic data and real datasets (REAL275 and MultiFocals) show state-of-the-art focal-length accuracy and strong generalization to out-of-domain data, outperforming existing monocular intrinsic estimation methods. The estimated focal length is demonstrated to improve downstream category-level object pose estimation using RGB-D cues, highlighting practical impact for 3D understanding with uncalibrated monocular input.
Abstract
In the realm of computer vision, the perception and reconstruction of the 3D world through vision signals heavily rely on camera intrinsic parameters, which have long been a subject of intense research within the community. In practical applications, without a strong scene geometry prior like the Manhattan World assumption or special artificial calibration patterns, monocular focal length estimation becomes a challenging task. In this paper, we propose a method for monocular focal length estimation using category-level object priors. Based on two well-studied existing tasks: monocular depth estimation and category-level object canonical representation learning, our focal solver takes depth priors and object shape priors from images containing objects and estimates the focal length from triplets of correspondences in closed form. Our experiments on simulated and real world data demonstrate that the proposed method outperforms the current state-of-the-art, offering a promising solution to the long-standing monocular focal length estimation problem.
