Kalib: Easy Hand-Eye Calibration with Reference Point Tracking
Tutian Tang, Minghao Liu, Wenqiang Xu, Cewu Lu
TL;DR
Kalib tackles the burden of hand-eye calibration in unstructured settings by proposing a markerless, training-free pipeline that tracks a fixed reference point on the robot via visual foundation models and uses forward kinematics plus a PnP solver to estimate the camera–robot transform. The method supports both eye-in-hand and eye-on-base configurations and requires only the robot’s kinematic chain and a single reference point, eliminating the need for fiducial boards or precise mesh models. Across simulation and real-world benchmarks, Kalib achieves competitive accuracy with substantially reduced manual setup and demonstrates robustness to noisy backgrounds and occlusions. The work highlights practical potential for continuous operation in unstructured environments and lays groundwork for further improvements in tracking robustness and calibration reliability.
Abstract
Hand-eye calibration aims to estimate the transformation between a camera and a robot. Traditional methods rely on fiducial markers, which require considerable manual effort and precise setup. Recent advances in deep learning have introduced markerless techniques but come with more prerequisites, such as retraining networks for each robot, and accessing accurate mesh models for data generation. In this paper, we propose Kalib, an automatic and easy-to-setup hand-eye calibration method that leverages the generalizability of visual foundation models to overcome these challenges. It features only two basic prerequisites, the robot's kinematic chain and a predefined reference point on the robot. During calibration, the reference point is tracked in the camera space. Its corresponding 3D coordinates in the robot coordinate can be inferred by forward kinematics. Then, a PnP solver directly estimates the transformation between the camera and the robot without training new networks or accessing mesh models. Evaluations in simulated and real-world benchmarks show that Kalib achieves good accuracy with a lower manual workload compared with recent baseline methods. We also demonstrate its application in multiple real-world settings with various robot arms and grippers. Kalib's user-friendly design and minimal setup requirements make it a possible solution for continuous operation in unstructured environments.
