iKap: Kinematics-aware Planning with Imperative Learning
Qihang Li, Zhuoqun Chen, Haoze Zheng, Haonan He, Zitong Zhan, Shaoshu Su, Junyi Geng, Chen Wang
TL;DR
iKap Tackles the challenge of vision-to-planning under realistic robot kinematics by formulating end-to-end learning as a self-supervised bi-level optimization (BLO) problem that couples a perception-planning network with a differentiable MPC layer enforcing $x_{t+1}=F(x_t,u_t)$ and trajectory feasibility. The system comprises a depth-based perception encoder, a planning network that outputs keypoint references, and a differentiable MPC using $ ext{SE}(2)$-based Dubins-car kinematics to produce executable trajectories, with gradients flowing through the BLO via implicit differentiation. The upper-level cost $oldsymbol{U}=oldsymbol{A}oldsymbol{C}^ ext{F}+oldsymbol{B}oldsymbol{C}^ ext{E}+oldsymbol{G}oldsymbol{C}^ ext{T}$ guides learning to minimize collisions, environmental proximity, and trajectory tracking error. Results in simulation and on a real Unitree GO2 show higher success rates and lower tracking errors than the state-of-the-art baselines, with real-world feasibility demonstrated despite noisy depth data, underscoring the method’s practical impact and adaptability to different controllers.
Abstract
Trajectory planning in robotics aims to generate collision-free pose sequences that can be reliably executed. Recently, vision-to-planning systems have gained increasing attention for their efficiency and ability to interpret and adapt to surrounding environments. However, traditional modular systems suffer from increased latency and error propagation, while purely data-driven approaches often overlook the robot's kinematic constraints. This oversight leads to discrepancies between planned trajectories and those that are executable. To address these challenges, we propose iKap, a novel vision-to-planning system that integrates the robot's kinematic model directly into the learning pipeline. iKap employs a self-supervised learning approach and incorporates the state transition model within a differentiable bi-level optimization framework. This integration ensures the network learns collision-free waypoints while satisfying kinematic constraints, enabling gradient back-propagation for end-to-end training. Our experimental results demonstrate that iKap achieves higher success rates and reduced latency compared to the state-of-the-art methods. Besides the complete system, iKap offers a visual-to-planning network that seamlessly works with various controllers, providing a robust solution for robots navigating complex environments.
