SDPRLayers: Certifiable Backpropagation Through Polynomial Optimization Problems in Robotics
Connor Holmes, Frederike Dümbgen, Timothy D. Barfoot
TL;DR
This work tackles the vulnerability of differentiable optimization in robotics to convergence on spurious local minima, which can distort backpropagated gradients. It introduces SDPRLayers, a differentiable optimization layer that solves semidefinite relaxations of polynomial optimization problems (POPs) and, when the relaxation is tight, yields gradients corresponding to the globally optimal solution through implicit differentiation and certificate matrices. The authors provide theoretical conditions for when gradients are certifiably correct, describe multiple backpropagation schemes (implicit selections, classic IFT, and SDP-based differentiation), and demonstrate strong performance on synthetic polynomial problems, stereo localization, and a deep-learning–aided localization pipeline. The results show that relying on SDP-based gradients avoids the pitfalls of local minima, improves convergence reliability, and enables robust end-to-end learning in robotics, with open-source PyTorch tooling for practical adoption.
Abstract
A recent set of techniques in the robotics community, known as certifiably correct methods, frames robotics problems as polynomial optimization problems (POPs) and applies convex, semidefinite programming (SDP) relaxations to either find or certify their global optima. In parallel, differentiable optimization allows optimization problems to be embedded into end-to-end learning frameworks and has received considerable attention in the robotics community. In this paper, we consider the ill effect of convergence to spurious local minima in the context of learning frameworks that use differentiable optimization. We present SDPRLayers, an approach that seeks to address this issue by combining convex relaxations with implicit differentiation techniques to provide certifiably correct solutions and gradients throughout the training process. We provide theoretical results that outline conditions for the correctness of these gradients and provide efficient means for their computation. Our approach is first applied to two simple-but-demonstrative simulated examples, which expose the potential pitfalls of reliance on local optimization in existing, state-of-the-art, differentiable optimization methods. We then apply our method in a real-world application: we train a deep neural network to detect image keypoints for robot localization in challenging lighting conditions. We provide our open-source, PyTorch implementation of SDPRLayers.
