Interpreting and Improving Optimal Control Problems with Directional Corrections
Trevor Barron, Xiaojing Zhang
TL;DR
The paper presents a directional-correction framework to interpret and improve optimal control problems (OCPs) in robotics by introducing a consistency score that measures how each cost component aligns with a desired plan change. It extends corrections to state-based directions, closed-loop MPC settings, and constrained OCPs, and analyzes cross-stage effects via a sensitivity map $F$. It further proposes an LP-based weight-optimization method to maximize consistency with a set of corrections, enabling automatic tuning of weights $w_k^{(r)}$ with a margin $m$ under nonnegativity and horizon-sum constraints. Through nonlinear unicycle experiments, the approach identifies inconsistent components (e.g., obstacle costs) and demonstrates open- and closed-loop parameter optimization that yields more consistent and desirable plan behavior, even in the presence of prediction noise. This framework provides a practical, data-driven means to diagnose and refine OCP formulations for robust robotic control.
Abstract
Many robotics tasks, such as path planning or trajectory optimization, are formulated as optimal control problems (OCPs). The key to obtaining high performance lies in the design of the OCP's objective function. In practice, the objective function consists of a set of individual components that must be carefully modeled and traded off such that the OCP has the desired solution. It is often challenging to balance multiple components to achieve the desired solution and to understand, when the solution is undesired, the impact of individual cost components. In this paper, we present a framework addressing these challenges based on the concept of directional corrections. Specifically, given the solution to an OCP that is deemed undesirable, and access to an expert providing the direction of change that would increase the desirability of the solution, our method analyzes the individual cost components for their "consistency" with the provided directional correction. This information can be used to improve the OCP formulation, e.g., by increasing the weight of consistent cost components, or reducing the weight of - or even redesigning - inconsistent cost components. We also show that our framework can automatically tune parameters of the OCP to achieve consistency with a set of corrections.
