Data-efficient, Explainable and Safe Box Manipulation: Illustrating the Advantages of Physical Priors in Model-Predictive Control
Achkan Salehi, Stephane Doncieux
TL;DR
This work tackles data efficiency, explainability, and safety in robotics control by injecting physical priors into Model Predictive Control for a box-rotation task on the SOTO2 conveyor gripper. It replaces a black-box environment model with a gray-box that learns a voxel-based mass distribution $\hat{\Pi}$ from a short exploration, then computes center of mass $r_c$ and inertia $I_c$ to plan via MPC using the dynamics $M\ddot{r}_c=F$ and $\dot{\omega}=I_c^{-1}\tau$, with torque estimated from contact surfaces. The approach yields zero-shot generalization to unseen mass distributions, improved data-efficiency, and built-in safety via early aborts, outperforming a black-box baseline in safety-critical scenarios. Limitations include supervision needs for $\hat{\Pi}$, sim-to-real transfer, and manual priors, suggesting directions such as domain randomization and automated prior extraction for broader applicability.
Abstract
Model-based RL/control have gained significant traction in robotics. Yet, these approaches often remain data-inefficient and lack the explainability of hand-engineered solutions. This makes them difficult to debug/integrate in safety-critical settings. However, in many systems, prior knowledge of environment kinematics/dynamics is available. Incorporating such priors can help address the aforementioned problems by reducing problem complexity and the need for exploration, while also facilitating the expression of the decisions taken by the agent in terms of physically meaningful entities. Our aim with this paper is to illustrate and support this point of view via a case-study. We model a payload manipulation problem based on a real robotic system, and show that leveraging prior knowledge about the dynamics of the environment in an MPC framework can lead to improvements in explainability, safety and data-efficiency, leading to satisfying generalization properties with less data.
