GenCHiP: Generating Robot Policy Code for High-Precision and Contact-Rich Manipulation Tasks
Kaylee Burns, Ajinkya Jain, Keegan Go, Fei Xia, Michael Stark, Stefan Schaal, Karol Hausman
TL;DR
GenCHiP investigates whether large language models can generate robot policy code for high-precision, contact-rich manipulation by reparameterizing the action space to enforce impedance/admittance constraints. By exposing a compliant control space and using targeted prompting strategies, GenCHiP enables LLMs to produce executable policies that reason about forces and contacts, even under perceptual noise. Across the Functional Manipulation Benchmark and NIST Task Board tasks, GenCHiP outperforms contact-unaware baselines by more than 3x–4x, demonstrating strong generalization to varied object geometries and task conditions. The work highlights the practical potential of LLMs to automate low-level parameterization of dexterous manipulation policies, reducing manual tuning and enabling broader applicability in real-world robotics.
Abstract
Large Language Models (LLMs) have been successful at generating robot policy code, but so far these results have been limited to high-level tasks that do not require precise movement. It is an open question how well such approaches work for tasks that require reasoning over contact forces and working within tight success tolerances. We find that, with the right action space, LLMs are capable of successfully generating policies for a variety of contact-rich and high-precision manipulation tasks, even under noisy conditions, such as perceptual errors or grasping inaccuracies. Specifically, we reparameterize the action space to include compliance with constraints on the interaction forces and stiffnesses involved in reaching a target pose. We validate this approach on subtasks derived from the Functional Manipulation Benchmark (FMB) and NIST Task Board Benchmarks. Exposing this action space alongside methods for estimating object poses improves policy generation with an LLM by greater than 3x and 4x when compared to non-compliant action spaces
