Inference of Human-derived Specifications of Object Placement via Demonstration
Alex Cuellar, Ho Chit Siu, Julie A Shah
TL;DR
This work addresses the challenge of capturing human preferences for spatial object arrangement by introducing Positionally-Augmented RCC (PARCC), a logic that extends RCC with directional and class-relational capabilities. It provides a CNF-based PARCC formula framework and an inference pipeline that learns specifications from demonstrations, using a two-stage process: discovering disjunctive clauses that all demonstrations satisfy, then assessing which clauses reflect the demonstrator's intent via non-specification demonstrations. A human study shows that specifications inferred from demonstrations ($\Phi_D$) align better with participants' demonstrated patterns than human-provided specifications ($\Phi_S$), and that non-specification demonstrations offer a reasonable surrogate for unlabeled behavior. The approach enables robust, human-aligned planning for object placement in tasks like packing and sorting, with potential integration into automated robotic pipelines where spatial relationships matter for safety and efficiency.
Abstract
As robots' manipulation capabilities improve for pick-and-place tasks (e.g., object packing, sorting, and kitting), methods focused on understanding human-acceptable object configurations remain limited expressively with regard to capturing spatial relationships important to humans. To advance robotic understanding of human rules for object arrangement, we introduce positionally-augmented RCC (PARCC), a formal logic framework based on region connection calculus (RCC) for describing the relative position of objects in space. Additionally, we introduce an inference algorithm for learning PARCC specifications via demonstrations. Finally, we present the results from a human study, which demonstrate our framework's ability to capture a human's intended specification and the benefits of learning from demonstration approaches over human-provided specifications.
