The Law of Task-Achieving Body Motion: Axiomatizing Success of Robot Manipulation Actions
Malte Huerkamp, Jonas Dech, Michael Beetz
TL;DR
The paper addresses the lack of guarantees in open-world robot manipulation by proposing the Law of Task-Achieving Body Motion, an axiomatic correctness framework that ties task requests to physical execution within scoped physics. It introduces Task--Environment--Embodiment (TEE) classes and Semantic Digital Twins (SDTs) to formalize state, dynamics, and validity, and defines three verifiable predicates—$SatisfiesRequest$, $Causes$, and $CanPerform$—to certify task achievement via a single CanAchieve condition. The framework enables motion synthesis, verification, and diagnostic reasoning (including counterfactuals) while clearly delimiting scope through physics validity intervals $I_{\Phi}$. The authors instantiate the approach on articulated container manipulation in kitchen settings across three mobile manipulators, demonstrating practical use modes: generation, verification, learning from observations, and cross-embodiment analysis. The work advances a modular, certifiable approach to combining semantics, causality, and embodiment in robotic manipulation, with future directions toward deformable objects, more realistic physics, and integration with foundation models for generated task plans and verifications.
Abstract
Autonomous agents that perform everyday manipulation actions need to ensure that their body motions are semantically correct with respect to a task request, causally effective within their environment, and feasible for their embodiment. In order to enable robots to verify these properties, we introduce the Law of Task-Achieving Body Motion as an axiomatic correctness specification for body motions. To that end we introduce scoped Task-Environment-Embodiment (TEE) classes that represent world states as Semantic Digital Twins (SDTs) and define applicable physics models to decompose task achievement into three predicates: SatisfiesRequest for semantic request satisfaction over SDT state evolution; Causes for causal sufficiency under the scoped physics model; and CanPerform for safety and feasibility verification at the embodiment level. This decomposition yields a reusable, implementation-independent interface that supports motion synthesis and the verification of given body motions. It also supports typed failure diagnosis (semantic, causal, embodiment and out-of-scope), feasibility across robots and environments, and counterfactual reasoning about robot body motions. We demonstrate the usability of the law in practice by instantiating it for articulated container manipulation in kitchen environments on three contrasting mobile manipulation platforms
