Remotely Detectable Robot Policy Watermarking
Michael Amir, Manon Flageat, Amanda Prorok
TL;DR
This work tackles the problem of verifying the provenance of robotic policies using only remote observations, introducing the Physical Observation Gap as the core obstacle. It proposes Colored Noise Coherency (CoNoCo), a frequency-domain watermarking method that injects Colored Gaussian Noise into the policy's exploration signals and detects the watermark via spectral coherency, robust to unknown robot dynamics. The authors formalize the problem, prove marginal distribution preservation, and demonstrate robust remote detectability across simulated and real-world tasks and multiple observation modalities, including video, motion capture, and CCTV-like footage. This enables non-invasive IP protection and safety accountability for physical policies in robotics, with practical performance preserved and strong resistance to adversarial attempts.
Abstract
The success of machine learning for real-world robotic systems has created a new form of intellectual property: the trained policy. This raises a critical need for novel methods that verify ownership and detect unauthorized, possibly unsafe misuse. While watermarking is established in other domains, physical policies present a unique challenge: remote detection. Existing methods assume access to the robot's internal state, but auditors are often limited to external observations (e.g., video footage). This ``Physical Observation Gap'' means the watermark must be detected from signals that are noisy, asynchronous, and filtered by unknown system dynamics. We formalize this challenge using the concept of a \textit{glimpse sequence}, and introduce Colored Noise Coherency (CoNoCo), the first watermarking strategy designed for remote detection. CoNoCo embeds a spectral signal into the robot's motions by leveraging the policy's inherent stochasticity. To show it does not degrade performance, we prove CoNoCo preserves the marginal action distribution. Our experiments demonstrate strong, robust detection across various remote modalities, including motion capture and side-way/top-down video footage, in both simulated and real-world robot experiments. This work provides a necessary step toward protecting intellectual property in robotics, offering the first method for validating the provenance of physical policies non-invasively, using purely remote observations.
