Task Tokens: A Flexible Approach to Adapting Behavior Foundation Models
Ron Vainshtein, Zohar Rimon, Shie Mannor, Chen Tessler
TL;DR
This work introduces Task Tokens, a parameter-efficient method to adapt Goal-Conditioned Behavior Foundation Models (GC-BFMs) like MaskedMimic by learning a Task Encoder that generates task-specific tokens while keeping the base model frozen. By fusing Prior Tokens, Task Tokens, and State Tokens, the approach enables hybrid goal- and reward-driven control, achieving rapid convergence and high success across diverse humanoid tasks. Experimental results show improved task adaptation, robustness to out-of-distribution perturbations, and enhanced perceived motion realism via human studies, with complementary gains when combined with other prompting modalities. Overall, Task Tokens offer a practical path to specialize BFMs for complex control tasks without sacrificing the generalization and naturalness of the underlying motion prior.
Abstract
Recent advancements in imitation learning have led to transformer-based behavior foundation models (BFMs) that enable multi-modal, human-like control for humanoid agents. While excelling at zero-shot generation of robust behaviors, BFMs often require meticulous prompt engineering for specific tasks, potentially yielding suboptimal results. We introduce "Task Tokens", a method to effectively tailor BFMs to specific tasks while preserving their flexibility. Our approach leverages the transformer architecture of BFMs to learn a new task-specific encoder through reinforcement learning, keeping the original BFM frozen. This allows incorporation of user-defined priors, balancing reward design and prompt engineering. By training a task encoder to map observations to tokens, used as additional BFM inputs, we guide performance improvement while maintaining the model's diverse control characteristics. We demonstrate Task Tokens' efficacy across various tasks, including out-of-distribution scenarios, and show their compatibility with other prompting modalities. Our results suggest that Task Tokens offer a promising approach for adapting BFMs to specific control tasks while retaining their generalization capabilities.
