Time-Aware Policy Learning for Adaptive and Punctual Robot Control

Yinsen Jia; Boyuan Chen

Time-Aware Policy Learning for Adaptive and Punctual Robot Control

Yinsen Jia, Boyuan Chen

TL;DR

Time-aware policy learning is introduced, a reinforcement learning framework that enables robots to explicitly perceive and reason with time as a first-class variable, providing a unified foundation for efficient, robust, resilient, and human-aligned robot autonomy.

Abstract

Temporal awareness underlies intelligent behavior in both animals and humans, guiding how actions are sequenced, paced, and adapted to changing goals and environments. Yet most robot learning algorithms remain blind to time. We introduce time-aware policy learning, a reinforcement learning framework that enables robots to explicitly perceive and reason with time as a first-class variable. The framework augments conventional reinforcement policies with two complementary temporal signals, the remaining time and a time ratio, which allow a single policy to modulate its behavior continuously from rapid and dynamic to cautious and precise execution. By jointly optimizing punctuality and stability, the robot learns to balance efficiency, robustness, resiliency, and punctuality without re-training or reward adjustment. Across diverse manipulation domains from long-horizon pick and place, to granular-media pouring, articulated-object handling, and multi-agent object delivery, the time-aware policy produces adaptive behaviors that outperform standard reinforcement learning baselines by up to 48% in efficiency, 8 times more robust in sim-to-real transfer, and 90% in acoustic quietness while maintaining near-perfect success rates. Explicit temporal reasoning further enables real-time human-in-the-loop control and multi-agent coordination, allowing robots to recover from disturbances, re-synchronize after delays, and align motion tempo with human intent. By treating time not as a constraint but as a controllable dimension of behavior, time-aware policy learning provides a unified foundation for efficient, robust, resilient, and human-aligned robot autonomy.

Time-Aware Policy Learning for Adaptive and Punctual Robot Control

TL;DR

Abstract

Time-Aware Policy Learning for Adaptive and Punctual Robot Control

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)