Subconscious Robotic Imitation Learning
Jun Xie, Zhicheng Wang, Jianwei Tan, Huanxu Lin, Xiaoguang Ma
TL;DR
The paper addresses the latency challenges in robotic imitation learning (RIL) by introducing Subconscious Robotic Imitation Learning (SRIL), which leverages subconscious downsampling, pattern-augmented learning, and cognitive offloading to accelerate policy inference. SRIL combines a SPAM-driven downsampling pipeline with a Transformer-based action predictor that fuses visual and state data, and it adds an execution strategy that uses exponential-weighted future-action blocks gated by a Cognitive Offloading Readiness (COR) metric. Experiments in six dual-arm simulations and three real-robot tasks show SRIL achieves 100–200% faster task execution with comparable or superior success rates compared to state-of-the-art policies like ACT and diffusion-based methods. The results indicate SRIL's potential to deliver real-time, high-accuracy manipulation in dynamic environments and industrial settings, with future work focusing on generalization, adaptive inference, and multi-task robustness.
Abstract
Although robotic imitation learning (RIL) is promising for embodied intelligent robots, existing RIL approaches rely on computationally intensive multi-model trajectory predictions, resulting in slow execution and limited real-time responsiveness. Instead, human beings subconscious can constantly process and store vast amounts of information from their experiences, perceptions, and learning, allowing them to fulfill complex actions such as riding a bike, without consciously thinking about each. Inspired by this phenomenon in action neurology, we introduced subconscious robotic imitation learning (SRIL), wherein cognitive offloading was combined with historical action chunkings to reduce delays caused by model inferences, thereby accelerating task execution. This process was further enhanced by subconscious downsampling and pattern augmented learning policy wherein intent-rich information was addressed with quantized sampling techniques to improve manipulation efficiency. Experimental results demonstrated that execution speeds of the SRIL were 100\% to 200\% faster over SOTA policies for comprehensive dual-arm tasks, with consistently higher success rates.
