Table of Contents
Fetching ...

DRAE: Dynamic Retrieval-Augmented Expert Networks for Lifelong Learning and Task Adaptation in Robotics

Yayu Long, Kewei Chen, Long Jin, Mingsheng Shang

TL;DR

DRAE tackles lifelong learning and catastrophic forgetting in robotic control by unifying dynamic Mixture-of-Experts routing with parameterized Retrieval-Augmented Generation and a hierarchical Reinforcement Learning framework coordinated by ReflexNet-SchemaPlanner-HyperOptima (RSHO). It introduces a non-parametric memory module (DPMM) to preserve older skills while enabling continual growth, and a unified objective that adaptively balances short-term adaptation with long-term knowledge retention. The approach yields state-of-the-art results across diverse robotic benchmarks ( MimicGen, DexArt, Adroit, NAVSIM) and shows strong generalization, reduced forgetting, and competitive real-world deployment performance, with theoretical guarantees on dynamic regret and sample complexity. By enabling dynamic expert expansion guided by external knowledge, DRAE provides a scalable, memory-efficient foundation for lifelong learning in dynamic robotic environments, with practical impact on manipulation, navigation, and humanoid motion tasks. Key results include an 82.5% task success rate on dynamic robotic manipulation tasks (vs. 74.2% for static MoEs), and substantial improvements in real-world deployment metrics such as higher success rates and faster adaptation when transferring to unseen domains.

Abstract

We introduce Dynamic Retrieval-Augmented Expert Networks (DRAE), a groundbreaking architecture that addresses the challenges of lifelong learning, catastrophic forgetting, and task adaptation by combining the dynamic routing capabilities of Mixture-of-Experts (MoE); leveraging the knowledge-enhancement power of Retrieval-Augmented Generation (RAG); incorporating a novel hierarchical reinforcement learning (RL) framework; and coordinating through ReflexNet-SchemaPlanner-HyperOptima (RSHO).DRAE dynamically routes expert models via a sparse MoE gating mechanism, enabling efficient resource allocation while leveraging external knowledge through parametric retrieval (P-RAG) to augment the learning process. We propose a new RL framework with ReflexNet for low-level task execution, SchemaPlanner for symbolic reasoning, and HyperOptima for long-term context modeling, ensuring continuous adaptation and memory retention. Experimental results show that DRAE significantly outperforms baseline approaches in long-term task retention and knowledge reuse, achieving an average task success rate of 82.5% across a set of dynamic robotic manipulation tasks, compared to 74.2% for traditional MoE models. Furthermore, DRAE maintains an extremely low forgetting rate, outperforming state-of-the-art methods in catastrophic forgetting mitigation. These results demonstrate the effectiveness of our approach in enabling flexible, scalable, and efficient lifelong learning for robotics.

DRAE: Dynamic Retrieval-Augmented Expert Networks for Lifelong Learning and Task Adaptation in Robotics

TL;DR

DRAE tackles lifelong learning and catastrophic forgetting in robotic control by unifying dynamic Mixture-of-Experts routing with parameterized Retrieval-Augmented Generation and a hierarchical Reinforcement Learning framework coordinated by ReflexNet-SchemaPlanner-HyperOptima (RSHO). It introduces a non-parametric memory module (DPMM) to preserve older skills while enabling continual growth, and a unified objective that adaptively balances short-term adaptation with long-term knowledge retention. The approach yields state-of-the-art results across diverse robotic benchmarks ( MimicGen, DexArt, Adroit, NAVSIM) and shows strong generalization, reduced forgetting, and competitive real-world deployment performance, with theoretical guarantees on dynamic regret and sample complexity. By enabling dynamic expert expansion guided by external knowledge, DRAE provides a scalable, memory-efficient foundation for lifelong learning in dynamic robotic environments, with practical impact on manipulation, navigation, and humanoid motion tasks. Key results include an 82.5% task success rate on dynamic robotic manipulation tasks (vs. 74.2% for static MoEs), and substantial improvements in real-world deployment metrics such as higher success rates and faster adaptation when transferring to unseen domains.

Abstract

We introduce Dynamic Retrieval-Augmented Expert Networks (DRAE), a groundbreaking architecture that addresses the challenges of lifelong learning, catastrophic forgetting, and task adaptation by combining the dynamic routing capabilities of Mixture-of-Experts (MoE); leveraging the knowledge-enhancement power of Retrieval-Augmented Generation (RAG); incorporating a novel hierarchical reinforcement learning (RL) framework; and coordinating through ReflexNet-SchemaPlanner-HyperOptima (RSHO).DRAE dynamically routes expert models via a sparse MoE gating mechanism, enabling efficient resource allocation while leveraging external knowledge through parametric retrieval (P-RAG) to augment the learning process. We propose a new RL framework with ReflexNet for low-level task execution, SchemaPlanner for symbolic reasoning, and HyperOptima for long-term context modeling, ensuring continuous adaptation and memory retention. Experimental results show that DRAE significantly outperforms baseline approaches in long-term task retention and knowledge reuse, achieving an average task success rate of 82.5% across a set of dynamic robotic manipulation tasks, compared to 74.2% for traditional MoE models. Furthermore, DRAE maintains an extremely low forgetting rate, outperforming state-of-the-art methods in catastrophic forgetting mitigation. These results demonstrate the effectiveness of our approach in enabling flexible, scalable, and efficient lifelong learning for robotics.

Paper Structure

This paper contains 132 sections, 5 theorems, 43 equations, 3 figures, 46 tables.

Key Result

Theorem 3.1

Under Lipschitz assumptions on $\Gamma$ and $\Psi$, DRAE with DPMM-based lifelong learning yields: where $P_T$ models environment non-stationarity.

Figures (3)

  • Figure 1: The DRAE architecture integrates four core components: (1) MoE-based dynamic routing for expert selection, (2) P-RAG for external knowledge fusion, (3) ReflexNet-SchemaPlanner-HyperOptima (RSHO) hierarchical control, and (4) DPMM for lifelong knowledge retention. The upper right detail shows critical component interactions including memory guidance, knowledge integration, and state feedback mechanisms. Key information flows demonstrate enhanced control input from augmented states to RSHO, task routing guidance from SchemaPlanner to Classifier, and execution feedback from ReflexNet to decoder.
  • Figure 2: Dynamic intermediate state transitions in coffee cup grasping and pouring task execution. The Multi-Task MoE panels reveal internal expert activation patterns evolving across task phases. P-RAG knowledge queries evolve from material property assessments to manipulation strategy requirements, while vision-guided processing states (right panel) show internal attention shifts from scene analysis to focused manipulation points. Interactive dialogue bubbles illustrate real-time decision-making, and DPMM encodes these transient patterns for future retention. This demonstrates DRAE's ability to maintain coherent representations while dynamically adapting intermediate states.
  • Figure 3: Dynamic regret of DRAE. DRAE achieves sublinear regret ($\mathcal{O}(\sqrt{T(1+P_T)}$), validating its theoretical guarantees for lifelong learning.

Theorems & Definitions (8)

  • Theorem 3.1: Sublinear Dynamic Regret
  • Theorem 3.2: Sample Complexity
  • Theorem C.1: Convergence of Expert Model
  • proof
  • Theorem C.2: Sample Complexity of DRAE
  • proof
  • Theorem C.3: Sublinear Regret for DRAE
  • proof