Parental Guidance: Efficient Lifelong Learning through Evolutionary Distillation
Octi Zhang, Quanquan Peng, Rosario Scalise, Bryon Boots
TL;DR
This paper addresses the challenge of generalization for robotic agents across diverse environments while sustaining continual learning. It introduces Parental Guidance, an evolution-inspired framework that distributes the learning process and merges imitation learning with reinforcement learning to inherit and refine behaviors across generations. A central DAG-based orchestrator coordinates distributed training, while offspring undergo behavioral distillation via DAgger followed by PPO-based RL refinement, enabling IL-to-RL transitions. Preliminary experiments show improved exploration efficiency and open-ended learning in a multi-terrain setting, suggesting a scalable path to lifelong adaptation without manual reward shaping.
Abstract
Developing robotic agents that can perform well in diverse environments while showing a variety of behaviors is a key challenge in AI and robotics. Traditional reinforcement learning (RL) methods often create agents that specialize in narrow tasks, limiting their adaptability and diversity. To overcome this, we propose a preliminary, evolution-inspired framework that includes a reproduction module, similar to natural species reproduction, balancing diversity and specialization. By integrating RL, imitation learning (IL), and a coevolutionary agent-terrain curriculum, our system evolves agents continuously through complex tasks. This approach promotes adaptability, inheritance of useful traits, and continual learning. Agents not only refine inherited skills but also surpass their predecessors. Our initial experiments show that this method improves exploration efficiency and supports open-ended learning, offering a scalable solution where sparse reward coupled with diverse terrain environments induces a multi-task setting.
