The Self-Learning Agent with a Progressive Neural Network Integrated Transformer
Ajay Sivakumar, Shalini, Vasantha Raj, Sebastian Sylvester
TL;DR
The paper tackles catastrophic forgetting in continual learning by integrating Progressive Neural Networks with a pre-trained LLaMA 3.2 transformer and an autonomous data-collection agent to enable task-incremental learning for conversational AI and code generation. It combines Meta-Learning for rapid adaptation, LoRA for efficient fine-tuning, and Elastic Weight Consolidation to preserve prior knowledge, orchestrated by an agent that gathers Wikipedia-derived data. Empirical results show lower perplexities (e.g., 22.1 for conversation and 19.8 for coding) and strong retention across tasks, with BLEU and code accuracy supporting practical performance gains. The approach demonstrates scalable, low-data continual learning and advances toward AGI-like adaptability in dynamic, multi-domain environments, with potential applications in scalable, autonomous AI systems.
Abstract
This paper introduces a self-learning agent that integrates LLaMA 3.2 with a Progressive Neural Network (PNN) for continual learning in conversational AI and code generation. The framework dynamically collects data, fine-tunes tasks with minimal samples, and leverages Meta-Learning for rapid adaptation. LoRA optimizes fine-tuning, while Elastic Weight Consolidation (EWC) enhances knowledge retention. Experimental results demonstrate improved adaptability and memory stability, positioning this approach as a scalable step toward Artificial General Intelligence (AGI).
