Agential AI for Integrated Continual Learning, Deliberative Behavior, and Comprehensible Models
Zeki Doruk Erden, Boi Faltings
TL;DR
The paper addresses fundamental limitations of gradient-based ML—namely continual learning, incomprehensibility, and poor integration with deliberative planning—by introducing Agential AI (AAI), a unified framework consisting of Modelleyen (varsel, non-gradient structure learning), Planlayan (goal-directed planning on a learned model), and Behavior Encapsulation (automatic hierarchical decomposition of learned plans). It demonstrates, in a proof-of-principle on a simple FSM environment, that the system can learn continually without destructive adaptation, perform planning on a learned model without reward-driven relearning, and produce a human-comprehensible, hierarchical representation of behavior. The approach leverages discrete state modeling with BSVs, DSVs, and CSVs, and introduces concepts like upstream conditioning and normalized causal effect (NCE) to manage complexity and significance of learned relations. Together, these components aim to deliver integrated, controllable AI that reasons over structured environment models and produces interpretable subpolicies, with future work targeting continuous spaces, higher-order conditioning, and live integration of behavior encapsulation.
Abstract
Contemporary machine learning paradigm excels in statistical data analysis, solving problems that classical AI couldn't. However, it faces key limitations, such as a lack of integration with planning, incomprehensible internal structure, and inability to learn continually. We present the initial design for an AI system, Agential AI (AAI), in principle operating independently or on top of statistical methods, designed to overcome these issues. AAI's core is a learning method that models temporal dynamics with guarantees of completeness, minimality, and continual learning, using component-level variation and selection to learn the structure of the environment. It integrates this with a behavior algorithm that plans on a learned model and encapsulates high-level behavior patterns. Preliminary experiments on a simple environment show AAI's effectiveness and potential.
