A theory of understanding for artificial intelligence: composability, catalysts, and learning
Zijian Zhang, Sara Aronowitz, Alán Aspuru-Guzik
TL;DR
The paper addresses the hard problem of defining understanding in AI and other subjects by proposing a verifier-based, minimal framework centered on composability. It formalizes understanding as the set $S$ of input-output tuples $(\vec{I}, I_{\rm out})$ that a verifier deems related to an object, and introduces catalysts and subject decomposition as tools to analyze and reveal internal structure and learning. Key contributions include a practical operationalization of understanding via composition, the concepts of inner catalysts and acquisition of understanding, and an argument that autocatalysis enabled by large language models (LLMs) is a promising path toward general intelligence. The work highlights how catalysts (e.g., explanations, RAG, and chain-of-thought prompts) can improve outputs, how decomposing subjects reveals cognitive architecture, and how learning corresponds to updating inner catalysts, suggesting a framework for building more general, AI-enabled learners with broad, cross-domain applicability.
Abstract
Understanding is a crucial yet elusive concept in artificial intelligence (AI). This work proposes a framework for analyzing understanding based on the notion of composability. Given any subject (e.g., a person or an AI), we suggest characterizing its understanding of an object in terms of its ability to process (compose) relevant inputs into satisfactory outputs from the perspective of a verifier. This highly universal framework can readily apply to non-human subjects, such as AIs, non-human animals, and institutions. Further, we propose methods for analyzing the inputs that enhance output quality in compositions, which we call catalysts. We show how the structure of a subject can be revealed by analyzing its components that act as catalysts and argue that a subject's learning ability can be regarded as its ability to compose inputs into its inner catalysts. Finally we examine the importance of learning ability for AIs to attain general intelligence. Our analysis indicates that models capable of generating outputs that can function as their own catalysts, such as language models, establish a foundation for potentially overcoming existing limitations in AI understanding.
