Emergent Explainability: Adding a causal chain to neural network inference
Adam Perrett
TL;DR
The paper addresses the opacity of neural networks in high-stakes settings like healthcare and proposes emergent communication (EmCom) to achieve causal explainability. A contextualiser (trained via reinforcement learning) and an actor (trained via supervised learning) exchange task-informed messages that accompany outputs, enabling a causal grounding of decisions. In synthetic experiments with $n=3$ inputs and $256$ truth tables (and $1024$ training examples), the study shows that greater cross-agent information sharing improves generalization to unseen tasks, supporting the viability of causal, task-aware explanations and potential for distributed learning and transfer. The work suggests significant implications for healthcare xAI and beyond, while noting the need for further validation and the development of human-interpretable communications and broader real-world testing.
Abstract
This position paper presents a theoretical framework for enhancing explainable artificial intelligence (xAI) through emergent communication (EmCom), focusing on creating a causal understanding of AI model outputs. We explore the novel integration of EmCom into AI systems, offering a paradigm shift from conventional associative relationships between inputs and outputs to a more nuanced, causal interpretation. The framework aims to revolutionize how AI processes are understood, making them more transparent and interpretable. While the initial application of this model is demonstrated on synthetic data, the implications of this research extend beyond these simple applications. This general approach has the potential to redefine interactions with AI across multiple domains, fostering trust and informed decision-making in healthcare and in various sectors where AI's decision-making processes are critical. The paper discusses the theoretical underpinnings of this approach, its potential broad applications, and its alignment with the growing need for responsible and transparent AI systems in an increasingly digital world.
