Editable XAI: Toward Bidirectional Human-AI Alignment with Co-Editable Explanations of Interpretable Attributes
Haoyang Chen, Jingwen Bai, Fang Tian, Brian Y Lim
TL;DR
Editable XAI introduces a bidirectional framework for aligning human domain knowledge with AI reasoning by making explanations editable. CoExplain pairs a neural predictor with a faith-proxy decision-tree explainer and supports writing user rules that can be parsed into neural networks, as well as AI-assisted enhancements that adjust thresholds or restructure topology. Across a user study (N=43), editable explanations improved user-AI faithfulness and understanding relative to read-only explanations, with CoExplain achieving near-optimal accuracy while maintaining alignment to user rules and reducing editing effort. The work demonstrates the value of writable explanations for collaborative human–AI reasoning and outlines design guidelines, limitations, and future directions for scalable, modular editable AI systems.
Abstract
While Explainable AI (XAI) helps users understand AI decisions, misalignment in domain knowledge can lead to disagreement. This inconsistency hinders understanding, and because explanations are often read-only, users lack the control to improve alignment. We propose making XAI editable, allowing users to write rules to improve control and gain deeper understanding through the generation effect of active learning. We developed CoExplain, leveraging a neural network for universal representation and symbolic rules for intuitive reasoning on interpretable attributes. CoExplain explains the neural network with a faithful proxy decision tree, parses user-written rules as an equivalent neural network graph, and collaboratively optimizes the decision tree. In a user study (N=43), CoExplain and manually editable XAI improved user understanding and model alignment compared to read-only XAI. CoExplain was easier to use with fewer edits and less time. This work contributes Editable XAI for bidirectional AI alignment, improving understanding and control.
