MindCraft: How Concept Trees Take Shape In Deep Models

Bowei Tian; Yexiao He; Wanghao Ye; Ziyao Wang; Meng Liu; Ang Li

MindCraft: How Concept Trees Take Shape In Deep Models

Bowei Tian, Yexiao He, Wanghao Ye, Ziyao Wang, Meng Liu, Ang Li

TL;DR

The MindCraft framework built upon Concept Trees establishes a widely applicable and powerful framework that enables in-depth analysis of conceptual representations in deep models, marking a significant step forward in the foundation of interpretable AI.

Abstract

Large-scale foundation models demonstrate strong performance across language, vision, and reasoning tasks. However, how they internally structure and stabilize concepts remains elusive. Inspired by causal inference, we introduce the MindCraft framework built upon Concept Trees. By applying spectral decomposition at each layer and linking principal directions into branching Concept Paths, Concept Trees reconstruct the hierarchical emergence of concepts, revealing exactly when they diverge from shared representations into linearly separable subspaces. Empirical evaluations across diverse scenarios across disciplines, including medical diagnosis, physics reasoning, and political decision-making, show that Concept Trees recover semantic hierarchies, disentangle latent concepts, and can be widely applied across multiple domains. The Concept Tree establishes a widely applicable and powerful framework that enables in-depth analysis of conceptual representations in deep models, marking a significant step forward in the foundation of interpretable AI.

MindCraft: How Concept Trees Take Shape In Deep Models

TL;DR

Abstract

MindCraft: How Concept Trees Take Shape In Deep Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)

Theorems & Definitions (2)