Digging Deeper: Learning Multi-Level Concept Hierarchies

Oscar Hill; Mateo Espinosa Zarlenga; Mateja Jamnik

Digging Deeper: Learning Multi-Level Concept Hierarchies

Oscar Hill, Mateo Espinosa Zarlenga, Mateja Jamnik

TL;DR

Multi-Level Concept Splitting (MLCS), which discovers multi-level concept hierarchies from only top-level supervision, and Deep-HiCEMs, an architecture that represents these discovered hierarchies and enables interventions at multiple levels of abstraction are presented.

Abstract

Although concept-based models promise interpretability by explaining predictions with human-understandable concepts, they typically rely on exhaustive annotations and treat concepts as flat and independent. To circumvent this, recent work has introduced Hierarchical Concept Embedding Models (HiCEMs) to explicitly model concept relationships, and Concept Splitting to discover sub-concepts using only coarse annotations. However, both HiCEMs and Concept Splitting are restricted to shallow hierarchies. We overcome this limitation with Multi-Level Concept Splitting (MLCS), which discovers multi-level concept hierarchies from only top-level supervision, and Deep-HiCEMs, an architecture that represents these discovered hierarchies and enables interventions at multiple levels of abstraction. Experiments across multiple datasets show that MLCS discovers human-interpretable concepts absent during training and that Deep-HiCEMs maintain high accuracy while supporting test-time concept interventions that can improve task performance.

Digging Deeper: Learning Multi-Level Concept Hierarchies

TL;DR

Abstract

Paper Structure (32 sections, 3 figures, 5 tables)

This paper contains 32 sections, 3 figures, 5 tables.

Introduction
Background and Related Work
Concept learning
Concept Splitting and HiCEMs
Multi-Level Concept Splitting
Hierarchical Sparse Autoencoder
Depth of the Hierarchy
Deep Hierarchical CEMs
Concept Structure
Architecture
Sub-concepts Modules
Experiments
PseudoKitchens-2
Setup
Datasets
...and 17 more sections

Figures (3)

Figure 1: Deep-HiCEM: as in a HiCEM, from a latent code $\mathbf{h}$, we learn two embeddings per concept ($\mathbf{\hat{c}_i^{+\prime}}$ and $\mathbf{\hat{c}_i^{-\prime}}$), which are then passed through sub-concepts modules, which produce new embeddings ($\mathbf{\hat{c}_i^{+}}$ and $\mathbf{\hat{c}_i^{-}}$) that include information about sub-concepts and their descendants.
Figure 2: Task accuracy as discovered concepts are intervened on. Intervening on discovered concepts improves task accuracy, with a few exceptions that would benefit from further investigation.
Figure 3: Change in task accuracy as provided concepts are intervened on. Provided concept interventions work just as well in Deep-HiCEMs as they do in HiCEMs.

Digging Deeper: Learning Multi-Level Concept Hierarchies

TL;DR

Abstract

Digging Deeper: Learning Multi-Level Concept Hierarchies

Authors

TL;DR

Abstract

Table of Contents

Figures (3)