Table of Contents
Fetching ...

A Case Study on Concept Induction for Neuron-Level Interpretability in CNN

Moumita Sen Sarma, Samatha Ereshi Akkamahadevi, Pascal Hitzler

TL;DR

This case study investigates whether the Concept Induction-based framework for hidden neuron analysis generalizes by applying it to the SUN2012 dataset, a large-scale scene recognition benchmark, and confirms that the method transfers to SUN2012, showing its broader applicability.

Abstract

Deep Neural Networks (DNNs) have advanced applications in domains such as healthcare, autonomous systems, and scene understanding, yet the internal semantics of their hidden neurons remain poorly understood. Prior work introduced a Concept Induction-based framework for hidden neuron analysis and demonstrated its effectiveness on the ADE20K dataset. In this case study, we investigate whether the approach generalizes by applying it to the SUN2012 dataset, a large-scale scene recognition benchmark. Using the same workflow, we assign interpretable semantic labels to neurons and validate them through web-sourced images and statistical testing. Our findings confirm that the method transfers to SUN2012, showing its broader applicability.

A Case Study on Concept Induction for Neuron-Level Interpretability in CNN

TL;DR

This case study investigates whether the Concept Induction-based framework for hidden neuron analysis generalizes by applying it to the SUN2012 dataset, a large-scale scene recognition benchmark, and confirms that the method transfers to SUN2012, showing its broader applicability.

Abstract

Deep Neural Networks (DNNs) have advanced applications in domains such as healthcare, autonomous systems, and scene understanding, yet the internal semantics of their hidden neurons remain poorly understood. Prior work introduced a Concept Induction-based framework for hidden neuron analysis and demonstrated its effectiveness on the ADE20K dataset. In this case study, we investigate whether the approach generalizes by applying it to the SUN2012 dataset, a large-scale scene recognition benchmark. Using the same workflow, we assign interpretable semantic labels to neurons and validate them through web-sourced images and statistical testing. Our findings confirm that the method transfers to SUN2012, showing its broader applicability.
Paper Structure (3 sections, 1 equation, 2 tables)

This paper contains 3 sections, 1 equation, 2 tables.