Multifaceted Feature Visualization: Uncovering the Different Types of Features Learned By Each Neuron in Deep Neural Networks
Anh Nguyen, Jason Yosinski, Jeff Clune
TL;DR
The paper tackles interpretability of deep neural networks by showing that individual neurons can be multifaceted rather than single-feature detectors. It introduces Multifaceted Feature Visualization (MFV), which identifies distinct facets of a neuron by clustering class exemplars in a low-dimensional space and initializing activation maximization from facet-specific mean images; it also adds center-biased regularization to produce coherent visuals. Results demonstrate that higher-layer neurons exhibit greater facet diversity, and MFV yields more natural colors and globally consistent facet visuals, thereby advancing the state of activation maximization. This approach enhances understanding of feature representations across layers and suggests practical avenues for more interpretable network design and visualization tools.
Abstract
We can better understand deep neural networks by identifying which features each of their neurons have learned to detect. To do so, researchers have created Deep Visualization techniques including activation maximization, which synthetically generates inputs (e.g. images) that maximally activate each neuron. A limitation of current techniques is that they assume each neuron detects only one type of feature, but we know that neurons can be multifaceted, in that they fire in response to many different types of features: for example, a grocery store class neuron must activate either for rows of produce or for a storefront. Previous activation maximization techniques constructed images without regard for the multiple different facets of a neuron, creating inappropriate mixes of colors, parts of objects, scales, orientations, etc. Here, we introduce an algorithm that explicitly uncovers the multiple facets of each neuron by producing a synthetic visualization of each of the types of images that activate a neuron. We also introduce regularization methods that produce state-of-the-art results in terms of the interpretability of images obtained by activation maximization. By separately synthesizing each type of image a neuron fires in response to, the visualizations have more appropriate colors and coherent global structure. Multifaceted feature visualization thus provides a clearer and more comprehensive description of the role of each neuron.
