Learning Important Features Through Propagating Activation Differences

Avanti Shrikumar; Peyton Greenside; Anshul Kundaje

Learning Important Features Through Propagating Activation Differences

Avanti Shrikumar, Peyton Greenside, Anshul Kundaje

TL;DR

DeepLIFT introduces a reference-based attribution framework that propagates feature importance through neural networks by backpropagating differences from a chosen reference input. By defining multipliers and a chain rule, it enables efficient, forward-compatible attributions without relying solely on gradients, and it separates positive and negative contributions to reveal interactions that gradient-based methods miss. The RevealCancel rule further refines attributions by approximating Shapley values and mitigating cancellation artifacts. Empirical results on MNIST and simulated genomic data demonstrate that DeepLIFT, especially with RevealCancel, provides more accurate and robust feature importance than gradient-based approaches, with practical implications for interpretability in vision and genomics tasks.

Abstract

The purported "black box" nature of neural networks is a barrier to adoption in applications where interpretability is essential. Here we present DeepLIFT (Deep Learning Important FeaTures), a method for decomposing the output prediction of a neural network on a specific input by backpropagating the contributions of all neurons in the network to every feature of the input. DeepLIFT compares the activation of each neuron to its 'reference activation' and assigns contribution scores according to the difference. By optionally giving separate consideration to positive and negative contributions, DeepLIFT can also reveal dependencies which are missed by other approaches. Scores can be computed efficiently in a single backward pass. We apply DeepLIFT to models trained on MNIST and simulated genomic data, and show significant advantages over gradient-based methods. Video tutorial: http://goo.gl/qKb7pL, ICML slides: bit.ly/deeplifticmlslides, ICML talk: https://vimeo.com/238275076, code: http://goo.gl/RM8jvH.

Learning Important Features Through Propagating Activation Differences

TL;DR

Abstract

Learning Important Features Through Propagating Activation Differences

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)