Visualizing Deep Neural Network Decisions: Prediction Difference Analysis
Luisa M Zintgraf, Taco S Cohen, Tameem Adel, Max Welling
TL;DR
The paper tackles the challenge of interpreting deep neural network decisions for image classification, including medical imaging. It introduces Prediction Difference Analysis with three key enhancements: conditional sampling, multivariate patch analysis, and deep visualization of hidden layers. These improvements yield image-specific, evidence-for/evidence-against explanations that are more faithful than prior saliency methods or weight-based approaches, demonstrated on ImageNet models and HIV-related MRI data. While computationally intensive, the authors argue that precomputation and GPU acceleration can enable practical, interactive 3D visualizations with potential clinical impact.
Abstract
This article presents the prediction difference analysis method for visualizing the response of a deep neural network to a specific input. When classifying images, the method highlights areas in a given input image that provide evidence for or against a certain class. It overcomes several shortcoming of previous methods and provides great additional insight into the decision making process of classifiers. Making neural network decisions interpretable through visualization is important both to improve models and to accelerate the adoption of black-box classifiers in application areas such as medicine. We illustrate the method in experiments on natural images (ImageNet data), as well as medical images (MRI brain scans).
