ConvNets and ImageNet Beyond Accuracy: Understanding Mistakes and Uncovering Biases
Pierre Stock, Moustapha Cisse
TL;DR
The paper argues that image classification benchmarks and model assessments based on raw accuracy underestimate the true end-user experience, due to misalignments with human judgment and hidden biases. By combining human subject studies, feature- and example-based explanations, and adversarial-perturbation-based model criticism, it demonstrates that (a) ImageNet accuracy/robustness are underappreciated, (b) explanations help users interpret and trust predictions even for adversarial inputs, and (c) biases learned by models can be exposed and quantified. The findings show substantial reductions in rectified error when misclassifications are judged by humans, improved end-user agreement with adversarial predictions when explanations are provided, and effective uncovering of undesirable biases through adversarial model criticism. These results advocate for broader use of explanations and bias-detection tools to build more reliable, trustworthy vision systems and to motivate the development of richer benchmarks beyond single-label accuracy.
Abstract
ConvNets and Imagenet have driven the recent success of deep learning for image classification. However, the marked slowdown in performance improvement combined with the lack of robustness of neural networks to adversarial examples and their tendency to exhibit undesirable biases question the reliability of these methods. This work investigates these questions from the perspective of the end-user by using human subject studies and explanations. The contribution of this study is threefold. We first experimentally demonstrate that the accuracy and robustness of ConvNets measured on Imagenet are vastly underestimated. Next, we show that explanations can mitigate the impact of misclassified adversarial examples from the perspective of the end-user. We finally introduce a novel tool for uncovering the undesirable biases learned by a model. These contributions also show that explanations are a valuable tool both for improving our understanding of ConvNets' predictions and for designing more reliable models.
