Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models
Adrien LeCoz, Houssem Ouertatani, Stéphane Herbin, Faouzi Adjed
TL;DR
The paper tackles reliable image classifier evaluation under distribution shift by leveraging text-conditioned diffusion models to synthesize failure-inducing images. It introduces an iterative pipeline that alternates image generation, classifier evaluation, and attribute-based subdomain selection, with selection guided by GA or Bayesian optimization. Empirical results show that GA and BO outperform random and combinatorial testing, enabling efficient identification of high-risk subdomains despite the substantial generation cost. The approach improves interpretability of failures by tying them to textual attributes and points to future gains from multi-fidelity evaluations and embedding-based representations to scale benchmarking.
Abstract
Image classifiers should be used with caution in the real world. Performance evaluated on a validation set may not reflect performance in the real world. In particular, classifiers may perform well for conditions that are frequently encountered during training, but poorly for other infrequent conditions. In this study, we hypothesize that recent advances in text-to-image generative models make them valuable for benchmarking computer vision models such as image classifiers: they can generate images conditioned by textual prompts that cause classifier failures, allowing failure conditions to be described with textual attributes. However, their generation cost becomes an issue when a large number of synthetic images need to be generated, which is the case when many different attribute combinations need to be tested. We propose an image classifier benchmarking method as an iterative process that alternates image generation, classifier evaluation, and attribute selection. This method efficiently explores the attributes that ultimately lead to poor behavior detection.
