Intriguing properties of generative classifiers
Priyank Jaini, Kevin Clark, Robert Geirhos
TL;DR
This work compares zero-shot generative classifiers derived from text-to-image models to discriminative models and human data on 17 challenging OOD datasets and perceptual tasks. By estimating class-conditioned likelihoods $p_{ heta}(oldsymbol{x}|y= ext{y}_k)$—via diffusion variational lower bounds for diffusion models or exact likelihoods for autoregressive models—the authors classify by $\tilde{y}=\arg\max_k \log p_{ heta}(oldsymbol{x}|y=\text{y}_k)$. They report four key findings: (i) generative classifiers exhibit human-like shape bias (e.g., Imagen 99%), (ii) near human-level OOD robustness, (iii) strong alignment with human error patterns, and (iv) the ability to capture certain perceptual illusions. The results suggest that generative pre-training can yield robust, human-aligned object recognition and may offer insights for integrating generative and discriminative approaches in vision systems, despite current speed limitations and cross-model confounds. Key equations include $\tilde{y}=\arg\max_k p(y=\text{y}_k|\boldsymbol{x})$ and $\log p_{ heta}(oldsymbol{x}|y=\text{y}_k)$ approximations via $p$-models.
Abstract
What is the best paradigm to recognize objects -- discriminative inference (fast but potentially prone to shortcut learning) or using a generative model (slow but potentially more robust)? We build on recent advances in generative modeling that turn text-to-image models into classifiers. This allows us to study their behavior and to compare them against discriminative models and human psychophysical data. We report four intriguing emergent properties of generative classifiers: they show a record-breaking human-like shape bias (99% for Imagen), near human-level out-of-distribution accuracy, state-of-the-art alignment with human classification errors, and they understand certain perceptual illusions. Our results indicate that while the current dominant paradigm for modeling human object recognition is discriminative inference, zero-shot generative models approximate human object recognition data surprisingly well.
