QPM: Discrete Optimization for Globally Interpretable Image Classification
Thomas Norrenbrock, Timo Kaiser, Sovan Biswas, Ramesh Manuvinakurike, Bodo Rosenhahn
TL;DR
QPM addresses global interpretability in image classification by learning a compact, contrastive class representation: each class is described by a binary assignment of a fixed number of features (typically $k=5$) that are shared across classes. A binary quadratic program over feature selection $\mathbf{s}$ and class-feature assignments $\mathbf{W}$ optimizes a composite objective $Z = Z_A + Z_R + Z_B$, where $Z_A$ encodes class–feature similarity, $Z_R$ penalizes feature redundancy, and $Z_B$ injects steerable biases, with constraints enforcing equal feature budgets per class and unique associations. After solving for $\mathbf{W}^*$, the features are fine-tuned to align with the assigned classes, producing faithful global explanations and unprecedented structural grounding while maintaining competitive accuracy across datasets. The approach is backbone-agnostic, provides measurable interpretability gains (e.g., SID@5, Class-Independence, Structural Grounding), and supports steerability to tailor feature selection to domain criteria.
Abstract
Understanding the classifications of deep neural networks, e.g. used in safety-critical situations, is becoming increasingly important. While recent models can locally explain a single decision, to provide a faithful global explanation about an accurate model's general behavior is a more challenging open task. Towards that goal, we introduce the Quadratic Programming Enhanced Model (QPM), which learns globally interpretable class representations. QPM represents every class with a binary assignment of very few, typically 5, features, that are also assigned to other classes, ensuring easily comparable contrastive class representations. This compact binary assignment is found using discrete optimization based on predefined similarity measures and interpretability constraints. The resulting optimal assignment is used to fine-tune the diverse features, so that each of them becomes the shared general concept between the assigned classes. Extensive evaluations show that QPM delivers unprecedented global interpretability across small and large-scale datasets while setting the state of the art for the accuracy of interpretable models.
