Extremal Contours: Gradient-driven contours for compact visual attribution

Reza Karimzadeh; Albert Alonso; Frans Zdyb; Julius B. Kirkegaard; Bulat Ibragimov

Extremal Contours: Gradient-driven contours for compact visual attribution

Reza Karimzadeh, Albert Alonso, Frans Zdyb, Julius B. Kirkegaard, Bulat Ibragimov

TL;DR

This work introduces a training-free explanation method that replaces dense perturbation masks with smooth, single, star-convex contours parameterized by a truncated Fourier series. The approach optimizes an extremal preserve/delete objective using classifier gradients, with adaptive area and spectral regularization to ensure compact, stable, and topologically simple explanations. It achieves competitive fidelity with substantially reduced parameter counts, provides explicit area control for fidelity–area analysis, and extends naturally to multiple contours for multi-object attribution, showing strong robustness and favorable performance on both supervised and self-supervised vision models. The framework offers a practical, interpretable alternative to dense masks, with clear pathways to medical imaging applications and future enhancements for more complex topologies.

Abstract

Faithful yet compact explanations for vision models remain a challenge, as commonly used dense perturbation masks are often fragmented and overfitted, needing careful post-processing. Here, we present a training-free explanation method that replaces dense masks with smooth tunable contours. A star-convex region is parameterized by a truncated Fourier series and optimized under an extremal preserve/delete objective using the classifier gradients. The approach guarantees a single, simply connected mask, cuts the number of free parameters by orders of magnitude, and yields stable boundary updates without cleanup. Restricting solutions to low-dimensional, smooth contours makes the method robust to adversarial masking artifacts. On ImageNet classifiers, it matches the extremal fidelity of dense masks while producing compact, interpretable regions with improved run-to-run consistency. Explicit area control also enables importance contour maps, yielding a transparent fidelity-area profiles. Finally, we extend the approach to multi-contour and show how it can localize multiple objects within the same framework. Across benchmarks, the method achieves higher relevance mass and lower complexity than gradient and perturbation based baselines, with especially strong gains on self-supervised DINO models where it improves relevance mass by over 15% and maintains positive faithfulness correlations.

Extremal Contours: Gradient-driven contours for compact visual attribution

TL;DR

Abstract

Extremal Contours: Gradient-driven contours for compact visual attribution

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)