SAGE: Spuriousness-Aware Guided Prompt Exploration for Mitigating Multimodal Bias

Wenqian Ye; Di Wang; Guangtao Zheng; Bohan Liu; Aidong Zhang

SAGE: Spuriousness-Aware Guided Prompt Exploration for Mitigating Multimodal Bias

Wenqian Ye, Di Wang, Guangtao Zheng, Bohan Liu, Aidong Zhang

TL;DR

This paper tackles multimodal spurious bias in zero-shot vision-language models like CLIP, where background or contextual cues can mislead object-level predictions. It introduces Spuriousness-Aware Guided Exploration (SAGE), a training-free method that searches a diverse set of prompt templates and selects those that maximize inter-class separation in the joint image-text space, thereby reducing reliance on spurious features. The authors provide a theoretical analysis linking higher class-separation in prompt-induced embeddings to robustness, and validate SAGE across four real-world benchmarks and five backbone models, showing improvements in zero-shot accuracy and worst-group robustness without any data annotations or model updates. Overall, SAGE offers a practical out-of-the-box debiasing approach for CLIP-like systems with strong generalization, balancing accuracy and fairness in zero-shot inference.

Abstract

Large vision-language models, such as CLIP, have shown strong zero-shot classification performance by aligning images and text in a shared embedding space. However, CLIP models often develop multimodal spurious biases, which is the undesirable tendency to rely on spurious features. For example, CLIP may infer object types in images based on frequently co-occurring backgrounds rather than the object's core features. This bias significantly impairs the robustness of pre-trained CLIP models on out-of-distribution data, where such cross-modal associations no longer hold. Existing methods for mitigating multimodal spurious bias typically require fine-tuning on downstream data or prior knowledge of the bias, which undermines the out-of-the-box usability of CLIP. In this paper, we first theoretically analyze the impact of multimodal spurious bias in zero-shot classification. Based on this insight, we propose Spuriousness-Aware Guided Exploration (SAGE), a simple and effective method that mitigates spurious bias through guided prompt selection. SAGE requires no training, fine-tuning, or external annotations. It explores a space of prompt templates and selects the prompts that induce the largest semantic separation between classes, thereby improving worst-group robustness. Extensive experiments on four real-world benchmark datasets and five popular backbone models demonstrate that SAGE consistently improves zero-shot performance and generalization, outperforming previous zero-shot approaches without any external knowledge or model updates.

SAGE: Spuriousness-Aware Guided Prompt Exploration for Mitigating Multimodal Bias

TL;DR

Abstract

SAGE: Spuriousness-Aware Guided Prompt Exploration for Mitigating Multimodal Bias

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (3)