Table of Contents
Fetching ...

More is Less? A Simulation-Based Approach to Dynamic Interactions between Biases in Multimodal Models

Mounia Drissi

TL;DR

This work tackles how biases from text and images interact in multimodal models by proposing a simulation-based, probabilistic framework that computes text bias ($St$), image bias ($Si$), and multimodal bias ($Sm$) on the MMBias dataset. It classifies interactions as amplification ($Sm > \max(St,Si)$), mitigation ($Sm < \min(St,Si)$), or neutrality, using a rule-based approach and conditional probabilities to reveal modality dominance. Results show amplification in 22% of cases, mitigation in 11%, and neutrality in 67%, with image bias often dominating and neutral interactions common when text does not significantly diverge. The framework offers an interpretable tool for auditing and designing fair multimodal systems, with potential transfer to other context-based datasets and modalities, guiding bias mitigation across preprocessing, in-processing, and post-processing stages.

Abstract

Multimodal machine learning models, such as those that combine text and image modalities, are increasingly used in critical domains including public safety, security, and healthcare. However, these systems inherit biases from their single modalities. This study proposes a systemic framework for analyzing dynamic multimodal bias interactions. Using the MMBias dataset, which encompasses categories prone to bias such as religion, nationality, and sexual orientation, this study adopts a simulation-based heuristic approach to compute bias scores for text-only, image-only, and multimodal embeddings. A framework is developed to classify bias interactions as amplification (multimodal bias exceeds both unimodal biases), mitigation (multimodal bias is lower than both), and neutrality (multimodal bias lies between unimodal biases), with proportional analyzes conducted to identify the dominant mode and dynamics in these interactions. The findings highlight that amplification (22\%) occurs when text and image biases are comparable, while mitigation (11\%) arises under the dominance of text bias, highlighting the stabilizing role of image bias. Neutral interactions (67\%) are related to a higher text bias without divergence. Conditional probabilities highlight the text's dominance in mitigation and mixed contributions in neutral and amplification cases, underscoring complex modality interplay. In doing so, the study encourages the use of this heuristic, systemic, and interpretable framework to analyze multimodal bias interactions, providing insight into how intermodal biases dynamically interact, with practical applications for multimodal modeling and transferability to context-based datasets, all essential for developing fair and equitable AI models.

More is Less? A Simulation-Based Approach to Dynamic Interactions between Biases in Multimodal Models

TL;DR

This work tackles how biases from text and images interact in multimodal models by proposing a simulation-based, probabilistic framework that computes text bias (), image bias (), and multimodal bias () on the MMBias dataset. It classifies interactions as amplification (), mitigation (), or neutrality, using a rule-based approach and conditional probabilities to reveal modality dominance. Results show amplification in 22% of cases, mitigation in 11%, and neutrality in 67%, with image bias often dominating and neutral interactions common when text does not significantly diverge. The framework offers an interpretable tool for auditing and designing fair multimodal systems, with potential transfer to other context-based datasets and modalities, guiding bias mitigation across preprocessing, in-processing, and post-processing stages.

Abstract

Multimodal machine learning models, such as those that combine text and image modalities, are increasingly used in critical domains including public safety, security, and healthcare. However, these systems inherit biases from their single modalities. This study proposes a systemic framework for analyzing dynamic multimodal bias interactions. Using the MMBias dataset, which encompasses categories prone to bias such as religion, nationality, and sexual orientation, this study adopts a simulation-based heuristic approach to compute bias scores for text-only, image-only, and multimodal embeddings. A framework is developed to classify bias interactions as amplification (multimodal bias exceeds both unimodal biases), mitigation (multimodal bias is lower than both), and neutrality (multimodal bias lies between unimodal biases), with proportional analyzes conducted to identify the dominant mode and dynamics in these interactions. The findings highlight that amplification (22\%) occurs when text and image biases are comparable, while mitigation (11\%) arises under the dominance of text bias, highlighting the stabilizing role of image bias. Neutral interactions (67\%) are related to a higher text bias without divergence. Conditional probabilities highlight the text's dominance in mitigation and mixed contributions in neutral and amplification cases, underscoring complex modality interplay. In doing so, the study encourages the use of this heuristic, systemic, and interpretable framework to analyze multimodal bias interactions, providing insight into how intermodal biases dynamically interact, with practical applications for multimodal modeling and transferability to context-based datasets, all essential for developing fair and equitable AI models.

Paper Structure

This paper contains 15 sections, 1 equation, 4 figures.

Figures (4)

  • Figure 1: Sample images from the MMBias dataset. Each row corresponds to one of the target classes: religion, nationality, disability, and sexual orientation. Source: 46, reproduced with permission.
  • Figure 2: Bias Scores Across Categories (Author’s compilation)
  • Figure 3: Average Bias Scores by Interaction Type (Author’s compilation).
  • Figure 4: Proportion of Cases by Text vs Image Dominance (Author’s compilation).