Table of Contents
Fetching ...

A multimodal slice discovery framework for systematic failure detection and explanation in medical image classification

Yixuan Liu, Kanwal K. Bhatia, Ahmed E. Fetit

TL;DR

This work introduces the first automated auditing framework that extends slice discovery methods to multimodal representations specifically for medical applications and shows that multimodal information generally allows more comprehensive and effective auditing of classifiers, while unimodal variants beyond image-only inputs exhibit strong potential in scenarios where resources are constrained.

Abstract

Despite advances in machine learning-based medical image classifiers, the safety and reliability of these systems remain major concerns in practical settings. Existing auditing approaches mainly rely on unimodal features or metadata-based subgroup analyses, which are limited in interpretability and often fail to capture hidden systematic failures. To address these limitations, we introduce the first automated auditing framework that extends slice discovery methods to multimodal representations specifically for medical applications. Comprehensive experiments were conducted under common failure scenarios using the MIMIC-CXR-JPG dataset, demonstrating the framework's strong capability in both failure discovery and explanation generation. Our results also show that multimodal information generally allows more comprehensive and effective auditing of classifiers, while unimodal variants beyond image-only inputs exhibit strong potential in scenarios where resources are constrained.

A multimodal slice discovery framework for systematic failure detection and explanation in medical image classification

TL;DR

This work introduces the first automated auditing framework that extends slice discovery methods to multimodal representations specifically for medical applications and shows that multimodal information generally allows more comprehensive and effective auditing of classifiers, while unimodal variants beyond image-only inputs exhibit strong potential in scenarios where resources are constrained.

Abstract

Despite advances in machine learning-based medical image classifiers, the safety and reliability of these systems remain major concerns in practical settings. Existing auditing approaches mainly rely on unimodal features or metadata-based subgroup analyses, which are limited in interpretability and often fail to capture hidden systematic failures. To address these limitations, we introduce the first automated auditing framework that extends slice discovery methods to multimodal representations specifically for medical applications. Comprehensive experiments were conducted under common failure scenarios using the MIMIC-CXR-JPG dataset, demonstrating the framework's strong capability in both failure discovery and explanation generation. Our results also show that multimodal information generally allows more comprehensive and effective auditing of classifiers, while unimodal variants beyond image-only inputs exhibit strong potential in scenarios where resources are constrained.
Paper Structure (12 sections, 5 equations, 1 figure, 1 table)

This paper contains 12 sections, 5 equations, 1 figure, 1 table.

Figures (1)

  • Figure 1: Average Precision@10 under noisy-label settings with 20% and 30% underperforming samples in the test set.