Exploiting Causality Signals in Medical Images: A Pilot Study with Empirical Results
Gianluca Carloni, Sara Colantonio
TL;DR
This work introduces a causality-driven augmentation to CNNs for medical image classification by extracting causality factors from feature-map interactions to weight maps via a Mulcat module. Causality signals are captured as asymmetric relationships between feature maps using $P(F^i|F^j)$ and $P(F^j|F^i)$ estimated with Max or Lehmer means, then used to produce a causality-driven feature representation. Across BreakHis and PI-CAI datasets, Mulcat variants consistently improve accuracy and, in qualitative CAM analyses, focus attention on diagnostically relevant regions, with only a small memory overhead and compatibility with existing attention modules like BAM. The approach also demonstrates potential in few-shot learning and enhances explainability, suggesting a practical, data-driven path to integrating causal reasoning into medical image analysis.
Abstract
We present a novel technique to discover and exploit weak causal signals directly from images via neural networks for classification purposes. This way, we model how the presence of a feature in one part of the image affects the appearance of another feature in a different part of the image. Our method consists of a convolutional neural network backbone and a causality-factors extractor module, which computes weights to enhance each feature map according to its causal influence in the scene. We develop different architecture variants and empirically evaluate all the models on two public datasets of prostate MRI images and breast histopathology slides for cancer diagnosis. We study the effectiveness of our module both in fully-supervised and few-shot learning, we assess its addition to existing attention-based solutions, we conduct ablation studies, and investigate the explainability of our models via class activation maps. Our findings show that our lightweight block extracts meaningful information and improves the overall classification, together with producing more robust predictions that focus on relevant parts of the image. That is crucial in medical imaging, where accurate and reliable classifications are essential for effective diagnosis and treatment planning.
