A Causality-Inspired Model for Intima-Media Thickening Assessment in Ultrasound Videos
Shuo Gao, Jingyang Zhang, Jun Xue, Meng Yang, Yang Chen, Guangquan Zhou
TL;DR
This work tackles frame-wise IMT assessment in carotid ultrasound videos, where cross-frame style shifts create spurious correlations that can mislead thickening detection. It introduces a causality-inspired framework built on a Structural Causal Model (SCM) to block noncausal backdoor paths $S \dashrightarrow Y$ and strengthen the causal chain $C \rightarrow F \rightarrow Y$ through three modules: Spurious Correlation Elimination (SCE), Causal Equivalence Consolidation (CEC), and Causal Transition Augmentation (CTA). SCE perturbs auxiliary style statistics with AdaIN and enforces prediction invariance; CEC performs content randomization with adversarial training to emphasize content-driven predictions; CTA adds a text-based auxiliary pathway guided by chain-of-thought and aligns it with image features via contrastive learning. On an in-house dataset of 120 carotid ultrasound videos, the approach achieves $86.93\%$ accuracy and outperforms several baselines, demonstrating the practical value of incorporating causal analysis and multimodal prompts for robust frame-wise IMT assessment.
Abstract
Carotid atherosclerosis represents a significant health risk, with its early diagnosis primarily dependent on ultrasound-based assessments of carotid intima-media thickening. However, during carotid ultrasound screening, significant view variations cause style shifts, impairing content cues related to thickening, such as lumen anatomy, which introduces spurious correlations that hinder assessment. Therefore, we propose a novel causal-inspired method for assessing carotid intima-media thickening in frame-wise ultrasound videos, which focuses on two aspects: eliminating spurious correlations caused by style and enhancing causal content correlations. Specifically, we introduce a novel Spurious Correlation Elimination (SCE) module to remove non-causal style effects by enforcing prediction invariance with style perturbations. Simultaneously, we propose a Causal Equivalence Consolidation (CEC) module to strengthen causal content correlation through adversarial optimization during content randomization. Simultaneously, we design a Causal Transition Augmentation (CTA) module to ensure smooth causal flow by integrating an auxiliary pathway with text prompts and connecting it through contrastive learning. The experimental results on our in-house carotid ultrasound video dataset achieved an accuracy of 86.93\%, demonstrating the superior performance of the proposed method. Code is available at \href{https://github.com/xielaobanyy/causal-imt}{https://github.com/xielaobanyy/causal-imt}.
