Temporal Inversion for Learning Interval Change in Chest X-Rays

Hanbin Ko; Kyeongmin Jeon; Doowoong Choi; Chang Min Park

Temporal Inversion for Learning Interval Change in Chest X-Rays

Hanbin Ko, Kyeongmin Jeon, Doowoong Choi, Chang Min Park

Abstract

Recent advances in vision--language pretraining have enabled strong medical foundation models, yet most analyze radiographs in isolation, overlooking the key clinical task of comparing prior and current images to assess interval change. For chest radiographs (CXRs), capturing interval change is essential, as radiologists must evaluate not only the static appearance of findings but also how they evolve over time. We introduce TILA (Temporal Inversion-aware Learning and Alignment), a simple yet effective framework that uses temporal inversion, reversing image pairs, as a supervisory signal to enhance the sensitivity of existing temporal vision-language models to directional change. TILA integrates inversion-aware objectives across pretraining, fine-tuning, and inference, complementing conventional appearance modeling with explicit learning of temporal order. We also propose a unified evaluation protocol to assess order sensitivity and consistency under temporal inversion, and introduce MS-CXR-Tretrieval, a retrieval evaluation set constructed through a general protocol that can be applied to any temporal CXR dataset. Experiments on public datasets and real-world hospital cohorts demonstrate that TILA consistently improves progression classification and temporal embedding alignment when applied to multiple existing architectures.

Temporal Inversion for Learning Interval Change in Chest X-Rays

Abstract

Temporal Inversion for Learning Interval Change in Chest X-Rays

Abstract

Paper Structure

Table of Contents

Figures (3)