CLARITY: Medical World Model for Guiding Treatment Decisions by Modeling Context-Aware Disease Trajectories in Latent Space
Tianxingjian Ding, Yuanhao Zou, Chen Chen, Mubarak Shah, Yu Tian
TL;DR
<3-5 sentence high-level summary> CLARITY tackles the problem of predicting dynamic, treatment-conditioned disease trajectories in oncology by embedding disease evolution in a structured latent space that is conditioned on rich clinical and temporal context. It introduces a Therapy Policies Agent and a post-treatment latent predictor (Actor) paired with a survival predictor, and couples them through an Inverse Survival Evaluation that iteratively refines therapy proposals into actionable decisions. Unlike diffusion-based image reconstruction approaches, CLARITY focuses on latent dynamics for physiologically faithful trajectories and demonstrates state-of-the-art treatment-planning performance on glioma datasets, with strong survival discrimination and computational efficiency. The framework also shows generalization to breast cancer and includes interpretable latent predictions via a latent-to-MRI decoder, while enforcing safety and validity through constraint-aware feedback loops.</3-5 sentence high-level summary>
Abstract
Clinical decision-making in oncology requires predicting dynamic disease evolution, a task current static AI predictors cannot perform. While world models (WMs) offer a paradigm for generative prediction, existing medical applications remain limited. Existing methods often rely on stochastic diffusion models, focusing on visual reconstruction rather than causal, physiological transitions. Furthermore, in medical domain, models like MeWM typically ignore patient-specific temporal and clinical contexts and lack a feedback mechanism to link predictions to treatment decisions. To address these gaps, we introduce CLARITY, a medical world model that forecasts disease evolution directly within a structured latent space. It explicitly integrates time intervals (temporal context) and patient-specific data (clinical context) to model treatment-conditioned progression as a smooth, interpretable trajectory, and thus generate physiologically faithful, individualized treatment plans. Finally, CLARITY introduces a novel prediction-to-decision framework, translating latent rollouts into transparent, actionable recommendations. CLARITY demonstrates state-of-the-art performance in treatment planning. On the MU-Glioma-Post dataset, our approach outperforms recent MeWM by 12\%, and significantly surpasses all other medical-specific large language models.
