Adaptive Test-Time Training for Predicting Need for Invasive Mechanical Ventilation in Multi-Center Cohorts
Xiaolei Lu, Shamim Nemati
TL;DR
Cross-site distribution shifts in ICU EHR data undermine IMV-need prediction. AdaTTT addresses this with adaptive test-time training that combines dynamic self-supervised learning and prototype-guided Partial Optimal Transport to align test-time representations with source structure. Information-theoretic analysis links auxiliary-task alignment to main-task performance, and experiments across multi-center ICU cohorts show AdaTTT achieving top discrimination and improved calibration versus strong baselines, supporting its potential for real-time clinical risk monitoring. Limitations include potential sensitivity to extreme shifts and added test-time computation, motivating future work on efficiency and broader clinical validation.
Abstract
Accurate prediction of the need for invasive mechanical ventilation (IMV) in intensive care units (ICUs) patients is crucial for timely interventions and resource allocation. However, variability in patient populations, clinical practices, and electronic health record (EHR) systems across institutions introduces domain shifts that degrade the generalization performance of predictive models during deployment. Test-Time Training (TTT) has emerged as a promising approach to mitigate such shifts by adapting models dynamically during inference without requiring labeled target-domain data. In this work, we introduce Adaptive Test-Time Training (AdaTTT), an enhanced TTT framework tailored for EHR-based IMV prediction in ICU settings. We begin by deriving information-theoretic bounds on the test-time prediction error and demonstrate that it is constrained by the uncertainty between the main and auxiliary tasks. To enhance their alignment, we introduce a self-supervised learning framework with pretext tasks: reconstruction and masked feature modeling optimized through a dynamic masking strategy that emphasizes features critical to the main task. Additionally, to improve robustness against domain shifts, we incorporate prototype learning and employ Partial Optimal Transport (POT) for flexible, partial feature alignment while maintaining clinically meaningful patient representations. Experiments across multi-center ICU cohorts demonstrate competitive classification performance on different test-time adaptation benchmarks.
