PULSE-ICU: A Pretrained Unified Long-Sequence Encoder for Multi-task Prediction in Intensive Care Units
Sejeong Jang, Joo Heung Yoon, Hyo Kyung Lee
TL;DR
ICU data are highly irregular and heterogeneous, challenging generalizable prediction. PULSE-ICU introduces a self-supervised foundation approach that encodes ICU events with a unified, multi-embedding representation and a Longformer encoder to model long trajectories without resampling. Pretrained on MIMIC-IV with masked event and value prediction, it is fine-tuned across 18 tasks and demonstrates strong within-domain performance and robust cross-domain transfer to HiRID, eICU, and PhysioNet 2012 with minimal fine-tuning. The results show data-efficient adaptation under limited-variable conditions and highlight the potential of foundation-style ICU representations for scalable, cross-institution decision support.
Abstract
Intensive care unit (ICU) data are highly irregular, heterogeneous, and temporally fragmented, posing challenges for generalizable clinical prediction. We present PULSE-ICU, a self-supervised foundation model that learns event-level ICU representations from large-scale EHR sequences without resampling or manual feature engineering. A unified embedding module encodes event identity, continuous values, units, and temporal attributes, while a Longformer-based encoder enables efficient modeling of long trajectories. PULSE-ICU was fine-tuned across 18 prediction tasks, including mortality, intervention forecasting, and phenotype identification, achieving strong performance across task types. External validation on eICU, HiRID, and P12 showed substantial improvements with minimal fine-tuning, demonstrating robustness to domain shift and variable constraints. These findings suggest that foundation-style modeling can improve data efficiency and adaptability, providing a scalable framework for ICU decision support across diverse clinical environments.
