Assessing Foundation Models' Transferability to Physiological Signals in Precision Medicine
Matthias Christenson, Cove Geary, Brian Locke, Pranav Koirala, Warren Woodrich Pettine
TL;DR
The paper tackles the challenge of transferring foundation models to physiological signal analysis within precision medicine under data scarcity. It introduces a simulation-driven benchmarking pipeline that projects synthetic time-series through a foundation model and evaluates representations along feature disentanglement, temporal dynamics preservation, and scenario discrimination, followed by downstream task validation. Applying the Moirai time-series foundation model reveals systematic distortions in embeddings, including feature entanglement, poor signal reconstruction, and loss of temporal structure, leading to reduced clinical discriminability. These findings suggest that substantial architectural changes or targeted, synthetic-data–driven fine-tuning are required before such models can be deployed clinically, and the authors outline plans to broaden simulations and develop clinically relevant validation tasks and benchmarks.
Abstract
The success of precision medicine requires computational models that can effectively process and interpret diverse physiological signals across heterogeneous patient populations. While foundation models have demonstrated remarkable transfer capabilities across various domains, their effectiveness in handling individual-specific physiological signals - crucial for precision medicine - remains largely unexplored. This work introduces a systematic pipeline for rapidly and efficiently evaluating foundation models' transfer capabilities in medical contexts. Our pipeline employs a three-stage approach. First, it leverages physiological simulation software to generate diverse, clinically relevant scenarios, particularly focusing on data-scarce medical conditions. This simulation-based approach enables both targeted capability assessment and subsequent model fine-tuning. Second, the pipeline projects these simulated signals through the foundation model to obtain embeddings, which are then evaluated using linear methods. This evaluation quantifies the model's ability to capture three critical aspects: physiological feature independence, temporal dynamics preservation, and medical scenario differentiation. Finally, the pipeline validates these representations through specific downstream medical tasks. Initial testing of our pipeline on the Moirai time series foundation model revealed significant limitations in physiological signal processing, including feature entanglement, temporal dynamics distortion, and reduced scenario discrimination. These findings suggest that current foundation models may require substantial architectural modifications or targeted fine-tuning before deployment in clinical settings.
