Multi-Modal AI for Remote Patient Monitoring in Cancer Care
Yansong Liu, Ronnie Stafford, Pramit Khetrapal, Huriye Kocadag, Graça Carvalho, Patricia de Winter, Maryam Imran, Amelia Snook, Adamos Hadjivasiliou, D. Vijay Anand, Weining Lin, John Kelly, Yukun Zhou, Ivana Drobnjak
TL;DR
This paper investigates remote patient monitoring for cancer patients on systemic therapy, targeting the between-visit period where symptoms go unchecked. It introduces a multi-modal AI framework using a token-based transformer to fuse HALO-X wearable data, QoR-15 surveys, and clinical events and to forecast adverse events within rolling 4-week windows, addressing asynchronous sampling and MNAR. In a prospective observational trial with 50 patients and over 2.1 million data points across thousands of patient-days, the approach achieved AUROC 0.70 and 83.9% accuracy, with prior chemotherapy, A&E visits, wellness check-ins, and daily maximum heart rate identified as key predictors. A case study demonstrates early risk escalation before an event, supporting proactive interventions, and the work establishes feasibility and a blueprint for broader validation and clinical integration.
Abstract
For patients undergoing systemic cancer therapy, the time between clinic visits is full of uncertainties and risks of unmonitored side effects. To bridge this gap in care, we developed and prospectively trialed a multi-modal AI framework for remote patient monitoring (RPM). This system integrates multi-modal data from the HALO-X platform, such as demographics, wearable sensors, daily surveys, and clinical events. Our observational trial is one of the largest of its kind and has collected over 2.1 million data points (6,080 patient-days) of monitoring from 84 patients. We developed and adapted a multi-modal AI model to handle the asynchronous and incomplete nature of real-world RPM data, forecasting a continuous risk of future adverse events. The model achieved an accuracy of 83.9% (AUROC=0.70). Notably, the model identified previous treatments, wellness check-ins, and daily maximum heart rate as key predictive features. A case study demonstrated the model's ability to provide early warnings by outputting escalating risk profiles prior to the event. This work establishes the feasibility of multi-modal AI RPM for cancer care and offers a path toward more proactive patient support.(Accepted at Europe NeurIPS 2025 Multimodal Representation Learning for Healthcare Workshop)
