Collaborative learning of common latent representations in routinely collected multivariate ICU physiological signals

Hollan Haule; Ian Piper; Patricia Jones; Tsz-Yan Milly Lo; Javier Escudero

Collaborative learning of common latent representations in routinely collected multivariate ICU physiological signals

Hollan Haule, Ian Piper, Patricia Jones, Tsz-Yan Milly Lo, Javier Escudero

TL;DR

The study tackles ICU phenotyping from routinely collected multivariate time series by learning common latent representations across patients using a collaborative-filtering framework coupled with LSTM embeddings. It introduces a loss that aligns cross-patient latent states via a similarity target and employs collaborative inference by least squares to predict IH-related states, evaluated on a pediatric TBI dataset. Results show strong IH detection performance (IH AUC up to $0.889$ and AP up to $0.725$) and more compact, discriminative latent spaces compared to autoencoder baselines, though cross-channel attention may detract in this setup. The approach demonstrates the feasibility of deriving clinically meaningful, shared representations from high-dimensional ICU signals, with potential to inform patient subgroups and personalized care.

Abstract

In Intensive Care Units (ICU), the abundance of multivariate time series presents an opportunity for machine learning (ML) to enhance patient phenotyping. In contrast to previous research focused on electronic health records (EHR), here we propose an ML approach for phenotyping using routinely collected physiological time series data. Our new algorithm integrates Long Short-Term Memory (LSTM) networks with collaborative filtering concepts to identify common physiological states across patients. Tested on real-world ICU clinical data for intracranial hypertension (IH) detection in patients with brain injury, our method achieved an area under the curve (AUC) of 0.889 and average precision (AP) of 0.725. Moreover, our algorithm outperforms autoencoders in learning more structured latent representations of the physiological signals. These findings highlight the promise of our methodology for patient phenotyping, leveraging routinely collected multivariate time series to improve clinical care practices.

Collaborative learning of common latent representations in routinely collected multivariate ICU physiological signals

TL;DR

and AP up to

) and more compact, discriminative latent spaces compared to autoencoder baselines, though cross-channel attention may detract in this setup. The approach demonstrates the feasibility of deriving clinically meaningful, shared representations from high-dimensional ICU signals, with potential to inform patient subgroups and personalized care.

Abstract

Paper Structure (15 sections, 9 equations, 2 figures, 1 table, 2 algorithms)

This paper contains 15 sections, 9 equations, 2 figures, 1 table, 2 algorithms.

Introduction
Methodology
Core principle and loss function formulation
Training algorithm
Collaborative Inference by Least Squares
LSTM Model for Extracting Embeddings
Self-Attention and Cross-channel attention
Dataset description
Preprocessing
Experiments
Results and discussion
Classification performance
Cross-channel attention ablation
Visualising embeddings with t-SNE
Conclusion

Figures (2)

Figure 1: Distribution of correlation values between the presence of patients in the training set and average AP. This shows that the presence of some patients in the training set strongly affects performance.
Figure 2: Comparative t-SNE 2D projections of patients in a test set.

Collaborative learning of common latent representations in routinely collected multivariate ICU physiological signals

TL;DR

Abstract

Collaborative learning of common latent representations in routinely collected multivariate ICU physiological signals

Authors

TL;DR

Abstract

Table of Contents

Figures (2)