Learning to Diagnose with LSTM Recurrent Neural Networks

Zachary C. Lipton; David C. Kale; Charles Elkan; Randall Wetzel

Learning to Diagnose with LSTM Recurrent Neural Networks

Zachary C. Lipton, David C. Kale, Charles Elkan, Randall Wetzel

TL;DR

This work demonstrates that LSTM RNNs can learn meaningful patterns from irregular multivariate ICU time series to perform multilabel diagnosis. It introduces sequential target replication and auxiliary outputs as regularization strategies, showing that these techniques improve predictive performance over strong baselines, including an MLP with hand-engineered features. The results indicate that LSTMs can outperform traditional approaches on a comprehensive PICU dataset, with ensembles offering additional gains. The study also discusses limitations related to data preprocessing and proposes future directions for early diagnosis and improved handling of missing data and interpretability.

Abstract

Clinical medical data, especially in the intensive care unit (ICU), consist of multivariate time series of observations. For each patient visit (or episode), sensor data and lab test results are recorded in the patient's Electronic Health Record (EHR). While potentially containing a wealth of insights, the data is difficult to mine effectively, owing to varying length, irregular sampling and missing data. Recurrent Neural Networks (RNNs), particularly those using Long Short-Term Memory (LSTM) hidden units, are powerful and increasingly popular models for learning from sequence data. They effectively model varying length sequences and capture long range dependencies. We present the first study to empirically evaluate the ability of LSTMs to recognize patterns in multivariate time series of clinical measurements. Specifically, we consider multilabel classification of diagnoses, training a model to classify 128 diagnoses given 13 frequently but irregularly sampled clinical measurements. First, we establish the effectiveness of a simple LSTM network for modeling clinical data. Then we demonstrate a straightforward and effective training strategy in which we replicate targets at each sequence step. Trained only on raw time series, our models outperform several strong baselines, including a multilayer perceptron trained on hand-engineered features.

Learning to Diagnose with LSTM Recurrent Neural Networks

TL;DR

Abstract

Learning to Diagnose with LSTM Recurrent Neural Networks

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)