Inducing Causal Structure for Interpretable Neural Networks Applied to Glucose Prediction for T1DM Patients

Ana Esponera; Giovanni Cinà

Inducing Causal Structure for Interpretable Neural Networks Applied to Glucose Prediction for T1DM Patients

Ana Esponera, Giovanni Cinà

TL;DR

This work investigates imposing causal structure on neural networks for blood glucose forecasting in Type 1 Diabetes via Interchange Intervention Training (IIT). By adapting the FDA-approved simglucose model into a time-compressed, acyclic causal graph and training an MLP with IIT, the authors demonstrate improved predictive accuracy and enhanced interpretability through counterfactual loss analysis. Key findings show IIT reduces prediction error across short horizons and provides module-level insight into which causal mechanisms are learned well, though gains diminish at longer horizons due to unmodeled factors. The study suggests IIT can yield interpretable, real-time glucose predictions suitable for on-device deployment, while highlighting avenues for future work, including temporal extensions and validation on real patient data.

Abstract

Causal abstraction techniques such as Interchange Intervention Training (IIT) have been proposed to infuse neural network with expert knowledge encoded in causal models, but their application to real-world problems remains limited. This article explores the application of IIT in predicting blood glucose levels in Type 1 Diabetes Mellitus (T1DM) patients. The study utilizes an acyclic version of the simglucose simulator approved by the FDA to train a Multi-Layer Perceptron (MLP) model, employing IIT to impose causal relationships. Results show that the model trained with IIT effectively abstracted the causal structure and outperformed the standard one in terms of predictive performance across different prediction horizons (PHs) post-meal. Furthermore, the breakdown of the counterfactual loss can be leveraged to explain which part of the causal mechanism are more or less effectively captured by the model. These preliminary results suggest the potential of IIT in enhancing predictive models in healthcare by effectively complying with expert knowledge.

Inducing Causal Structure for Interpretable Neural Networks Applied to Glucose Prediction for T1DM Patients

TL;DR

Abstract

Inducing Causal Structure for Interpretable Neural Networks Applied to Glucose Prediction for T1DM Patients

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)