A new approach to data assimilation initialization problems with sparse data using multiple cost functions

David J. Abers; George Hripcsak; Lena Mamykina; Melike Sirlanci; Esteban G. Tabak

A new approach to data assimilation initialization problems with sparse data using multiple cost functions

David J. Abers, George Hripcsak, Lena Mamykina, Melike Sirlanci, Esteban G. Tabak

Abstract

This article develops a novel data assimilation methodology, addressing challenges that are common in real-world settings, such as severe sparsity of observations, lack of reliable models, and non-stationarity of the system dynamics. These challenges often cause identifiability issues and can confound model parameter initialization, both of which can lead to estimated models with unrealistic qualitative dynamics and induce deeper parameter estimation errors. The proposed methodology's objective function is constructed as a sum of components, each serving a different purpose: enforcing point-wise and distribution-wise agreement between data and model output, enforcing agreement of variables and parameters with a model provided, and penalizing unrealistic rapid parameter changes, unless they are due to external drivers or interventions. This methodology was motivated by, developed and evaluated in the context of estimating blood glucose levels in different medical settings. Both simulated and real data are used to evaluate the methodology from different perspectives, such as its ability to estimate unmeasured variables, its ability to reproduce the correct qualitative blood glucose dynamics, how it manages known non-stationarity, and how it performs when given a range of dense and severely sparse data. The results show that a multicomponent cost function can balance the minimization of point-wise errors with global properties, robustly preserving correct qualitative dynamics and managing data sparsity.

A new approach to data assimilation initialization problems with sparse data using multiple cost functions

Abstract

A new approach to data assimilation initialization problems with sparse data using multiple cost functions

Abstract

Paper Structure

Table of Contents

Figures (13)