Safe Active Learning for Gaussian Differential Equations

Leon Glass; Katharina Ensinger; Christoph Zimmer

Safe Active Learning for Gaussian Differential Equations

Leon Glass, Katharina Ensinger, Christoph Zimmer

TL;DR

This paper introduces Safe Active Learning for Gaussian Process ODEs (SAL GPODE), a framework that combines a mutual-information based acquisition with a safety constraint to guide data collection for learning the unknown dynamics g in a GPODE. The method uses decoupled GP sampling to draw trajectory realizations, computes a Monte Carlo estimate of the information gain from potential measurements, and imposes a probabilistic safety constraint via a separate safety GP. Data collection is planned by solving a constrained optimization that selects initial states to maximize information while keeping the probability of unsafe behavior below a user-defined threshold. Experiments on the Van der Pol oscillator and Lotka-Volterra dynamics show SAL GPODE achieves faster, safer model learning than non-active baselines, highlighting its potential for efficient and safe data-driven dynamical modeling.

Abstract

Gaussian Process differential equations (GPODE) have recently gained momentum due to their ability to capture dynamics behavior of systems and also represent uncertainty in predictions. Prior work has described the process of training the hyperparameters and, thereby, calibrating GPODE to data. How to design efficient algorithms to collect data for training GPODE models is still an open field of research. Nevertheless high-quality training data is key for model performance. Furthermore, data collection leads to time-cost and financial-cost and might in some areas even be safety critical to the system under test. Therefore, algorithms for safe and efficient data collection are central for building high quality GPODE models. Our novel Safe Active Learning (SAL) for GPODE algorithm addresses this challenge by suggesting a mechanism to propose efficient and non-safety-critical data to collect. SAL GPODE does so by sequentially suggesting new data, measuring it and updating the GPODE model with the new data. In this way, subsequent data points are iteratively suggested. The core of our SAL GPODE algorithm is a constrained optimization problem maximizing information of new data for GPODE model training constrained by the safety of the underlying system. We demonstrate our novel SAL GPODE's superiority compared to a standard, non-active way of measuring new data on two relevant examples.

Safe Active Learning for Gaussian Differential Equations

TL;DR

Abstract

Safe Active Learning for Gaussian Differential Equations

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)