Treatment Stitching with Schrödinger Bridge for Enhancing Offline Reinforcement Learning in Adaptive Treatment Strategies

Dong-Hee Shin; Deok-Joong Lee; Young-Han Son; Tae-Eui Kam

Treatment Stitching with Schrödinger Bridge for Enhancing Offline Reinforcement Learning in Adaptive Treatment Strategies

Dong-Hee Shin, Deok-Joong Lee, Young-Han Son, Tae-Eui Kam

TL;DR

This work tackles the data scarcity and safety constraints of applying reinforcement learning to adaptive treatment strategies by introducing TreatStitch, a data-augmentation framework that creates clinically valid synthetic trajectories. It combines direct stitching of similar state representations from real trajectories with Schrödinger bridge–based bridging to connect dissimilar states, thereby expanding the offline dataset without violating clinical plausibility. The approach is backed by theoretical guarantees that stitched transitions stay close to the original data distribution, mitigating out-of-distribution risks, and is validated on the EpiCare benchmark and MIMIC-III sepsis data, where it outperforms multiple generative baselines and improves offline RL performance, especially under restricted data. The work offers a practical, model-agnostic augmentation strategy for offline ATS learning, with potential to improve safety and effectiveness of AI-driven clinical decision support systems.

Abstract

Adaptive treatment strategies (ATS) are sequential decision-making processes that enable personalized care by dynamically adjusting treatment decisions in response to evolving patient symptoms. While reinforcement learning (RL) offers a promising approach for optimizing ATS, its conventional online trial-and-error learning mechanism is not permissible in clinical settings due to risks of harm to patients. Offline RL tackles this limitation by learning policies exclusively from historical treatment data, but its performance is often constrained by data scarcity-a pervasive challenge in clinical domains. To overcome this, we propose Treatment Stitching (TreatStitch), a novel data augmentation framework that generates clinically valid treatment trajectories by intelligently stitching segments from existing treatment data. Specifically, TreatStitch identifies similar intermediate patient states across different trajectories and stitches their respective segments. Even when intermediate states are too dissimilar to stitch directly, TreatStitch leverages the Schrödinger bridge method to generate smooth and energy-efficient bridging trajectories that connect dissimilar states. By augmenting these synthetic trajectories into the original dataset, offline RL can learn from a more diverse dataset, thereby improving its ability to optimize ATS. Extensive experiments across multiple treatment datasets demonstrate the effectiveness of TreatStitch in enhancing offline RL performance. Furthermore, we provide a theoretical justification showing that TreatStitch maintains clinical validity by avoiding out-of-distribution transitions.

Treatment Stitching with Schrödinger Bridge for Enhancing Offline Reinforcement Learning in Adaptive Treatment Strategies

TL;DR

Abstract

Treatment Stitching with Schrödinger Bridge for Enhancing Offline Reinforcement Learning in Adaptive Treatment Strategies

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (2)