Reinforcement Learning on Dyads to Enhance Medication Adherence

Ziping Xu; Hinal Jajal; Sung Won Choi; Inbal Nahum-Shani; Guy Shani; Alexandra M. Psihogios; Pei-Yao Hung; Susan Murphy

Reinforcement Learning on Dyads to Enhance Medication Adherence

Ziping Xu, Hinal Jajal, Sung Won Choi, Inbal Nahum-Shani, Guy Shani, Alexandra M. Psihogios, Pei-Yao Hung, Susan Murphy

TL;DR

The paper addresses medication adherence in AYAs after allogeneic hematopoietic cell transplantation by delivering a three-component digital intervention through a novel MARL framework. It encodes domain knowledge via a causal DAG and employs surrogate rewards to handle delayed mediator effects, with three specialized agents operating at two daily timescales and one weekly timescale to maximize the cumulative adherence $\sum_{w=1}^{14}\sum_{d=1}^{7}\sum_{t=1}^{2} R_{w,d,t}^{AYA}$. In Roadmap 2.0–based simulations with 25 dyads over 14 weeks, MARL approaches outperform single-agent and random baselines, and surrogate rewards provide additional gains across standardized treatment effects (STE) of 0.15, 0.3, and 0.5. These results support advancing to the ADAPTS-HCT trial, while acknowledging limitations of the synthetic environment and the need for validation in real-world recruitment dynamics.

Abstract

Medication adherence is critical for the recovery of adolescents and young adults (AYAs) who have undergone hematopoietic cell transplantation (HCT). However, maintaining adherence is challenging for AYAs after hospital discharge, who experience both individual (e.g. physical and emotional symptoms) and interpersonal barriers (e.g., relational difficulties with their care partner, who is often involved in medication management). To optimize the effectiveness of a three-component digital intervention targeting both members of the dyad as well as their relationship, we propose a novel Multi-Agent Reinforcement Learning (MARL) approach to personalize the delivery of interventions. By incorporating the domain knowledge, the MARL framework, where each agent is responsible for the delivery of one intervention component, allows for faster learning compared with a flattened agent. Evaluation using a dyadic simulator environment, based on real clinical data, shows a significant improvement in medication adherence (approximately 3%) compared to purely random intervention delivery. The effectiveness of this approach will be further evaluated in an upcoming trial.

Reinforcement Learning on Dyads to Enhance Medication Adherence

TL;DR

Abstract

Reinforcement Learning on Dyads to Enhance Medication Adherence

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)