Inpatient Overflow Management with Proximal Policy Optimization

Jingjing Sun; Jim Dai; Pengyi Shi

Inpatient Overflow Management with Proximal Policy Optimization

Jingjing Sun, Jim Dai, Pengyi Shi

TL;DR

This work addresses scalable overflow management in inpatient hospital systems by modeling overflow decisions as a time-periodic, long-run average cost MDP with large state/action spaces. The authors introduce atomic actions to decompose multi-patient routing into tractable sequential decisions and couple this with a randomized PPO framework, enhanced by a partially-shared policy network and a queueing-informed linear value function. The approach achieves near-optimal performance on five-, ten-, and twenty-pool systems, matching or outperforming approximate dynamic programming (ADP) while dramatically reducing computation time and data requirements. The combination of domain-specific policy design, batching, and pool-wise value decomposition yields strong sample efficiency and scalability, with practical implications for real-world hospital overflow management. The work demonstrates that tailoring general RL methods to queueing-structured, time-periodic problems can yield substantial gains in both performance and efficiency, enabling deployment in large-scale healthcare operations.

Abstract

Overflowing patients to non-primary wards can effectively alleviate congestion in hospitals, while undesired overflow also leads to issues like mismatched service quality. Therefore, we need to trade off between congestion and undesired overflow. This overflow management problem is modeled as a discrete-time Markov Decision Process with large state and action space. To overcome the curse-of-dimensionality, we decompose the action at each time into a sequence of atomic actions and use an actor-critic algorithm, Proximal Policy Optimization (PPO), to guide the atomic actions. Moreover, we tailor the design of neural network which represents policy to account for the daily periodic pattern of the system flows. Under hospital settings of different scales, the PPO policies consistently outperform commonly used state-of-art policies.

Inpatient Overflow Management with Proximal Policy Optimization

TL;DR

Abstract

Inpatient Overflow Management with Proximal Policy Optimization

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (4)