Table of Contents
Fetching ...

Towards Physiologically Sensible Predictions via the Rule-based Reinforcement Learning Layer

Lingwei Zhu, Zheng Chen, Yukie Nagai, Jimeng Sun

TL;DR

This work addresses physiologically invalid predictions produced by data-driven healthcare models by introducing the Rule-based Reinforcement Learning Layer (RRLL), a lightweight RL layer that learns to reassign labels using a compact set of impossibility rules. RRLL defines an MDP over $K$-class one-hot labels, where states fuse the base predictor's outputs with recent actions and features, and rewards penalize rule-violating transitions. Trained with REINFORCE and an action-change penalty, RRLL can be plugged onto any predictor to yield end-to-end predictions that are more physiologically plausible. Across sleep staging and seizure onset detection, RRLL improves accuracy and dramatically reduces impossible transitions, demonstrating generality across domains and base models. The approach offers a practical path toward deployable, clinically sensible AI systems without requiring extensive domain-specific modeling of disease progression.

Abstract

This paper adds to the growing literature of reinforcement learning (RL) for healthcare by proposing a novel paradigm: augmenting any predictor with Rule-based RL Layer (RRLL) that corrects the model's physiologically impossible predictions. Specifically, RRLL takes as input states predicted labels and outputs corrected labels as actions. The reward of the state-action pair is evaluated by a set of general rules. RRLL is efficient, general and lightweight: it does not require heavy expert knowledge like prior work but only a set of impossible transitions. This set is much smaller than all possible transitions; yet it can effectively reduce physiologically impossible mistakes made by the state-of-the-art predictor models. We verify the utility of RRLL on a variety of important healthcare classification problems and observe significant improvements using the same setup, with only the domain-specific set of impossibility changed. In-depth analysis shows that RRLL indeed improves accuracy by effectively reducing the presence of physiologically impossible predictions.

Towards Physiologically Sensible Predictions via the Rule-based Reinforcement Learning Layer

TL;DR

This work addresses physiologically invalid predictions produced by data-driven healthcare models by introducing the Rule-based Reinforcement Learning Layer (RRLL), a lightweight RL layer that learns to reassign labels using a compact set of impossibility rules. RRLL defines an MDP over -class one-hot labels, where states fuse the base predictor's outputs with recent actions and features, and rewards penalize rule-violating transitions. Trained with REINFORCE and an action-change penalty, RRLL can be plugged onto any predictor to yield end-to-end predictions that are more physiologically plausible. Across sleep staging and seizure onset detection, RRLL improves accuracy and dramatically reduces impossible transitions, demonstrating generality across domains and base models. The approach offers a practical path toward deployable, clinically sensible AI systems without requiring extensive domain-specific modeling of disease progression.

Abstract

This paper adds to the growing literature of reinforcement learning (RL) for healthcare by proposing a novel paradigm: augmenting any predictor with Rule-based RL Layer (RRLL) that corrects the model's physiologically impossible predictions. Specifically, RRLL takes as input states predicted labels and outputs corrected labels as actions. The reward of the state-action pair is evaluated by a set of general rules. RRLL is efficient, general and lightweight: it does not require heavy expert knowledge like prior work but only a set of impossible transitions. This set is much smaller than all possible transitions; yet it can effectively reduce physiologically impossible mistakes made by the state-of-the-art predictor models. We verify the utility of RRLL on a variety of important healthcare classification problems and observe significant improvements using the same setup, with only the domain-specific set of impossibility changed. In-depth analysis shows that RRLL indeed improves accuracy by effectively reducing the presence of physiologically impossible predictions.

Paper Structure

This paper contains 16 sections, 7 equations, 8 figures, 4 tables.

Figures (8)

  • Figure 1: Overview of the proposed rule-based reinforcement learning layer. It takes as input state predictions from the predictor and outputs physiological sensible reassigned labels as actions. Reward function is jointly defined by the mismatch to true labels and the set of impossible transitions.
  • Figure 2: Overview of RRLL workflow. (Left) Given a trained base predictor with frozen weights, we loop through the dataset to obtain its tentative predicted labels $\hat{y}$ and features $\boldsymbol{z}$. The predicted labels and feature vectors, together with the agent's last action constitute states. (Middle) The agent draws an action which is a reassigned label to the input instance. The state-action pair is then evaluated by the set of impossibility $\zeta$. (Right) After RRLL learning, the entire model can be used in an end-to-end manner.
  • Figure 3: Cumulative reward on all base predictor models, averaged over 10 seeds. Model index is the the base predictor model trained after model index epochs. Ribbons show the 95% confidence interval.
  • Figure 4: RRLL is capable of improving the accuracy of all base models, showing the general applicability of RRLL.
  • Figure 5: The percentage of physiologically impossible predictions along RRLL learning. Ribbons show the 95% confidence. The blue dashed lines show the base predictor. Though the base predictor percentage decreases from Model 1 to 100, RRLL can still notably reduce it on Model 100, indicating that the base predictor does not improve accuracy by explicitly minimizing physiologically impossible predictions.
  • ...and 3 more figures