Applications of 0-1 Neural Networks in Prescription and Prediction
Vrishabh Patil, Kara Hoppe, Yonatan Mintz
TL;DR
This work tackles learning personalized treatment policies under limited observational data by proposing Prescriptive Neural Networks (PNNs), shallow 0-1 networks trained with mixed-integer programming. PNNs integrate counterfactual estimation (IPW/DM/DR) into a MILP framework to directly optimize policies while preserving interpretability, and they come with statistical consistency guarantees. Empirical results on simulated data and a postpartum hypertension case study show PNNs can reduce peak SBP and outperform competing prescriptive methods, with SHAP-based insights highlighting clinically plausible feature importance. The framework thereby provides a practical, auditable approach to prescriptive analytics in medicine, with extensions demonstrated for Warfarin dosing and MNIST prediction in the appendix.
Abstract
A key challenge in medical decision making is learning treatment policies for patients with limited observational data. This challenge is particularly evident in personalized healthcare decision-making, where models need to take into account the intricate relationships between patient characteristics, treatment options, and health outcomes. To address this, we introduce prescriptive networks (PNNs), shallow 0-1 neural networks trained with mixed integer programming that can be used with counterfactual estimation to optimize policies in medium data settings. These models offer greater interpretability than deep neural networks and can encode more complex policies than common models such as decision trees. We show that PNNs can outperform existing methods in both synthetic data experiments and in a case study of assigning treatments for postpartum hypertension. In particular, PNNs are shown to produce policies that could reduce peak blood pressure by 5.47 mm Hg (p=0.02) over existing clinical practice, and by 2 mm Hg (p=0.01) over the next best prescriptive modeling technique. Moreover PNNs were more likely than all other models to correctly identify clinically significant features while existing models relied on potentially dangerous features such as patient insurance information and race that could lead to bias in treatment.
