The Best Decisions Are Not the Best Advice: Making Adherence-Aware Recommendations
Julien Grand-Clément, Jean Pauphilet
TL;DR
This work introduces an adherence-aware decision framework for expert-in-the-loop systems, modeling the gap between algorithmic recommendations and implemented actions via an adherence parameter $\theta$ and the effective policy $\pi_{\rm eff}(\pi_{\sf alg},\theta)$. It proves that optimal recommendations can be computed as stationary deterministic policies by reducing to a surrogate MDP, and it provides both value-iteration and linear-programming methods for solving the adherence-aware problem. The authors also characterize structural properties, such as monotonicity and piecewise-constant dependence on $\theta$, and establish equivalences with random and adversarial adherence models, including robustness to uncertain baseline policies. Numerical experiments in machine replacement and healthcare illustrate potential severe deterioration if adherence is ignored and demonstrate the practical value of adherence-robust recommendations, as well as extensions to heterogeneous, uncertain, and time-varying adherence. Overall, the paper offers a rigorous, computation-friendly framework for designing recommendations that remain effective when human deviations from machine suggestions occur, with clear implications for deployment in high-stakes settings.
Abstract
Many high-stake decisions follow an expert-in-loop structure in that a human operator receives recommendations from an algorithm but is the ultimate decision maker. Hence, the algorithm's recommendation may differ from the actual decision implemented in practice. However, most algorithmic recommendations are obtained by solving an optimization problem that assumes recommendations will be perfectly implemented. We propose an adherence-aware optimization framework to capture the dichotomy between the recommended and the implemented policy and analyze the impact of partial adherence on the optimal recommendation. We show that overlooking the partial adherence phenomenon, as is currently being done by most recommendation engines, can lead to arbitrarily severe performance deterioration, compared with both the current human baseline performance and what is expected by the recommendation algorithm. Our framework also provides useful tools to analyze the structure and to compute optimal recommendation policies that are naturally immune against such human deviations, and are guaranteed to improve upon the baseline policy.
