Concept-driven Off Policy Evaluation
Ritam Majumdar, Jack Teversham, Sonali Parbhoo
TL;DR
This work broadens Off-Policy Evaluation by introducing concept-based estimators that leverage interpretable concepts from Concept Bottleneck Models to reduce variance in batch-data evaluation. It defines Concept-based IS estimators (CIS and CPDIS), proves unbiasedness under known concepts and variance reduction relative to traditional OPE, and extends to unknown concepts via an end-to-end learning framework (PC-OPE) that optimizes concise, diverse concepts and concept-to-policy mappings. The approach enables targeted interventions on concepts for deeper insights into evaluation behavior and robustness, demonstrated on WindyGridworld and MIMIC-III with substantial variance reductions and improved interpretability. Limitations include potential bias and trajectory distribution mismatch when learning concepts, motivating future work on confounding, partial observability, and broader domains.
Abstract
Evaluating off-policy decisions using batch data poses significant challenges due to limited sample sizes leading to high variance. To improve Off-Policy Evaluation (OPE), we must identify and address the sources of this variance. Recent research on Concept Bottleneck Models (CBMs) shows that using human-explainable concepts can improve predictions and provide better understanding. We propose incorporating concepts into OPE to reduce variance. Our work introduces a family of concept-based OPE estimators, proving that they remain unbiased and reduce variance when concepts are known and predefined. Since real-world applications often lack predefined concepts, we further develop an end-to-end algorithm to learn interpretable, concise, and diverse parameterized concepts optimized for variance reduction. Our experiments with synthetic and real-world datasets show that both known and learned concept-based estimators significantly improve OPE performance. Crucially, we show that, unlike other OPE methods, concept-based estimators are easily interpretable and allow for targeted interventions on specific concepts, further enhancing the quality of these estimators.
