To Ask or Not to Ask: Learning to Require Human Feedback
Andrea Pugnana, Giovanni De Toni, Cesare Barbera, Roberto Pellungrini, Bruno Lepri, Andrea Passerini
TL;DR
This work addresses the limitations of Learning to Defer by introducing Learning to Ask (LtA), a framework where an ML model learns not only when to defer to a human but also how to incorporate expert input through an enriched predictor. LtA employs a two-component architecture (f and g) and a budgeted selector s, optimizing the $L^{ask}$ loss with a deferral constraint; it provides a theoretical optimality result (Theorem 1) and a realizable-consistent surrogate for joint training (Theorem 2). The authors present two practical training paradigms, LtA-Seq and LtA-Joint, and validate them on synthetic data and a real X-ray dataset, showing that LtA can outperform traditional LtD, especially with richer forms of expert feedback. This framework advances human-AI collaboration by enabling more flexible, budget-aware querying and integration of expert insights into predictive models.
Abstract
Developing decision-support systems that complement human performance in classification tasks remains an open challenge. A popular approach, Learning to Defer (LtD), allows a Machine Learning (ML) model to pass difficult cases to a human expert. However, LtD treats humans and ML models as mutually exclusive decision-makers, restricting the expert contribution to mere predictions. To address this limitation, we propose Learning to Ask (LtA), a new framework that handles both when and how to incorporate expert input in an ML model. LtA is based on a two-part architecture: a standard ML model and an enriched model trained with additional expert human feedback, with a formally optimal strategy for selecting when to query the enriched model. We provide two practical implementations of LtA: a sequential approach, which trains the models in stages, and a joint approach, which optimises them simultaneously. For the latter, we design surrogate losses with realisable-consistency guarantees. Our experiments with synthetic and real expert data demonstrate that LtA provides a more flexible and powerful foundation for effective human-AI collaboration.
