Expert-Agnostic Learning to Defer

Joshua Strong; Pramit Saha; Yasin Ibrahim; Cheng Ouyang; Alison Noble

Expert-Agnostic Learning to Defer

Joshua Strong, Pramit Saha, Yasin Ibrahim, Cheng Ouyang, Alison Noble

TL;DR

This work tackles the challenge of Learning to Defer (L2D) under unseen experts by introducing Expert-Agnostic Learning to Defer (EA-L2D), a Bayesian framework that builds explicit per-class expert behavioural representations using a Beta-Binomial model. The deferral decision is made by an expert-agnostic rejector that operates on classifier confidence and the expert's quantified competence, enabling robust generalisation to OOD experts and multiple expertise patterns. The approach supports incorporating prior knowledge through informative priors and reduces annotation costs by using a surrogate loss with uncertainty-aware weighting, with theoretical guarantees and empirical validation on four medical-imaging datasets. Across both ID and OOD scenarios, EA-L2D achieves up to 28% relative improvements in deferral performance and demonstrates stronger robustness to distribution shifts than L2D-Pop, highlighting its practical relevance for safe, collaborative AI in high-stakes domains.

Abstract

Learning to Defer (L2D) trains autonomous systems to handle straightforward cases while deferring uncertain ones to human experts. Recent advancements in this field have introduced methods that offer flexibility to unseen experts at test time. However, we find these approaches struggle to generalise to experts with behaviours not seen during training, require extensive human annotation, and lack mechanisms for incorporating prior knowledge of expert capabilities. To address these challenges, we introduce Expert-Agnostic Learning to Defer (EA-L2D), a novel L2D framework that employs a Bayesian approach to model expert behaviour in an \textit{expert-agnostic} fashion. Across benchmark medical imaging datasets (HAM10000, Blood Cells, Retinal OCT, and Liver Tumours), EA-L2D significantly outperforms prior methods on unseen experts, achieving up to a 28\% relative improvement, while also matching or exceeding state-of-the-art performance on seen experts.

Expert-Agnostic Learning to Defer

TL;DR

Abstract

Expert-Agnostic Learning to Defer

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (19)