The Average Patient Fallacy
Alaleh Azhir, Shawn N. Murphy, Hossein Estiri
TL;DR
Problem: Machine learning in medicine is driven by population averages, risking neglect of rare but high-impact cases. Approach: formalizes the average patient fallacy, contrasts standard risk minimization with precision-oriented objectives, and introduces clinically weighted objectives, contextual optimization, and measurable rare-case metrics. Key contributions: definitions of Rare Case Performance Gap (RCPG), Rare-Case Calibration Error (RCCE), the Rarity Index, and a lambda-governed constrained optimization framework grounded in clinical consensus. Significance: provides an auditable pathway to align AI with precision medicine, improving detection and treatment of rare presentations while maintaining ethical and practical safeguards.
Abstract
Machine learning in medicine is typically optimized for population averages. This frequency weighted training privileges common presentations and marginalizes rare yet clinically critical cases, a bias we call the average patient fallacy. In mixture models, gradients from rare cases are suppressed by prevalence, creating a direct conflict with precision medicine. Clinical vignettes in oncology, cardiology, and ophthalmology show how this yields missed rare responders, delayed recognition of atypical emergencies, and underperformance on vision-threatening variants. We propose operational fixes: Rare Case Performance Gap, Rare Case Calibration Error, a prevalence utility definition of rarity, and clinically weighted objectives that surface ethical priorities. Weight selection should follow structured deliberation. AI in medicine must detect exceptional cases because of their significance.
