Navigating Shortcuts, Spurious Correlations, and Confounders: From Origins via Detection to Mitigation
David Steinmann, Felix Divo, Maurice Kraus, Antonia Wüst, Lukas Struppek, Felix Friedrich, Kristian Kersting
TL;DR
This work identifies shortcuts, spurious correlations, and confounders as core failure modes of ML systems and offers a formal unification of these concepts via a comprehensive taxonomy. It links these phenomena to causality, bias, and security, and distinguishes world-induced from sampling-induced shortcuts, providing a formal framework grounded in distributions $P_ ext{gt}(x)$ and $P(x)$. The authors present a two-part taxonomy—detection and mitigation—supported by a structured view of available datasets, and they survey a wide range of methods: model-utility based, perturbation, XAI, and causality-driven detection, along with dataset-, model-, and inference-time mitigation strategies. The work culminates in a synthesis of open challenges, beyond-image-domain extensions, and a roadmap for standardized evaluation and dataset development to advance shortcut robustness in AI systems.
Abstract
Shortcuts, also described as Clever Hans behavior, spurious correlations, or confounders, present a significant challenge in machine learning and AI, critically affecting model generalization and robustness. Research in this area, however, remains fragmented across various terminologies, hindering the progress of the field as a whole. Consequently, we introduce a unifying taxonomy of shortcut learning by providing a formal definition of shortcuts and bridging the diverse terms used in the literature. In doing so, we further establish important connections between shortcuts and related fields, including bias, causality, and security, where parallels exist but are rarely discussed. Our taxonomy organizes existing approaches for shortcut detection and mitigation, providing a comprehensive overview of the current state of the field and revealing underexplored areas and open challenges. Moreover, we compile and classify datasets tailored to study shortcut learning. Altogether, this work provides a holistic perspective to deepen understanding and drive the development of more effective strategies for addressing shortcuts in machine learning.
