On the Quest for Effectiveness in Human Oversight: Interdisciplinary Perspectives
Sarah Sterz, Kevin Baum, Sebastian Biewer, Holger Hermanns, Anne Lauber-Rönsberg, Philip Meinel, Markus Langer
TL;DR
This paper tackles the lack of clarity around when human oversight of high-risk AI is effective. It proposes an objective framework based on four conditions—causal power, epistemic access, self-control, and fitting intentions—and shows that effectiveness is equivalent to having Moral Responsibility plus Fitting Intentions, formalized as $\text{Effectiveness} = \text{Moral Responsibility} + \text{Fitting Intentions}$. It then maps these conditions onto three pragmatic domains (technical design, human factors, and environment) and analyzes the EU AI Act, especially Article 14, to assess alignment and gaps. The authors advocate for a structured, interdisciplinary approach to design, regulate, and audit effective human oversight, with implications for policy implementation, standards development, and future empirical validation.
Abstract
Human oversight is currently discussed as a potential safeguard to counter some of the negative aspects of high-risk AI applications. This prompts a critical examination of the role and conditions necessary for what is prominently termed effective or meaningful human oversight of these systems. This paper investigates effective human oversight by synthesizing insights from psychological, legal, philosophical, and technical domains. Based on the claim that the main objective of human oversight is risk mitigation, we propose a viable understanding of effectiveness in human oversight: for human oversight to be effective, the oversight person has to have (a) sufficient causal power with regard to the system and its effects, (b) suitable epistemic access to relevant aspects of the situation, (c) self-control, and (d) fitting intentions for their role. Furthermore, we argue that this is equivalent to saying that an oversight person is effective if and only if they are morally responsible and have fitting intentions. Against this backdrop, we suggest facilitators and inhibitors of effectiveness in human oversight when striving for practical applicability. We discuss factors in three domains, namely, the technical design of the system, individual factors of oversight persons, and the environmental circumstances in which they operate. Finally, this paper scrutinizes the upcoming AI Act of the European Union -- in particular Article 14 on Human Oversight -- as an exemplary regulatory framework in which we study the practicality of our understanding of effective human oversight. By analyzing the provisions and implications of the European AI Act proposal, we pinpoint how far that proposal aligns with our analyses regarding effective human oversight as well as how it might get enriched by our conceptual understanding of effectiveness in human oversight.
