Echoes of AI Harms: A Human-LLM Synergistic Framework for Bias-Driven Harm Anticipation

Nicoleta Tantalaki; Sophia Vei; Athena Vakali

Echoes of AI Harms: A Human-LLM Synergistic Framework for Bias-Driven Harm Anticipation

Nicoleta Tantalaki, Sophia Vei, Athena Vakali

TL;DR

ECHO introduces a proactive, human-centered framework that links AI bias types to potential harms across sociotechnical contexts. By leveraging domain-specific vignettes, dual human-LLM harm annotation, and ethical matrices, it maps bias origins through the AI lifecycle to stakeholder-specific harms in two high-stakes domains (disease diagnosis and hiring). The paper provides descriptive and inferential ethical matrices (dEM and iEM) to reveal robust bias–harm pathways and demonstrates how these insights can guide early design and governance decisions. This approach advances anticipatory governance in AI, offering a generalizable protocol for tracing harm pathways from biases to outcomes prior to deployment or development, with clear implications for policy and risk management.

Abstract

The growing influence of Artificial Intelligence (AI) systems on decision-making in critical domains has exposed their potential to cause significant harms, often rooted in biases embedded across the AI lifecycle. While existing frameworks and taxonomies document bias or harms in isolation, they rarely establish systematic links between specific bias types and the harms they cause, particularly within real-world sociotechnical contexts. Technical fixes proposed to address AI biases are ill-equipped to address them and are typically applied after a system has been developed or deployed, offering limited preventive value. We propose ECHO, a novel framework for proactive AI harm anticipation through the systematic mapping of AI bias types to harm outcomes across diverse stakeholder and domain contexts. ECHO follows a modular workflow encompassing stakeholder identification, vignette-based presentation of biased AI systems, and dual (human-LLM) harm annotation, integrated within ethical matrices for structured interpretation. This human-centered approach enables early-stage detection of bias-to-harm pathways, guiding AI design and governance decisions from the outset. We validate ECHO in two high-stakes domains (disease diagnosis and hiring), revealing domain-specific, bias-to-harm patterns and demonstrating ECHO's potential to support anticipatory governance of AI systems

Echoes of AI Harms: A Human-LLM Synergistic Framework for Bias-Driven Harm Anticipation

TL;DR

Abstract

Echoes of AI Harms: A Human-LLM Synergistic Framework for Bias-Driven Harm Anticipation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)

Theorems & Definitions (5)