When Algorithms Meet Artists: Semantic Compression of Artists' Concerns in the Public AI-Art Debate
Ariya Mukherjee-Gandhi, Oliver Muellerklein
TL;DR
This study investigates whether public discourse on AI-generated art proportionally represents artists' concerns. It builds a shared semantic map from 131 public documents (2013–2025) and projects 1,259 artist probes (34 frames across five dimensions: Threat, Utility, Ownership, Transparency, Compensation) into 22 topics, using a consensus-based semantic projection to ensure cross-corpus stability. The results show strong semantic compression: 95% of artist concerns concentrate in just 4 topics, while 14 topics (62% of discourse) contain no artist input, with governance concerns (ownership and transparency) being underrepresented by up to ~7× even after controlling for style. These findings reveal an epistemic marginalization in public AI-art discourse and offer a generalizable auditing method for assessing stakeholder representation in governance debates, with broad implications for policy design and other technology domains.
Abstract
Artists occupy a paradoxical position in generative AI: their work trains the models reshaping creative labor. We tested whether their concerns achieve proportional representation in public discourse shaping AI governance. Analyzing public AI-art discourse (news, podcasts, legal filings, research; 2013--2025) and projecting 1,259 survey-derived artist statements into this semantic space, we find stark compression: 95% of artist concerns cluster in 4 of 22 discourse topics, while 14 topics (62% of discourse) contain no artist perspective. This compression is selective - governance concerns (ownership, transparency) are 7x underrepresented; affective themes (threat, utility) show only 1.4x underrepresentation after style controls. The pattern indicates semantic, not stylistic, marginalization. These findings demonstrate a measurable representational gap: decision-makers relying on public discourse as a proxy for stakeholder priorities will systematically underweight those most affected. We introduce a consensus-based semantic projection methodology that is currently being validated across domains and generalizes to other stakeholder-technology contexts.
