Exploring the Boundaries of Content Moderation in Text-to-Image Generation

Piera Riccio; Georgina Curto; Nuria Oliver

Exploring the Boundaries of Content Moderation in Text-to-Image Generation

Piera Riccio, Georgina Curto, Nuria Oliver

Abstract

This paper analyzes the community safety guidelines of five text-to-image (T2I) generation platforms and audits five T2I models, focusing on prompts related to the representation of humans in areas that might lead to societal stigma. While current research primarily focuses on ensuring safety by restricting the generation of harmful content, our study offers a complementary perspective. We argue that the concept of safety is difficult to define and operationalize, reflected in a discrepancy between the officially published safety guidelines and the actual behavior of the T2I models, and leading at times to over-censorship. Our findings call for more transparency and an inclusive dialogue about the platforms' content moderation practices, bearing in mind their global cultural and social impact.

Exploring the Boundaries of Content Moderation in Text-to-Image Generation

Abstract

Paper Structure (17 sections, 2 figures, 4 tables)

This paper contains 17 sections, 2 figures, 4 tables.

Introduction
Related Work
Safety Guidelines in T2I Systems
Auditing
1. Physical Appearance and Personal Traits
2. Health
3. Reproduction, Women's Health and Romantic Relationships
4. Legal and Illegal Activities
5. Politics and ideologies
6. Artistic Nudity
Discussion
Opacity of Content Moderation in T2I platforms
Image generation is different from Web search
Social and cultural consequences of prompt and content moderation
Artistic nudity as a special case
...and 2 more sections

Figures (2)

Figure 1: (a) Histogram of the types of prompt and content moderation experienced in the auditing process; (b) Percentage of moderated/censored prompts per T2I model.
Figure 2: From left to right: A revisitation of Botticelli's The Birth of Venus by Stable Image Ulta (SIU), A revisitation of Titian’s Venus of Urbino, by Stable Image Core (SIC), A revisitation of Botticelli's The Birth of Venus, by Stable Diffusion 3 (SD3), and A revisitation of Michelangelo's David, by Midjourney (MJ).

Exploring the Boundaries of Content Moderation in Text-to-Image Generation

Abstract

Exploring the Boundaries of Content Moderation in Text-to-Image Generation

Authors

Abstract

Table of Contents

Figures (2)