Table of Contents
Fetching ...

Security Benefits and Side Effects of Labeling AI-Generated Images

Sandra Höltervennhoff, Jonas Ricker, Maike M. Raphael, Charlotte Schwedes, Rebecca Weil, Asja Fischer, Thorsten Holz, Lea Schönherr, Sascha Fahl

TL;DR

This study investigates the security benefits and side effects of labeling AI-generated images through a mixed-methods design combining five focus groups and a preregistered online survey (N=1,354 valid). It finds that while AI labels can reduce belief in misinformation linked to AI-generated content, they can also cause overreliance, reduce trust for true claims, and paradoxically elevate the credibility of some non-AI misinformation; mislabeling risks further erode trust. The results highlight that labels are not a panacea and must be implemented with simple, consistent rules, transparent mechanisms, and robust handling of mislabeling to achieve meaningful security benefits. The findings have important regulatory and platform-design implications for disclosure policies and the broader effort to combat AI-enabled misinformation. Overall, labeling is a valuable transparency tool but should be complemented with other safeguards and carefully managed to avoid unintended trust distortions.

Abstract

Generative artificial intelligence is developing rapidly, impacting humans' interaction with information and digital media. It is increasingly used to create deceptively realistic misinformation, so lawmakers have imposed regulations requiring the disclosure of AI-generated content. However, only little is known about whether these labels reduce the risks of AI-generated misinformation. Our work addresses this research gap. Focusing on AI-generated images, we study the implications of labels, including the possibility of mislabeling. Assuming that simplicity, transparency, and trust are likely to impact the successful adoption of such labels, we first qualitatively explore users' opinions and expectations of AI labeling using five focus groups. Second, we conduct a pre-registered online survey with over 1300 U.S. and EU participants to quantitatively assess the effect of AI labels on users' ability to recognize misinformation containing either human-made or AI-generated images. Our focus groups illustrate that, while participants have concerns about the practical implementation of labeling, they consider it helpful in identifying AI-generated images and avoiding deception. However, considering security benefits, our survey revealed an ambiguous picture, suggesting that users might over-rely on labels. While inaccurate claims supported by labeled AI-generated images were rated less credible than those with unlabeled AI-images, the belief in accurate claims also decreased when accompanied by a labeled AI-generated image. Moreover, we find the undesired side effect that human-made images conveying inaccurate claims were perceived as more credible in the presence of labels.

Security Benefits and Side Effects of Labeling AI-Generated Images

TL;DR

This study investigates the security benefits and side effects of labeling AI-generated images through a mixed-methods design combining five focus groups and a preregistered online survey (N=1,354 valid). It finds that while AI labels can reduce belief in misinformation linked to AI-generated content, they can also cause overreliance, reduce trust for true claims, and paradoxically elevate the credibility of some non-AI misinformation; mislabeling risks further erode trust. The results highlight that labels are not a panacea and must be implemented with simple, consistent rules, transparent mechanisms, and robust handling of mislabeling to achieve meaningful security benefits. The findings have important regulatory and platform-design implications for disclosure policies and the broader effort to combat AI-enabled misinformation. Overall, labeling is a valuable transparency tool but should be complemented with other safeguards and carefully managed to avoid unintended trust distortions.

Abstract

Generative artificial intelligence is developing rapidly, impacting humans' interaction with information and digital media. It is increasingly used to create deceptively realistic misinformation, so lawmakers have imposed regulations requiring the disclosure of AI-generated content. However, only little is known about whether these labels reduce the risks of AI-generated misinformation. Our work addresses this research gap. Focusing on AI-generated images, we study the implications of labels, including the possibility of mislabeling. Assuming that simplicity, transparency, and trust are likely to impact the successful adoption of such labels, we first qualitatively explore users' opinions and expectations of AI labeling using five focus groups. Second, we conduct a pre-registered online survey with over 1300 U.S. and EU participants to quantitatively assess the effect of AI labels on users' ability to recognize misinformation containing either human-made or AI-generated images. Our focus groups illustrate that, while participants have concerns about the practical implementation of labeling, they consider it helpful in identifying AI-generated images and avoiding deception. However, considering security benefits, our survey revealed an ambiguous picture, suggesting that users might over-rely on labels. While inaccurate claims supported by labeled AI-generated images were rated less credible than those with unlabeled AI-images, the belief in accurate claims also decreased when accompanied by a labeled AI-generated image. Moreover, we find the undesired side effect that human-made images conveying inaccurate claims were perceived as more credible in the presence of labels.

Paper Structure

This paper contains 73 sections, 8 figures, 4 tables.

Figures (8)

  • Figure 1: Example stimuli from all four subsets.
  • Figure 2: Participants' opinion on how mislabeled AI-generated (AI) and human-made (H) images affect users' trust. Claims were "Users lose trust in the labeling system if they become aware of such mislabeling." and "It is not a problem if such mislabeling only happens once in a while."
  • Figure 3: Mean accuracy with 95% confidence intervals, separated by Image Type (human vs. AI), Veracity (true vs. false), and Group (control vs. treatment).
  • Figure 4: Stimuli in the human/true subset
  • Figure 5: Stimuli in the human/false subset.
  • ...and 3 more figures