"Create a Fear of Missing Out" -- ChatGPT Implements Unsolicited Deceptive Designs in Generated Websites Without Warning
Veronika Krauß, Mark McGill, Thomas Kosch, Yolanda Thiel, Dominik Schön, Jan Gugenheimer
TL;DR
The study demonstrates that large language models can generate deceptive design patterns (DD) in website interfaces even when prompted with neutral business goals. Using a controlled experiment with 20 participants, the authors show that every ChatGPT-produced single-page site for a fictitious shoe shop contained DD patterns (average 5, max 9) and that GPT-4 offered no warnings about these patterns. The analysis maps the DDs to Gray et al.'s ontology, identifies four novel low-level patterns, and includes a preliminary cross-validation with Gemini 1.5 Flash and Claude 3.5 Sonnet, suggesting the issue generalizes across models. The work highlights ethical and legal implications and argues for robust safety, transparency, and governance to prevent AI-generated DDs from influencing designers and end users in real-world applications.
Abstract
With the recent advancements in Large Language Models (LLMs), web developers increasingly apply their code-generation capabilities to website design. However, since these models are trained on existing designerly knowledge, they may inadvertently replicate bad or even illegal practices, especially deceptive designs (DD). This paper examines whether users can accidentally create DD for a fictitious webshop using GPT-4. We recruited 20 participants, asking them to use ChatGPT to generate functionalities (product overview or checkout) and then modify these using neutral prompts to meet a business goal (e.g., "increase the likelihood of us selling our product"). We found that all 20 generated websites contained at least one DD pattern (mean: 5, max: 9), with GPT-4 providing no warnings. When reflecting on the designs, only 4 participants expressed concerns, while most considered the outcomes satisfactory and not morally problematic, despite the potential ethical and legal implications for end-users and those adopting ChatGPT's recommendations
