Human Perception of LLM-generated Text Content in Social Media Environments
Kristina Radivojevic, Matthew Chou, Karla Badillo-Urquiola, Paul Brenner
TL;DR
This paper investigates how humans perceive and identify LLM-generated text in social media and whether the Uncanny Valley extends to textual content. Using a dataset from a real-world bot-human social-media experiment, the authors administered a static-post survey to 1,095 participants and found that identification accuracy is only $42\%$, with notable gender and age influences on detection and prediction. Latent topics reveal two high-signal patterns in bot perception: emotion/interaction cues and language evolution mechanics, and a robust UV effect appears in perceived discomfort toward bot-like text. The work highlights important security and ethics implications, suggesting educational tools and policy guardrails to mitigate manipulation risks, while acknowledging limitations due to the use of prompt-engineered rather than fine-tuned LLMs. These insights advance understanding of human-computer discourse in online environments and inform future research on robust bot-detection and user education strategies.
Abstract
Emerging technologies, particularly artificial intelligence (AI), and more specifically Large Language Models (LLMs) have provided malicious actors with powerful tools for manipulating digital discourse. LLMs have the potential to affect traditional forms of democratic engagements, such as voter choice, government surveys, or even online communication with regulators; since bots are capable of producing large quantities of credible text. To investigate the human perception of LLM-generated content, we recruited over 1,000 participants who then tried to differentiate bot from human posts in social media discussion threads. We found that humans perform poorly at identifying the true nature of user posts on social media. We also found patterns in how humans identify LLM-generated text content in social media discourse. Finally, we observed the Uncanny Valley effect in text dialogue in both user perception and identification. This indicates that despite humans being poor at the identification process, they can still sense discomfort when reading LLM-generated content.
