Large Language Models Help Reveal Unhealthy Diet and Body Concerns in Online Eating Disorders Communities
Minh Duc Chu, Zihao He, Rebecca Dorn, Kristina Lerman
TL;DR
The paper tackles the challenge of identifying unhealthy online ED communities that use obfuscated language by proposing a framework that aligns open-source LLMs to the linguistic patterns of specific ED communities. It builds a large-scale data pipeline from 2.6 million tweets, detects 402 communities via retweet networks, and selects the 20 largest for detailed analysis. By fine-tuning Llama-3 on community posts, the authors create community proxies that are then evaluated against psychometric instruments (SWED) to reveal varying ED risk across communities, with Pro-ED showing the highest risk and Anti-ED showing lower risk. The approach yields robust cross-validation across classification, toxicity, emotion, embedding similarity, and human judgments, offering a scalable tool for public health monitoring and targeted interventions, while acknowledging limitations around coverage, prompts, biases, and ethical considerations.
Abstract
Eating disorders (ED), a severe mental health condition with high rates of mortality and morbidity, affect millions of people globally, especially adolescents. The proliferation of online communities that promote and normalize ED has been linked to this public health crisis. However, identifying harmful communities is challenging due to the use of coded language and other obfuscations. To address this challenge, we propose a novel framework to surface implicit attitudes of online communities by adapting large language models (LLMs) to the language of the community. We describe an alignment method and evaluate results along multiple dimensions of semantics and affect. We then use the community-aligned LLM to respond to psychometric questionnaires designed to identify ED in individuals. We demonstrate that LLMs can effectively adopt community-specific perspectives and reveal significant variations in eating disorder risks in different online communities. These findings highlight the utility of LLMs to reveal implicit attitudes and collective mindsets of communities, offering new tools for mitigating harmful content on social media.
