What If Moderation Didn't Mean Suppression? A Case for Personalized Content Transformation
Rayhan Rashed, Farnaz Jahanbakhsh
TL;DR
The paper argues that centralized content moderation fails to account for subjective harm and often suppresses valuable content. It introduces DIY-MOD, a browser extension that enables personalized content transformation by selectively modifying distressing elements while preserving meaning, guided by user-defined filters and a two-stage AI-assisted selection pipeline. Through two studies, the authors show increased user agency, safety, and engagement, and extract design principles (e.g., cognitive closure, transparency, and context-aware transformations) for effective interventions. The work advocates platform-integrated personalization as a scalable path while emphasizing privacy-by-design, open collaboration, and careful consideration of authorship and civic discourse implications. Overall, it offers a practical middleware approach that could reshape how individuals navigate harmful content without full disengagement from online communities.
Abstract
Centralized content moderation paradigm both falls short and over-reaches: 1) it fails to account for the subjective nature of harm, and 2) it acts with blunt suppression in response to content deemed harmful, even when such content can be salvaged. We first investigate this through formative interviews, documenting how seemingly benign content becomes harmful due to individual life experiences. Based on these insights, we developed DIY-MOD, a browser extension that operationalizes a new paradigm: personalized content transformation. Operating on a user's own definition of harm, DIY-MOD transforms sensitive elements within content in real-time instead of suppressing the content itself. The system selects the most appropriate transformation for a piece of content from a diverse palette--from obfuscation to artistic stylizing--to match the user's specific needs while preserving the content's informational value. Our two-session user study demonstrates that this approach increases users' sense of agency and safety, enabling them to engage with content and communities they previously needed to avoid.
