AI Feedback Enhances Community-Based Content Moderation through Engagement with Counterarguments

Saeedeh Mohammadi; Taha Yasseri

AI Feedback Enhances Community-Based Content Moderation through Engagement with Counterarguments

Saeedeh Mohammadi, Taha Yasseri

TL;DR

An AI-assisted hybrid moderation framework in which participants receive AI-generated feedback, supportive, neutral, or argumentative, on their notes and are asked to revise them accordingly is explored, showing that incorporating feedback improves the quality of notes.

Abstract

Today, social media platforms are significant sources of news and political communication, but their role in spreading misinformation has raised significant concerns. In response, these platforms have implemented various content moderation strategies. One such method, Community Notes (formerly Birdwatch) on X (formerly Twitter), relies on crowdsourced fact-checking and has gained traction. However, it faces challenges such as partisan bias and delays in verification. This study explores an AI-assisted hybrid moderation framework in which participants receive AI-generated feedback, supportive, neutral, or argumentative, on their notes and are asked to revise them accordingly. The results show that incorporating feedback improves the quality of notes, with the most substantial gains resulting from argumentative feedback. This underscores the value of diverse perspectives and direct engagement in human-AI collective intelligence. The research contributes to ongoing discussions about AI's role in political content moderation, highlighting the potential of generative AI and the importance of informed design.

AI Feedback Enhances Community-Based Content Moderation through Engagement with Counterarguments

TL;DR

Abstract

AI Feedback Enhances Community-Based Content Moderation through Engagement with Counterarguments

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (18)