Using RL to Identify Divisive Perspectives Improves LLMs Abilities to Identify Communities on Social Media
Nikhil Mehta, Dan Goldwasser
TL;DR
This work tackles the problem of identifying user information communities on social media under emergent events where test data differ from training data. It introduces a two-LM framework in which a smaller prompt-generating LM is trained via supervised learning and reinforcement learning to produce focus_area prompts that guide a fixed LLM_task to form meaningful communities, improving downstream tasks such as bot detection and news source profiling. Four reward signals plus curriculum learning drive the RL training, yielding consistent gains across Reddit and TwiBot datasets and improved inductive profiling of news sources. The approach generalizes across LLM backbones (ChatGPT, Llama 2) and demonstrates practical impact by enhancing community-aware downstream analytics while highlighting considerations for safe, ethical deployment.
Abstract
The large scale usage of social media, combined with its significant impact, has made it increasingly important to understand it. In particular, identifying user communities, can be helpful for many downstream tasks. However, particularly when models are trained on past data and tested on future, doing this is difficult. In this paper, we hypothesize to take advantage of Large Language Models (LLMs), to better identify user communities. Due to the fact that many LLMs, such as ChatGPT, are fixed and must be treated as black-boxes, we propose an approach to better prompt them, by training a smaller LLM to do this. We devise strategies to train this smaller model, showing how it can improve the larger LLMs ability to detect communities. Experimental results show improvements on Reddit and Twitter data, on the tasks of community detection, bot detection, and news media profiling.
