Prompt-based Personality Profiling: Reinforcement Learning for Relevance Filtering

Jan Hofmann; Cornelia Sindermann; Roman Klinger

Prompt-based Personality Profiling: Reinforcement Learning for Relevance Filtering

Jan Hofmann, Cornelia Sindermann, Roman Klinger

TL;DR

The paper tackles efficient author profiling from long social-media profiles by filtering for only the most relevant posts before prompts are used for zero-shot personality prediction. It introduces RL-Profiler, a two-part system with a Selection Network that learns which posts matter and a Classification Network that predicts Big Five traits via LLM prompting, trained with reinforcement-learning rewards derived from the prompt-based predictor's performance. Across PAN-AP-2015 data, RL-Profiler achieves near-full-data performance with far smaller context and shows substantial improvements in artificially balanced data, illustrating the method's potential to focus LLM attention on informative content. The approach reduces computation and resource usage while enabling robust personality inference, offering a path toward scalable, eco-friendly social-media analysis and broader applicability to other profiling tasks.

Abstract

Author profiling is the task of inferring characteristics about individuals by analyzing content they share. Supervised machine learning still dominates automatic systems that perform this task, despite the popularity of prompting large language models to address natural language understanding tasks. One reason is that the classification instances consist of large amounts of posts, potentially a whole user profile, which may exceed the input length of Transformers. Even if a model can use a large context window, the entirety of posts makes the application of API-accessed black box systems costly and slow, next to issues which come with such "needle-in-the-haystack" tasks. To mitigate this limitation, we propose a new method for author profiling which aims at distinguishing relevant from irrelevant content first, followed by the actual user profiling only with relevant data. To circumvent the need for relevance-annotated data, we optimize this relevance filter via reinforcement learning with a reward function that utilizes the zero-shot capabilities of large language models. We evaluate our method for Big Five personality trait prediction on two Twitter corpora. On publicly available real-world data with a skewed label distribution, our method shows similar efficacy to using all posts in a user profile, but with a substantially shorter context. An evaluation on a version of these data balanced with artificial posts shows that the filtering to relevant posts leads to a significantly improved accuracy of the predictions.

Prompt-based Personality Profiling: Reinforcement Learning for Relevance Filtering

TL;DR

Abstract

Paper Structure (35 sections, 5 equations, 5 figures, 9 tables, 1 algorithm)

This paper contains 35 sections, 5 equations, 5 figures, 9 tables, 1 algorithm.

Introduction
Related Work
Zero-Shot Predictions with Large Language Models
Personality in Psychology
Automatic Personality Prediction from Text
RL-Profiler: Reinforcement Learning by LLM-based Performance Rewards
Selection Network (SelNet)
Training the RL agent
Reward.
Policy Optimization.
Pre-training using Mutual Information
Classification Network (CNet)
Experiments
Experimental Setting and Training Details
Corpora
...and 20 more sections

Figures (5)

Figure 1: Overview on the workflow of the RL-Profiler (RL: Reinforcement Learning; SelNet: Selection Network; CNet: Classification Network; LLM: Large Language Model).
Figure 2: Prompt template used in CNet for predicting a level of extraversion.
Figure 3: Visual representation of macro $\textrm{F}_1$ scores for selection-based models with different settings for top-$N$ on validation data. The x-axis (not true to scale) shows settings for top-$N$, i.e., $N \in \{5,10,20,30,50\}$ (linearly interpolated), while the y-axis shows the corresponding macro $\textrm{F}_1$ scores. If $N$ exceeds the number of available posts in profiles, all models converge to the ALL+CNet system since all systems select all available posts.
Figure 4: Visual representation of macro $\textrm{F}_1$ scores for selection-based models with different settings for top-$N$ on artificially enriched validation data. The x-axis (not true to scale) shows settings for top-$N$, i.e., $N \in \{5,10,20,30,50\}$ (linearly interpolated), while the y-axis shows the corresponding macro $\textrm{F}_1$ scores. If $N$ exceeds the number of available posts in profiles, all models converge to the ALL+CNet system since all systems select all available posts.
Figure 5: Prompt templates for generating artificial posts indicating a high and low level of extraversion.

Prompt-based Personality Profiling: Reinforcement Learning for Relevance Filtering

TL;DR

Abstract

Prompt-based Personality Profiling: Reinforcement Learning for Relevance Filtering

Authors

TL;DR

Abstract

Table of Contents

Figures (5)