Causally-Informed Reinforcement Learning for Adaptive Emotion-Aware Social Media Recommendation
Bhavika Jain, Robert Pitsko, Ananya Drishti, Mahfuza Farooque
TL;DR
This work tackles the problem of emotional well-being in social media recommendations by proposing ESMR, a causally informed, emotion-aware hybrid policy that switches between LightGBM-based engagement and reinforcement learning during prolonged negative states. Emotion states are predicted with a Transformer-based TabTransformer, and causal discovery via DirectLiNGAM identifies top drivers used to shape rewards. The approach demonstrates that incorporating emotion dynamics and causal factors into the reward function can reduce negative emotional trajectories and volatility while maintaining engagement—achieving a more emotionally intelligent recommender. The findings highlight the feasibility and significance of dynamically balancing engagement with emotional stability, offering a path toward healthier and more adaptive content curation in practice.
Abstract
Social media recommendation systems play a central role in shaping users' emotional experiences. However, most systems are optimized solely for engagement metrics, such as click rate, viewing time, or scrolling, without accounting for users' emotional states. Repeated exposure to emotionally charged content has been shown to negatively affect users' emotional well-being over time. We propose an Emotion-aware Social Media Recommendation (ESMR) framework that personalizes content based on users' evolving emotional trajectories. ESMR integrates a Transformer-based emotion predictor with a hybrid recommendation policy: a LightGBM model for engagement during stable periods and a reinforcement learning agent with causally informed rewards when negative emotional states persist. Through behaviorally grounded evaluation over 30-day interaction traces, ESMR demonstrates improved emotional recovery, reduced volatility, and strong engagement retention. ESMR offers a path toward emotionally aware recommendations without compromising engagement performance.
