Using LLMs to Infer Non-Binary COVID-19 Sentiments of Chinese Micro-bloggers
Jerry Chongyi Hu, Mohammed Shahid Modi, Boleslaw K. Szymanski
TL;DR
This paper investigates public sentiment dynamics on Chinese social media (Weibo) during the early COVID-19 period, focusing on sarcasm and moderation. It introduces a large-scale sentiment classification pipeline using the instruction-tuned Llama 3 8B with few-shot prompts to categorize millions of posts into positive, negative, neutral, and sarcastic categories, and validates performance against ground-truth baselines. The study compares COVID-19 discourse with the African Swine Fever event, revealing that government messaging amplified positive sentiment during COVID-19 while sarcasm and negative signals tracked surge patterns, informing understanding of online polarization. Despite limitations in sarcasm detection accuracy, the approach demonstrates a scalable, non-English sentiment-analysis workflow with implications for crisis communication and governance.
Abstract
Studying public sentiment during crises is crucial for understanding how opinions and sentiments shift, resulting in polarized societies. We study Weibo, the most popular microblogging site in China, using posts made during the outbreak of the COVID-19 crisis. The study period includes the pre-COVID-19 stage, the outbreak stage, and the early stage of epidemic prevention. We use Llama 3 8B, a Large Language Model, to analyze users' sentiments on the platform by classifying them into positive, negative, sarcastic, and neutral categories. Analyzing sentiment shifts on Weibo provides insights into how social events and government actions influence public opinion. This study contributes to understanding the dynamics of social sentiments during health crises, fulfilling a gap in sentiment analysis for Chinese platforms. By examining these dynamics, we aim to offer valuable perspectives on digital communication's role in shaping society's responses during unprecedented global challenges.
