Can Large Language Models be Effective Online Opinion Miners?
Ryang Heo, Yongsik Seo, Junseong Lee, Dongha Lee
TL;DR
The paper tackles the gap between traditional, extraction-centric opinion mining benchmarks and the complex, multi-source online discourse encountered in real settings. It introduces the Online Opinion Mining Benchmark (OOMB), which provides dual-layer annotations (entity-feature-opinion tuples and opinion-centric insights) and two tasks—Feature-centric Opinion Extraction (FOE) and Opinion-centric Insight Generation (OIG)—to assess both structured extraction and abstractive summarization by large language models. Across ten LLMs, results reveal that while LLMs struggle to precisely extract structured opinions, they show relatively stronger performance in generating cohesive, topic-oriented insights, though still exhibit gaps in capturing implicit intents and nuanced contexts. The work establishes a foundation for LLM-based opinion mining in realistic online environments, demonstrates the value of human-in-the-loop annotation, and outlines future directions such as domain generalization, user-aware annotations, and improved evaluation frameworks.
Abstract
The surge of user-generated online content presents a wealth of insights into customer preferences and market trends. However, the highly diverse, complex, and context-rich nature of such contents poses significant challenges to traditional opinion mining approaches. To address this, we introduce Online Opinion Mining Benchmark (OOMB), a novel dataset and evaluation protocol designed to assess the ability of large language models (LLMs) to mine opinions effectively from diverse and intricate online environments. OOMB provides extensive (entity, feature, opinion) tuple annotations and a comprehensive opinion-centric summary that highlights key opinion topics within each content, thereby enabling the evaluation of both the extractive and abstractive capabilities of models. Through our proposed benchmark, we conduct a comprehensive analysis of which aspects remain challenging and where LLMs exhibit adaptability, to explore whether they can effectively serve as opinion miners in realistic online scenarios. This study lays the foundation for LLM-based opinion mining and discusses directions for future research in this field.
