SPOT: An Annotated French Corpus and Benchmark for Detecting Critical Interventions in Online Conversations
Manon Berriche, Célia Nouri, Chloé Clavel, Jean-Philippe Cointet
TL;DR
SPOT introduces a large, annotated French Facebook corpus to study stopping points—ordinary, context-dependent interventions that pause or redirect online conversations. It operationalizes stopping points as a binary task and benchmarks encoder-based CamemBERT models against instruction-tuned LLMs across multiple prompting strategies, showing supervised encoders outperform LLMs by over 10 F1 points and that contextual metadata further improves performance. The dataset includes 43,305 comments tied to posts, articles, and pages, with robust inter-annotator reliability and a detailed annotation guidelines repository for reproducibility. Findings highlight the importance of conversational and publication context for detecting nuanced social interactions, and they point toward graph-based or hierarchical models and cross-platform extensions as promising directions for future work.
Abstract
We introduce SPOT (Stopping Points in Online Threads), the first annotated corpus translating the sociological concept of stopping point into a reproducible NLP task. Stopping points are ordinary critical interventions that pause or redirect online discussions through a range of forms (irony, subtle doubt or fragmentary arguments) that frameworks like counterspeech or social correction often overlook. We operationalize this concept as a binary classification task and provide reliable annotation guidelines. The corpus contains 43,305 manually annotated French Facebook comments linked to URLs flagged as false information by social media users, enriched with contextual metadata (article, post, parent comment, page or group, and source). We benchmark fine-tuned encoder models (CamemBERT) and instruction-tuned LLMs under various prompting strategies. Results show that fine-tuned encoders outperform prompted LLMs in F1 score by more than 10 percentage points, confirming the importance of supervised learning for emerging non-English social media tasks. Incorporating contextual metadata further improves encoder models F1 scores from 0.75 to 0.78. We release the anonymized dataset, along with the annotation guidelines and code in our code repository, to foster transparency and reproducible research.
