Social Good or Scientific Curiosity? Uncovering the Research Framing Behind NLP Artefacts
Eric Chamoun, Nedjma Ousidhoum, Michael Schlichtkrull, Andreas Vlachos
TL;DR
This work formalizes a cross-domain schema for NLP research framings and develops an automated three-stage framework to infer framings from papers: extract epistemic elements, link them via interpretable rules to candidate framings, and refine predictions with an LLM. Evaluations on automated fact-checking and hate speech detection show the approach outperforms a strong LLM baseline and excels at identifying underspecified framings, revealing trends toward more scientific curiosity and greater justification for human-in-the-loop systems. The authors demonstrate scalability by applying the framework to over a hundred AFC papers, uncovering shifts toward human-assisted fact-checking and richer justification generation. Overall, the work enables scalable, transparent analysis of how NLP artefacts are framed and used, informing responsible research practices and framing transparency.
Abstract
Clarifying the research framing of NLP artefacts (e.g., models, datasets, etc.) is crucial to aligning research with practical applications. Recent studies manually analyzed NLP research across domains, showing that few papers explicitly identify key stakeholders, intended uses, or appropriate contexts. In this work, we propose to automate this analysis, developing a three-component system that infers research framings by first extracting key elements (means, ends, stakeholders), then linking them through interpretable rules and contextual reasoning. We evaluate our approach on two domains: automated fact-checking using an existing dataset, and hate speech detection for which we annotate a new dataset-achieving consistent improvements over strong LLM baselines. Finally, we apply our system to recent automated fact-checking papers and uncover three notable trends: a rise in vague or underspecified research goals, increased emphasis on scientific exploration over application, and a shift toward supporting human fact-checkers rather than pursuing full automation.
