ImpScore: A Learnable Metric For Quantifying The Implicitness Level of Sentence
Yuxin Wang, Xiaomeng Zhu, Weimin Lyu, Saeed Hassanpour, Soroush Vosoughi
TL;DR
ImpScore introduces a reference-free metric $I$ that quantifies sentence implicitness as the divergence between semantic meaning and pragmatic interpretation. Built on a Sentence-BERT backbone, it projects embeddings into separate semantic and pragmatic spaces via projection matrices and a transformation, then trains with pairwise contrastive losses on $112{,}580$ (implicit, explicit) pairs to produce robust scores. Across in-distribution and out-of-distribution evaluations, ImpScore achieves high implicitness and pragmatics accuracy and correlates strongly with human judgments, while revealing LLM limitations on highly implicit content in hate-speech contexts. The approach is lightweight, scalable, and openly accessible, enabling large-scale implicitness analysis for evaluation, data curation, and potential reinforcement-learning signals in language model training.
Abstract
Handling implicit language is essential for natural language processing systems to achieve precise text understanding and facilitate natural interactions with users. Despite its importance, the absence of a metric for accurately measuring the implicitness of language significantly constrains the depth of analysis possible in evaluating models' comprehension capabilities. This paper addresses this gap by developing a scalar metric that quantifies the implicitness level of language without relying on external references. Drawing on principles from traditional linguistics, we define "implicitness" as the divergence between semantic meaning and pragmatic interpretation. To operationalize this definition, we introduce ImpScore, a reference-free metric formulated through an interpretable regression model. This model is trained using pairwise contrastive learning on a specially curated dataset consisting of (implicit sentence, explicit sentence) pairs. We validate ImpScore through a user study that compares its assessments with human evaluations on out-of-distribution data, demonstrating its accuracy and strong correlation with human judgments. Additionally, we apply ImpScore to hate speech detection datasets, illustrating its utility and highlighting significant limitations in current large language models' ability to understand highly implicit content. Our metric is publicly available at https://github.com/audreycs/ImpScore.
