Computational Sentence-level Metrics Predicting Human Sentence Comprehension
Kun Sun, Rong Wang
TL;DR
This work addresses the gap in modeling sentence-level human comprehension by introducing two metrics—sentence surprisal and sentence relevance—computed with multilingual LLMs. Sentence surprisal uses next-sentence probability or chain-rule-based probabilities, while sentence relevance uses an attention-inspired, memory-weighted semantic similarity across surrounding sentences. Evaluated on the Multilingual Eye-tracking Corpus (MECO) with 13 languages, the metrics predict sentence reading speed via Generalized Additive Mixed Models, with combined surprisal and relevance yielding the strongest cross-linguistic predictive power ($\Delta \text{AIC}$ substantially negative). The findings show that sentence-level metrics generalize across languages and offer interpretable insight into discourse-level processing, supporting closer integration of LLM-based representations with cognitive language processing research. These results have potential to inform cross-linguistic models of reading and to enhance NLP systems with discourse-aware, cognitively plausible metrics.
Abstract
The majority of research in computational psycholinguistics has concentrated on the processing of words. This study introduces innovative methods for computing sentence-level metrics using multilingual large language models. The metrics developed sentence surprisal and sentence relevance and then are tested and compared to validate whether they can predict how humans comprehend sentences as a whole across languages. These metrics offer significant interpretability and achieve high accuracy in predicting human sentence reading speeds. Our results indicate that these computational sentence-level metrics are exceptionally effective at predicting and elucidating the processing difficulties encountered by readers in comprehending sentences as a whole across a variety of languages. Their impressive performance and generalization capabilities provide a promising avenue for future research in integrating LLMs and cognitive science.
