Digital Comprehensibility Assessment of Simplified Texts among Persons with Intellectual Disabilities
Andreas Säuberli, Franz Holzknecht, Patrick Haller, Silvana Deilen, Laura Schiffl, Silvia Hansen-Schirra, Sarah Ebling
TL;DR
This study addresses how to evaluate the comprehensibility of text simplifications for primary target groups, specifically persons with intellectual disabilities, using a digital, tablet-based reading platform. It compares unsimplified, manually simplified, and automatically simplified German texts across two reader groups (ID and non-ID) and analyzes multiple measures—comprehension questions, perceived difficulty, response times, and reading speed—within a Bayesian Rasch framework with three facets. Key findings show that manual simplification more reliably improves objective comprehension for the target group, while automatic simplification yields smaller or mixed effects and may be harder for control readers; cognitive and behavioral data reveal substantial heterogeneity and skimming tendencies in the ID group. The work demonstrates the viability of digital, interaction-level assessments for inclusive evaluation of automatic text simplification and underscores the need to tailor evaluation methods to primary target groups, combining subjective, objective, and behavioral metrics to capture comprehensibility more accurately.
Abstract
Text simplification refers to the process of increasing the comprehensibility of texts. Automatic text simplification models are most commonly evaluated by experts or crowdworkers instead of the primary target groups of simplified texts, such as persons with intellectual disabilities. We conducted an evaluation study of text comprehensibility including participants with and without intellectual disabilities reading unsimplified, automatically and manually simplified German texts on a tablet computer. We explored four different approaches to measuring comprehensibility: multiple-choice comprehension questions, perceived difficulty ratings, response time, and reading speed. The results revealed significant variations in these measurements, depending on the reader group and whether the text had undergone automatic or manual simplification. For the target group of persons with intellectual disabilities, comprehension questions emerged as the most reliable measure, while analyzing reading speed provided valuable insights into participants' reading behavior.
