Dynamik: Syntactically-Driven Dynamic Font Sizing for Emphasis of Key Information
Naoto Nishida, Yoshio Ishiguro, Jun Rekiomto, Naomi Yamashita
TL;DR
Dynamik tackles cognitive load in subtitle reading for non-native speakers by dynamically sizing keywords using a simple linguistic criterion that separates content words from function words. Implemented in a Unity-based real-time system with Azure Speech recognition and spaCy morphology/POS tagging, it supports three subtitle modes and is evaluated through crowd-sourced testing with 84 non-native English speakers across CNN news clips. Results indicate that Dynamik reduces mental workload and improves perceived comprehension among lower-English-proficiency participants, while producing no robust differences for native speakers. The approach offers practical potential to reduce subtitle display area and adapt to other languages, while inviting further work on alternative cues, refined keyword extraction, and latency optimization.
Abstract
In today's globalized world, there are increasing opportunities for individuals to communicate using a common non-native language (lingua franca). Non-native speakers often have opportunities to listen to foreign languages, but may not comprehend them as fully as native speakers do. To aid real-time comprehension, live transcription of subtitles is frequently used in everyday life (e.g., during Zoom conversations, watching YouTube videos, or on social networking sites). However, simultaneously reading subtitles while listening can increase cognitive load. In this study, we propose Dynamik, a system that reduces cognitive load during reading by decreasing the size of less important words and enlarging important ones, thereby enhancing sentence contrast. Our results indicate that Dynamik can reduce certain aspects of cognitive load, specifically, participants' perceived performance and effort among individuals with low proficiency in English, as well as enhance the users' sense of comprehension, especially among people with low English ability. We further discuss our methods' applicability to other languages and potential improvements and further research directions.
