Multipath parsing in the brain
Berta Franzluebbers, Donald Dunagan, Miloš Stanojević, Jan Buys, John T. Hale
TL;DR
The paper investigates whether human sentence comprehension maintains multiple syntactic analyses in parallel (multipath parsing) rather than a single path. It introduces an incremental generative dependency parser, enhanced with BLOOM-based encodings and Pfeiffer adapters, and compares single-path ($k=1$) to multipath ($k=5$) surprisal predictions against fMRI data collected while English and Chinese speakers listen to an audiobook. Across languages, syntactic surprisal derived from the five-path model better accounts for brain activity, with robust effects localized to bilateral superior temporal gyrus and related regions, supporting a multipath account of real-time parsing. This work advances neurocognitive modeling by linking beam-like, multi-hypothesis parsing to distributed brain responses, offering cross-linguistic evidence and suggesting avenues for higher-temporal-resolution studies and broader stimulus contexts.
Abstract
Humans understand sentences word-by-word, in the order that they hear them. This incrementality entails resolving temporary ambiguities about syntactic relationships. We investigate how humans process these syntactic ambiguities by correlating predictions from incremental generative dependency parsers with timecourse data from people undergoing functional neuroimaging while listening to an audiobook. In particular, we compare competing hypotheses regarding the number of developing syntactic analyses in play during word-by-word comprehension: one vs more than one. This comparison involves evaluating syntactic surprisal from a state-of-the-art dependency parser with LLM-adapted encodings against an existing fMRI dataset. In both English and Chinese data, we find evidence for multipath parsing. Brain regions associated with this multipath effect include bilateral superior temporal gyrus.
