A Quantum-Inspired Analysis of Human Disambiguation Processes
Daphne Wang
TL;DR
This work develops a quantum-inspired framework to model human disambiguation in English, using category-theoretic tools (sheaves, presheaves, and monoidal categories) and contextuality formalisms to connect lexical/syntactic ambiguities with quantum-like statistics. By analyzing corpus and human-judgment data through Contextuality-by-Default and the sheaf-theoretic approach, the study uncovers quantum-like contextuality in lexical phrases and notable causal structure in verb-driven disambiguation. It also demonstrates that quantum simulations and variational circuits can approximate human disambiguation patterns, offering a pathway to quantum-native NLP methods and potential advantages in certain linguistic tasks. The findings illuminate how contextuality and causality frameworks capture human parsing dynamics and reading-time effects, suggesting new directions for NLP models that leverage quantum-inspired representations and causally structured data.
Abstract
Formal languages are essential for computer programming and are constructed to be easily processed by computers. In contrast, natural languages are much more challenging and instigated the field of Natural Language Processing (NLP). One major obstacle is the ubiquity of ambiguities. Recent advances in NLP have led to the development of large language models, which can resolve ambiguities with high accuracy. At the same time, quantum computers have gained much attention in recent years as they can solve some computational problems faster than classical computers. This new computing paradigm has reached the fields of machine learning and NLP, where hybrid classical-quantum learning algorithms have emerged. However, more research is needed to identify which NLP tasks could benefit from a genuine quantum advantage. In this thesis, we applied formalisms arising from foundational quantum mechanics, such as contextuality and causality, to study ambiguities arising from linguistics. By doing so, we also reproduced psycholinguistic results relating to the human disambiguation process. These results were subsequently used to predict human behaviour and outperformed current NLP methods.
