Language processing in humans and computers

Dusko Pavlovic

Language processing in humans and computers

Dusko Pavlovic

TL;DR

This work surveys language processing in humans and computers, contrasting how meaning is constructed and communicated in biological systems with how language engines operate. It frames language production as predictive inference and situates it within a broad information-theoretic and channel-based perspective, arguing that both human and machine learning rely on language-like structures to build and share meaning. The text traverses syntax (from formal grammars and pregroup grammars to dependency parsing), semantics (static vector-based representations and dynamic, context-sensitive meaning), and learning theory (from pigeons to perceptrons and Kolmogorov-Arnold decompositions), presenting language as a universal learning mechanism. It also highlights practical and philosophical challenges—hallucinations, copyright, grounding, and the evolving role of the Web—while proposing a framework in which future AI systems could safely extend human cognition through grounded, context-aware learning pipelines.

Abstract

Machine-learned language models have transformed everyday life: they steer us when we study, drive, manage money. They have the potential to transform our civilization. But they hallucinate. Their realities are virtual. This note provides a high-level overview of language models and outlines a low-level model of learning machines. It turns out that, after they become capable of recognizing hallucinations and dreaming safely, as humans tend to be, the language-learning machines proceed to generate broader systems of false beliefs and self-confirming theories, as humans tend to do.

Language processing in humans and computers

TL;DR

Abstract

Paper Structure (137 sections, 67 equations, 46 figures, 2 tables)

This paper contains 137 sections, 67 equations, 46 figures, 2 tables.

Preface: On language models and celebrities
Who are chatbots and what are they to you?
Chatbots: Shifting the paradigm of meaning
What just happened?
The paradigm of meaning
Zeno and the aliens
AI: How did we get here and where do we go?
The mind-body problem and solution
Turing and Darwin
From search engines to language engines
What does the Web past say about the AI future?
Afterthoughts: Four elephants in the room with chatbots
The first elephant in the room: The Web
The second elephant in the room: The pocket calculator
The third elephant in the room: Hallucinations
...and 122 more sections

Figures (46)

Figure 1: Transformer architecture revealed
Figure 2: GPT4's response to the prompt: "please summarize in shakespearean verse the paper 'quantum measurements without sums' by bob coecke and dusko pavlovic"
Figure 3: People use words to refer to things, chatbots to extend phrases
Figure 4: DALL-E's view: "As Parmenides argued that movement could not exist, Zeno paced around."
Figure 5: Network of neurons from Turing's 1947 memo on Intelligent Machinery
...and 41 more figures

Language processing in humans and computers

TL;DR

Abstract

Language processing in humans and computers

Authors

TL;DR

Abstract

Table of Contents

Figures (46)