Defining Knowledge: Bridging Epistemology and Large Language Models

Constanza Fierro; Ruchira Dhar; Filippos Stamatiou; Nicolas Garneau; Anders Søgaard

Defining Knowledge: Bridging Epistemology and Large Language Models

Constanza Fierro, Ruchira Dhar, Filippos Stamatiou, Nicolas Garneau, Anders Søgaard

TL;DR

The paper tackles the question of what it means for LLMs to know and demonstrates that NLP linguistics often treats knowledge without a solid epistemological footing. It surveys five classical definitions of knowledge—tb-knowledge, j-knowledge, g-knowledge, v-knowledge, and p-knowledge—and formalizes their mappings to LLM assessment, then compares these mappings to current NLP evaluation practices. An empirical survey of 105 philosophers and computer scientists reveals meaningful disagreements across definitions and an overall trend that non-human knowledge is possible while empirical knowledge in LLMs is contested or debated. The authors propose concrete, definition-aligned evaluation protocols and argue that grounding knowledge claims in epistemology can lead to more rigorous, trustworthy assessments of what LLMs truly know and how to test it.

Abstract

Knowledge claims are abundant in the literature on large language models (LLMs); but can we say that GPT-4 truly "knows" the Earth is round? To address this question, we review standard definitions of knowledge in epistemology and we formalize interpretations applicable to LLMs. In doing so, we identify inconsistencies and gaps in how current NLP research conceptualizes knowledge with respect to epistemological frameworks. Additionally, we conduct a survey of 100 professional philosophers and computer scientists to compare their preferences in knowledge definitions and their views on whether LLMs can really be said to know. Finally, we suggest evaluation protocols for testing knowledge in accordance to the most relevant definitions.

Defining Knowledge: Bridging Epistemology and Large Language Models

TL;DR

Abstract

Paper Structure (36 sections, 6 equations, 9 figures, 2 tables)

This paper contains 36 sections, 6 equations, 9 figures, 2 tables.

Introduction
Contributions
Definitions of Knowledge
True beliefs (tb-knowledge)
Justification (j-knowledge)
Sui generis (g-knowledge)
Virtue (v-knowledge)
Predictive accuracy (p-knowledge)
Knowledge in NLP Research
tb-knowledge
j-knowledge
g-knowledge
v-knowledge
p-knowledge
Survey Results
...and 21 more sections

Figures (9)

Figure 1: From our survey (§\ref{['sec:survey_results']}): Philosophers and computer scientists prefer different definitions of knowledge.
Figure 2: LLMs understanding of respondents.
Figure 3: Epistemology understanding of respondents.
Figure 4: Disagreements on epistemological definitions of knowledge.
Figure 5: Four of the survey questions and their respective answers.
...and 4 more figures

Theorems & Definitions (7)

Definition 2.1: belief
Definition 2.2: belief$^+$
Definition 2.3: tb-knowledge
Definition 2.4: j-knowledge
Definition 2.5: g-knowledge
Definition 2.6: v-knowledge
Definition 2.7: p-knowledge

Defining Knowledge: Bridging Epistemology and Large Language Models

TL;DR

Abstract

Defining Knowledge: Bridging Epistemology and Large Language Models

Authors

TL;DR

Abstract

Table of Contents

Figures (9)

Theorems & Definitions (7)