Large Language Models Preserve Semantic Isotopies in Story Continuations
Marc Cavazza
TL;DR
This work investigates whether large language models preserve semantic isotopies—structural semantic threads—when continuing stories. By constructing a 10,000-story ROCStories-based pipeline across five LLMs and using GPT-4o to extract isotopies, the authors quantify isotopy coverage, density, and spread in completed texts and analyze semantic properties like relevance and label distributions. They find that isotopies persist through completion, with high coverage, comparable density to human benchmarks, and near-even spread, while exhibiting topic convergence yet lexical variation across models. The study advances the case for a textual-semantic lens on LLM behavior and suggests practical uses for assessing cohesion and guiding training, while acknowledging horizon-length and interpretative limitations. Overall, the results provide initial empirical support that LLM-generated text maintains meaningful isotopic structure consistent with structural semantics.
Abstract
In this work, we explore the relevance of textual semantics to Large Language Models (LLMs), extending previous insights into the connection between distributional semantics and structural semantics. We investigate whether LLM-generated texts preserve semantic isotopies. We design a story continuation experiment using 10,000 ROCStories prompts completed by five LLMs. We first validate GPT-4o's ability to extract isotopies from a linguistic benchmark, then apply it to the generated stories. We then analyze structural (coverage, density, spread) and semantic properties of isotopies to assess how they are affected by completion. Results show that LLM completion within a given token horizon preserves semantic isotopies across multiple properties.
