Table of Contents
Fetching ...

Geospatial distributions reflect rates of evolution of features of language

Henri Kauhanen, Deepthi Gopal, Tobias Galla, Ricardo Bermúdez-Otero

Abstract

Quantifying the speed of linguistic change is challenging due to the fact that the historical evolution of languages is sparsely documented. Consequently, traditional methods rely on phylogenetic reconstruction. In this paper, we propose a model-based approach to the problem through the analysis of language change as a stochastic process combining vertical descent, spatial interactions, and mutations in both dimensions. A notion of linguistic temperature emerges naturally from this analysis as a dimensionless measure of the propensity of a linguistic feature to undergo change. We demonstrate how temperatures of linguistic features can be inferred from their present-day geospatial distributions, without recourse to information about their phylogenies. Thus the evolutionary dynamics of language, operating across thousands of years, leaves a measurable geospatial signature. This signature licenses inferences about the historical evolution of languages even in the absence of longitudinal data.

Geospatial distributions reflect rates of evolution of features of language

Abstract

Quantifying the speed of linguistic change is challenging due to the fact that the historical evolution of languages is sparsely documented. Consequently, traditional methods rely on phylogenetic reconstruction. In this paper, we propose a model-based approach to the problem through the analysis of language change as a stochastic process combining vertical descent, spatial interactions, and mutations in both dimensions. A notion of linguistic temperature emerges naturally from this analysis as a dimensionless measure of the propensity of a linguistic feature to undergo change. We demonstrate how temperatures of linguistic features can be inferred from their present-day geospatial distributions, without recourse to information about their phylogenies. Thus the evolutionary dynamics of language, operating across thousands of years, leaves a measurable geospatial signature. This signature licenses inferences about the historical evolution of languages even in the absence of longitudinal data.

Paper Structure

This paper contains 18 sections, 88 equations, 3 figures.

Figures (3)

  • Figure S1: Correlations between quantities predicted by theory [Eqs. (\ref{['eq:steady-rho-full']}), (\ref{['eq:tau-full']}) and (\ref{['eq:sigma-full']})] and measured from numerical simulations ($\rho$: feature frequency, $\sigma$: isogloss density, $\tau$: temperature). One hundred independent features on a $50\times 50$ lattice at 25,000,000 iterations.
  • Figure S2: Left: Temperature estimates computed with varying neighbourhood sizes correlated against the temperature estimates of the main analysis (neighbourhood size $k = 10$). Right: Distribution (Gaussian kernel estimate) of distance to $k$th neighbour ($k=1, 10, 50$).
  • Figure S3: Correlations between quantities predicted by theory [Eqs. (\ref{['eq:steady-rho-full']}), (\ref{['eq:tau-full']}) and (\ref{['eq:sigma-full']})] and measured from numerical simulations on an adjacency graph inferred from the WALS atlas ($\rho$: feature frequency, $\sigma$: isogloss density, $\tau$: temperature). One hundred independent features at 25,000,000 iterations.