Swap distance minimization in SOV languages. Cognitive and mathematical foundations
Ramon Ferrer-i-Cancho, Savithry Namboodiripad
TL;DR
This study formalizes swap distance minimization as a gradient constraint on word order by introducing word order rotation and a permutation-ring framework. It tests three flexible SOV languages—Korean, Malayalam, and Sinhalese—using acceptability, reaction-time, error, and corpus data, analyzed through Kendall $\tau$ correlations and a Monte Carlo global analysis across conditions. The results show robust swap-distance effects in Korean and Malayalam, with weaker but detectable evidence in Sinhalese, and reveal that swap distance minimization often dominates a simple canonical-order preference while co-occurring with verb-predictability pressures. Together, these findings support a gradient, language-specific view of word-order optimization and provide a quantitative framework for cross-linguistic variation in syntactic ordering and its cognitive correlates.
Abstract
Distance minimization is a general principle of language. A special case of this principle in the domain of word order is swap distance minimization. This principle predicts that variations from a canonical order that are reached by fewer swaps of adjacent constituents are lest costly and thus more likely. Here we investigate the principle in the context of the triple formed by subject (S), object (O) and verb (V). We introduce the concept of word order rotation as a cognitive underpinning of that prediction. When the canonical order of a language is SOV, the principle predicts SOV < SVO, OSV < VSO, OVS < VOS, in order of increasing cognitive cost. We test the prediction in three flexible order SOV languages: Korean (Koreanic), Malayalam (Dravidian), and Sinhalese (Indo-European). Evidence of swap distance minimization is found in all three languages, but it is weaker in Sinhalese. Swap distance minimization is stronger than a preference for the canonical order in Korean and especially Malayalam.
