On convexity and efficiency in semantic systems
Nathaniel Imel, Noga Zaslavasky
TL;DR
This paper investigates the relationship between convexity and efficiency in human semantic category systems, using color naming as a focal domain. It combines analytical proofs within the Information Bottleneck (IB) framework and empirical analyses to show that convexity and efficiency are distinct properties, though IB-optimal color naming is largely convex; the IB objective is $\mathcal{F}_{\beta}[q] = I_q(M;W) - \beta I_q(W;U)$ and the deviation from optimality is $\varepsilon[q]$. Efficiency proves to be a stronger predictor than convexity for distinguishing attested color naming from rotated or hypothetical variants. Beyond empirical fit, efficiency captures phenomena that convexity cannot, such as adaptation to environmental communicative needs and language evolution. Overall, efficiency provides a more comprehensive account of semantic typology, with convexity offering only partial explanatory power and not a universal constraint.
Abstract
There are two widely held characterizations of human semantic category systems: (1) they form convex partitions of conceptual spaces, and (2) they are efficient for communication. While prior work observed that convexity and efficiency co-occur in color naming, the analytical relation between them and why they co-occur have not been well understood. We address this gap by combining analytical and empirical analyses that build on the Information Bottleneck (IB) framework for semantic efficiency. First, we show that convexity and efficiency are distinct in the sense that neither entails the other: there are convex systems which are inefficient, and optimally-efficient systems that are non-convex. Crucially, however, the IB-optimal systems are mostly convex in the domain of color naming, explaining the main empirical basis for the convexity approach. Second, we show that efficiency is a stronger predictor for discriminating attested color naming systems from hypothetical variants, with convexity adding negligible improvement on top of that. Finally, we discuss a range of empirical phenomena that convexity cannot account for but efficiency can. Taken together, our work suggests that while convexity and efficiency can yield similar structural observations, they are fundamentally distinct, with efficiency providing a more comprehensive account of semantic typology.
