Beyond Cosine Similarity
Xinbo Ai
TL;DR
This paper reexamines semantic similarity beyond the conventional cosine metric by deriving a tighter dot-product bound via the Rearrangement Inequality and introducing recos, a similarity measure that normalizes by the sorted–and–reordered dot product $|\mathbf{u}^{\uparrow} \cdot \mathbf{v}^{\updownarrow}|$, thereby capturing ordinal concordance in addition to angular information. The authors formalize a hierarchy of bounds, define three metrics (recos, cos, decos) with distinct saturation conditions, and prove that recos has the broadest capture range. Empirically, recos consistently improves correlation with human judgments on seven STS benchmarks across 11 embedding models, with statistically robust gains, especially for complex and universal embeddings like CLIP-ViT, DPR, and SPECTER. The work demonstrates that ordinal patterns across embedding dimensions carry meaningful semantic signals and can complement traditional angular metrics, offering a principled alternative with practical retrieval implications while noting computational overhead and avenues for scalable approximations.
Abstract
Cosine similarity, the standard metric for measuring semantic similarity in vector spaces, is mathematically grounded in the Cauchy-Schwarz inequality, which inherently limits it to capturing linear relationships--a constraint that fails to model the complex, nonlinear structures of real-world semantic spaces. We advance this theoretical underpinning by deriving a tighter upper bound for the dot product than the classical Cauchy-Schwarz bound. This new bound leads directly to recos, a similarity metric that normalizes the dot product by the sorted vector components. recos relaxes the condition for perfect similarity from strict linear dependence to ordinal concordance, thereby capturing a broader class of relationships. Extensive experiments across 11 embedding models--spanning static, contextualized, and universal types--demonstrate that recos consistently outperforms traditional cosine similarity, achieving higher correlation with human judgments on standard Semantic Textual Similarity (STS) benchmarks. Our work establishes recos as a mathematically principled and empirically superior alternative, offering enhanced accuracy for semantic analysis in complex embedding spaces.
