Is Polarization an Inevitable Outcome of Similarity-Based Content Recommendations? -- Mathematical Proofs and Computational Validation
Minhyeok Lee
TL;DR
This work analyzes polarization arising from similarity-based recommendations using a minimal geometric model where users and content are represented as points in $\mathbb{R}^d$ and updates move toward the median of locally recommended items, formalized by $\mathbf{u}_i^{(t+1)} = \mathbf{u}_i^{(t)} + \alpha(\mathbf{m}_i^{(t)} - \mathbf{u}_i^{(t)}) + \boldsymbol{\eta}_i^{(t)}$ with content weights $w(\tau)=e^{-\lambda\tau}$. In a one-dimensional setting with $M$ fixed content creators at $c_1<\dots<c_M$, the authors prove a monotone contraction leading to at most $M$ clusters around attractors, while simulations in 2D show robust cluster formation under varied parameters. The results indicate that the geometry of proximity-based retrieval in latent spaces can drive fragmentation even absent explicit ideological cues, highlighting the non-neutral nature of recommendation systems. These insights have implications for designing interventions that promote exposure to diverse viewpoints and for understanding the structural forces shaping online discourse.
Abstract
The increasing reliance on digital platforms shapes how individuals understand the world, as recommendation systems direct users toward content "similar" to their existing preferences. While this process simplifies information retrieval, there is concern that it may foster insular communities, so-called echo chambers, reinforcing existing viewpoints and limiting exposure to alternatives. To investigate whether such polarization emerges from fundamental principles of recommendation systems, we propose a minimal model that represents users and content as points in a continuous space. Users iteratively move toward the median of locally recommended items, chosen by nearest-neighbor criteria, and we show mathematically that they naturally coalesce into distinct, stable clusters without any explicit ideological bias. Computational simulations confirm these findings and explore how population size, adaptation rates, content production probabilities, and noise levels modulate clustering speed and intensity. Our results suggest that similarity-based retrieval, even in simplified scenarios, drives fragmentation. While we do not claim all systems inevitably cause polarization, we highlight that such retrieval is not neutral. Recognizing the geometric underpinnings of recommendation spaces may inform interventions, policies, and critiques that address unintended cultural and ideological divisions.
