Synchronization on circles and spheres with nonlinear interactions
Christopher Criscitiello, Quentin Rebjock, Andrew D. McRae, Nicolas Boumal
TL;DR
The paper analyzes gradient-flow dynamics for $n$ points on the unit sphere interacting through $\varphi$ of their inner products, connecting to transformer-inspired models. It shows that for $d\ge3$ synchronization holds on connected graphs under increasing convex $\varphi$, and it introduces a new condition on the Taylor coefficients of $\varphi'$ that ensures circle synchronization ($d=2$). It also constructs a real-analytic $\varphi$ that yields spurious non-synchronizing local maxima for all $n\ge5$, illustrating fundamental obstacles on the circle, while showing benign behavior for small $\beta$ and providing large-$\beta$ criteria. Together, these results clarify the circle-sphere dichotomy and address open problems posed by Geshkovski et al. (2024) in the context of transformer-inspired gradient landscapes on manifolds.
Abstract
We consider the dynamics of $n$ points on a sphere in $\mathbb{R}^d$ ($d \geq 2$) which attract each other according to a function $\varphi$ of their inner products. When $\varphi$ is linear ($\varphi(t) = t$), the points converge to a common value (i.e., synchronize) in various connectivity scenarios: this is part of classical work on Kuramoto oscillator networks. When $\varphi$ is exponential ($\varphi(t) = e^{βt}$), these dynamics correspond to a limit of how idealized transformers process data, as described by Geshkovski et al. (2024). Accordingly, they ask whether synchronization occurs for exponential $\varphi$. In the context of consensus for multi-agent control, Markdahl et al. (2018) show that for $d \geq 3$ (spheres), if the interaction graph is connected and $\varphi$ is increasing and convex, then the system synchronizes. What is the situation on circles ($d=2$)? First, we show that $\varphi$ being increasing and convex is no longer sufficient. Then we identify a new condition (that the Taylor coefficients of $\varphi'$ are decreasing) under which we do have synchronization on the circle. In so doing, we provide some answers to the open problems posed by Geshkovski et al. (2024).
