Computing Optimal Manipulations in Cryptographic Self-Selection Proof-of-Stake Protocols
Matheus V. X. Ferreira, Aadityan Ganesh, Jack Hourigan, Hannah Huh, S. Matthew Weinberg, Catherine Yu
TL;DR
The paper addresses the question of how much a strategic actor can manipulate leadership in cryptographic self-selection for Proof-of-Stake protocols. It introduces a principled, computational approach that recasts the problem as finding fixed points of nonlinear sampling operators and uses controlled truncation, discretization, and inflation/deflation to bound errors with provable guarantees. The authors develop a LinearCSSPA variant, propose an ideal fixed-point simulation, and provide practical algorithms to approximate the optimal adversarial reward and strategy, including finite-sample methods and precomputation techniques. Their experimental results tighten prior bounds (e.g., for a 10% stake, 0.1–well-connected adversaries lead at most about 10.15% of rounds) and highlight the crucial role of network connectivity in the profitability of manipulation, with broader implications for designing less manipulable leader selection schemes.
Abstract
Cryptographic Self-Selection is a paradigm employed by modern Proof-of-Stake consensus protocols to select a block-proposing "leader." Algorand [Chen and Micali, 2019] proposes a canonical protocol, and Ferreira et al. [2022] establish bounds $f(α,β)$ on the maximum fraction of rounds a strategic player can lead as a function of their stake $α$ and a network connectivity parameter $β$. While both their lower and upper bounds are non-trivial, there is a substantial gap between them (for example, they establish $f(10\%,1) \in [10.08\%, 21.12\%]$), leaving open the question of how significant of a concern these manipulations are. We develop computational methods to provably nail $f(α,β)$ for any desired $(α,β)$ up to arbitrary precision, and implement our method on a wide range of parameters (for example, we confirm $f(10\%,1) \in [10.08\%, 10.15\%]$). Methodologically, estimating $f(α,β)$ can be phrased as estimating to high precision the value of a Markov Decision Process whose states are countably-long lists of real numbers. Our methodological contributions involve (a) reformulating the question instead as computing to high precision the expected value of a distribution that is a fixed-point of a non-linear sampling operator, and (b) provably bounding the error induced by various truncations and sampling estimations of this distribution (which appears intractable to solve in closed form). One technical challenge, for example, is that natural sampling-based estimates of the mean of our target distribution are \emph{not} unbiased estimators, and therefore our methods necessarily go beyond claiming sufficiently-many samples to be close to the mean.
