Almost Optimal Algorithms for Token Collision in Anonymous Networks
Sirui Bai, Xinyu Fu, Xudong Wu, Penghui Yao, Chaodong Zheng
TL;DR
Token collision in anonymous networks is analyzed under the CONGEST model. The authors present near-optimal deterministic algorithms with time $\tilde{O}(D+kL/\log n)$ and a pipeline extension for large $L$, plus a near-optimal randomized algorithm, with matching lower bounds and impossibility results. The core technique builds BFS trees using identifier-based leadership and uses an identifier-induced forest to convergecast tokens and detect collisions, with a pipeline variant for large tokens. These results illuminate symmetry-breaking limits in anonymous networks and have implications for randomized unique identifiers, set-disjointness reductions, and practical distributed verification.
Abstract
In distributed systems, situations often arise where some nodes each holds a collection of tokens, and all nodes collectively need to determine whether all tokens are distinct. For example, if each token represents a logged-in user, the problem corresponds to checking whether there are duplicate logins. Similarly, if each token represents a data object or a timestamp, the problem corresponds to checking whether there are conflicting operations in distributed databases. In distributed computing theory, unique identifiers generation is also related to this problem: each node generates one token, which is its identifier, then a verification phase is needed to ensure all identifiers are unique. In this paper, we formalize and initiate the study of token collision. In this problem, a collection of $k$ tokens, each represented by some length-$L$ bit string, are distributed to $n$ nodes of an anonymous CONGEST network in an arbitrary manner. The nodes need to determine whether there are tokens with an identical value. We present near optimal deterministic algorithms for the token collision problem with $\tilde{O}(D+k\cdot L/\log{n})$ round complexity, where $D$ denotes the network diameter. Besides high efficiency, the prior knowledge required by our algorithms is also limited. For completeness, we further present a near optimal randomized algorithm for token collision.
