Universal Asymptotics for Jensen--Shannon Divergence under Shuffling
Alex Shvets
TL;DR
The paper analyzes the Jensen--Shannon divergence between transcript distributions in the shuffle model under neighboring datasets, establishing a universal two-term asymptotic expansion as the number of users $n$ grows. By representing the log-likelihood ratio through an additive statistic $U_n$ and expanding a pointwise functional $D(1+U_n)$, the authors derive $\mathrm{JSD}(T_n\|T'_n) = \frac{\chi^2(W_1\|W_0)}{8n} + \frac{1}{n^2}\big[\frac{7}{64}\chi^4(W_1\|W_0) - \frac{1}{16}\mu_3(W)\big] + O(n^{-3})$ under a mild positivity assumption. They further specialize the result to binary and $k$-ary randomized responses and extend to multi-message protocols by independent repetition, where the leading term scales with $\big(1+\chi^2\big)^m-1$. The work provides explicit remainder control and yields precise privacy-utility guarantees for shuffle-based privacy amplification, with clear guidance for protocol design. The universality across fixed local channels makes the results broadly applicable to a range of practical privacy settings.
Abstract
We study the Jensen--Shannon divergence (JSD) between transcript distributions induced by neighboring datasets in the shuffle model when each user applies a fixed local randomizer and a trusted shuffler releases the output histogram. Under a mild positivity assumption, we prove an explicit two-term asymptotic expansion where the leading term is chi-squared divergence divided by 8n. Binary randomized response and k-ary randomized response follow as corollaries. For multi-message protocols based on independent repetition, the leading coefficient becomes (1 + chi-squared)^m - 1. A fully explicit remainder control is provided in the appendix.
