Partition-theoretic model of prime distribution
Aidan Botkin, Madeline L. Dawsey, David J. Hemmer, Matthew R. Just, Robert Schneider
TL;DR
This work introduces a deterministic, partition-theoretic model of prime distribution centered on the formula $p_n = 1 + 2 \sum_{j=1}^{n-1} \left\lceil \frac{d(j)}{2}\right\rceil + \varepsilon(n)$, with $d$ the divisor function and $\varepsilon(n)$ negligible, bridging partition theory and multiplicative number theory via norm and supernorm statistics. The authors demonstrate that Model 1 naturally yields the prime-number theorem regime $p_n \sim n\log n$ and posits a mechanism for twin primes through the case $d(n)=2$, while also predicting local prime-gap variations; they subsequently refine the model (Model 2, Model 2*) by incorporating $\pi_2(p_{n-1})$ corrections to produce highly accurate estimates for $\pi(n)$ up to $n=10^4$ and beyond. Extensive computational testing validates qualitative alignment with known prime-distribution patterns and reveals both promising predictions and notable limitations, particularly for larger $n$ and for twin-prime behavior. The work presents a complementary, highly structured framework to the probabilistic models of primes, offering new avenues for testing partition-theoretic influences on prime gaps and local fluctuations.
Abstract
We make an application of ideas from partition theory to a problem in multiplicative number theory. We propose a deterministic model of prime number distribution, from first principles related to properties of integer partitions, that naturally predicts the prime number theorem as well as the twin prime conjecture. The model posits that, for $n\geq 2$, $$p_{n}\ =\ 1\ +\ 2\sum_{j=1}^{n-1}\left\lceil \frac{d(j)}{2}\right\rceil\ +\ \varepsilon(n),$$ where $p_k$ is the $k$th prime number, $d(k)$ is the divisor function, and $\varepsilon(k)$ is an explicit error term that is negligible asymptotically; both the main term and error term represent enumerative functions in our conceptual model. We refine the error term to give numerical estimates of $π(n)$ similar to those provided by the logarithmic integral, and much more accurate than $\operatorname{li}(n)$ up to $n=10{,}000$ where the estimates are {\it almost exact}. We then perform computational tests of unusual predictions of the model, finding limited evidence of predictable variations in prime gaps.
