Exploiting Parallelism for Fast Feynman Diagrammatics
John Sturt, Evgeny Kozik
TL;DR
This paper tackles the computational bottleneck of high-order Feynman diagram sums in Diagrammatic Monte Carlo by introducing GPU-accelerated CoS, which encodes the sum over diagrams as a directed graph and evaluates it with a factorised, parallelizable structure. The CoS approach reduces the apparent factorial complexity to $\mathcal{O}(n^2 2^n)$ operations and, when mapped onto GPUs via graph flattening and cooperative thread blocks, yields orders-of-magnitude speed-ups over CPU implementations. The authors demonstrate performance gains across consumer to server GPUs, achieving acceleration up to about $10^3$ times and enabling studies of strong-correlation physics previously out of reach. They discuss limitations such as memory bandwidth and synchronization, and outline future directions including cross-platform accelerator support, mixed precision, and tensor-train representations. Overall, the work lowers the practical barrier to high-order DiagMC and broadens access to nonperturbative quantum many-body phenomena.
Abstract
Diagrammatic expansions are a paradigmatic and powerful tool of quantum many-body theory. Their evaluation to high order, e.g., by the Diagrammatic Monte Carlo technique, can provide unbiased results in strongly correlated and challenging regimes. However, calculating a factorial number of terms to acceptable precision remains very costly even for state-of-the-art methods. We achieve a dramatic acceleration of evaluating Feynman's diagrammatic series by use of specialised hardware architecture within the recently introduced combinatorial summation (CoS) framework. We present how exploiting the massive parallelism and concurrency available from GPUs leads to orders of magnitude improvement in computation time even on consumer-grade hardware. This provides a platform for making probes of novel phenomena of strong correlations much more accessible.
