Optimal parallelisation strategies for flat histogram Monte Carlo sampling

Hubert J. Naguszewski; Christopher D. Woodgate; David Quigley

Optimal parallelisation strategies for flat histogram Monte Carlo sampling

Hubert J. Naguszewski, Christopher D. Woodgate, David Quigley

TL;DR

The paper addresses how to efficiently parallelize flat-histogram Monte Carlo methods, specifically Wang–Landau sampling, to compute phase behavior in lattice models. It benchmarks multiple parallelization strategies—non-uniform energy-domain decomposition, dynamic load balancing, replica exchange, and varying numbers of walkers per domain—using a fixed-lattice AlTiCrMo high-entropy alloy and DOS-based observables. The key finding is that non-uniform energy-domain decomposition yields the largest speedups, with dynamic load balancing providing additional, though smaller, gains; replica exchange largely leaves efficiency unchanged, and using 1–2 walkers per sub-domain is typically sufficient. The study offers concrete, actionable recommendations for accelerating WL simulations in materials science and similar flat-histogram frameworks, enabling higher-throughput exploration of phase diagrams and thermodynamics.

Abstract

Flat histogram methods, such as Wang-Landau sampling, provide a means for high-throughput calculation of phase diagrams of atomistic/lattice model systems. Many parallelisation schemes with varying degrees of complexity have been proposed to accelerate such sampling simulations. In this study, several widely used schemes are benchmarked - both in isolation and in combination - to establish best practice. The schemes studied include energy domain decomposition with both static sizing of energy sub-domains, as well as a dynamic sub-domain sizing scheme which we propose. We also assess the benefits both of replica exchange and of including multiple random walkers per sub-domain, to determine which factors have the largest impact on parallel efficiency. Additionally, the influence of energy sub-domain overlap regions is discussed. As an illustrative test case, we implement and apply the aforementioned strategies to a lattice-based model describing the internal energies of the AlTiCrMo refractory high-entropy superalloy, which is understood to crystallographically order into a B2 (CsCl) structure with decreasing temperature. We find that - while all of the proposed strategies confer a non-negligible speedup - parallelisation across energy domains which are non-uniform in size offers the most appreciable performance improvements. This work offers concrete recommendations for which parallelisation strategies should be prioritised to optimally accelerate flat-histogram Monte Carlo simulations.

Optimal parallelisation strategies for flat histogram Monte Carlo sampling

TL;DR

Abstract

Optimal parallelisation strategies for flat histogram Monte Carlo sampling

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)