Revisiting Monte Carlo Strength Evaluation
Martin Stanek
TL;DR
The paper tackles efficient password strength evaluation by ranking passwords under a probabilistic model using the Monte Carlo estimator. It adds three concrete improvements—probability-interval interpolation, reduced probability overlap via sampling, and precomputation-based speedups—and shows that sampling improves precision while interpolation offers mixed benefits; bin-based precomputation yields notable speedups at modest memory costs. The experimental results using a PCFG model trained on RockYou demonstrate meaningful reductions in estimation error with sampling and substantial runtime savings with binning, though gains depend on model characteristics and configuration. Collectively, the work advances practical, scalable strength estimation for password models, enabling faster model comparisons and user feedback in security workflows.
Abstract
The Monte Carlo method, proposed by Dell'Amico and Filippone, estimates a password's rank within a probabilistic model for password generation, i.e., it determines the password's strength according to this model. We propose several ideas to improve the precision or speed of the estimation. Through experimental tests, we demonstrate that improved sampling can yield slightly better precision. Moreover, additional precomputation results in faster estimations with a modest increase in memory usage.
