MapReplay: Trace-Driven Benchmark Generation for Java HashMap

Filippo Schiavio; Andrea Rosà; Júnior Löff; Lubomír Bulej; Petr Tůma; Walter Binder

MapReplay: Trace-Driven Benchmark Generation for Java HashMap

Filippo Schiavio, Andrea Rosà, Júnior Löff, Lubomír Bulej, Petr Tůma, Walter Binder

Abstract

Hash-based maps, particularly java.util.HashMap, are pervasive in Java applications and the JVM, making their performance critical. Evaluating optimizations is challenging because performance depends on factors such as operation patterns, key distributions, and resizing behavior. Microbenchmarks are fast and repeatable but often oversimplify workloads, failing to capture the realistic usage patterns. Application benchmarks (e.g., DaCapo, Renaissance) provide realistic usages but are more expensive to run, prone to variability, and dominated by non-HashMap computations, making map-related performance changes difficult to observe. To address this challenge, we propose MapReplay, a benchmarking methodology that combines the realism of application benchmarks with the efficiency of microbenchmarks. MapReplay traces HashMap API usages generating a replay workload that reproduces the same operation sequence while faithfully reconstructing internal map states. This enables realistic and efficient evaluation of alternative implementations under realistic usage patterns. Applying MapReplay to DaCapo-Chopin and Renaissance, the resulting suite, MapReplayBench, reproduces application-level performance trends while reducing experimentation time and revealing insights difficult to obtain from full benchmarks.

MapReplay: Trace-Driven Benchmark Generation for Java HashMap

Abstract

Paper Structure (66 sections, 3 figures, 3 tables)

This paper contains 66 sections, 3 figures, 3 tables.

Introduction
Background: HashMap
Design and Architecture
Terminology and MapReplay Architecture
Tracer.
Offline trace post-processor.
Replay infrastructure and replay workload.
Goals
Faithful execution paths.
Equivalent map state.
Design Decisions
Operation categorization.
Substitution of application keys, minimized replay of callbacks.
Preserving hash collisions.
Simplified equality semantics.
...and 51 more sections

Figures (3)

Figure 1: MapReplay architecture.
Figure 2: Microbenchmark performance for different operations, map sizes, and DIC configurations. Vertical gray lines indicate thresholds where resizing occurs.
Figure 3: Correlation between relative speedups (ratios) in application benchmarks and replay workloads. Different symbols indicate statistical significance of the combinations of speedups. Rectangles indicate the 99% bootstrap confidence intervals of the corresponding speedup ratios for weakly discordant changes plotted using the $\circ$ symbol.

MapReplay: Trace-Driven Benchmark Generation for Java HashMap

Abstract

MapReplay: Trace-Driven Benchmark Generation for Java HashMap

Authors

Abstract

Table of Contents

Figures (3)