Characterizing the Impact of Active Queue Management on Speed Test Measurements
Siddhant Ray, Taveesh Sharma, Jonatas Marques, Paul Schmitt, Francesco Bronzino, Nick Feamster
TL;DR
The paper addresses the gap that traditional speed tests, which emphasize peak throughput, fail to reflect user-perceived responsiveness under load. Through a controlled lab study across multiple AQM schemes (CoDel, FQ-CoDel, SFQ) and with and without burst shaping and competing traffic, it shows that speed-test measurements exhibit significant variability in both throughput and latency distributions that depend on AQM policy. End-to-end results and instantaneous throughput analyses reveal that aggregated metrics can mask important dynamics, especially under load ($LUL$) and cross-traffic conditions. The findings highlight the need to calibrate and enrich speed-test platforms with AQM-aware metrics to produce results that better reflect real user experience and to inform policy and regulatory outcomes.
Abstract
Present day speed test tools measure peak throughput, but often fail to capture the user-perceived responsiveness of a network connection under load. Recently, platforms such as NDT, Ookla Speedtest and Cloudflare Speed Test have introduced metrics such as ``latency under load'' or ``working latency'' to fill this gap. Yet, the sensitivity of these metrics to basic network configurations such as Active Queue Management (AQM) remains poorly understood. In this work, we conduct an empirical study of the impact of AQM on speed test measurements in a laboratory setting. Using controlled experiments, we compare the distribution of throughput and latency under different load measurements across different AQM schemes, including CoDel, FQ-CoDel and Stochastic Fair Queuing (SFQ). On comparing with a standard drop-tail baseline, we find that measurements have high variance across AQM schemes and load conditions. These results highlight the critical role of AQM in shaping how emerging latency metrics should be interpreted, and underscore the need for careful calibration of speed test platforms before their results are used to guide policy or regulatory outcomes.
