Estimating the Number of HTTP/3 Responses in QUIC Using Deep Learning
Barak Gahtan, Robert J. Shahla, Reuven Cohen, Alex M. Bronstein
TL;DR
This work tackles estimating the number of HTTP/3 responses in a QUIC connection from a passive observer. It introduces DecQUIC, which converts QUIC traces into RGB images of time-windowed histograms and trains a CNN-GRU-attention model using a novel discrete regression loss $L = \alpha\mathrm{FL} + (1-\alpha)(\beta\mathrm{ORL} + (1-\beta)\mathrm{DBL})$ to predict per-window response counts. On a large-scale dataset of over $10^5$ traces and more than $7$ million labeled images, the method achieves up to $97\%$ accuracy in known/unknown server settings and $92.6\%$ accuracy for total per-trace counts, demonstrating robust generalization and practical utility for tasks like load balancing and flood-attack detection. The results highlight the benefit of fine-grained, directional, and density features from RGB representations and the importance of window length on per-window versus per-trace accuracy, offering a flexible solution for real-world QUIC traffic analysis.
Abstract
QUIC, a new and increasingly used transport protocol, enhances TCP by offering improved security, performance, and stream multiplexing. These features, however, also impose challenges for network middle-boxes that need to monitor and analyze web traffic. This paper proposes a novel method to estimate the number of HTTP/3 responses in a given QUIC connection by an observer. This estimation reveals server behavior, client-server interactions, and data transmission efficiency, which is crucial for various applications such as designing a load balancing solution and detecting HTTP/3 flood attacks. The proposed scheme transforms QUIC connection traces into image sequences and uses machine learning (ML) models, guided by a tailored loss function, to predict response counts. Evaluations on more than seven million images-derived from 100,000 traces collected across 44,000 websites over four months-achieve up to 97% accuracy in both known and unknown server settings and 92% accuracy on previously unseen complete QUIC traces.
