Enhancing Resiliency of Sketch-based Security via LSB Sharing-based Dynamic Late Merging
Seungsam Yang, Seyed Mohammad Mehdi Mirnajafizadeh, Sian Kim, Rhongho Jang, DaeHun Nyang
TL;DR
The paper tackles the vulnerability of small-counter, sketch-based traffic measurement to sketch pollution attacks. It proposes SC_LSB, a Siamese Counter that employs Least Significant Bit sharing and late merging to extend counter capacity while preserving multiple independent counters, thereby blending static isolation with dynamic recycling. The authors provide theoretical analyses of error bounds and demonstrate through extensive experiments that SC_LSB significantly improves accuracy under pollution attacks across flow-size estimation, heavy-hitter detection, change detection, entropy estimation, and related security tasks, with only modest throughput overhead. This work offers a practical path toward more resilient sketch-based security in cloud and data-plane environments, where memory efficiency and robustness against adversarial manipulation are critical.
Abstract
With the exponentially growing Internet traffic, sketch data structure with a probabilistic algorithm has been expected to be an alternative solution for non-compromised (non-selective) security monitoring. While facilitating counting within a confined memory space, the sketch's memory efficiency and accuracy were further pushed to their limit through finer-grained and dynamic control of constrained memory space to adapt to the data stream's inherent skewness (i.e., Zipf distribution), namely small counters with extensions. In this paper, we unveil a vulnerable factor of the small counter design by introducing a new sketch-oriented attack, which threatens a stream of state-of-the-art sketches and their security applications. With the root cause analyses, we propose Siamese Counter with enhanced adversarial resiliency and verified feasibility with extensive experimental and theoretical analyses. Under a sketch pollution attack, Siamese Counter delivers 47% accurate results than a state-of-the-art scheme, and demonstrates up to 82% more accurate estimation under normal measurement scenarios.
