WiFinger: Fingerprinting Noisy IoT Event Traffic Using Packet-level Sequence Matching
Ronghua Li, Shinan Liu, Haibo Hu, Qingqing Ye, Nick Feamster
TL;DR
WiFinger addresses the privacy risk posed by encrypted IoT traffic by reframing Wi‑Fi event fingerprinting as a robust packet-level subsequence matching problem. It introduces NT-LCS-based matching with anchor references and segmentation (AFMLCS) and a three-stage fingerprint extraction process, enabling accurate tracking under noisy wireless conditions with limited training data. Empirical results show WiFinger delivering high recall and precision, significantly outperforming state-of-the-art ML-based and packet-level approaches in multi-event tracking, while maintaining efficiency. The work highlights practical defense trade-offs and broad applicability to other wireless sensing contexts, underscoring the need for careful countermeasures in IoT deployments.
Abstract
IoT environments such as smart homes are susceptible to privacy inference attacks, where attackers can analyze patterns of encrypted network traffic to infer the state of devices and even the activities of people. While most existing attacks exploit ML techniques for discovering such traffic patterns, they underperform on wireless traffic, especially Wi-Fi, due to its heavy noisiness and the packet loss of wireless sniffing. In addition, these approaches commonly target distinguishing chunked IoT event traffic samples, and they fail at effectively tracking multiple events simultaneously. In this work, we propose WiFinger, a fine-grained multi-IoT event fingerprinting approach against noisy traffic. WiFinger turns the traffic pattern classification task into a subsequence matching problem and introduces novel techniques to account for the high time complexity while maintaining high accuracy. In addition, its reliance on training sample volumes reduces efforts for any future fingerprint updates. Experiments demonstrate that WiFinger outperforms existing approaches under practical threat models, with an average recall of 89% (v.s. 49% and 46% respectively) and almost zero false positives for various IoT events.
