CSI-Bench: A Large-Scale In-the-Wild Dataset for Multi-task WiFi Sensing
Guozhen Zhu, Yuqian Hu, Weihang Gao, Wei-Hsiang Wang, Beibei Wang, K. J. Ray Liu
TL;DR
CSI-Bench addresses the lack of large-scale real-world WiFi sensing data by providing 461 hours of amplitude CSI collected from 35 users across 26 environments and 16 device types in-the-wild. It supports seven single-task and three multi-task objectives, with standardized splits and baseline results, enabling robust evaluation of both single-task and multi-task models, including parameter-efficient adapters. The experiments reveal strong performance of transformer-based architectures but reveal notable generalization gaps under out-of-distribution conditions (cross-user, cross-environment, cross-device), underscoring the need for domain-adaptive methods. The dataset also demonstrates practical benefits for edge deployment, with multi-task learning reducing model size and training time while maintaining accuracy. By releasing code and data, CSI-Bench provides a scalable platform for privacy-preserving health sensing and broader human-centric WiFi applications.
Abstract
WiFi sensing has emerged as a compelling contactless modality for human activity monitoring by capturing fine-grained variations in Channel State Information (CSI). Its ability to operate continuously and non-intrusively while preserving user privacy makes it particularly suitable for health monitoring. However, existing WiFi sensing systems struggle to generalize in real-world settings, largely due to datasets collected in controlled environments with homogeneous hardware and fragmented, session-based recordings that fail to reflect continuous daily activity. We present CSI-Bench, a large-scale, in-the-wild benchmark dataset collected using commercial WiFi edge devices across 26 diverse indoor environments with 35 real users. Spanning over 461 hours of effective data, CSI-Bench captures realistic signal variability under natural conditions. It includes task-specific datasets for fall detection, breathing monitoring, localization, and motion source recognition, as well as a co-labeled multitask dataset with joint annotations for user identity, activity, and proximity. To support the development of robust and generalizable models, CSI-Bench provides standardized evaluation splits and baseline results for both single-task and multi-task learning. CSI-Bench offers a foundation for scalable, privacy-preserving WiFi sensing systems in health and broader human-centric applications.
