Bluetooth Low Energy Dataset Using In-Phase and Quadrature Samples for Indoor Localization
Samuel G. Leitch, Qasim Zeeshan Ahmed, Ben Van Herbruggen, Mathias Baert, Jaron Fontaine, Eli De Poorter, Adnan Shahid, Pavlos I. Lazaridis
TL;DR
This work develops a BLE indoor localization framework by collecting raw IQ samples with TI BOOSTXL-AOA hardware and automating ground-truth labeling via a MoCap system. It provides a rich dataset across multiple tag heights and distances while incorporating obstacle scenarios, enabling robust learning for AoA and distance estimation. Validation against the TI PDoA baseline shows competitive angle estimates, with region-specific MAE around $15.42^\circ$ in $[-50^\circ,50^\circ]$, and a Gaussian Process Regression model achieving distance MAE of $0.174$m. The dataset thus offers a practical, reusable resource for advancing BLE-based indoor localization and establishing baselines for AoA and range estimation methods in realistic environments.
Abstract
One significant challenge in research is to collect a large amount of data and learn the underlying relationship between the input and the output variables. This paper outlines the process of collecting and validating a dataset designed to determine the angle of arrival (AoA) using Bluetooth low energy (BLE) technology. The data, collected in a laboratory setting, is intended to approximate real-world industrial scenarios. This paper discusses the data collection process, the structure of the dataset, and the methodology adopted for automating sample labeling for supervised learning. The collected samples and the process of generating ground truth (GT) labels were validated using the Texas Instruments (TI) phase difference of arrival (PDoA) implementation on the data, yielding a mean absolute error (MAE) at one of the heights without obstacles of $25.71^\circ$. The distance estimation on BLE was implemented using a Gaussian Process Regression algorithm, yielding an MAE of $0.174$m.
