LoRaCompass: Robust Reinforcement Learning to Efficiently Search for a LoRa Tag

Tianlang He; Zhongming Lin; Tianrui Jiang; S. -H. Gary Chan

LoRaCompass: Robust Reinforcement Learning to Efficiently Search for a LoRa Tag

Tianlang He, Zhongming Lin, Tianrui Jiang, S. -H. Gary Chan

TL;DR

LoRaCompass tackles the problem of locating a LoRa tag from RSSI readings with a mobile sensor in large, varied environments. It introduces a robust exploitation mechanism built on a receptive-field RSSI feature extractor and a policy distillation loss, together with a closed-form, UCB-inspired exploration function to reduce decision uncertainty. The approach is trained in a realistic simulator and validated on ground and drone platforms across over $80 km^2$ of unseen environments, achieving a success rate above $90%$ within $100 m$ and near-linear search efficiency compared to baselines. The work demonstrates practical deployability with a single training site and shows potential for multi-sensor collaboration to further boost search performance.

Abstract

The Long-Range (LoRa) protocol, known for its extensive range and low power, has increasingly been adopted in tags worn by mentally incapacitated persons (MIPs) and others at risk of going missing. We study the sequential decision-making process for a mobile sensor to locate a periodically broadcasting LoRa tag with the fewest moves (hops) in general, unknown environments, guided by the received signal strength indicator (RSSI). While existing methods leverage reinforcement learning for search, they remain vulnerable to domain shift and signal fluctuation, resulting in cascading decision errors that culminate in substantial localization inaccuracies. To bridge this gap, we propose LoRaCompass, a reinforcement learning model designed to achieve robust and efficient search for a LoRa tag. For exploitation under domain shift and signal fluctuation, LoRaCompass learns a robust spatial representation from RSSI to maximize the probability of moving closer to a tag, via a spatially-aware feature extractor and a policy distillation loss function. It further introduces an exploration function inspired by the upper confidence bound (UCB) that guides the sensor toward the tag with increasing confidence. We have validated LoRaCompass in ground-based and drone-assisted scenarios within diverse unseen environments covering an area of over 80km^2. It has demonstrated high success rate (>90%) in locating the tag within 100m proximity (a 40% improvement over existing methods) and high efficiency with a search path length (in hops) that scales linearly with the initial distance.

LoRaCompass: Robust Reinforcement Learning to Efficiently Search for a LoRa Tag

TL;DR

Abstract

LoRaCompass: Robust Reinforcement Learning to Efficiently Search for a LoRa Tag

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (23)