SPAR: Self-supervised Placement-Aware Representation Learning for Distributed Sensing
Yizhuo Chen, Tianchen Wang, You Lyu, Yanlan Hu, Jinyang Li, Tomoyoshi Kimura, Hongjue Zhao, Yigong Hu, Denizhan Kara, Tarek Abdelzaher
TL;DR
SPAR addresses the placement-sensitivity gap in self-supervised learning for distributed sensing by coupling explicit spatial and structural embeddings with dual reconstruction objectives. Through a unified transformer-based architecture, it learns context-aware representations that reflect the duality between observer placements and observations. The work provides information-theoretic and occlusion-invariant analyses to justify the design and demonstrates superior generalization across vehicle, HAR, and seismic localization tasks, including unseen layouts and constrained communications. The results indicate SPAR's potential to enable robust, data-efficient sensing in real-world, multi-modal, multi-node deployments.
Abstract
We present SPAR, a framework for self-supervised placement-aware representation learning in distributed sensing. Distributed sensing spans applications where multiple spatially distributed and multimodal sensors jointly observe an environment, from vehicle monitoring to human activity recognition and earthquake localization. A central challenge shared by this wide spectrum of applications is that observed signals are inseparably shaped by sensor placements, including their spatial locations and structural characteristics. However, existing pretraining methods remain largely placement-agnostic. SPAR addresses this gap through a unifying principle: the duality between signals and positions. Guided by this principle, SPAR introduces spatial and structural positional embeddings together with dual reconstruction objectives, explicitly modeling how observing positions and observed signals shape each other. Placement is thus treated not as auxiliary metadata but as intrinsic to representation learning. SPAR is theoretically supported by analyses from information theory and occlusion-invariant learning. Extensive experiments on three real-world datasets show that SPAR achieves superior robustness and generalization across various modalities, placements, and downstream tasks.
