Towards Privacy-preserved Pre-training of Remote Sensing Foundation Models with Federated Mutual-guidance Learning
Jieyi Tan, Chengwei Zhang, Bo Dang, Yansheng Li
TL;DR
The paper tackles privacy-preserving pre-training of Remote Sensing Foundation Models by introducing FedSense, a federated self-supervised learning framework that couples Server-to-Clients Guidance (SCG) with Clients-to-Server Guidance (CSG) to break the cycle between data heterogeneity and communication overhead. SCG enforces global-flatness and aligns client representations with a universal encoder, while CSG uses low-bit quantization with error-feedback and server-side similarity distillation on public data to reduce communication without sacrificing performance. The approach supports both contrastive and masked-image-modeling SSL backbones and demonstrates superior performance and communication efficiency across four downstream RS tasks, including effective 1-bit quantization. These results indicate FedSense enables scalable, privacy-preserving collaboration among institutions for pre-training RSFMs with real-world data heterogeneity. The work lays a foundation for extending privacy-preserved, multi-modal RSFM pre-training in future work.
Abstract
Traditional Remote Sensing Foundation models (RSFMs) are pre-trained with a data-centralized paradigm, through self-supervision on large-scale curated remote sensing data. For each institution, however, pre-training RSFMs with limited data in a standalone manner may lead to suboptimal performance, while aggregating remote sensing data from multiple institutions for centralized pre-training raises privacy concerns. Seeking for collaboration is a promising solution to resolve this dilemma, where multiple institutions can collaboratively train RSFMs without sharing private data. In this paper, we propose a novel privacy-preserved pre-training framework (FedSense), which enables multiple institutions to collaboratively train RSFMs without sharing private data. However, it is a non-trivial task hindered by a vicious cycle, which results from model drift by remote sensing data heterogeneity and high communication overhead. To break this vicious cycle, we introduce Federated Mutual-guidance Learning. Specifically, we propose a Server-to-Clients Guidance (SCG) mechanism to guide clients updates towards global-flatness optimal solutions. Additionally, we propose a Clients-to-Server Guidance (CSG) mechanism to inject local knowledge into the server by low-bit communication. Extensive experiments on four downstream tasks demonstrate the effectiveness of our FedSense in both full-precision and communication-reduced scenarios, showcasing remarkable communication efficiency and performance gains.
