Secure Visual Data Processing via Federated Learning
Pedro Santos, Tânia Carvalho, Filipe Magalhães, Luís Antunes
TL;DR
This work tackles privacy in large-scale visual data processing by proposing a framework that fuses object detection, federated learning, and anonymization. It trains a shared detector across distributed data sources using a Flower-based FL setup with YOLOv8 and applies Gaussian blur to detected sensitive regions, evaluated on Open Images-derived data. The study shows a modest accuracy trade-off in FL but substantial privacy gains, supported by systematic analyses of epochs, rounds, and aggregation methods. This framework enables privacy-preserving visual data management suitable for privacy-sensitive domains such as surveillance, healthcare, and cross-institutional analytics.
Abstract
As the demand for privacy in visual data management grows, safeguarding sensitive information has become a critical challenge. This paper addresses the need for privacy-preserving solutions in large-scale visual data processing by leveraging federated learning. Although there have been developments in this field, previous research has mainly focused on integrating object detection with either anonymization or federated learning. However, these pairs often fail to address complex privacy concerns. On the one hand, object detection with anonymization alone can be vulnerable to reverse techniques. On the other hand, federated learning may not provide sufficient privacy guarantees. Therefore, we propose a new approach that combines object detection, federated learning and anonymization. Combining these three components aims to offer a robust privacy protection strategy by addressing different vulnerabilities in visual data. Our solution is evaluated against traditional centralized models, showing that while there is a slight trade-off in accuracy, the privacy benefits are substantial, making it well-suited for privacy sensitive applications.
