Formal Logic Enabled Personalized Federated Learning Through Property Inference
Ziyan An, Taylor T. Johnson, Meiyi Ma
TL;DR
FedSTL addresses the absence of symbolic temporal reasoning in federated learning by automatically inferring temporal logic properties for each client and dynamically clustering clients based on property alignment. It introduces a teacher-student mechanism to correct predictions toward the inferred STL properties and employs a hierarchical update scheme that preserves personalization while enabling cluster-level learning. Across real-world highway traffic and SUMO-based smart-city data, FedSTL achieves substantial improvements in predictive accuracy and property satisfaction, including up to a 54% reduction in MSE and a 100% property-satisfaction rate for teacher corrections. This approach enables more trustworthy, explainable AIoT systems by integrating domain-specific temporal reasoning into privacy-preserving distributed learning.
Abstract
Recent advancements in federated learning (FL) have greatly facilitated the development of decentralized collaborative applications, particularly in the domain of Artificial Intelligence of Things (AIoT). However, a critical aspect missing from the current research landscape is the ability to enable data-driven client models with symbolic reasoning capabilities. Specifically, the inherent heterogeneity of participating client devices poses a significant challenge, as each client exhibits unique logic reasoning properties. Failing to consider these device-specific specifications can result in critical properties being missed in the client predictions, leading to suboptimal performance. In this work, we propose a new training paradigm that leverages temporal logic reasoning to address this issue. Our approach involves enhancing the training process by incorporating mechanically generated logic expressions for each FL client. Additionally, we introduce the concept of aggregation clusters and develop a partitioning algorithm to effectively group clients based on the alignment of their temporal reasoning properties. We evaluate the proposed method on two tasks: a real-world traffic volume prediction task consisting of sensory data from fifteen states and a smart city multi-task prediction utilizing synthetic data. The evaluation results exhibit clear improvements, with performance accuracy improved by up to 54% across all sequential prediction models.
