Federated Learning of Binary Neural Networks: Enabling Low-Cost Inference

Nitin Priyadarshini Shankar; Soham Lahiri; Sheetal Kalyani; Saurav Prakash

Federated Learning of Binary Neural Networks: Enabling Low-Cost Inference

Nitin Priyadarshini Shankar, Soham Lahiri, Sheetal Kalyani, Saurav Prakash

Abstract

Federated Learning (FL) preserves privacy by distributing training across devices. However, using DNNs is computationally intensive at the low-powered edge during inference. Edge deployment demands models that simultaneously optimize memory footprint and computational efficiency, a dilemma where conventional DNNs fail by exceeding resource limits. Traditional post-training binarization reduces model size but suffers from severe accuracy loss due to quantization errors. To address these challenges, we propose FedBNN, a rotation-aware binary neural network framework that learns binary representations directly during local training. By encoding each weight as a single bit $\{+1, -1\}$ instead of a $32$-bit float, FedBNN shrinks the model footprint, significantly reducing runtime (during inference) FLOPs and memory requirements in comparison to federated methods using real models. Evaluations across multiple benchmark datasets demonstrate that FedBNN significantly reduces resource consumption while performing similarly to existing federated methods using real-valued models.

Federated Learning of Binary Neural Networks: Enabling Low-Cost Inference

Abstract

instead of a

-bit float, FedBNN shrinks the model footprint, significantly reducing runtime (during inference) FLOPs and memory requirements in comparison to federated methods using real models. Evaluations across multiple benchmark datasets demonstrate that FedBNN significantly reduces resource consumption while performing similarly to existing federated methods using real-valued models.

Paper Structure (27 sections, 23 equations, 13 figures, 6 tables, 1 algorithm)

This paper contains 27 sections, 23 equations, 13 figures, 6 tables, 1 algorithm.

Introduction
Preliminaries
Federated Learning
Binarized Neural Network
Proposed Method - FedBNN
Proposed Rotation-Aware Client Training
Trainable rotation weight with global memory
Adjustable rotated weight vector for federated learning with global memory
Training aware approximation for federated learning
Proposed Aggregation Strategy
Runtime computation savings for binary models
Experimental Evaluation
Setup
Datasets and Partitioning
Simulation results
...and 12 more sections

Figures (13)

Figure 1: FedBNN overall architecture.
Figure 2: Performance comparison between FedAvg and FedBNN on FMNIST using the CNN4 architecture under IID, Non-IID1 (Dirichlet $\alpha=0.3$), and Non-IID2 (label-skew) data partitions.
Figure 3: Performance comparison between FedAvg and FedBNN on SVHN using the CNN4 architecture under IID, Non-IID1 (Dirichlet $\alpha=0.3$), and Non-IID2 (label-skew) data partitions.
Figure 4: Performance comparison between FedAvg and FedBNN on CIFAR10 using the ResNet10 architecture under IID, Non-IID1 (Dirichlet $\alpha=0.3$), and Non-IID2 (label-skew) data partitions.
Figure 5: Performance comparison between FedAvg and FedBNN on CIFAR10 using the ConvNeXt-Tiny architecture under IID, Non-IID1 (Dirichlet $\alpha=0.3$), and Non-IID2 (label-skew) data partitions.
...and 8 more figures

Federated Learning of Binary Neural Networks: Enabling Low-Cost Inference

Abstract

Federated Learning of Binary Neural Networks: Enabling Low-Cost Inference

Authors

Abstract

Table of Contents

Figures (13)