Fast-Convergent and Communication-Alleviated Heterogeneous Hierarchical Federated Learning in Autonomous Driving

Wei-Bin Kou; Qingfeng Lin; Ming Tang; Rongguang Ye; Shuai Wang; Guangxu Zhu; Yik-Chung Wu

Fast-Convergent and Communication-Alleviated Heterogeneous Hierarchical Federated Learning in Autonomous Driving

Wei-Bin Kou, Qingfeng Lin, Ming Tang, Rongguang Ye, Shuai Wang, Guangxu Zhu, Yik-Chung Wu

TL;DR

The paper addresses inter-city domain shifts in Street Scene Semantic Understanding (TriSU) for autonomous driving and the slow convergence of Hierarchical Federated Learning (HFL) under non-i.i.d. city data. It introduces FedGau, a Gaussian-distribution-based weighting scheme, and AdapRS, a performance-aware adaptive resource scheduler, to accelerate convergence and reduce communication. FedGau models both per-image and per-dataset statistics as Gaussians and uses Bhattacharyya distance $D_B$ to quantify distributional similarity for weighting, achieving substantial speedups and accuracy gains; AdapRS dynamically tunes edge-cloud communication intervals to cut bandwidth use by about 29.65% without sacrificing performance. The framework demonstrates improved generalization and efficiency on Cityscapes and CamVid across multiple backbones, with potential applicability to broader privacy-preserving distributed learning tasks in autonomous driving.

Abstract

Street Scene Semantic Understanding (denoted as TriSU) is a complex task for autonomous driving (AD). However, inference model trained from data in a particular geographical region faces poor generalization when applied in other regions due to inter-city data domain-shift. Hierarchical Federated Learning (HFL) offers a potential solution for improving TriSU model generalization by collaborative privacy-preserving training over distributed datasets from different cities. Unfortunately, it suffers from slow convergence because data from different cities are with disparate statistical properties. Going beyond existing HFL methods, we propose a Gaussian heterogeneous HFL algorithm (FedGau) to address inter-city data heterogeneity so that convergence can be accelerated. In the proposed FedGau algorithm, both single RGB image and RGB dataset are modelled as Gaussian distributions for aggregation weight design. This approach not only differentiates each RGB image by respective statistical distribution, but also exploits the statistics of dataset from each city in addition to the conventionally considered data volume. With the proposed approach, the convergence is accelerated by 35.5\%-40.6\% compared to existing state-of-the-art (SOTA) HFL methods. On the other hand, to reduce the involved communication resource, we further introduce a novel performance-aware adaptive resource scheduling (AdapRS) policy. Unlike the traditional static resource scheduling policy that exchanges a fixed number of models between two adjacent aggregations, AdapRS adjusts the number of model aggregation at different levels of HFL so that unnecessary communications are minimized. Extensive experiments demonstrate that AdapRS saves 29.65\% communication overhead compared to conventional static resource scheduling policy while maintaining almost the same performance.

Fast-Convergent and Communication-Alleviated Heterogeneous Hierarchical Federated Learning in Autonomous Driving

TL;DR

to quantify distributional similarity for weighting, achieving substantial speedups and accuracy gains; AdapRS dynamically tunes edge-cloud communication intervals to cut bandwidth use by about 29.65% without sacrificing performance. The framework demonstrates improved generalization and efficiency on Cityscapes and CamVid across multiple backbones, with potential applicability to broader privacy-preserving distributed learning tasks in autonomous driving.

Abstract

Paper Structure (33 sections, 31 equations, 11 figures, 8 tables, 3 algorithms)

This paper contains 33 sections, 31 equations, 11 figures, 8 tables, 3 algorithms.

Introduction
Related Work
Hierarchical Federated Learning (HFL)
Communication Resource Scheduling
Street Scene Semantic Understanding (TriSU)
Methodology
HFL Formulation
Vehicle Update
Edge Aggregation
Cloud Aggregation
FedGau
Step I: Distribution Estimation of Single RGB Image
Step II: Distribution Estimation of RGB Dataset
Step III: Distance Calculation between RGB Datasets
Step IV: FedGau Weights Calculation
...and 18 more sections

Figures (11)

Figure 1: Illustration of HFL in inter-city setting. $\mathcal{M}$ is the set of participating cities.
Figure 2: (a) Gaussian distribution of a single RGB image's pixel values. (b) Gaussian distribution of RGB dataset estimated by averaging Gaussian distributions of all included RGB images. $n, \mu, \delta^2$ represent the dataset size, mean and variance of dataset Gaussian distribution, respectively.
Figure 3: Overview of FedGau. In FedGau, datasets on vehicles or covered by edge servers and cloud server, are all modelled as Gaussian distributions, which are subsequently utilized to measure data heterogeneity and to accelerate HFL model convergence.
Figure 4: Histograms of pixel densities. The first raw represents histograms of one RGB image from CamVid dataset brostow2008segmentation. The second raw represents histograms of one RGB image from Cityscapes dataset Cordts2016Cityscapes. The last raw represents histograms of one RGB image from Internet.
Figure 5: Illustration of two normalized histograms and the corresponding estimated probability density functions (pdfs). These two pdfs are estimated by our proposed scheme. For example, the mean and variance of "RGB Sample #1" are 121.97 and 55.54.
...and 6 more figures

Fast-Convergent and Communication-Alleviated Heterogeneous Hierarchical Federated Learning in Autonomous Driving

TL;DR

Abstract

Fast-Convergent and Communication-Alleviated Heterogeneous Hierarchical Federated Learning in Autonomous Driving

Authors

TL;DR

Abstract

Table of Contents

Figures (11)