Distributed Zero-Shot Learning for Visual Recognition

Zhi Chen; Yadan Luo; Zi Huang; Jingjing Li; Sen Wang; Xin Yu

Distributed Zero-Shot Learning for Visual Recognition

Zhi Chen, Yadan Luo, Zi Huang, Jingjing Li, Sen Wang, Xin Yu

TL;DR

This work introduces DistZSL, a distributed zero-shot learning framework that learns from decentralized data with partial class-conditional distributions without sharing raw data. It couples a cross-device attribute regularizer, derived from Graphical Lasso semantic similarities and KL alignment, with a global attribute-to-visual consensus that enforces a bilateral semantic-visual reconstruction to stabilize cross-device mappings. By replacing per-client classifiers with shared semantic anchors and jointly optimizing semantic and visual pathways, DistZSL mitigates local optima and biases due to data heterogeneity. Theoretical analysis provides alignment and reconstruction guarantees, while experiments on five ZSL datasets demonstrate consistent improvements over federated baselines across i.i.d., non-i.i.d., and p.c.c.d. settings, with comprehensive ablations validating each component's contribution.

Abstract

In this paper, we propose a Distributed Zero-Shot Learning (DistZSL) framework that can fully exploit decentralized data to learn an effective model for unseen classes. Considering the data heterogeneity issues across distributed nodes, we introduce two key components to ensure the effective learning of DistZSL: a cross-node attribute regularizer and a global attribute-to-visual consensus. Our proposed cross-node attribute regularizer enforces the distances between attribute features to be similar across different nodes. In this manner, the overall attribute feature space would be stable during learning, and thus facilitate the establishment of visual-to-attribute(V2A) relationships. Then, we introduce the global attribute-tovisual consensus to mitigate biased V2A mappings learned from individual nodes. Specifically, we enforce the bilateral mapping between the attribute and visual feature distributions to be consistent across different nodes. Thus, the learned consistent V2A mapping can significantly enhance zero-shot learning across different nodes. Extensive experiments demonstrate that DistZSL achieves superior performance to the state-of-the-art in learning from distributed data.

Distributed Zero-Shot Learning for Visual Recognition

TL;DR

Abstract

Distributed Zero-Shot Learning for Visual Recognition

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (11)

Theorems & Definitions (15)