Class Distribution Shifts in Zero-Shot Learning: Learning Robust Representations
Yuli Slavutsky, Yuval Benjamini
TL;DR
This work tackles the challenge of class distribution shifts in zero-shot learning when the shifting attribute is unknown. It develops a parametric model showing that ERM can fail even when $P(z|c)$ is unchanged, and introduces a robust learning framework that builds diverse synthetic environments via hierarchical sampling and enforces cross-environment balance using a performance-based penalty (VarAUC). Empirical results on simulations and real datasets demonstrate improved generalization to shifted class distributions without sacrificing in-distribution performance, with statistically significant gains on CelebA and ETHEC. The approach reframes class distribution shifts as an OOD environment-balancing problem in zero-shot settings, offering a practical route to more robust open-world verification systems.
Abstract
Zero-shot learning methods typically assume that the new, unseen classes encountered during deployment come from the same distribution as the the classes in the training set. However, real-world scenarios often involve class distribution shifts (e.g., in age or gender for person identification), posing challenges for zero-shot classifiers that rely on learned representations from training classes. In this work, we propose and analyze a model that assumes that the attribute responsible for the shift is unknown in advance. We show that in this setting, standard training may lead to non-robust representations. To mitigate this, we develop an algorithm for learning robust representations in which (a) synthetic data environments are constructed via hierarchical sampling, and (b) environment balancing penalization, inspired by out-of-distribution problems, is applied. We show that our algorithm improves generalization to diverse class distributions in both simulations and experiments on real-world datasets.
