Towards Practical Non-Adversarial Distribution Matching
Ziyu Gong, Ben Usman, Han Zhao, David I. Inouye
TL;DR
The paper tackles the instability of adversarial distribution matching by introducing a non-adversarial, VAE-based matching framework (VAUB) that provides upper bounds on the Generalized Jensen–Shannon Divergence while remaining model-agnostic. By relaxing invertibility and incorporating a mutual information–preserving reconstruction term, VAUB enables plug-and-play replacement of adversarial losses in standard pipelines, such as domain adaptation and fairness models. The authors further extend the approach with Noisy Jensen–Shannon Divergence (NJSD) and corresponding noisy upper bounds (NAUB, NVAUB) to mitigate vanishing gradients and local minima, and they connect these methods to fairness literature through a nuanced analysis of priors and MI terms. Empirical results on toy and benchmark datasets show improved stability and competitive performance when replacing adversarial losses with VAUB, highlighting the practical impact for robust invariant representation learning across domains.
Abstract
Distribution matching can be used to learn invariant representations with applications in fairness and robustness. Most prior works resort to adversarial matching methods but the resulting minimax problems are unstable and challenging to optimize. Non-adversarial likelihood-based approaches either require model invertibility, impose constraints on the latent prior, or lack a generic framework for distribution matching. To overcome these limitations, we propose a non-adversarial VAE-based matching method that can be applied to any model pipeline. We develop a set of alignment upper bounds for distribution matching (including a noisy bound) that have VAE-like objectives but with a different perspective. We carefully compare our method to prior VAE-based matching approaches both theoretically and empirically. Finally, we demonstrate that our novel matching losses can replace adversarial losses in standard invariant representation learning pipelines without modifying the original architectures -- thereby significantly broadening the applicability of non-adversarial matching methods.
