Complex variational autoencoders admit Kähler structure

Andrew Gracyk

Complex variational autoencoders admit Kähler structure

Andrew Gracyk

TL;DR

<3-5 sentence high-level summary> The paper investigates complex variational autoencoders (VAEs) and shows that complex latent spaces admit a Kähler geometric structure linked to the Fisher information metric. It derives the complex Fisher metric under a complex Gaussian decoder and establishes that the Hessian of the KL divergence serves as a Kähler potential, enabling a principled geometric interpretation of latent representations. To make this geometry computationally practical, it introduces two Kähler potentials: an exact form tied to the KL Hessian and a scalable log-sum-exp surrogate that preserves plurisubharmonicity. The authors also propose curvature-aware sampling and a regularization term involving the determinant of the metric, demonstrating smoother representations and fewer semantic outliers while maintaining sampling efficiency.

Abstract

It has been discovered that latent-Euclidean variational autoencoders (VAEs) admit, in various capacities, Riemannian structure. We adapt these arguments but for complex VAEs with a complex latent stage. We show that complex VAEs reveal to some level Kähler geometric structure. Our methods will be tailored for decoder geometry. We derive the Fisher information metric in the complex case under a latent complex Gaussian with trivial relation matrix. It is well known from statistical information theory that the Fisher information coincides with the Hessian of the Kullback-Leibler (KL) divergence. Thus, the metric Kähler potential relation is exactly achieved under relative entropy. We propose a Kähler potential derivative of complex Gaussian mixtures that acts as a rough proxy to the Fisher information metric while still being faithful to the underlying Kähler geometry. Computation of the metric via this potential is efficient, and through our potential, valid as a plurisubharmonic (PSH) function, large scale computational burden of automatic differentiation is displaced to small scale. Our methods leverage the law of total covariance to bridge behavior between our potential and the Fisher metric. We show that we can regularize the latent space with decoder geometry, and that we can sample in accordance with a weighted complex volume element. We demonstrate these strategies, at the exchange of sample variation, yield consistently smoother representations and fewer semantic outliers.

Complex variational autoencoders admit Kähler structure

TL;DR

Abstract

Complex variational autoencoders admit Kähler structure

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)