Fréchet Wavelet Distance: A Domain-Agnostic Metric for Image Generation
Lokesh Veeramacheneni, Moritz Wolter, Hildegard Kuehne, Juergen Gall
TL;DR
FID-based metrics rely on pretrained backbones and exhibit domain bias when evaluating non-ImageNet datasets. Fréchet Wavelet Distance (FWD) addresses this by projecting images into packet space via a wavelet-packet transform and computing a per-packet Fréchet distance, then averaging across packets with $FWD = \frac{1}{P}\sum_{p=1}^{P} d(\mathcal{N}(\mu_{r_p}, \Sigma_{r_p}), \mathcal{N}(\mu_{g_p}, \Sigma_{g_p}))^2$. The approach is domain-agnostic, interpretable, and computationally efficient, demonstrated across diverse datasets and generators, with strong robustness to corruptions and domain shifts and alignment with human judgments. By providing per-packet insights and avoiding ImageNet-dependent biases, FWD offers a reliable complement to existing metrics for cross-domain image synthesis evaluation.
Abstract
Modern metrics for generative learning like Fréchet Inception Distance (FID) and DINOv2-Fréchet Distance (FD-DINOv2) demonstrate impressive performance. However, they suffer from various shortcomings, like a bias towards specific generators and datasets. To address this problem, we propose the Fréchet Wavelet Distance (FWD) as a domain-agnostic metric based on the Wavelet Packet Transform ($W_p$). FWD provides a sight across a broad spectrum of frequencies in images with a high resolution, preserving both spatial and textural aspects. Specifically, we use $W_p$ to project generated and real images to the packet coefficient space. We then compute the Fréchet distance with the resultant coefficients to evaluate the quality of a generator. This metric is general-purpose and dataset-domain agnostic, as it does not rely on any pre-trained network, while being more interpretable due to its ability to compute Fréchet distance per packet, enhancing transparency. We conclude with an extensive evaluation of a wide variety of generators across various datasets that the proposed FWD can generalize and improve robustness to domain shifts and various corruptions compared to other metrics.
