Latent Diffusion U-Net Representations Contain Positional Embeddings and Anomalies
Jonas Loos, Lorenz Linhardt
TL;DR
This work analyzes latent diffusion U-Net representations in Stable Diffusion to assess their suitability as robust features for downstream tasks. By applying representational similarity analyses and token-norm assessments, it uncovers three phenomena: a linearly extractable positional embedding, corner tokens with abnormally high similarity, and high-norm anomalies in up-sampling blocks. The findings hold across SD-1.5, SD-2.1, and SD-Turbo on a subset of ImageNet, highlighting potential pitfalls for tasks requiring spatial locality or reliable feature norms. The work motivates caution and further study of diffusion-model representations before deploying them for robust downstream applications.
Abstract
Diffusion models have demonstrated remarkable capabilities in synthesizing realistic images, spurring interest in using their representations for various downstream tasks. To better understand the robustness of these representations, we analyze popular Stable Diffusion models using representational similarity and norms. Our findings reveal three phenomena: (1) the presence of a learned positional embedding in intermediate representations, (2) high-similarity corner artifacts, and (3) anomalous high-norm artifacts. These findings underscore the need to further investigate the properties of diffusion model representations before considering them for downstream tasks that require robust features. Project page: https://jonasloos.github.io/sd-representation-anomalies
