LookingGlass: Generative Anamorphoses via Laplacian Pyramid Warping
Pascal Chang, Sergio Sancho, Jingwei Tang, Markus Gross, Vinicius C. Azevedo
TL;DR
LookingGlass introduces a latent-space, feed-forward approach to generating high-quality anamorphoses by marrying latent rectified flows with a Laplacian Pyramid Warping (LPW) framework. By operating in latent spaces and encoding transformations in image space, the method preserves details while supporting arbitrary view projections beyond simple 2D transformations. The LPW pipeline blends multi-view information across pyramid levels to reduce artifacts from extreme distortions, while VAE encoding/decoding and a residual correction minimize reconstruction errors. Quantitative and qualitative results show improved fidelity, CLIP alignment, and user-preferred outcomes over prior diffusion-based and pixel-space approaches, with a practical runtime on contemporary GPUs. The work enables robust, interpretable, and extensible generative anamorphoses with potential applications in generative texture mapping, panorama synthesis, and advanced perceptual illusions.
Abstract
Anamorphosis refers to a category of images that are intentionally distorted, making them unrecognizable when viewed directly. Their true form only reveals itself when seen from a specific viewpoint, which can be through some catadioptric device like a mirror or a lens. While the construction of these mathematical devices can be traced back to as early as the 17th century, they are only interpretable when viewed from a specific vantage point and tend to lose meaning when seen normally. In this paper, we revisit these famous optical illusions with a generative twist. With the help of latent rectified flow models, we propose a method to create anamorphic images that still retain a valid interpretation when viewed directly. To this end, we introduce Laplacian Pyramid Warping, a frequency-aware image warping technique key to generating high-quality visuals. Our work extends Visual Anagrams (arXiv:2311.17919) to latent space models and to a wider range of spatial transforms, enabling the creation of novel generative perceptual illusions.
