LATENTPATCH: A Non-Parametric Approach for Face Generation and Editing
Benjamin Samuth, Julien Rabin, David Tschumperlé, Frédéric Jurie
TL;DR
LatentPatch addresses few-shot face generation by avoiding fine-tuning large generative models and instead exploiting a non-parametric, patch-based sampling in a latent space derived from a pre-trained auto-encoder. The method constructs a universal latent representation, compresses it with PCA, and synthesizes new latent codes by sequentially sampling latent patches from the source distribution across multiple scales, without any model training. It supports conditional generation and attribute-guided editing, offering near photo-realistic results with a tiny parameter budget. Experiments on face datasets show competitive quality and flexibility, highlighting practical applicability when data and compute are scarce.
Abstract
This paper presents LatentPatch, a new method for generating realistic images from a small dataset of only a few images. We use a lightweight model with only a few thousand parameters. Unlike traditional few-shot generation methods that finetune pre-trained large-scale generative models, our approach is computed directly on the latent distribution by sequential feature matching, and is explainable by design. Avoiding large models based on transformers, recursive networks, or self-attention, which are not suitable for small datasets, our method is inspired by non-parametric texture synthesis and style transfer models, and ensures that generated image features are sampled from the source distribution. We extend previous single-image models to work with a few images and demonstrate that our method can generate realistic images, as well as enable conditional sampling and image editing. We conduct experiments on face datasets and show that our simplistic model is effective and versatile.
