MegaPortrait: Revisiting Diffusion Control for High-fidelity Portrait Generation

Han Yang; Sotiris Anagnostidis; Enis Simsar; Thomas Hofmann

MegaPortrait: Revisiting Diffusion Control for High-fidelity Portrait Generation

Han Yang, Sotiris Anagnostidis, Enis Simsar, Thomas Hofmann

TL;DR

The proposed MegaPortrait is an innovative system for creating personalized portrait images in computer vision with three modules: Identity Net, Shading Net, and Harmonization Net, which is better than state-of-the-art AI portrait products in identity preservation and image fidelity.

Abstract

We propose MegaPortrait. It's an innovative system for creating personalized portrait images in computer vision. It has three modules: Identity Net, Shading Net, and Harmonization Net. Identity Net generates learned identity using a customized model fine-tuned with source images. Shading Net re-renders portraits using extracted representations. Harmonization Net fuses pasted faces and the reference image's body for coherent results. Our approach with off-the-shelf Controlnets is better than state-of-the-art AI portrait products in identity preservation and image fidelity. MegaPortrait has a simple but effective design and we compare it with other methods and products to show its superiority.

MegaPortrait: Revisiting Diffusion Control for High-fidelity Portrait Generation

TL;DR

Abstract

MegaPortrait: Revisiting Diffusion Control for High-fidelity Portrait Generation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)