Multi-Directional Subspace Editing in Style-Space
Chen Naveh, Yacov Hel-Or
TL;DR
MDSE addresses the challenge of disentangled, multi-attribute editing in StyleGAN by decomposing the latent space $\mathcal{W^+}$ into orthogonal subspaces, each tied to a specific attribute. It introduces an orthogonality loss and a mixing loss to support multi-directional edits within each subspace while preserving other attributes, and it demonstrates superior disentanglement and identity preservation compared with leading baselines. Quantitative metrics and ablation studies confirm the importance of explicit subspace orthogonality for reducing entanglement and artifacts during sequential edits. The work has practical impact for controllable, high-fidelity face editing and suggests directions for integrating such structure into generator training for even stronger disentanglement.
Abstract
This paper describes a new technique for finding disentangled semantic directions in the latent space of StyleGAN. Our method identifies meaningful orthogonal subspaces that allow editing of one human face attribute, while minimizing undesired changes in other attributes. Our model is capable of editing a single attribute in multiple directions, resulting in a range of possible generated images. We compare our scheme with three state-of-the-art models and show that our method outperforms them in terms of face editing and disentanglement capabilities. Additionally, we suggest quantitative measures for evaluating attribute separation and disentanglement, and exhibit the superiority of our model with respect to those measures.
