TANGLED: Generating 3D Hair Strands from Images with Arbitrary Styles and Viewpoints
Pengyu Long, Zijun Zhao, Min Ouyang, Qingcheng Zhao, Qixuan Zhang, Wei Yang, Lan Xu, Jingyi Yu
TL;DR
TANGLED tackles the challenge of generating realistic 3D hair strands from images across arbitrary styles and viewpoints by introducing a MultiHair dataset, a lineart-conditioned latent diffusion model, and a parametric braid inpainting module. The approach encodes hair strands as polylines mapped to scalp UV space and denoises a 2D latent representation conditioned on multi-view lineart features via cross-attention, enabling robust generation from sparse or diverse inputs. A dedicated braid inpainting stage imposes parametric braid constraints to preserve coherence in complex braided styles. Experiments show improved geometric and semantic fidelity over baselines, with strong user preference for the generated hairstyles, highlighting potential for culturally inclusive avatars and sketch-based editing in animation and AR contexts.
Abstract
Hairstyles are intricate and culturally significant with various geometries, textures, and structures. Existing text or image-guided generation methods fail to handle the richness and complexity of diverse styles. We present TANGLED, a novel approach for 3D hair strand generation that accommodates diverse image inputs across styles, viewpoints, and quantities of input views. TANGLED employs a three-step pipeline. First, our MultiHair Dataset provides 457 diverse hairstyles annotated with 74 attributes, emphasizing complex and culturally significant styles to improve model generalization. Second, we propose a diffusion framework conditioned on multi-view linearts that can capture topological cues (e.g., strand density and parting lines) while filtering out noise. By leveraging a latent diffusion model with cross-attention on lineart features, our method achieves flexible and robust 3D hair generation across diverse input conditions. Third, a parametric post-processing module enforces braid-specific constraints to maintain coherence in complex structures. This framework not only advances hairstyle realism and diversity but also enables culturally inclusive digital avatars and novel applications like sketch-based 3D strand editing for animation and augmented reality.
