MeshUp: Multi-Target Mesh Deformation via Blended Score Distillation
Hyunwoo Kim, Itai Lang, Noam Aigerman, Thibault Groueix, Vladimir G. Kim, Rana Hanocka
TL;DR
MeshUp addresses the problem of deforming a 3D mesh toward multiple target concepts with region-specific control. It introduces Blended Score Distillation (BSD), which fuses activation maps from parallel diffusion branches for each target into a single denoising pathway, guided by Score Distillation Sampling (SDS). Localization is achieved by constructing a 3D Region of Interest on the mesh via self-attention maps and 3D ROI masks that constrain where each concept expresses, with Jacobian-based deformation $J_i$ optimized to realize the mix. The method supports text, image, and mesh inputs, allows arbitrary numbers of targets, and enables precise regional blending, demonstrated through multi-target, localized, and texture-transfer results, along with ablations and user studies. Overall, MeshUp provides a scalable, diffusion-guided framework for high-fidelity, semantically controlled mesh deformation with practical implications for creative 3D modeling.
Abstract
We propose MeshUp, a technique that deforms a 3D mesh towards multiple target concepts, and intuitively controls the region where each concept is expressed. Conveniently, the concepts can be defined as either text queries, e.g., "a dog" and "a turtle," or inspirational images, and the local regions can be selected as any number of vertices on the mesh. We can effectively control the influence of the concepts and mix them together using a novel score distillation approach, referred to as the Blended Score Distillation (BSD). BSD operates on each attention layer of the denoising U-Net of a diffusion model as it extracts and injects the per-objective activations into a unified denoising pipeline from which the deformation gradients are calculated. To localize the expression of these activations, we create a probabilistic Region of Interest (ROI) map on the surface of the mesh, and turn it into 3D-consistent masks that we use to control the expression of these activations. We demonstrate the effectiveness of BSD empirically and show that it can deform various meshes towards multiple objectives. Our project page is at https://threedle.github.io/MeshUp.
