MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance

Hidir Yesiltepe; Tuna Han Salih Meral; Connor Dunlop; Pinar Yanardag

MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance

Hidir Yesiltepe, Tuna Han Salih Meral, Connor Dunlop, Pinar Yanardag

TL;DR

MotionShop presents Mixture of Score Guidance (MSG), a theoretically grounded, training-free framework for zero-shot motion transfer in diffusion-based video models. By decomposing conditional scores into motion and content components and interpreting score mixing as a mixture of potential energies, MSG links motion transfer to stabilized Langevin dynamics and enables faithful transfer across single/multi-object and complex camera motions. The approach operates directly on pre-trained video diffusion models, avoiding fine-tuning, and is supported by extensive qualitative and quantitative experiments alongside MotionBench, a new 200-source, 1,000-transfer-motion dataset. MotionShop demonstrates superior motion fidelity and temporal consistency while preserving scene content, with a principled trade-off against text alignment that favors robust motion transfer. The work advances practical motion editing in diffusion-based video generation and provides a standardized benchmark to evaluate future motion-transfer methods.

Abstract

In this work, we propose the first motion transfer approach in diffusion transformer through Mixture of Score Guidance (MSG), a theoretically-grounded framework for motion transfer in diffusion models. Our key theoretical contribution lies in reformulating conditional score to decompose motion score and content score in diffusion models. By formulating motion transfer as a mixture of potential energies, MSG naturally preserves scene composition and enables creative scene transformations while maintaining the integrity of transferred motion patterns. This novel sampling operates directly on pre-trained video diffusion models without additional training or fine-tuning. Through extensive experiments, MSG demonstrates successful handling of diverse scenarios including single object, multiple objects, and cross-object motion transfer as well as complex camera motion transfer. Additionally, we introduce MotionBench, the first motion transfer dataset consisting of 200 source videos and 1000 transferred motions, covering single/multi-object transfers, and complex camera motions.

MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance

TL;DR

Abstract

MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)