I2VControl: Disentangled and Unified Video Motion Synthesis Control

Wanquan Feng; Tianhao Qi; Jiawei Liu; Mingzhen Sun; Pengqi Tu; Tianxiang Ma; Fei Dai; Songtao Zhao; Siyu Zhou; Qian He

I2VControl: Disentangled and Unified Video Motion Synthesis Control

Wanquan Feng, Tianhao Qi, Jiawei Liu, Mingzhen Sun, Pengqi Tu, Tianxiang Ma, Fei Dai, Songtao Zhao, Siyu Zhou, Qian He

TL;DR

This work tackles the challenge of controllable video synthesis by addressing the fragmentation of motion controls across camera movement, object dragging, and motion brush. It proposes I2VControl, a unified framework that represents all controls as dense point trajectories, partitions the input into motion units, and employs an adapter network to integrate with pretrained diffusion models. Key contributions include consistent representation, spatial partitioning, a dual-task data pipeline, and training an adapter that enables conflict-free multi-type control, achieving strong performance across tasks and enabling creative combinations. The approach is shown to generalize across different base models, highlighting practical impact for flexible and user-driven video production.

Abstract

Motion controllability is crucial in video synthesis. However, most previous methods are limited to single control types, and combining them often results in logical conflicts. In this paper, we propose a disentangled and unified framework, namely I2VControl, to overcome the logical conflicts. We rethink camera control, object dragging, and motion brush, reformulating all tasks into a consistent representation based on point trajectories, each managed by a dedicated formulation. Accordingly, we propose a spatial partitioning strategy, where each unit is assigned to a concomitant control category, enabling diverse control types to be dynamically orchestrated within a single synthesis pipeline without conflicts. Furthermore, we design an adapter structure that functions as a plug-in for pre-trained models and is agnostic to specific model architectures. We conduct extensive experiments, achieving excellent performance on various control tasks, and our method further facilitates user-driven creative combinations, enhancing innovation and creativity. Project page: https://wanquanf.github.io/I2VControl .

I2VControl: Disentangled and Unified Video Motion Synthesis Control

TL;DR

Abstract

I2VControl: Disentangled and Unified Video Motion Synthesis Control

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)