O-DisCo-Edit: Object Distortion Control for Unified Realistic Video Editing

Yuqing Chen; Junjie Wang; Lin Liu; Ruihang Chu; Xiaopeng Zhang; Qi Tian; Yujiu Yang

O-DisCo-Edit: Object Distortion Control for Unified Realistic Video Editing

Yuqing Chen, Junjie Wang, Lin Liu, Ruihang Chu, Xiaopeng Zhang, Qi Tian, Yujiu Yang

TL;DR

The paper tackles the difficulty of controllable video editing with diffusion models by introducing O-DisCo-Edit, a unified framework that uses a noise-based object distortion signal (O-DisCo) to encompass diverse editing cues. It pairs O-DisCo with a Copy-Form Preservation module and an Identity Preservation module to maintain unedited regions and object identity, respectively. A training-time random distorter (R-O-DisCo) and an inference-time adaptive distorter (A-O-DisCo) enable multi-granularity control during editing. Through extensive experiments across eight tasks and thorough ablations, the approach achieves state-of-the-art results on most benchmarks and demonstrates improved efficiency over prior multi-task and specialized models. This work proposes a new paradigm where a single unified control signal can drive flexible, high-fidelity video editing with reduced resource demands.

Abstract

Diffusion models have recently advanced video editing, yet controllable editing remains challenging due to the need for precise manipulation of diverse object properties. Current methods require different control signal for diverse editing tasks, which complicates model design and demands significant training resources. To address this, we propose O-DisCo-Edit, a unified framework that incorporates a novel object distortion control (O-DisCo). This signal, based on random and adaptive noise, flexibly encapsulates a wide range of editing cues within a single representation. Paired with a "copy-form" preservation module for preserving non-edited regions, O-DisCo-Edit enables efficient, high-fidelity editing through an effective training paradigm. Extensive experiments and comprehensive human evaluations consistently demonstrate that O-DisCo-Edit surpasses both specialized and multitask state-of-the-art methods across various video editing tasks. https://cyqii.github.io/O-DisCo-Edit.github.io/

O-DisCo-Edit: Object Distortion Control for Unified Realistic Video Editing

TL;DR

Abstract

O-DisCo-Edit: Object Distortion Control for Unified Realistic Video Editing

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (16)