Inversion-Free Style Transfer with Dual Rectified Flows
Yingying Deng, Xiangyu He, Fan Tang, Weiming Dong, Xucheng Yin
TL;DR
This work tackles the inefficiency and distortion introduced by inversion-based training-free style transfer methods by proposing an inversion-free framework built on dual Rectified Flows that operate entirely in forward ODEs. Content and style trajectories are predicted in parallel and fused through a midpoint interpolation, with a carefully designed velocity field that accounts for evolving stylized content and style distributions. An attention injection module further guides style integration, yielding improved visual fidelity and content preservation across diverse styles. Extensive experiments demonstrate that the approach delivers high-quality stylization with better fidelity and generalization, while offering faster inference than diffusion-based baselines. The proposed method provides a practical, scalable pipeline for inversion-free style transfer suitable for real-world creative tools.
Abstract
Style transfer, a pivotal task in image processing, synthesizes visually compelling images by seamlessly blending realistic content with artistic styles, enabling applications in photo editing and creative design. While mainstream training-free diffusion-based methods have greatly advanced style transfer in recent years, their reliance on computationally inversion processes compromises efficiency and introduces visual distortions when inversion is inaccurate. To address these limitations, we propose a novel \textit{inversion-free} style transfer framework based on dual rectified flows, which tackles the challenge of finding an unknown stylized distribution from two distinct inputs (content and style images), \textit{only with forward pass}. Our approach predicts content and style trajectories in parallel, then fuses them through a dynamic midpoint interpolation that integrates velocities from both paths while adapting to the evolving stylized image. By jointly modeling the content, style, and stylized distributions, our velocity field design achieves robust fusion and avoids the shortcomings of naive overlays. Attention injection further guides style integration, enhancing visual fidelity, content preservation, and computational efficiency. Extensive experiments demonstrate generalization across diverse styles and content, providing an effective and efficient pipeline for style transfer.
