VSF: Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By Value Sign Flip

Wenqi Guo; Shan Du

VSF: Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By Value Sign Flip

Wenqi Guo, Shan Du

TL;DR

VSF addresses the persistent challenge of enforcing negative prompts in fast, few-step diffusion and flow-matching models by dynamically flipping the sign of negative-prompt attention contributions at the token level. By combining adaptive attention with careful duplication/masking of negative embeddings, VSF achieves stronger negative-content avoidance while preserving positive prompt fidelity and image quality, even in extremely fast generation regimes. Empirical results on NegGenBench and multiple baselines show VSF outperforming NASA, NAG, and CFG in negative adherence, with competitive or superior quality, and qualitative attention analyses corroborate the mechanism. The approach is practical, computationally efficient, and broadly compatible with contemporary transformer-based diffusion architectures, offering a straightforward path to safer and more controllable image and video generation.

Abstract

We introduce Value Sign Flip (VSF), a simple and efficient method for incorporating negative prompt guidance in few-step diffusion and flow-matching image generation models. Unlike existing approaches such as classifier-free guidance (CFG), NASA, and NAG, VSF dynamically suppresses undesired content by flipping the sign of attention values from negative prompts. Our method requires only small computational overhead and integrates effectively with MMDiT-style architectures such as Stable Diffusion 3.5 Turbo, as well as cross-attention-based models like Wan. We validate VSF on challenging datasets with complex prompt pairs and demonstrate superior performance in both static image and video generation tasks. Experimental results show that VSF significantly improves negative prompt adherence compared to prior methods in few-step models, and even CFG in non-few-step models, while maintaining competitive image quality. Code and ComfyUI node are available in https://github.com/weathon/VSF/tree/main.

VSF: Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By Value Sign Flip

TL;DR

Abstract

VSF: Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By Value Sign Flip

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (19)

Theorems & Definitions (1)