YINYANG-ALIGN: Benchmarking Contradictory Objectives and Proposing Multi-Objective Optimization based DPO for Text-to-Image Alignment
Amitava Das, Yaswanth Narsupalli, Gurpreet Singh, Vinija Jain, Vasu Sharma, Suranjana Trivedy, Aman Chadha, Amit Sheth
TL;DR
YinYangAlign tackles the problem of robust Text-to-Image alignment by introducing six inherently conflicting objectives and a holistic benchmark to evaluate how T2I models balance them. It proposes Contradictory Alignment Optimization (CAO), a multi-objective extension of Direct Preference Optimization (DPO) that couples per-axiom losses with a global synergy term and a Synergy Jacobian to stabilize gradient interactions, enabling Pareto-aware optimization. The framework is illustrated with a detailed annotation pipeline and a rich loss design that includes Artistic Freedom, Faithfulness to Prompt, Emotional Impact, Originality vs Referentiality, Verifiability, and Cultural Sensitivity, with Sinkhorn regularization and CLIP-based references underpinning key components. Empirical results show single-axiom DPO degrades other objectives, while CAO achieves balanced improvements across all six axes and exposes Pareto-front trade-offs, demonstrating improved multi-objective alignment for practical, ethical, and creative T2I applications. The YinYangAlign benchmark and CAO framework thus provide a scalable, interpretable pathway toward ethically aware, user-tailored, and high-fidelity text-to-image generation in real-world settings.
Abstract
Precise alignment in Text-to-Image (T2I) systems is crucial to ensure that generated visuals not only accurately encapsulate user intents but also conform to stringent ethical and aesthetic benchmarks. Incidents like the Google Gemini fiasco, where misaligned outputs triggered significant public backlash, underscore the critical need for robust alignment mechanisms. In contrast, Large Language Models (LLMs) have achieved notable success in alignment. Building on these advancements, researchers are eager to apply similar alignment techniques, such as Direct Preference Optimization (DPO), to T2I systems to enhance image generation fidelity and reliability. We present YinYangAlign, an advanced benchmarking framework that systematically quantifies the alignment fidelity of T2I systems, addressing six fundamental and inherently contradictory design objectives. Each pair represents fundamental tensions in image generation, such as balancing adherence to user prompts with creative modifications or maintaining diversity alongside visual coherence. YinYangAlign includes detailed axiom datasets featuring human prompts, aligned (chosen) responses, misaligned (rejected) AI-generated outputs, and explanations of the underlying contradictions.
