Bayesian Power Steering: An Effective Approach for Domain Adaptation of Diffusion Models
Ding Huang, Ting Li, Jian Huang
TL;DR
This work introduces Bayesian Power Steering (BPS), a Bayesian fine-tuning framework for domain adaptation of diffusion models that converts learning from a large probability space to a task-specific small probability space. By modeling the posterior denoise function $\bar{\boldsymbol{\epsilon}}^*(\mathbf{z}_t,t,\mathbf{c}_{text},\mathbf{c}_{add})$ as a combination of the pretrained denoise and a time-dependent steering term $M$, BPS implements learnable, time-aware interventions across a multi-scale feature hierarchy (CP, TP, EP) while keeping the base model frozen. The method demonstrates data-efficient, high-fidelity adaptation across layout-to-image, segmentation-to-image, artistic drawing, and sketch-to-image tasks, achieving state-of-the-art or competitive results (e.g., FID $=10.49$ on COCO17 sketch) and robust performance under data scarcity. This approach enables practical deployment of diffusion models for user-specific conditioning with limited labeled data, by leveraging structured, task-aware conditioning and a lightweight, discriminative integration strategy. Overall, BPS offers a compact, fast-converging, and versatile framework for domain adaptation in diffusion-based generative systems.
Abstract
We propose a Bayesian framework for fine-tuning large diffusion models with a novel network structure called Bayesian Power Steering (BPS). We clarify the meaning behind adaptation from a \textit{large probability space} to a \textit{small probability space} and explore the task of fine-tuning pre-trained models using learnable modules from a Bayesian perspective. BPS extracts task-specific knowledge from a pre-trained model's learned prior distribution. It efficiently leverages large diffusion models, differentially intervening different hidden features with a head-heavy and foot-light configuration. Experiments highlight the superiority of BPS over contemporary methods across a range of tasks even with limited amount of data. Notably, BPS attains an FID score of 10.49 under the sketch condition on the COCO17 dataset.
