Habitizing Diffusion Planning for Efficient and Effective Decision Making
Haofei Lu, Yifei Shen, Dongsheng Li, Junliang Xing, Dongqi Han
TL;DR
This work tackles the slow inference of diffusion planning for decision making by introducing Habi, a two-stage framework that habitizes diffusion planners into fast habitual policies. Through a Bayesian behavior formulation, Habi learns a lightweight prior over actions and aligns it with the diffusion planner's posterior via an ELBO-based objective, augmented with an adaptive KL weighting and a critic to supervise habitual decisions. The resulting Habitual Inference (HI) runs at over 800 Hz on CPU while achieving competitive or superior performance on the D4RL offline RL benchmarks, and provides robust analyses of prior/posterior alignment, action-sampling effects, and distribution visualizations. This approach bridges cognitive-inspired habit formation with engineering efficiency, enabling real-time, probabilistic decision making in real-world tasks. Practical impact includes enabling diffusion-based decision-making to operate at real-time frequencies on modest hardware without sacrificing performance.
Abstract
Diffusion models have shown great promise in decision-making, also known as diffusion planning. However, the slow inference speeds limit their potential for broader real-world applications. Here, we introduce Habi, a general framework that transforms powerful but slow diffusion planning models into fast decision-making models, which mimics the cognitive process in the brain that costly goal-directed behavior gradually transitions to efficient habitual behavior with repetitive practice. Even using a laptop CPU, the habitized model can achieve an average 800+ Hz decision-making frequency (faster than previous diffusion planners by orders of magnitude) on standard offline reinforcement learning benchmarks D4RL, while maintaining comparable or even higher performance compared to its corresponding diffusion planner. Our work proposes a fresh perspective of leveraging powerful diffusion models for real-world decision-making tasks. We also provide robust evaluations and analysis, offering insights from both biological and engineering perspectives for efficient and effective decision-making.
