CoSteer: Collaborative Decoding-Time Personalization via Local Delta Steering

Hang Lv; Sheng Liang; Hao Wang; Hongchao Gu; Yaxiong Wu; Wei Guo; Defu Lian; Yong Liu; Enhong Chen

CoSteer: Collaborative Decoding-Time Personalization via Local Delta Steering

Hang Lv, Sheng Liang, Hao Wang, Hongchao Gu, Yaxiong Wu, Wei Guo, Defu Lian, Yong Liu, Enhong Chen

TL;DR

CoSteer is proposed, a collaborative framework that enables tuning-free, real-time personalization via decoding-time adaptation, and generates high-quality personalized content, ensuring both effectiveness and computational efficiency.

Abstract

Personalization has become crucial for adapting models to the diverse and evolving needs of users across cultural, temporal, and contextual dimensions. While existing methods often rely on centralized fine-tuning or static preference alignment within a single model, they struggle to achieve both real-time and high-quality personalization under the resource and privacy constraints of personal devices. To address this challenge, we propose CoSteer, a collaborative framework that enables tuning-free, real-time personalization via decoding-time adaptation. By leveraging logit differences between context-aware and context-agnostic local small models, CoSteer steers cloud-based large models, ensuring effective personalization while preserving the large model's capabilities. Personalization is handled locally, with only final tokens sent to the cloud, maintaining both user context and system efficiency. Through extensive experiments across a wide range of tasks, we demonstrate that CoSteer generates high-quality personalized content, ensuring both effectiveness and computational efficiency. Our results highlight its robustness across models and environments, confirming its practical applicability in real-world scenarios.

CoSteer: Collaborative Decoding-Time Personalization via Local Delta Steering

TL;DR

Abstract

CoSteer: Collaborative Decoding-Time Personalization via Local Delta Steering

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)