Why Thinking Hurts? Diagnosing and Rectifying the Reasoning Shift in Foundation Recommender Models
Luankang Zhang, Yonghao Huang, Hang Lv, Mingjia Yin, Liangyue Li, Zulong Chen, Hao Wang, Enhong Chen
TL;DR
By compressing reasoning chains and applying bias-subtracted contrastive decoding, this approach mitigates ungrounded textual drift and effectively calibrates inference, allowing foundation models to leverage reasoning without sacrificing ID-grounded accuracy.
Abstract
Integrating Chain-of-Thought (CoT) reasoning into Semantic ID-based recommendation foundation models (such as OpenOneRec) often paradoxically degrades recommendation performance. We identify the root cause as textual inertia from the General Subspace, where verbose reasoning dominates inference and causes the model to neglect critical Semantic ID. To address this, we propose a training-free Inference-Time Subspace Alignment framework. By compressing reasoning chains and applying bias-subtracted contrastive decoding, our approach mitigates ungrounded textual drift. Experiments show this effectively calibrates inference, allowing foundation models to leverage reasoning without sacrificing ID-grounded accuracy.
