Why is Normalization Necessary for Linear Recommenders?
Seongmin Park, Mincheol Yoon, Hye-young Kim, Jongwuk Lee
TL;DR
This work tackles popularity and neighborhood biases in linear autoencoder (LAE)–based recommenders by examining existing normalization methods and introducing Data-Adaptive Normalization (DAN). DAN provides item- and user-adaptive normalization that adjusts to dataset-specific skewness and homophily, with a closed-form solution that preserves eigenstructure while enabling denoising-like benefits. Empirical results across six benchmark datasets show that DAN-equipped LAEs achieve significant gains, particularly for long-tail items and unbiased evaluations, while maintaining computational efficiency. The approach is model-agnostic within LAEs and lays a path for extending adaptive normalization to neural models, offering practical impact for scalable, accurate recommendation in diverse data regimes.
Abstract
Despite their simplicity, linear autoencoder (LAE)-based models have shown comparable or even better performance with faster inference speed than neural recommender models. However, LAEs face two critical challenges: (i) popularity bias, which tends to recommend popular items, and (ii) neighborhood bias, which overly focuses on capturing local item correlations. To address these issues, this paper first analyzes the effect of two existing normalization methods for LAEs, i.e., random-walk and symmetric normalization. Our theoretical analysis reveals that normalization highly affects the degree of popularity and neighborhood biases among items. Inspired by this analysis, we propose a versatile normalization solution, called Data-Adaptive Normalization (DAN), which flexibly controls the popularity and neighborhood biases by adjusting item- and user-side normalization to align with unique dataset characteristics. Owing to its model-agnostic property, DAN can be easily applied to various LAE-based models. Experimental results show that DAN-equipped LAEs consistently improve existing LAE-based models across six benchmark datasets, with significant gains of up to 128.57% and 12.36% for long-tail items and unbiased evaluations, respectively. Refer to our code in https://github.com/psm1206/DAN.
