From Entity Reliability to Clean Feedback: An Entity-Aware Denoising Framework Beyond Interaction-Level Signals
Ze Liu, Xianquan Wang, Shuochen Liu, Jie Ma, Huibo Xu, Yupeng Han, Kai Zhang, Jun Zhou
TL;DR
The paper tackles noise in implicit-feedback for recommender systems by shifting focus from interaction-level to entity-level reliability. It introduces EARD, a lightweight, model-agnostic framework that assigns weights to training instances using both interaction-level loss-based signals and entity-level reliability estimated from average losses, via an ECDF-based base weighting and a rank-based mapping to entity reputations. The final weight for each (user, item) pair is the product of base, user, and item weights, enabling effective denoising with minimal hyperparameter tuning ($\alpha,\beta$). Extensive experiments across Yelp, Amazon-Book, and ML-1M on GMF, NeuMF, and CDAE show significant improvements over state-of-the-art baselines, with favorable efficiency compared to heavy denoising methods. The work demonstrates the practical value of entity-aware reliability modeling for robust recommendations in noisy, large-scale settings.
Abstract
Implicit feedback is central to modern recommender systems but is inherently noisy, often impairing model training and degrading user experience. At scale, such noise can mislead learning processes, reducing both recommendation accuracy and platform value. Existing denoising strategies typically overlook the entity-specific nature of noise while introducing high computational costs and complex hyperparameter tuning. To address these challenges, we propose \textbf{EARD} (\textbf{E}ntity-\textbf{A}ware \textbf{R}eliability-\textbf{D}riven Denoising), a lightweight framework that shifts the focus from interaction-level signals to entity-level reliability. Motivated by the empirical observation that training loss correlates with noise, EARD quantifies user and item reliability via their average training losses as a proxy for reputation, and integrates these entity-level factors with interaction-level confidence. The framework is \textbf{model-agnostic}, \textbf{computationally efficient}, and requires \textbf{only two intuitive hyperparameters}. Extensive experiments across multiple datasets and backbone models demonstrate that EARD yields substantial improvements over state-of-the-art baselines (e.g., up to 27.01\% gain in NDCG@50), while incurring negligible additional computational cost. Comprehensive ablation studies and mechanism analyses further confirm EARD's robustness to hyperparameter choices and its practical scalability. These results highlight the importance of entity-aware reliability modeling for denoising implicit feedback and pave the way for more robust recommendation research.
