One for Dozens: Adaptive REcommendation for All Domains with Counterfactual Augmentation
Huishi Luo, Yiwen Chen, Yiqing Wu, Fuzhen Zhuang, Deqing Wang
TL;DR
This paper addresses multi-domain recommendation with dozens of domains by introducing AREAD, a framework that combines a Hierarchical Expert Integration (HEI) with Hierarchical Expert Mask Pruning (HEMP) and a popularity-based counterfactual augmenter. HEI enables cross-domain knowledge sharing across multiple granularities using a compact hierarchical network, while HEMP learns domain-specific masks to identify effective transfers, guided by the Lottery Ticket Hypothesis. The counterfactual augmenter augments data for minor domains by leveraging cross-domain genuine-interest signals from popular items, mitigating data sparsity. Across two public datasets with over twenty domains, AREAD delivers consistent gains, especially for minor domains, while maintaining reasonable training and maintenance costs and providing interpretable insights into domain relationships and expert usage.
Abstract
Multi-domain recommendation (MDR) aims to enhance recommendation performance across various domains. However, real-world recommender systems in online platforms often need to handle dozens or even hundreds of domains, far exceeding the capabilities of traditional MDR algorithms, which typically focus on fewer than five domains. Key challenges include a substantial increase in parameter count, high maintenance costs, and intricate knowledge transfer patterns across domains. Furthermore, minor domains often suffer from data sparsity, leading to inadequate training in classical methods. To address these issues, we propose Adaptive REcommendation for All Domains with counterfactual augmentation (AREAD). AREAD employs a hierarchical structure with a limited number of expert networks at several layers, to effectively capture domain knowledge at different granularities. To adaptively capture the knowledge transfer pattern across domains, we generate and iteratively prune a hierarchical expert network selection mask for each domain during training. Additionally, counterfactual assumptions are used to augment data in minor domains, supporting their iterative mask pruning. Our experiments on two public datasets, each encompassing over twenty domains, demonstrate AREAD's effectiveness, especially in data-sparse domains. Source code is available at https://github.com/Chrissie-Law/AREAD-Multi-Domain-Recommendation.
