Explainable Recommender with Geometric Information Bottleneck
Hanqi Yan, Lin Gui, Menghan Wang, Kun Zhang, Yulan He
TL;DR
GIANT introduces a geometric information bottleneck framework for explainable recommendations by coupling a text-based variational model with a geometry-derived prior from the user-item interaction graph. The geometric prior is obtained via clustering of graph embeddings using a Gaussian kernel, yielding cluster-probability priors that regularize the latent factors learned from reviews through a transferable information bottleneck objective $O_{IB} = I(\bar{X};Z) - \beta \cdot I(X;Z)$. This approach enables sentence-level explanations by selecting reviews from like-minded clusters and produces semantically coherent latent topics, while maintaining competitive rating prediction and improved interpretability across BeerAdvocate, Digital Music, and Office Products datasets. The method demonstrates that global community structure can be incorporated into text-based encoders to enhance interpretability without sacrificing predictive performance, offering practical benefits for transparent and user-trustworthy recommender systems.
Abstract
Explainable recommender systems can explain their recommendation decisions, enhancing user trust in the systems. Most explainable recommender systems either rely on human-annotated rationales to train models for explanation generation or leverage the attention mechanism to extract important text spans from reviews as explanations. The extracted rationales are often confined to an individual review and may fail to identify the implicit features beyond the review text. To avoid the expensive human annotation process and to generate explanations beyond individual reviews, we propose to incorporate a geometric prior learnt from user-item interactions into a variational network which infers latent factors from user-item reviews. The latent factors from an individual user-item pair can be used for both recommendation and explanation generation, which naturally inherit the global characteristics encoded in the prior knowledge. Experimental results on three e-commerce datasets show that our model significantly improves the interpretability of a variational recommender using the Wasserstein distance while achieving performance comparable to existing content-based recommender systems in terms of recommendation behaviours.
