Efficient Learning of Sparse Representations from Interactions

Vojtěch Vančura; Martin Spišák; Rodrigo Alves; Ladislav Peška

Efficient Learning of Sparse Representations from Interactions

Vojtěch Vančura, Martin Spišák, Rodrigo Alves, Ladislav Peška

TL;DR

This work tackles the challenge of deploying compact yet expressive item representations in large-scale recommender systems by training sparse, high‑dimensional embeddings. It introduces Compressed ELSA, which enforces row‑wise sparsity via gradual pruning and enables efficient inference with CSC/SpMV, while producing interpretable item segments through dominant latent factors and semantic merging. The approach achieves near‑dense accuracy with up to 100x compression, outperforms post‑hoc sparse methods, and provides segment‑level explainability that can guide unified item and segment recommendations. The practical impact lies in enabling scalable, interpretable retrieval pipelines without sacrificing retrieval quality, with code and demos available for deployment and exploration.

Abstract

Behavioral patterns captured in embeddings learned from interaction data are pivotal across various stages of production recommender systems. However, in the initial retrieval stage, practitioners face an inherent tradeoff between embedding expressiveness and the scalability and latency of serving components, resulting in the need for representations that are both compact and expressive. To address this challenge, we propose a training strategy for learning high-dimensional sparse embedding layers in place of conventional dense ones, balancing efficiency, representational expressiveness, and interpretability. To demonstrate our approach, we modified the production-grade collaborative filtering autoencoder ELSA, achieving up to 10x reduction in embedding size with no loss of recommendation accuracy, and up to 100x reduction with only a 2.5% loss. Moreover, the active embedding dimensions reveal an interpretable inverted-index structure that segments items in a way directly aligned with the model's latent space, thereby enabling integration of segment-level recommendation functionality (e.g., 2D homepage layouts) within the candidate retrieval model itself. Source codes, additional results, as well as a live demo are available at https://github.com/zombak79/compressed_elsa

Efficient Learning of Sparse Representations from Interactions

TL;DR

Abstract

Paper Structure (14 sections, 6 equations, 2 figures, 1 table)

This paper contains 14 sections, 6 equations, 2 figures, 1 table.

Introduction
Related Work
Contributions
Method
Compressed ELSA
Pruning strategies
Inference with Sparse Layers
Interpretable Segments from Sparse Latents.
Experiments
Baselines
Accuracy vs. Compression
Pruning Schedules and Embedding Width
Segment Interpretability in Practice.
Conclusions and Limitations

Figures (2)

Figure 1: Ablation study: (a) Comparison of gradual pruning strategies; (b) Performance under different pruning strategies with and without training restarts; (c) Effect of initial embedding dimensionality. All results report nDCG@100 on the Goodbooks-10k dataset.
Figure 2: Agreement between user activations and segment-specific latent dimensions. Gray bars show the user’s sparse latent-factor values, and colored arrows mark dimensions linked to semantic segments recommended to this user. The arrows consistently align with the user’s activation values, indicating a match in segment-level preferences. (Figure extracted from our online demo.)

Efficient Learning of Sparse Representations from Interactions

TL;DR

Abstract

Efficient Learning of Sparse Representations from Interactions

Authors

TL;DR

Abstract

Table of Contents

Figures (2)