Theoretically informed selection of latent activation in autoencoder based recommender systems

Aviad Susman

Theoretically informed selection of latent activation in autoencoder based recommender systems

Aviad Susman

TL;DR

This work identifies three key mathematical properties that the encoder in an autoencoder should exhibit to improve recommendation accuracy and demonstrates that common activation functions, such as ReLU and tanh, cannot fulfill these properties jointly within a generalizable framework.

Abstract

Autoencoders may lend themselves to the design of more accurate and computationally efficient recommender systems by distilling sparse high-dimensional data into dense lower-dimensional latent representations. However, designing these systems remains challenging due to the lack of theoretical guidance. This work addresses this by identifying three key mathematical properties that the encoder in an autoencoder should exhibit to improve recommendation accuracy: (1) dimensionality reduction, (2) preservation of similarity ordering in dot product comparisons, and (3) preservation of non-zero vectors. Through theoretical analysis, we demonstrate that common activation functions, such as ReLU and tanh, cannot fulfill these properties jointly within a generalizable framework. In contrast, sigmoid-like activations emerge as suitable choices for latent activations. This theoretically informed approach offers a more systematic method for hyperparameter selection, enhancing the efficiency of model design.

Theoretically informed selection of latent activation in autoencoder based recommender systems

TL;DR

Abstract

Theoretically informed selection of latent activation in autoencoder based recommender systems

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (4)