Table of Contents
Fetching ...

Content-based Recommendation Engine for Video Streaming Platform

Puskal Khadka, Prabhav Lamichhane

TL;DR

This paper addresses the challenge of presenting relevant video recommendations in large streaming catalogs by adopting a content-based approach that relies on item metadata. It constructs TF-IDF representations from features such as Genre, Cast, and Overview, and uses cosine similarity to quantify item likeness for personalized suggestions. A user-centric evaluation on a small dataset yields precision ~0.91, recall ~0.83, and F1 ~0.87, demonstrating the method's effectiveness when user data is sparse. The approach offers fast, personalized recommendations without requiring cross-user data, and the authors discuss extending it with hybrid signals and deep language embeddings to further improve performance.

Abstract

Recommendation engines suggest content, products, or services to the user by using machine learning algorithms. This paper proposes a content-based recommendation engine that provides personalized video suggestions based on users' previous interactions and preferences. The engine uses TF-IDF (Term Frequency-Inverse Document Frequency) text vectorization technique to evaluate the relevance of words in video descriptions, followed by the computation of cosine similarity between content items to determine their degree of similarity. The system's performance is evaluated using precision, recall, and F1-score metrics. Experimental results demonstrate the effectiveness of content-based filtering in delivering relevant and personalized video recommendations to users. This approach can enhance user engagement on video streaming platforms and reduce search time, providing a more intuitive, preference-based viewing experience.

Content-based Recommendation Engine for Video Streaming Platform

TL;DR

This paper addresses the challenge of presenting relevant video recommendations in large streaming catalogs by adopting a content-based approach that relies on item metadata. It constructs TF-IDF representations from features such as Genre, Cast, and Overview, and uses cosine similarity to quantify item likeness for personalized suggestions. A user-centric evaluation on a small dataset yields precision ~0.91, recall ~0.83, and F1 ~0.87, demonstrating the method's effectiveness when user data is sparse. The approach offers fast, personalized recommendations without requiring cross-user data, and the authors discuss extending it with hybrid signals and deep language embeddings to further improve performance.

Abstract

Recommendation engines suggest content, products, or services to the user by using machine learning algorithms. This paper proposes a content-based recommendation engine that provides personalized video suggestions based on users' previous interactions and preferences. The engine uses TF-IDF (Term Frequency-Inverse Document Frequency) text vectorization technique to evaluate the relevance of words in video descriptions, followed by the computation of cosine similarity between content items to determine their degree of similarity. The system's performance is evaluated using precision, recall, and F1-score metrics. Experimental results demonstrate the effectiveness of content-based filtering in delivering relevant and personalized video recommendations to users. This approach can enhance user engagement on video streaming platforms and reduce search time, providing a more intuitive, preference-based viewing experience.
Paper Structure (16 sections, 7 equations, 2 figures, 8 tables)

This paper contains 16 sections, 7 equations, 2 figures, 8 tables.

Figures (2)

  • Figure 1: Heatmap showing cosine similarity between movies
  • Figure 2: Venn diagram illustrating the relationship between precision and recall