Skip to content

mohammedar95/NLP-content-based-recommendation-engine_pt2

Repository files navigation

TMDB Open Source API to apply NLP Content-Based Recommendation System | Streaming App Data (Part 2)

Publication:

https://medium.com/@malrashidan95/nlp-content-based-recommendation-system-streaming-app-data-part-2-2e60b2d6b0fc

Why Part 2?

If we noticed from the data we have from the previous post (Recommendation System Using (Rank, User-User, Matrix Factorization) on Streaming App). We have limited advantages from the dataset. Some of the examples we lack of is:

  • movie/tv year released to understand years' trends
  • full-content details such as overviews of the content would help us in applying content-based recommendations
  • more tags of genre because we know each content can have more genre
  • metrics of each content such as ratings, votes, and reviews. These can help identify which content has more value. Since STC did not add their ratings, we cannot conclude local popularity by using ratings because these ratings were collected globally from TMDB. Therefore, we can add ratings globally. For example, "Most Voted Movies Globally"

Business Understanding

In the previous article, I had planned to add a content-based recommendation system but unfortunately, that was not possible due to a lack of data points from STC that can help us to apply the content-based recommendation. In this article, our main goal is to make that happen by feeding our data with another source of data using TMDB OpenSource API. This will help us fetch required data points such as release date, overview, and genre.

Objective

Our goal here is simple, we will try to get as much information as possible to feed our dataset. We have a list of Movies/TVs from STC JAWWY, and we want to use TMDB API to get more information about STC movies. Steps:

  1. Identfy Movies and TV Shows Names from STC datasets
  2. Search these the Movies and TV Shows in TMDB API
  3. Merge new variables into our main df
  4. Build content-based recommendation system based on cosine similarity

How to Run:

In the project's root directory:

  • Simply download the data from the data folder
  • Run the notebook and make sure to install required libraries (TMDB, SKLEARN, PANDAS)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published