Movie Recommendation System

Movie Recommendation System using Popularity Based Recommendation, Content Based Recommendation, Collaborative Based Recommendation

@@ Technologies - ML, NLP, Matrix Factorization @@
@@ Prerequisites - Python, ML, NLP, Linear Algebra @@
@@ Domain - Entertainment @@

About the Dataset

https://www.kaggle.com/rounakbanik/movie-recommender-systems/data

The dataset consists of metadata for all 45,000 movies listed in the Full MovieLens Dataset. The dataset consists of movies released on or before July 2017. Data points include cast, crew, plot keywords, budget, revenue, posters, release dates, languages, production companies, countries, TMDB vote counts and vote averages.

This dataset consists of the following files:

movies_metadata.csv: The main Movies Metadata file. Contains information on 45,000 movies featured in the Full MovieLens dataset. Features include posters, backdrops, budget, revenue, release dates, languages, production countries and companies.

keywords.csv: Contains the movie plot keywords for our MovieLens movies. Available in the form of a stringified JSON Object.

credits.csv: Consists of Cast and Crew Information for all our movies. Available in the form of a stringified JSON Object.

links.csv: The file that contains the TMDB and IMDB IDs of all the movies featured in the Full MovieLens dataset.

links_small.csv: Contains the TMDB and IMDB IDs of a small subset of 9,000 movies of the Full Dataset.

ratings_small.csv: The subset of 100,000 ratings from 700 users on 9,000 movies.

Project Plan

Merging all the given datasets ( credits.csv, keywords.csv, links.csv,links_small.csv,movies_metadata.csv,ratings_small.csv)

○ credits and keywords, credits and movies_metadata on id

○ ratings_small and links on movieId

○ links and credits on tmdbId
Data cleaning

● Exploratory Data Analysis (Data Visualisations)

● Data Preprocessing
Save the Dataframe as csv file which will be used in Popularity, Content, Collaborative Based Recommendation systems
Model Building

○ Weighted Rating for Popularity based Recommendation systems

○ TF-IDF (term frequency - inverse document frequency) for Content Based Recommendation system

○ KNN (K Nearest Neighbors) for Collaborative Based Recommendation system

Three Models deployed in single app !!! Can be seem through Nav Bar
Model Serialisation and DeSerialisation
Deployment using Streamlit where we have to select or type the movie accordingly to render output on screen

To see the interface of the app

Like how it looked like on laptop and mobile resolutions

+ Go to Deployment video file

Click On this to the see interface of the app recorded in mp4 files

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
Deployed app video		Deployed app video
MRS1		MRS1
pages		pages
.gitattributes		.gitattributes
About.py		About.py
app.py		app.py
background.png		background.png
background1.png		background1.png
backgroundj1.jpg		backgroundj1.jpg
backgroundj2.jpg		backgroundj2.jpg
backgroundj3.jpg		backgroundj3.jpg
collaborative.ipynb		collaborative.ipynb
collaborative_cosine_similarity.pkl		collaborative_cosine_similarity.pkl
contentbased.ipynb		contentbased.ipynb
datacleaning-ipynb.ipynb		datacleaning-ipynb.ipynb
df.pkl		df.pkl
knn_model.pkl		knn_model.pkl
logo.png		logo.png
logo2.png		logo2.png
logo3.png		logo3.png
movies_popularity.pkl		movies_popularity.pkl
popularity-ipynb.ipynb		popularity-ipynb.ipynb
readme.md		readme.md
requirements.txt		requirements.txt
sig.pkl		sig.pkl
weight_average.pkl		weight_average.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Movie Recommendation System