Predicting user ratings on unseen movie which can be used in recommending movies to users
Netflix provided a lot of anonymous rating data, and a prediction accuracy bar that is 10% better than what Cinematch can do on the same training data set. (Accuracy is a measurement of how closely predicted ratings of movies match subsequent actual ratings.)Sources https://www.netflixprize.com/rules.html https://www.kaggle.com/netflix-inc/netflix-prize-data
Objectives: Predict the rating that a user would give to a movie that he ahs not yet rated. Minimize the difference between predicted and actual rating (RMSE and MAPE)
Constraints: Some form of interpretability.
combined_data_1.txt combined_data_2.txt combined_data_3.txt combined_data_4.txt movie_titles.csv
Refer : https://www.kaggle.com/netflix-inc/netflix-prize-data/data
- Sparse matrices (https://docs.scipy.org/doc/scipy/reference/sparse.html)
- SVD and Truncated SVD (https://scikit-learn.org/stable/modules/generated/sklearn.decomposition.TruncatedSVD.html)
- Surprise baseline models http://surpriselib.com/