CMPT-353 Project

Project Code for CMPT 353 Summer 2019. Project description can be found: https://coursys.sfu.ca/2019su-cmpt-353-d1/pages/Project

WikiData Movie

Description: Data analysis and machine learning on wikidata movie data.

Objective: Predict audience average ratings

Required Libraries

sys
pandas
numpy
sklearn
matplolib
scipy
statsmodels

Usage

The first thing you should run:

$ python3 preproc.py

This will import the data and hopefully create a CSV file ml.csv.

The remainder of the files can be run in any order

Machine Learning: Extracts ml.csv into a dataframe and does ML analysis to try predict audience averages. resutls are printed to the screen.

$ python3 ml.py

Graphs: Extracts ml.csv into a dataframe and saves charts and graphs to the directory. Images are displayed in the report. You can optionally comment out the plt.savefig() lie and use plt.show().

$ python3 graph.py

Stats: Extracts omdb-data.json.gz, rotten-tomatoes.json.gz, wikidata-movies.json.gz, genres.json.gz into a dataframe and saves results from statistical analysis into genre_pvalue.csv, actor_pvalue.csv, and director_pvalue.csv

$ python3 stats2.py

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.vscode		.vscode
__pycache__		__pycache__
data		data
movies		movies
.DS_Store		.DS_Store
Audience Average vs Audeince Percent.png		Audience Average vs Audeince Percent.png
Audience Average vs Critic Average.png		Audience Average vs Critic Average.png
Audience Average vs Critic Percent.png		Audience Average vs Critic Percent.png
Audience Average vs Year.png		Audience Average vs Year.png
README.md		README.md
correlation.png		correlation.png
graphs.py		graphs.py
ml.csv		ml.csv
ml.py		ml.py
movies.zip		movies.zip
na.png		na.png
nan.csv		nan.csv
notes.txt		notes.txt
output.csv		output.csv
preproc.py		preproc.py
stats.csv		stats.csv
stats2.py		stats2.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CMPT-353 Project

WikiData Movie

Required Libraries

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CMPT-353 Project

WikiData Movie

Required Libraries

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages