I worked with provision a Spark cluster on AWS EMR, connect it to a Jupyter Notebook and then run a series of queries (in python with DataFrame API or Spark SQL) that answer a few simple questions about the IMDB Data available.
Syedhossain3/Apache_SPARK_AWS_EMR_Analyzing_IMDB_Datasets
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|