Skip to content

OpenClassrooms - Formation Data Scientist - Projet 8

License

Notifications You must be signed in to change notification settings

ahonestla/OC_DS_P8

Repository files navigation

🍓 Classification of fruits images

OPENCLASSROOMS - Data Scientist - Project 8

This repository contains files for a Big Data project that featurize images with a MobileNetV2 model using Google Cloud Platform.

📊 Data

The dataset used for this project is the Fruits 360, which includes more than tens of thousands images of fruits (size 100x100 pixels).

📁 Files

  • notebook_local.ipynb : Images processing using PySpark (local instance)
  • notebook_cloud.ipynb : Images processing using PySpark (notebook uploaded on GCP)
  • pyspark_script.py : Images processing using PySpark (script run on GCP)
  • projet8_presentation.pdf: Final presentation of the project

🛠️ Tools

  • Python 3.x
  • Jupyter Notebook
  • NumPy
  • Pandas
  • Tensorflow
  • PySpark
  • GCP : Storage, Dataproc

About

OpenClassrooms - Formation Data Scientist - Projet 8

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published