Skip to content

Latest commit

 

History

History
20 lines (19 loc) · 973 Bytes

File metadata and controls

20 lines (19 loc) · 973 Bytes

Project3

  1. About Dataset- This dataset contains the 2022 and 2023 global mart sales datasets.
  2. Used Python version 3.10.10 and Jupyter version 7.2.1 for this project.
  3. Used Kaggle API to Download Data from Kaggle.
  4. Used Jupyter Notebook to read data into Pandas dataframe, and transform data using Python.
  5. Transformation includes- renaming column , changing data type, adding calculated columns dropping unnecessary columns
  6. Used SQLalchemy to load data into df_orders table into prebuilt Database project4python_sql.
  7. Used window authentication to load data into SSMS and ODBC driver for connection.
  8. Analysis of Data in SSMS includes - find the top 10 highest revenue-generating product find the top 5 products in each region by highest revenue-generating find month-over-month growth for years 2022 and 2023 for each category which month has the highest sales which subcategory has the highest YoY growth