Skip to content

Data Wrangling & Visualisation for Data Scientist

Awantik Das edited this page Aug 2, 2019 · 2 revisions

Introduction to Data Wrangling

  1. The journey of a Data Scientist
  2. Why Python becomes the default choice
  3. Tools vs Libraries
  4. Popular libraries
  5. Objectives of Data Wrangling
  6. Challenges of Big Data Ecosystem

Gathering Raw Data

Revision: Concept of python iterator

  1. SQL Databases
  2. NoSQL Databases
  3. Parse XML - Simple & Semi-structured
  4. CSV Parser
  5. From REST Interface
  6. Parsing HTML
  7. JSON parsing
  8. Connecting with Twitter

Essential NumPy

Loading data into DataFrame

Data Cleaning

Shaping & Structuring

Manipulation of Text Data

Storing cleaned data for Data Analysis

Clone this wiki locally