We use this workshop to learn about data science using real-life examples.
We use Python as our main programming language for this purpose.

Link to mindmap: (https://bubbl.us/010902974515863474)
Github allows us to collaborate on projects.
Getting started guide
Pythontutor allows us to visualize our code step-by-step
Travis-CI - for unit testing (comparing output of code versions, check differences)
pandas - A powerful data analysis module for python
Data sets:
- Worldbank data
- Data sets for data science
- Berkeley Earth - Global temperature data
- Open Refine - edit/cleanup data sets
- Plotly - Plot/visualize data
CSV
- ETL - extract, transform, load
Here is how to write easy documentation in Markdown.
We use markdown editor SDingus and markdown converter pandoc.
- Jacobs, J., & Rudis, B. (2014). Data-Driven Security: Analysis, Visualization and Dashboards. John Wiley & Sons.
See to-do list in issues.
See latest homework issue.
Reading a file with csv.reader returns strings - need to cast as int
"Off by one" -> index starts at 1 in csv