Skip to content

Commit 310ded7

Browse files
authored
Add web and slides form template generated version (#170)
* Add web and slides form template generated version * Remove unused files
1 parent 835226f commit 310ded7

37 files changed

+10748
-793
lines changed

CONTRIBUTING.md

Lines changed: 0 additions & 46 deletions
This file was deleted.
Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
title: Data manipulation, analysis and visualisation in Python
2-
logo:
2+
logo:
33
description: Specialist course Doctoral schools of Ghent University
44
show_downloads: true
5-
theme: jekyll-theme-minimal
5+
theme: jekyll-theme-minimal

docs/contributing.md

Lines changed: 46 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,46 @@
1+
---
2+
layout: default
3+
---
4+
5+
# Contributing guide
6+
7+
First of all, thanks for considering contributing to the course! 👍
8+
9+
## How you can contribute
10+
11+
There are several ways you can contribute to this course.
12+
13+
### Share the love ❤️
14+
15+
Think this course is useful? Let others discover it, by telling them in person, via Twitter or a blog post.
16+
17+
### Ask a question ⁉️
18+
19+
Trying out the material and got stuck? Post your question as an [issue on GitHub](https://github.com/jorisvandenbossche/course-python-data/issues). While we cannot offer user support, we'll try to do our best to address it, as questions often lead to the discovery of bugs.
20+
21+
Want to ask a question in private? Contact the course maintainer by [email]([email protected]).
22+
23+
### Propose an idea 💡
24+
25+
Have an idea for to improve the course? Take a look at the [issue list](https://github.com/jorisvandenbossche/course-python-data/issues) to see if it isn't included or suggested yet. If not, suggest your idea as an [issue on GitHub](https://github.com/jorisvandenbossche/course-python-data/issues/new).
26+
27+
### Report a bug 🐛
28+
29+
Using the course and discovered a bug or a typo? That's annoying! Don't let others have the same experience and report it as an [issue on GitHub](https://github.com/jorisvandenbossche/Have an idea for to improve the course? Take a look at the [issue list](https://github.com/jorisvandenbossche/course-python-data/issues) to see if it isn't included or suggested yet. If not, suggest your idea as an [issue on GitHub](https://github.com/jorisvandenbossche/course-python-data/issues/new).
30+
/issues/new) so we can fix it. A good bug report makes it easier for us to do so, so please include:
31+
32+
* Your operating system name and version (e.g. Mac OS 10.13.6).
33+
* Any details about your local setup that might be helpful in troubleshooting.
34+
* Detailed steps to reproduce the bug.
35+
36+
### Contribute code 📝
37+
38+
Care to fix issues or typo's? Awesome! 👏
39+
40+
Some notes to take into account:
41+
42+
- The course material is developed in the [course-python-data](https://github.com/jorisvandenbossche/course-python-data) repository. When updating course material, edit the notebooks in the [course-python-data](https://github.com/jorisvandenbossche/course-python-data) repository, the other ones (the ones used in the tutorial) are generated automatically.
43+
- the exercises are cleared using the `nbtutor` notebook extension: <https://github.com/jorisvandenbossche/nbtutor>
44+
45+
46+

docs/index.md

Lines changed: 65 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,65 @@
1+
# Data manipulation, analysis and visualisation in Python
2+
3+
## Introduction
4+
5+
The handling of data is a recurring task for data analysts. Reading in experimental data, checking its properties,
6+
and creating visualisations may become tedious tasks. Hence, increasing the efficiency in this process is beneficial for many professionals
7+
handling data. Spreadsheet-based software lacks the ability to properly support this process, due to the lack of automation and repeatability.
8+
The usage of a high-level scripting language such as Python is ideal for these tasks.
9+
10+
This course trains participants to use Python effectively to do these tasks. The course focuses on data manipulation and cleaning of tabular data,
11+
explorative analysis and visualisation using important packages such as Pandas, Numpy, Matplotlib and Seaborn.
12+
13+
The course does not cover statistics, data mining, machine learning, or predictive modelling. It aims to provide participants the means to effectively
14+
tackle commonly encountered data handling tasks in order to increase the overall efficiency. These skills are both useful for data cleaning as well as
15+
feature engineering.
16+
17+
The course has been developed as a course for the Specialist course Doctoral schools of Ghent University, but can be taught to others upon request.
18+
19+
## Course info
20+
21+
### Aim & scope
22+
23+
This course is intended for researchers that have at least basic programming skills. A basic (scientific) programming course that is part of
24+
the regular curriculum should suffice. For those who have experience in another programming language (e.g. Matlab, R, ...), following a Python
25+
tutorial prior to the course is advised.
26+
27+
The course is intended for professionals who wish to enhance their general data manipulation and visualization skills in Python, with a specific
28+
focus on tabular data. The course is NOT intended to be a course on statistics or machine learning.
29+
30+
### Program
31+
32+
After setting up the programming environment with the required packages using the conda package manager and an introduction of the Jupyter
33+
notebook environment, the data analysis package Pandas and the plotting packages Matplotlib and Seaborn are introduced. Advanced usage of Pandas
34+
for different data cleaning and manipulation tasks is taught and the acquired skills will immediately be brought into practice to handle real-world
35+
data sets. Applications include time series handling, categorical data, merging data, tidy data,...
36+
37+
The course closes with a discussion on the scientific Python ecosystem and the visualisation landscape learning
38+
participants to create interactive charts.
39+
40+
## Getting started
41+
42+
The course uses Python 3 and some data analysis packages such as Pandas, Seaborn, Numpy and Matplotlib. To install the required libraries,
43+
we recommend Anaconda or miniconda ([https://www.anaconda.com/download/](https://www.anaconda.com/download/)) or another Python distribution that
44+
includes the scientific libraries (this recommendation applies to all platforms, so for both Window, Linux and Mac).
45+
46+
For detailed instructions to get started on your local machine, see the [setup instructions](./setup.html).
47+
48+
In case you do not want to install everything and just want to try out the course material, use the environment setup by
49+
Binder [![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/jorisvandenbossche/DS-python-data-analysis/HEAD) and open de notebooks
50+
rightaway (inside the `notebooks` directory).
51+
52+
## Slides
53+
54+
For the course slides, click [here](https://jorisvandenbossche.github.io/DS-python-data-analysis/slides.html).
55+
56+
## Contributing
57+
58+
Found any typo or have a suggestion, see [how to contribute](./contributing.html).
59+
60+
## Meta
61+
Authors: Joris Van den Bossche, Stijn Van Hoey
62+
63+
<img src="./static/img/logo_flanders+richtingmorgen.png" width="79%">
64+
<img src="./static/img/doctoralschoolsprofiel_hq_rgb_web.png" width="20%">
65+

0 commit comments

Comments
 (0)