|
59 | 59 | "source": [
|
60 | 60 | "The data were previously available on the open data portal of the city, and we downloaded them in the `CSV` format, and provided the original file as `data/fietstellingencoupure.csv`.\n",
|
61 | 61 | "\n",
|
62 |
| - "This data set contains the historical data of the bike counters, and consists of the following columns:\n", |
| 62 | + "This dataset contains the historical data of the bike counters, and consists of the following columns:\n", |
63 | 63 | "\n",
|
64 | 64 | "- The first column `datum` is the date, in `dd/mm/yy` format\n",
|
65 | 65 | "- The second column `tijd` is the time of the day, in `hh:mm` format\n",
|
|
633 | 633 | "cell_type": "markdown",
|
634 | 634 | "metadata": {},
|
635 | 635 | "source": [
|
636 |
| - "However, when we already know the format of the dates (and if this is consistent throughout the full data set), we can use this information to interpret the dates:" |
| 636 | + "However, when we already know the format of the dates (and if this is consistent throughout the full dataset), we can use this information to interpret the dates:" |
637 | 637 | ]
|
638 | 638 | },
|
639 | 639 | {
|
|
673 | 673 | "cell_type": "markdown",
|
674 | 674 | "metadata": {},
|
675 | 675 | "source": [
|
676 |
| - "### Write the data set cleaning as a function\n", |
| 676 | + "### Write the dataset cleaning as a function\n", |
677 | 677 | "\n",
|
678 | 678 | "In order to make it easier to reuse the code for the pre-processing we have implemented, let's convert the code to a Python function:"
|
679 | 679 | ]
|
|
817 | 817 | "cell_type": "markdown",
|
818 | 818 | "metadata": {},
|
819 | 819 | "source": [
|
820 |
| - "### Store our collected data set as an interim data product" |
| 820 | + "### Store our collected dataset as an interim data product" |
821 | 821 | ]
|
822 | 822 | },
|
823 | 823 | {
|
|
847 | 847 | "cell_type": "markdown",
|
848 | 848 | "metadata": {},
|
849 | 849 | "source": [
|
850 |
| - "We now have a cleaned-up data set of the bike counts at Coupure Links in Ghent (Belgium). Next, we want to get an impression of the characteristics and properties of the data" |
| 850 | + "We now have a cleaned-up dataset of the bike counts at Coupure Links in Ghent (Belgium). Next, we want to get an impression of the characteristics and properties of the data" |
851 | 851 | ]
|
852 | 852 | },
|
853 | 853 | {
|
|
1549 | 1549 | "cell_type": "code",
|
1550 | 1550 | "execution_count": 32,
|
1551 | 1551 | "metadata": {
|
1552 |
| - "clear_cell": false, |
1553 |
| - "collapsed": false, |
1554 |
| - "jupyter": { |
1555 |
| - "outputs_hidden": false |
1556 |
| - } |
| 1552 | + "clear_cell": true, |
| 1553 | + "tags": [] |
1557 | 1554 | },
|
1558 | 1555 | "outputs": [
|
1559 | 1556 | {
|
|
1813 | 1810 | "\n",
|
1814 | 1811 | "**EXERCISE**\n",
|
1815 | 1812 | "\n",
|
1816 |
| - "- Select a subset of the data set from 2013-12-31 12:00:00 until 2014-01-01 12:00:00 and assign the result to a new variable `newyear`\n", |
| 1813 | + "- Select a subset of the dataset from 2013-12-31 12:00:00 until 2014-01-01 12:00:00 and assign the result to a new variable `newyear`\n", |
1817 | 1814 | "- Plot the selected data `newyear`.\n",
|
1818 | 1815 | "- Use a `rolling` function with a window of 10 values (check documentation of the function) to smooth the data of this period and make a plot of the smoothed version.\n",
|
1819 | 1816 | "\n",
|
|
0 commit comments