You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: 3-Data-Visualization/09-visualization-quantities/README.md
+8-8Lines changed: 8 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -52,7 +52,7 @@ Let's start by plotting some of the numeric data using a basic line plot. Suppos
52
52
wingspan = birds['MaxWingspan']
53
53
wingspan.plot()
54
54
```
55
-

55
+

56
56
57
57
What do you notice immediately? There seems to be at least one outlier - that's quite a wingspan! A 2300 centimeter wingspan equals 23 meters - are there Pterodactyls roaming Minnesota? Let's investigate.
58
58
@@ -72,7 +72,7 @@ plt.plot(x, y)
72
72
73
73
plt.show()
74
74
```
75
-

75
+

76
76
77
77
Even with the rotation of the labels set to 45 degrees, there are too many to read. Let's try a different strategy: label only those outliers and set the labels within the chart. You can use a scatter chart to make more room for the labeling:
78
78
@@ -94,7 +94,7 @@ What's going on here? You used `tick_params` to hide the bottom labels and then
94
94
95
95
What did you discover?
96
96
97
-

97
+

98
98
## Filter your data
99
99
100
100
Both the Bald Eagle and the Prairie Falcon, while probably very large birds, appear to be mislabeled, with an extra `0` added to their maximum wingspan. It's unlikely that you'll meet a Bald Eagle with a 25 meter wingspan, but if so, please let us know! Let's create a new dataframe without those two outliers:
@@ -114,7 +114,7 @@ plt.show()
114
114
115
115
By filtering out outliers, your data is now more cohesive and understandable.
116
116
117
-

117
+

118
118
119
119
Now that we have a cleaner dataset at least in terms of wingspan, let's discover more about these birds.
120
120
@@ -140,7 +140,7 @@ birds.plot(x='Category',
140
140
title='Birds of Minnesota')
141
141
142
142
```
143
-

143
+

144
144
145
145
This bar chart, however, is unreadable because there is too much non-grouped data. You need to select only the data that you want to plot, so let's look at the length of birds based on their category.

158
+

159
159
160
160
This bar chart shows a good view of the number of birds in each category. In a blink of an eye, you see that the largest number of birds in this region are in the Ducks/Geese/Waterfowl category. Minnesota is the 'land of 10,000 lakes' so this isn't surprising!
Nothing is surprising here: hummingbirds have the least MaxLength compared to Pelicans or Geese. It's good when data makes logical sense!
177
177
@@ -189,7 +189,7 @@ plt.show()
189
189
```
190
190
In this plot, you can see the range per bird category of the Minimum Length and Maximum length. You can safely say that, given this data, the bigger the bird, the larger its length range. Fascinating!
0 commit comments