@@ -1168,6 +1168,18 @@ In a bar plot, the height of the bar represents the value of a summary statistic
1168
1168
They are particularly useful for comparing summary statistics between different
1169
1169
groups of a categorical variable.
1170
1170
1171
+
1172
+ Here, we have a data frame of Earth's landmasses,
1173
+ and are trying to compare their sizes.
1174
+ The right type of visualization to answer this question is a bar plot.
1175
+ In a bar plot, the height of each bar represents the value of an * amount*
1176
+ (a size, count, proportion, percentage, etc).
1177
+ They are particularly useful for comparing counts or proportions across different
1178
+ groups of a categorical variable. Note, however, that bar plots should generally not be
1179
+ used to display mean or median values, as they hide important information about
1180
+ the variation of the data. Instead it's better to show the distribution of
1181
+ all the individual data points, e.g., using a histogram, which we will discuss further in {numref}` histogramsviz ` .
1182
+
1171
1183
``` {index} altair; mark_bar
1172
1184
```
1173
1185
@@ -1292,6 +1304,7 @@ visualization for answering our original questions. Landmasses are organized by
1292
1304
their size, and continents are colored differently than other landmasses,
1293
1305
making it quite clear that all the seven largest landmasses are continents.
1294
1306
1307
+ (histogramsviz)=
1295
1308
### Histograms: the Michelson speed of light data set
1296
1309
1297
1310
``` {index} Michelson speed of light
0 commit comments