Skip to content

Commit 7f147d0

Browse files
intro bugs from py issue
1 parent 64087e1 commit 7f147d0

File tree

1 file changed

+7
-5
lines changed

1 file changed

+7
-5
lines changed

source/intro.Rmd

Lines changed: 7 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -89,7 +89,7 @@ tongues in Canada, and how many people speak each of them?*
8989
Every good data analysis begins with a *question*—like the
9090
above—that you aim to answer using data. As it turns out, there
9191
are actually a number of different *types* of question regarding data:
92-
descriptive, exploratory, inferential, predictive, causal, and mechanistic,
92+
descriptive, exploratory, predictive, inferential, causal, and mechanistic,
9393
all of which are defined in Table \@ref(tab:questions-table).
9494
Carefully formulating a question as early as possible in your analysis—and
9595
correctly identifying which type of question it is—will guide your overall approach to
@@ -174,10 +174,12 @@ Since we are using R for data analysis in this book, the first step for us is to
174174
load the data into R. When we load tabular data into
175175
R, it is represented as a *data frame* object\index{data frame!overview}. Figure
176176
\@ref(fig:img-spreadsheet-vs-dataframe) shows that an R data frame is very similar
177-
to a spreadsheet. We refer to the rows as \index{observation} **observations**; these are the things that we
178-
collect the data on, e.g., voters, cities, etc. We refer to the columns as \index{variable}
179-
**variables**; these are the characteristics of those observations, e.g., voters' political
180-
affiliations, cities' populations, etc.
177+
to a spreadsheet. We refer to the rows as \index{observation} **observations**;
178+
these are the individual objects
179+
for which we collect data. In Figure \@ref(fig:img-spreadsheet-vs-dataframe), the observations are
180+
languages. We refer to the columns as **variables**; these are the characteristics of each
181+
observation. In Figure \@ref(fig:img-spreadsheet-vs-dataframe), the variables are the the
182+
language's category, its name, the number of mother tongue speakers, etc.
181183

182184
```{r img-spreadsheet-vs-dataframe, echo = FALSE, message = FALSE, warning = FALSE, fig.align = "center", fig.cap = "A spreadsheet versus a data frame in R.", out.width="100%", fig.retina = 2}
183185
knitr::include_graphics("img/intro/spreadsheet_vs_dataframe.png")

0 commit comments

Comments
 (0)