Skip to content

Commit de2e0b8

Browse files
predictive to predictive question (and similar change elsewhere)
1 parent 9b8ce27 commit de2e0b8

File tree

2 files changed

+11
-11
lines changed

2 files changed

+11
-11
lines changed

intro.Rmd

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -77,12 +77,12 @@ all of which are defined in Table \@ref(tab:questions-table).
7777
Carefully formulating a question as early as possible in your analysis—and
7878
correctly identifying which type of question it is—will guide your overall approach to
7979
the analysis as well as the selection of appropriate tools.\index{question!data analysis}
80-
\index{descriptive!definition}
81-
\index{exploratory!definition}
82-
\index{predictive!definition}
83-
\index{inferential!definition}
84-
\index{causal!definition}
85-
\index{mechanistic!definition}
80+
\index{descriptive question!definition}
81+
\index{exploratory question!definition}
82+
\index{predictive question!definition}
83+
\index{inferential question!definition}
84+
\index{causal question!definition}
85+
\index{mechanistic question!definition}
8686

8787
Table: (\#tab:questions-table) Types of data analysis question. From [What is the question?](https://science.sciencemag.org/content/347/6228/1314) [@leek2015question] and [The Art of Data Science](https://leanpub.com/artofdatascience) [@peng2015art].
8888

@@ -157,9 +157,9 @@ Since we are using R for data analysis in this book, the first step for us is to
157157
load the data into R. When we load tabular data into
158158
R, it is represented as a *data frame* object\index{data frame!overview}. Figure
159159
\@ref(fig:img-spreadsheet-vs-dataframe) shows that an R data frame is very similar
160-
to a spreadsheet. We refer to the rows as **observations** \index{observation}; these are the things that we
161-
collect the data on, e.g., voters, cities, etc. We refer to the columns as
162-
**variables** \index{variable}; these are the characteristics of those observations, e.g., voters' political
160+
to a spreadsheet. We refer to the rows as \index{observation} **observations**; these are the things that we
161+
collect the data on, e.g., voters, cities, etc. We refer to the columns as \index{variable}
162+
**variables**; these are the characteristics of those observations, e.g., voters' political
163163
affiliations, cities' populations, etc.
164164

165165

@@ -479,7 +479,7 @@ ggplot(ten_lang, aes(x = language, y = mother_tongue)) +
479479
> time, a single expression in R must be contained in a single line of code.
480480
> However, there *are* a small number of situations in which you can have a
481481
> single R expression span multiple lines. Above is one such case: here, R knows that a line cannot
482-
> end with a `+` symbol, and so it keeps reading the next line to figure out
482+
> end with a `+` symbol, \index{plussymb@$+$} and so it keeps reading the next line to figure out
483483
> what the right hand side of the `+` symbol should be. We could, of course,
484484
> put all of the added layers on one line of code, but splitting them across
485485
> multiple lines helps a lot with code readability. \index{multi-line expression}

regression1.Rmd

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -32,7 +32,7 @@ By the end of the chapter, readers will be able to:
3232

3333
## The regression problem
3434

35-
Regression, like classification, is a predictive \index{predictive} problem setting where we want
35+
Regression, like classification, is a predictive \index{predictive question} problem setting where we want
3636
to use past information to predict future observations. But in the case of
3737
regression, the goal is to predict *numerical* values instead of *categorical* values.
3838
The variable that you want to predict is often called the *response variable*. \index{response variable}

0 commit comments

Comments
 (0)