Skip to content

Commit fc3bf2a

Browse files
Merge pull request #555 from UBC-DSCI/dataset-data-set
Fix dataset -> data set
2 parents edac32a + b0bebda commit fc3bf2a

File tree

2 files changed

+3
-3
lines changed

2 files changed

+3
-3
lines changed

source/clustering.Rmd

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -902,7 +902,7 @@ to those in the earlier classification and regression chapters.
902902
We will begin by loading the `tidyclust`\index{tidyclust} library, which contains the necessary
903903
functionality, and then read in the
904904
original (i.e., unstandardized) subset of 18 observations
905-
from the penguins dataset.
905+
from the penguins data set.
906906

907907
```{r 10-get-unscaled-data, echo = FALSE, message = FALSE, warning = FALSE}
908908
unstandardized_data <- read_csv("data/penguins_toy.csv") |>
@@ -1188,5 +1188,5 @@ and guidance that the worksheets provide will function as intended.
11881188
clustering for when you expect there to be subgroups, and then subgroups within
11891189
subgroups, etc., in your data. In the realm of more general unsupervised
11901190
learning, it covers *principal components analysis (PCA)*, which is a very
1191-
popular technique for reducing the number of predictors in a dataset.
1191+
popular technique for reducing the number of predictors in a data set.
11921192

source/regression1.Rmd

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -66,7 +66,7 @@ By the end of the chapter, readers will be able to do the following:
6666
* Recognize situations where a simple regression analysis would be appropriate for making predictions.
6767
* Explain the K-nearest neighbor (KNN) regression algorithm and describe how it differs from KNN classification.
6868
* Interpret the output of a KNN regression.
69-
* In a dataset with two or more variables, perform K-nearest neighbor regression in R using a `tidymodels` workflow.
69+
* In a data set with two or more variables, perform K-nearest neighbor regression in R using a `tidymodels` workflow.
7070
* Execute cross-validation in R to choose the number of neighbors.
7171
* Evaluate KNN regression prediction accuracy in R using a test data set and the root mean squared prediction error (RMSPE).
7272
* In the context of KNN regression, compare and contrast goodness of fit and prediction properties (namely RMSE vs RMSPE).

0 commit comments

Comments
 (0)