Skip to content

Commit a298b3f

Browse files
stars to dashes lobjs
1 parent aaf4928 commit a298b3f

File tree

5 files changed

+54
-54
lines changed

5 files changed

+54
-54
lines changed

source/clustering.Rmd

Lines changed: 9 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -61,16 +61,16 @@ including techniques to choose the number of clusters.
6161
## Chapter learning objectives
6262
By the end of the chapter, readers will be able to do the following:
6363

64-
* Describe a situation in which clustering is an appropriate technique to use,
64+
- Describe a situation in which clustering is an appropriate technique to use,
6565
and what insight it might extract from the data.
66-
* Explain the K-means clustering algorithm.
67-
* Interpret the output of a K-means analysis.
68-
* Differentiate between clustering, classification, and regression.
69-
* Identify when it is necessary to scale variables before clustering, and do this using R.
70-
* Perform K-means clustering in R using `tidymodels` workflows.
71-
* Use the elbow method to choose the number of clusters for K-means.
72-
* Visualize the output of K-means clustering in R using colored scatter plots.
73-
* Describe the advantages, limitations and assumptions of the K-means clustering algorithm.
66+
- Explain the K-means clustering algorithm.
67+
- Interpret the output of a K-means analysis.
68+
- Differentiate between clustering, classification, and regression.
69+
- Identify when it is necessary to scale variables before clustering, and do this using R.
70+
- Perform K-means clustering in R using `tidymodels` workflows.
71+
- Use the elbow method to choose the number of clusters for K-means.
72+
- Visualize the output of K-means clustering in R using colored scatter plots.
73+
- Describe the advantages, limitations and assumptions of the K-means clustering algorithm.
7474

7575
## Clustering
7676
Clustering \index{clustering} is a data analysis technique

source/inference.Rmd

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -37,16 +37,16 @@ populations and then introduce two common techniques in statistical inference:
3737
## Chapter learning objectives
3838
By the end of the chapter, readers will be able to do the following:
3939

40-
* Describe real-world examples of questions that can be answered with statistical inference.
41-
* Define common population parameters (e.g., mean, proportion, standard deviation) that are often estimated using sampled data, and estimate these from a sample.
42-
* Define the following statistical sampling terms: population, sample, population parameter, point estimate, and sampling distribution.
43-
* Explain the difference between a population parameter and a sample point estimate.
44-
* Use R to draw random samples from a finite population.
45-
* Use R to create a sampling distribution from a finite population.
46-
* Describe how sample size influences the sampling distribution.
47-
* Define bootstrapping.
48-
* Use R to create a bootstrap distribution to approximate a sampling distribution.
49-
* Contrast the bootstrap and sampling distributions.
40+
- Describe real-world examples of questions that can be answered with statistical inference.
41+
- Define common population parameters (e.g., mean, proportion, standard deviation) that are often estimated using sampled data, and estimate these from a sample.
42+
- Define the following statistical sampling terms: population, sample, population parameter, point estimate, and sampling distribution.
43+
- Explain the difference between a population parameter and a sample point estimate.
44+
- Use R to draw random samples from a finite population.
45+
- Use R to create a sampling distribution from a finite population.
46+
- Describe how sample size influences the sampling distribution.
47+
- Define bootstrapping.
48+
- Use R to create a bootstrap distribution to approximate a sampling distribution.
49+
- Contrast the bootstrap and sampling distributions.
5050

5151
## Why do we need sampling?
5252
We often need to understand how quantities we observe in a subset

source/regression1.Rmd

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -63,15 +63,15 @@ however that is beyond the scope of this book.
6363
## Chapter learning objectives
6464
By the end of the chapter, readers will be able to do the following:
6565

66-
* Recognize situations where a regression analysis would be appropriate for making predictions.
67-
* Explain the K-nearest neighbors (K-NN) regression algorithm and describe how it differs from K-NN classification.
68-
* Interpret the output of a K-NN regression.
69-
* In a data set with two or more variables, perform K-nearest neighbors regression in R.
70-
* Evaluate K-NN regression prediction quality in R using the root mean squared prediction error (RMSPE).
71-
* Estimate the RMSPE in R using cross-validation or a test set.
72-
* Choose the number of neighbors in K-nearest neighbors regression by minimizing estimated cross-validation RMSPE.
66+
- Recognize situations where a regression analysis would be appropriate for making predictions.
67+
- Explain the K-nearest neighbors (K-NN) regression algorithm and describe how it differs from K-NN classification.
68+
- Interpret the output of a K-NN regression.
69+
- In a data set with two or more variables, perform K-nearest neighbors regression in R.
70+
- Evaluate K-NN regression prediction quality in R using the root mean squared prediction error (RMSPE).
71+
- Estimate the RMSPE in R using cross-validation or a test set.
72+
- Choose the number of neighbors in K-nearest neighbors regression by minimizing estimated cross-validation RMSPE.
7373
- Describe underfitting and overfitting, and relate it to the number of neighbors in K-nearest neighbors regression.
74-
* Describe the advantages and disadvantages of K-nearest neighbors regression.
74+
- Describe the advantages and disadvantages of K-nearest neighbors regression.
7575

7676
## The regression problem
7777

source/regression2.Rmd

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -51,10 +51,10 @@ predictor.
5151
## Chapter learning objectives
5252
By the end of the chapter, readers will be able to do the following:
5353

54-
* Use R to fit simple and multivariable linear regression models on training data.
55-
* Evaluate the linear regression model on test data.
56-
* Compare and contrast predictions obtained from K-nearest neighbors regression to those obtained using linear regression from the same data set.
57-
* Describe how linear regression is affected by outliers and multicollinearity.
54+
- Use R to fit simple and multivariable linear regression models on training data.
55+
- Evaluate the linear regression model on test data.
56+
- Compare and contrast predictions obtained from K-nearest neighbors regression to those obtained using linear regression from the same data set.
57+
- Describe how linear regression is affected by outliers and multicollinearity.
5858

5959
## Simple linear regression
6060

source/wrangling.Rmd

Lines changed: 23 additions & 23 deletions
Original file line numberDiff line numberDiff line change
@@ -21,29 +21,29 @@ application, providing more practice working through a whole case study.
2121

2222
By the end of the chapter, readers will be able to do the following:
2323

24-
- Define the term "tidy data".
25-
- Discuss the advantages of storing data in a tidy data format.
26-
- Define what vectors, lists, and data frames are in R, and describe how they relate to
27-
each other.
28-
- Describe the common types of data in R and their uses.
29-
- Use the following functions for their intended data wrangling tasks:
30-
- `c`
31-
- `pivot_longer`
32-
- `pivot_wider`
33-
- `separate`
34-
- `select`
35-
- `filter`
36-
- `mutate`
37-
- `summarize`
38-
- `map`
39-
- `group_by`
40-
- `across`
41-
- `rowwise`
42-
- Use the following operators for their intended data wrangling tasks:
43-
- `==`, `!=`, `<`, `<=`, `>`, and `>=`
44-
- `%in%`
45-
- `!`, `&`, and `|`
46-
- `|>` and `%>%`
24+
- Define the term "tidy data".
25+
- Discuss the advantages of storing data in a tidy data format.
26+
- Define what vectors, lists, and data frames are in R, and describe how they relate to
27+
each other.
28+
- Describe the common types of data in R and their uses.
29+
- Use the following functions for their intended data wrangling tasks:
30+
- `c`
31+
- `pivot_longer`
32+
- `pivot_wider`
33+
- `separate`
34+
- `select`
35+
- `filter`
36+
- `mutate`
37+
- `summarize`
38+
- `map`
39+
- `group_by`
40+
- `across`
41+
- `rowwise`
42+
- Use the following operators for their intended data wrangling tasks:
43+
- `==`, `!=`, `<`, `<=`, `>`, and `>=`
44+
- `%in%`
45+
- `!`, `&`, and `|`
46+
- `|>` and `%>%`
4747

4848
## Data frames, vectors, and lists
4949

0 commit comments

Comments
 (0)