Skip to content

Commit 0fd7a9c

Browse files
committed
editing writing
1 parent 82dd3a3 commit 0fd7a9c

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

classification2.Rmd

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -715,7 +715,7 @@ it takes to run the analysis. So when you do cross-validation, you need to
715715
consider the size of the data, and the speed of the algorithm (e.g., $K$-nearest
716716
neighbor) and the speed of your computer. In practice, this is a
717717
trial-and-error process, but typically $C$ is chosen to be either 5 or 10. Here we use 10-fold cross-validation rather
718-
than 5-fold:
718+
than 5-fold and we see we get a lower standard error:
719719

720720
```{r 06-10-fold}
721721
cancer_vfold <- vfold_cv(cancer_train, v = 10, strata = Class)
@@ -731,7 +731,7 @@ vfold_metrics
731731

732732
Increasing the number of folds will usually result in a lower standard error, though this is
733733
not always the case. Due to random noise, sometimes we might get a higher value. In this example,
734-
the standard error went down slightly, but not by a lot.
734+
the standard error decreased slightly, but not by a lot.
735735

736736
```{r 06-50-fold-seed, echo = FALSE, warning = FALSE, message = FALSE}
737737
# hidden seed
@@ -753,7 +753,7 @@ vfold_metrics_50 <- workflow() |>
753753
vfold_metrics_50
754754
```
755755

756-
In practice, we usually have a lot of data and setting $C$ to such a large number often takes too long to run, so we usually stick to 5 or 10 folds.
756+
In practice, we usually have a lot of data and setting $C$ to such a large number often takes a long time to run, so we usually stick to 5 or 10 folds.
757757

758758
### Parameter value selection
759759

0 commit comments

Comments
 (0)