Skip to content

Commit 097dea8

Browse files
authored
MAINT synchronize quizzes (#665)
1 parent c3e371b commit 097dea8

File tree

1 file changed

+8
-11
lines changed

1 file changed

+8
-11
lines changed

jupyter-book/evaluation/evaluation_wrap_up_quiz.md

Lines changed: 8 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -180,9 +180,8 @@ In the previous cross-validation, we made the choice of using a `ShuffleSplit`
180180
cross-validation strategy. It means that randomly selected samples were selected
181181
as testing set ignoring any time dependency between the lines of the dataframe.
182182

183-
We would like to have a cross-validation strategy that evaluates the capacity of
184-
our model to predict on a completely new bike ride: the samples in the
185-
validation set should only come from rides not present in the training set.
183+
We would like to have a cross-validation strategy that takes into account the
184+
groups defined by each individual date. Each group corresponds to a bike ride.
186185

187186
```{admonition} Question
188187
How many bike rides are stored in the dataframe `data`? Do not hesitate to
@@ -204,14 +203,12 @@ Hint: You can access to the date and time of a `DatetimeIndex` using
204203

205204
+++
206205

207-
Instead of using the naive `ShuffleSplit` strategy, we will use a strategy that
208-
takes into account the group defined by each individual date. It corresponds to
209-
a bike ride. We would like to have a cross-validation strategy that evaluates
210-
the capacity of our model to predict on a completely new bike ride: the samples
211-
in the validation set should only come from rides not present in the training
212-
set. Therefore, we can use a `LeaveOneGroupOut` strategy: at each iteration of
213-
the cross-validation, we will keep a bike ride for the evaluation and use all
214-
other bike rides to train our model.
206+
We would like to have a cross-validation strategy that evaluates the capacity of
207+
our model to predict on a completely new bike ride: the samples in the
208+
validation set should only come from rides not present in the training set.
209+
Therefore, we can use a `LeaveOneGroupOut` strategy: at each iteration of the
210+
cross-validation, we will keep a bike ride for the evaluation and use all other
211+
bike rides to train our model.
215212

216213
Thus, you concretely need to:
217214

0 commit comments

Comments
 (0)