Skip to content
Open
Show file tree
Hide file tree
Changes from 4 commits
Commits
Show all changes
81 commits
Select commit Hold shift + click to select a range
2ec9530
WIP Add module about clustering
May 22, 2025
8ab8ff8
Iter on Kmeans exercise
May 27, 2025
d828ada
Synch exercise notebooks
May 27, 2025
d94eb14
Add notebooks on hdbscan and feature engineering
May 27, 2025
fd41577
Reworked the k-means intro notebook to use penguins dataset
ogrisel May 28, 2025
90bdfed
Rerender the first notebook
ogrisel May 28, 2025
8ccd658
Add some missing cell markers
ogrisel May 28, 2025
0a2dfa3
Rerender the first notebook
ogrisel May 28, 2025
83bc291
More missing markers
ogrisel May 28, 2025
60d32eb
Rerender the first notebook
ogrisel May 28, 2025
2621cec
Improve phrasing / fix typos
ogrisel May 28, 2025
57fafec
Typo
ogrisel May 28, 2025
51a0d13
Rerender the first notebook
ogrisel May 28, 2025
6b26222
Iter on Olivier's work
May 30, 2025
fe58a3e
General rewording
Jun 2, 2025
7374ae5
Apply suggestions from code review
ArturoAmorQ Jun 2, 2025
fd03b1a
Rephrasing in cluster_kmeans_sol_01.py
ogrisel Jun 3, 2025
da9a3ec
Resynchronize exercise and fix CI
Jun 3, 2025
d4ad40c
Wording
Jun 3, 2025
8ab3e29
Use MAE to score predicted house prices
Jun 4, 2025
52f244a
Solve plotly DeprecationWarning
Jun 4, 2025
dec3453
Prefer make_column_transformer as per #831
Jun 4, 2025
00c41a7
Iter on hdbscan notebook
Jun 5, 2025
ffe1855
Remove redundant paragraph
Jun 5, 2025
26cd3d2
Rename exercise and solution
Jun 5, 2025
50e9bc0
Add exercise and solution using AMI
Jun 5, 2025
d7e03e6
Fix exercise
Jun 6, 2025
900f0da
Small improvements to the solution of exercise 02
ogrisel Jun 6, 2025
87d438f
Add the skrub dependency
ogrisel Jun 6, 2025
166868c
Expand analysis a bit
ogrisel Jun 6, 2025
74245d3
Improvements in the HDBSCAN notebook
ogrisel Jun 6, 2025
9bfe2f2
Reworded analysis of the BBC text clustering notebook + use cross-val…
ogrisel Jun 6, 2025
a765e53
Improvements in the supervised metrics notebook
ogrisel Jun 6, 2025
024545f
Add discussion on silhouette for hdbscan
Jun 9, 2025
2647274
Fix warning and plot not rendering
Jun 9, 2025
bd32b87
Add intro, overview and sections
Jun 10, 2025
27f4e52
Iter discussion on silhouette for hdbscan
Jun 10, 2025
b8f6646
Tweaks
Jun 10, 2025
d328631
Add first quiz on clustering and related images
Jun 11, 2025
c32eabe
Wording tweaks
Jun 11, 2025
89110ea
Add second quiz on clustering
Jun 20, 2025
d4b975e
Apply suggestions from code review
ArturoAmorQ Jun 24, 2025
b016d8b
Update jupyter-book/clustering/clustering_module_take_away.md
ArturoAmorQ Jun 24, 2025
a72be00
Synchronize quizzes from review
Jun 24, 2025
60a58c4
Merge branch 'clustering_module' of github.com:ArturoAmorQ/scikit-lea…
Jun 24, 2025
aff2dba
Synchronize notebooks
Jun 24, 2025
0c161ee
Add clustering wrap-up quiz
Jul 15, 2025
54ff951
Fix bug in wrap-up quiz
Jul 15, 2025
3fa71e9
Add wrap-up quiz to toc
Jul 15, 2025
5c7b28c
Fix a couple of bugs
Aug 12, 2025
83fc894
Feature branch to update to 1.6 (#813)
ogrisel May 27, 2025
49b4474
FIX Penguin figures not rendering (#828)
ArturoAmorQ May 27, 2025
c7009bc
Missing notebook sync (#838)
ogrisel May 27, 2025
9a3dd25
MAINT Changed the use of ColumnTransformer to make_column_transformer…
SebastienMelo May 27, 2025
110f5cc
Fix typos (#839)
omahs Jun 2, 2025
36ca1c5
minor improvements in wording and import statement order (#841)
rouk1 Jun 2, 2025
8ca0182
Minor typos fixups (#842)
davidjsonn Jun 9, 2025
4f9b633
MTN px parallel render fix (#843)
SebastienMelo Jun 12, 2025
4aa8c13
MAINT Update matplotlib to v3.10.3 (#846)
brospars Jun 17, 2025
1210659
MTN Heat map explanation (#833)
SebastienMelo Jul 9, 2025
ff94dea
MTN Bias variance quizz (#849)
SebastienMelo Jul 9, 2025
e1a5f07
MNT Add info about the estimators html diagram (#844)
ArturoAmorQ Jul 15, 2025
a7df7ad
Update notebooks
Jul 15, 2025
c962329
Add cross-validation diagram to GridSearchCV notebook (#847)
student-ChestaVashishtha Jul 30, 2025
00ddb9b
MTN Synchronized the quizzes for module 1 and 7
SebastienMelo Aug 7, 2025
271555a
MTN Fix the parallel plots
SebastienMelo Aug 7, 2025
0df2714
MTN Hyperparameter tuning with grid search
SebastienMelo Aug 13, 2025
d263961
MTN Made the distinction between predictor and transformer clearer (#…
SebastienMelo Aug 13, 2025
090c715
MTN Added model state to the glossary (#857)
SebastienMelo Aug 13, 2025
e5d4bf6
MTN Proposal for explantation of what are iterations (#859)
SebastienMelo Sep 10, 2025
b943bed
FIX HistGradientBoosting fitting time too long (#860)
SebastienMelo Sep 17, 2025
2c8b60a
Update README and Adding License txt file for Bike Rides Dataset (#858)
student-ChestaVashishtha Sep 26, 2025
29ad752
Remove introduction paragraph meant for maintainers (#863)
SebastienMelo Oct 2, 2025
a4bd6ea
Improve wording in definition of numerical features (#861)
SebastienMelo Oct 2, 2025
e39860f
Explicit that fitting time is measured in seconds (#862)
SebastienMelo Oct 2, 2025
a516bdc
Add dataset credits and licenses (#864)
SebastienMelo Oct 9, 2025
22bb873
Update notebooks
Oct 9, 2025
8e3fe4b
Change BBC news to Wikinews
Oct 23, 2025
da12c87
Merge branch 'INRIA:main' into clustering_module
ArturoAmorQ Oct 23, 2025
d8fe672
Update corresponding notebook
Oct 23, 2025
b235eaa
Add credits to newly added datasets
Oct 23, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1,251 changes: 1,251 additions & 0 deletions datasets/bbc_news.csv

Large diffs are not rendered by default.

201 changes: 201 additions & 0 deletions datasets/mall_customers.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,201 @@
Age,Annual Income (k$),Spending Score (1-100),Genre
19,15,39,Male
21,15,81,Male
20,16,6,Female
23,16,77,Female
31,17,40,Female
22,17,76,Female
35,18,6,Female
23,18,94,Female
64,19,3,Male
30,19,72,Female
67,19,14,Male
35,19,99,Female
58,20,15,Female
24,20,77,Female
37,20,13,Male
22,20,79,Male
35,21,35,Female
20,21,66,Male
52,23,29,Male
35,23,98,Female
35,24,35,Male
25,24,73,Male
46,25,5,Female
31,25,73,Male
54,28,14,Female
29,28,82,Male
45,28,32,Female
35,28,61,Male
40,29,31,Female
23,29,87,Female
60,30,4,Male
21,30,73,Female
53,33,4,Male
18,33,92,Male
49,33,14,Female
21,33,81,Female
42,34,17,Female
30,34,73,Female
36,37,26,Female
20,37,75,Female
65,38,35,Female
24,38,92,Male
48,39,36,Male
31,39,61,Female
49,39,28,Female
24,39,65,Female
50,40,55,Female
27,40,47,Female
29,40,42,Female
31,40,42,Female
49,42,52,Female
33,42,60,Male
31,43,54,Female
59,43,60,Male
50,43,45,Female
47,43,41,Male
51,44,50,Female
69,44,46,Male
27,46,51,Female
53,46,46,Male
70,46,56,Male
19,46,55,Male
67,47,52,Female
54,47,59,Female
63,48,51,Male
18,48,59,Male
43,48,50,Female
68,48,48,Female
19,48,59,Male
32,48,47,Female
70,49,55,Male
47,49,42,Female
60,50,49,Female
60,50,56,Female
59,54,47,Male
26,54,54,Male
45,54,53,Female
40,54,48,Male
23,54,52,Female
49,54,42,Female
57,54,51,Male
38,54,55,Male
67,54,41,Male
46,54,44,Female
21,54,57,Female
48,54,46,Male
55,57,58,Female
22,57,55,Female
34,58,60,Female
50,58,46,Female
68,59,55,Female
18,59,41,Male
48,60,49,Male
40,60,40,Female
32,60,42,Female
24,60,52,Male
47,60,47,Female
27,60,50,Female
48,61,42,Male
20,61,49,Male
23,62,41,Female
49,62,48,Female
67,62,59,Male
26,62,55,Male
49,62,56,Male
21,62,42,Female
66,63,50,Female
54,63,46,Male
68,63,43,Male
66,63,48,Male
65,63,52,Male
19,63,54,Female
38,64,42,Female
19,64,46,Male
18,65,48,Female
19,65,50,Female
63,65,43,Female
49,65,59,Female
51,67,43,Female
50,67,57,Female
27,67,56,Male
38,67,40,Female
40,69,58,Female
39,69,91,Male
23,70,29,Female
31,70,77,Female
43,71,35,Male
40,71,95,Male
59,71,11,Male
38,71,75,Male
47,71,9,Male
39,71,75,Male
25,72,34,Female
31,72,71,Female
20,73,5,Male
29,73,88,Female
44,73,7,Female
32,73,73,Male
19,74,10,Male
35,74,72,Female
57,75,5,Female
32,75,93,Male
28,76,40,Female
32,76,87,Female
25,77,12,Male
28,77,97,Male
48,77,36,Male
32,77,74,Female
34,78,22,Female
34,78,90,Male
43,78,17,Male
39,78,88,Male
44,78,20,Female
38,78,76,Female
47,78,16,Female
27,78,89,Female
37,78,1,Male
30,78,78,Female
34,78,1,Male
30,78,73,Female
56,79,35,Female
29,79,83,Female
19,81,5,Male
31,81,93,Female
50,85,26,Male
36,85,75,Female
42,86,20,Male
33,86,95,Female
36,87,27,Female
32,87,63,Male
40,87,13,Male
28,87,75,Male
36,87,10,Male
36,87,92,Male
52,88,13,Female
30,88,86,Female
58,88,15,Male
27,88,69,Male
59,93,14,Male
35,93,90,Male
37,97,32,Female
32,97,86,Female
46,98,15,Male
29,98,88,Female
41,99,39,Female
30,99,97,Male
54,101,24,Female
28,101,68,Male
41,103,17,Female
36,103,85,Female
34,103,23,Female
32,103,69,Female
33,113,8,Male
38,113,91,Female
47,120,16,Female
35,120,79,Female
45,126,28,Female
32,126,74,Male
32,137,18,Male
30,137,83,Male
Loading
Loading