-
Notifications
You must be signed in to change notification settings - Fork 589
WIP Add module about clustering #836
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
ArturoAmorQ
wants to merge
81
commits into
INRIA:main
Choose a base branch
from
ArturoAmorQ:clustering_module
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Changes from 4 commits
Commits
Show all changes
81 commits
Select commit
Hold shift + click to select a range
2ec9530
WIP Add module about clustering
8ab8ff8
Iter on Kmeans exercise
d828ada
Synch exercise notebooks
d94eb14
Add notebooks on hdbscan and feature engineering
fd41577
Reworked the k-means intro notebook to use penguins dataset
ogrisel 90bdfed
Rerender the first notebook
ogrisel 8ccd658
Add some missing cell markers
ogrisel 0a2dfa3
Rerender the first notebook
ogrisel 83bc291
More missing markers
ogrisel 60d32eb
Rerender the first notebook
ogrisel 2621cec
Improve phrasing / fix typos
ogrisel 57fafec
Typo
ogrisel 51a0d13
Rerender the first notebook
ogrisel 6b26222
Iter on Olivier's work
fe58a3e
General rewording
7374ae5
Apply suggestions from code review
ArturoAmorQ fd03b1a
Rephrasing in cluster_kmeans_sol_01.py
ogrisel da9a3ec
Resynchronize exercise and fix CI
d4ad40c
Wording
8ab3e29
Use MAE to score predicted house prices
52f244a
Solve plotly DeprecationWarning
dec3453
Prefer make_column_transformer as per #831
00c41a7
Iter on hdbscan notebook
ffe1855
Remove redundant paragraph
26cd3d2
Rename exercise and solution
50e9bc0
Add exercise and solution using AMI
d7e03e6
Fix exercise
900f0da
Small improvements to the solution of exercise 02
ogrisel 87d438f
Add the skrub dependency
ogrisel 166868c
Expand analysis a bit
ogrisel 74245d3
Improvements in the HDBSCAN notebook
ogrisel 9bfe2f2
Reworded analysis of the BBC text clustering notebook + use cross-val…
ogrisel a765e53
Improvements in the supervised metrics notebook
ogrisel 024545f
Add discussion on silhouette for hdbscan
2647274
Fix warning and plot not rendering
bd32b87
Add intro, overview and sections
27f4e52
Iter discussion on silhouette for hdbscan
b8f6646
Tweaks
d328631
Add first quiz on clustering and related images
c32eabe
Wording tweaks
89110ea
Add second quiz on clustering
d4b975e
Apply suggestions from code review
ArturoAmorQ b016d8b
Update jupyter-book/clustering/clustering_module_take_away.md
ArturoAmorQ a72be00
Synchronize quizzes from review
60a58c4
Merge branch 'clustering_module' of github.com:ArturoAmorQ/scikit-lea…
aff2dba
Synchronize notebooks
0c161ee
Add clustering wrap-up quiz
54ff951
Fix bug in wrap-up quiz
3fa71e9
Add wrap-up quiz to toc
5c7b28c
Fix a couple of bugs
83fc894
Feature branch to update to 1.6 (#813)
ogrisel 49b4474
FIX Penguin figures not rendering (#828)
ArturoAmorQ c7009bc
Missing notebook sync (#838)
ogrisel 9a3dd25
MAINT Changed the use of ColumnTransformer to make_column_transformer…
SebastienMelo 110f5cc
Fix typos (#839)
omahs 36ca1c5
minor improvements in wording and import statement order (#841)
rouk1 8ca0182
Minor typos fixups (#842)
davidjsonn 4f9b633
MTN px parallel render fix (#843)
SebastienMelo 4aa8c13
MAINT Update matplotlib to v3.10.3 (#846)
brospars 1210659
MTN Heat map explanation (#833)
SebastienMelo ff94dea
MTN Bias variance quizz (#849)
SebastienMelo e1a5f07
MNT Add info about the estimators html diagram (#844)
ArturoAmorQ a7df7ad
Update notebooks
c962329
Add cross-validation diagram to GridSearchCV notebook (#847)
student-ChestaVashishtha 00ddb9b
MTN Synchronized the quizzes for module 1 and 7
SebastienMelo 271555a
MTN Fix the parallel plots
SebastienMelo 0df2714
MTN Hyperparameter tuning with grid search
SebastienMelo d263961
MTN Made the distinction between predictor and transformer clearer (#…
SebastienMelo 090c715
MTN Added model state to the glossary (#857)
SebastienMelo e5d4bf6
MTN Proposal for explantation of what are iterations (#859)
SebastienMelo b943bed
FIX HistGradientBoosting fitting time too long (#860)
SebastienMelo 2c8b60a
Update README and Adding License txt file for Bike Rides Dataset (#858)
student-ChestaVashishtha 29ad752
Remove introduction paragraph meant for maintainers (#863)
SebastienMelo a4bd6ea
Improve wording in definition of numerical features (#861)
SebastienMelo e39860f
Explicit that fitting time is measured in seconds (#862)
SebastienMelo a516bdc
Add dataset credits and licenses (#864)
SebastienMelo 22bb873
Update notebooks
8e3fe4b
Change BBC news to Wikinews
da12c87
Merge branch 'INRIA:main' into clustering_module
ArturoAmorQ d8fe672
Update corresponding notebook
b235eaa
Add credits to newly added datasets
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,201 @@ | ||
| Age,Annual Income (k$),Spending Score (1-100),Genre | ||
| 19,15,39,Male | ||
| 21,15,81,Male | ||
| 20,16,6,Female | ||
| 23,16,77,Female | ||
| 31,17,40,Female | ||
| 22,17,76,Female | ||
| 35,18,6,Female | ||
| 23,18,94,Female | ||
| 64,19,3,Male | ||
| 30,19,72,Female | ||
| 67,19,14,Male | ||
| 35,19,99,Female | ||
| 58,20,15,Female | ||
| 24,20,77,Female | ||
| 37,20,13,Male | ||
| 22,20,79,Male | ||
| 35,21,35,Female | ||
| 20,21,66,Male | ||
| 52,23,29,Male | ||
| 35,23,98,Female | ||
| 35,24,35,Male | ||
| 25,24,73,Male | ||
| 46,25,5,Female | ||
| 31,25,73,Male | ||
| 54,28,14,Female | ||
| 29,28,82,Male | ||
| 45,28,32,Female | ||
| 35,28,61,Male | ||
| 40,29,31,Female | ||
| 23,29,87,Female | ||
| 60,30,4,Male | ||
| 21,30,73,Female | ||
| 53,33,4,Male | ||
| 18,33,92,Male | ||
| 49,33,14,Female | ||
| 21,33,81,Female | ||
| 42,34,17,Female | ||
| 30,34,73,Female | ||
| 36,37,26,Female | ||
| 20,37,75,Female | ||
| 65,38,35,Female | ||
| 24,38,92,Male | ||
| 48,39,36,Male | ||
| 31,39,61,Female | ||
| 49,39,28,Female | ||
| 24,39,65,Female | ||
| 50,40,55,Female | ||
| 27,40,47,Female | ||
| 29,40,42,Female | ||
| 31,40,42,Female | ||
| 49,42,52,Female | ||
| 33,42,60,Male | ||
| 31,43,54,Female | ||
| 59,43,60,Male | ||
| 50,43,45,Female | ||
| 47,43,41,Male | ||
| 51,44,50,Female | ||
| 69,44,46,Male | ||
| 27,46,51,Female | ||
| 53,46,46,Male | ||
| 70,46,56,Male | ||
| 19,46,55,Male | ||
| 67,47,52,Female | ||
| 54,47,59,Female | ||
| 63,48,51,Male | ||
| 18,48,59,Male | ||
| 43,48,50,Female | ||
| 68,48,48,Female | ||
| 19,48,59,Male | ||
| 32,48,47,Female | ||
| 70,49,55,Male | ||
| 47,49,42,Female | ||
| 60,50,49,Female | ||
| 60,50,56,Female | ||
| 59,54,47,Male | ||
| 26,54,54,Male | ||
| 45,54,53,Female | ||
| 40,54,48,Male | ||
| 23,54,52,Female | ||
| 49,54,42,Female | ||
| 57,54,51,Male | ||
| 38,54,55,Male | ||
| 67,54,41,Male | ||
| 46,54,44,Female | ||
| 21,54,57,Female | ||
| 48,54,46,Male | ||
| 55,57,58,Female | ||
| 22,57,55,Female | ||
| 34,58,60,Female | ||
| 50,58,46,Female | ||
| 68,59,55,Female | ||
| 18,59,41,Male | ||
| 48,60,49,Male | ||
| 40,60,40,Female | ||
| 32,60,42,Female | ||
| 24,60,52,Male | ||
| 47,60,47,Female | ||
| 27,60,50,Female | ||
| 48,61,42,Male | ||
| 20,61,49,Male | ||
| 23,62,41,Female | ||
| 49,62,48,Female | ||
| 67,62,59,Male | ||
| 26,62,55,Male | ||
| 49,62,56,Male | ||
| 21,62,42,Female | ||
| 66,63,50,Female | ||
| 54,63,46,Male | ||
| 68,63,43,Male | ||
| 66,63,48,Male | ||
| 65,63,52,Male | ||
| 19,63,54,Female | ||
| 38,64,42,Female | ||
| 19,64,46,Male | ||
| 18,65,48,Female | ||
| 19,65,50,Female | ||
| 63,65,43,Female | ||
| 49,65,59,Female | ||
| 51,67,43,Female | ||
| 50,67,57,Female | ||
| 27,67,56,Male | ||
| 38,67,40,Female | ||
| 40,69,58,Female | ||
| 39,69,91,Male | ||
| 23,70,29,Female | ||
| 31,70,77,Female | ||
| 43,71,35,Male | ||
| 40,71,95,Male | ||
| 59,71,11,Male | ||
| 38,71,75,Male | ||
| 47,71,9,Male | ||
| 39,71,75,Male | ||
| 25,72,34,Female | ||
| 31,72,71,Female | ||
| 20,73,5,Male | ||
| 29,73,88,Female | ||
| 44,73,7,Female | ||
| 32,73,73,Male | ||
| 19,74,10,Male | ||
| 35,74,72,Female | ||
| 57,75,5,Female | ||
| 32,75,93,Male | ||
| 28,76,40,Female | ||
| 32,76,87,Female | ||
| 25,77,12,Male | ||
| 28,77,97,Male | ||
| 48,77,36,Male | ||
| 32,77,74,Female | ||
| 34,78,22,Female | ||
| 34,78,90,Male | ||
| 43,78,17,Male | ||
| 39,78,88,Male | ||
| 44,78,20,Female | ||
| 38,78,76,Female | ||
| 47,78,16,Female | ||
| 27,78,89,Female | ||
| 37,78,1,Male | ||
| 30,78,78,Female | ||
| 34,78,1,Male | ||
| 30,78,73,Female | ||
| 56,79,35,Female | ||
| 29,79,83,Female | ||
| 19,81,5,Male | ||
| 31,81,93,Female | ||
| 50,85,26,Male | ||
| 36,85,75,Female | ||
| 42,86,20,Male | ||
| 33,86,95,Female | ||
| 36,87,27,Female | ||
| 32,87,63,Male | ||
| 40,87,13,Male | ||
| 28,87,75,Male | ||
| 36,87,10,Male | ||
| 36,87,92,Male | ||
| 52,88,13,Female | ||
| 30,88,86,Female | ||
| 58,88,15,Male | ||
| 27,88,69,Male | ||
| 59,93,14,Male | ||
| 35,93,90,Male | ||
| 37,97,32,Female | ||
| 32,97,86,Female | ||
| 46,98,15,Male | ||
| 29,98,88,Female | ||
| 41,99,39,Female | ||
| 30,99,97,Male | ||
| 54,101,24,Female | ||
| 28,101,68,Male | ||
| 41,103,17,Female | ||
| 36,103,85,Female | ||
| 34,103,23,Female | ||
| 32,103,69,Female | ||
| 33,113,8,Male | ||
| 38,113,91,Female | ||
| 47,120,16,Female | ||
| 35,120,79,Female | ||
| 45,126,28,Female | ||
| 32,126,74,Male | ||
| 32,137,18,Male | ||
| 30,137,83,Male | ||
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.