@@ -171,19 +171,21 @@ multiple different clusterings. This does not engender much confidence
171171in any individual clustering that may result.
172172
173173So, in summary, here's how K-Means seems to stack up against out
174- desiderata:
175- - **Don't be wrong! **: K-means is going to throw points
176- into clusters whether they belong or not; it also assumes your clusters
177- are globular. K-Means scores very poorly on this point.
178- - **Intuitive parameters **: If you have a good intuition for how many clusters the
179- dataset your exploring has then great, otherwise you might have a
180- problem.
181- - **Stability **: Hopefully the clustering is stable for your
182- data. Best to have many runs and check though.
183- - **Performance **: This is K-Means big win. It's a simple algorithm and with the right tricks
184- and optimizations can be made exceptionally efficient. There are few
185- algorithms that can compete with K-Means for performance. If you have
186- truly huge data then K-Means might be your only option.
174+ desiderata:
175+
176+ - **Don't be wrong! **: K-means is going to throw points into clusters
177+ whether they belong or not; it also assumes your clusters are globular.
178+ K-Means scores very poorly on this point.
179+ - **Intuitive parameters **: If you have a good intuition for how many
180+ clusters the dataset your exploring has then great, otherwise you might
181+ have a problem.
182+ - **Stability **: Hopefully the clustering is stable for your data. Best
183+ to have many runs and check though.
184+ - **Performance **: This is K-Means big win. It's a simple algorithm and
185+ with the right tricks and optimizations can be made exceptionally
186+ efficient. There are few algorithms that can compete with K-Means for
187+ performance. If you have truly huge data then K-Means might be your only
188+ option.
187189
188190But enough opinion, how does K-Means perform on our test dataset? Let's
189191have look. We'll be generous and use our knowledge that there are six
0 commit comments