File tree Expand file tree Collapse file tree 1 file changed +11
-0
lines changed
Expand file tree Collapse file tree 1 file changed +11
-0
lines changed Original file line number Diff line number Diff line change @@ -47,6 +47,17 @@ Despite the generate model having clearly different "clusters", without more
4747data we simply cannot differentiate between these models, and hence no
4848density based clustering will manage cluster these according to the model.
4949
50+ Q: I am not getting the claimed performance. Why not?
51+ -----------------------------------------------------
52+
53+ The most likely explanation is to do with the dimensionality of your input data.
54+ While HDBSCAN can perform well on low to medium dimensional data the performance
55+ tends to decrease significantly as dimension increases. In general HDBSCAN can do
56+ well on up to around 50 or 100 dimensional data, but performance can see
57+ significant decreases beyond that. Of course a lot is also dataset dependent, so
58+ you can still get good performance even on high dimensional data, but it
59+ is no longer guaranteed.
60+
5061Q: I want to predict the cluster of a new unseen point. How do I do this?
5162-------------------------------------------------------------------------
5263
You can’t perform that action at this time.
0 commit comments