File tree Expand file tree Collapse file tree 1 file changed +11
-0
lines changed Expand file tree Collapse file tree 1 file changed +11
-0
lines changed Original file line number Diff line number Diff line change @@ -47,6 +47,17 @@ Despite the generate model having clearly different "clusters", without more
47
47
data we simply cannot differentiate between these models, and hence no
48
48
density based clustering will manage cluster these according to the model.
49
49
50
+ Q: I am not getting the claimed performance. Why not?
51
+ -----------------------------------------------------
52
+
53
+ The most likely explanation is to do with the dimensionality of your input data.
54
+ While HDBSCAN can perform well on low to medium dimensional data the performance
55
+ tends to decrease significantly as dimension increases. In general HDBSCAN can do
56
+ well on up to around 50 or 100 dimensional data, but performance can see
57
+ significant decreases beyond that. Of course a lot is also dataset dependent, so
58
+ you can still get good performance even on high dimensional data, but it
59
+ is no longer guaranteed.
60
+
50
61
Q: I want to predict the cluster of a new unseen point. How do I do this?
51
62
-------------------------------------------------------------------------
52
63
You can’t perform that action at this time.
0 commit comments