ClusterData crashes with large graphs representing point clouds #2610
Replies: 2 comments 4 replies
-
Nevermind, it seems the problem was coming from the features tensor of the graph (due to a wrong manipulation, I had x=[3] instead of x=[number of nodes, 3] in the Data object) causing the process to hang then crash |
Beta Was this translation helpful? Give feedback.
-
I am reopening this discussion as I am facing more or less the same problem now I can indeed partition some large clouds points (with 6 000 000 nodes, 16 neighboors for the knn graph et more or les 16 features + XYZ in 2000 parts with a batch size of 2 or 10) however the metis partitionning is sometimes extremely inconsistent and can crash on points clouds smaller (800 000). This is a bit hard to reproduce as it doesnt seem to be systematic but the same point cloud with same parameters can crash 2 times and work the 3rd time. What happens is the ClusterData shows
calculate for a bit then stops completely. On jupyter it can be seen by the star showing it is running for some seconds then disappears. On Colab (on which it seems to crash the most to me), it displays that the kernel crashed. Spyder will just say it encountered a problem Is there any way to avoid this issue ? If not, is there any alternative ? |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello,
I have a problem with the ClusterData class which crashes each time I work with large datasets.
I am working on points clouds, and thought of using it to custer my data in order to have a reasonable computation time, but as soon as I work with large points clouds (here 2,6 million points), the kernel crashes everytime on the METIS partitionning unless I use a very low num_parts argument (less than 10).
In the paper on which the class is based : https://arxiv.org/pdf/1905.07953.pdf : the authors partition the Amazon3M dataset in 15000 partitions for instance, which is way more than what I am trying to do; hence my confusion.
Is there a problem with the way I'm using ClusterData ?
Thx in advance
Beta Was this translation helpful? Give feedback.
All reactions