Partition Large Graph for Testing #2794

melsaa · 2021-06-30T07:14:34Z

melsaa
Jun 30, 2021

I have a large graph (12M nodes, 66M edges) and my task is to do link prediction with GNN. I successfully trained the model with GraphSAINT, however, the learned embedding is too big to be stored in memory.

Is there any partition/clustering algorithm that preserve all the edges so that I can use it during testing?

Thanks

Answered by rusty1s

Jul 1, 2021

Sadly no, and this is one of the disadvantages of subgraph-sampling since you typically want to do inference on the full graph. There is a clever workaround though, which allows you to create node-embeddings on CPU in a layer-wise fashion, i.e., it will create all node embeddings for the first layer, and then uses those to create node embeddings for the second layer, and so on. Here is an example on how to do so.

View full answer

rusty1s · 2021-07-01T05:46:13Z

rusty1s
Jul 1, 2021
Maintainer

Sadly no, and this is one of the disadvantages of subgraph-sampling since you typically want to do inference on the full graph. There is a clever workaround though, which allows you to create node-embeddings on CPU in a layer-wise fashion, i.e., it will create all node embeddings for the first layer, and then uses those to create node embeddings for the second layer, and so on. Here is an example on how to do so.

2 replies

melsaa Jul 2, 2021
Author

Thanks for the answer. So, from what I understand GPU memory still has to be large enough to store all node features and one layer embedding. Is it possible to do link prediction for a new node where all the links to the original graph are unknown?

rusty1s Jul 2, 2021
Maintainer

That is not really true, since we save the input features and node embeddings on CPU rather than GPU. Only the current mini-batch will be put on GPU.

Regarding your second question: This is indeed a bit tricky, since you can only infer connections for the new nodes based on its initial node features. However, in theory, this should be doable via a classic link prediction approach.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Partition Large Graph for Testing #2794

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Partition Large Graph for Testing #2794

Uh oh!

Uh oh!

melsaa Jun 30, 2021

Replies: 1 comment · 2 replies

Uh oh!

rusty1s Jul 1, 2021 Maintainer

Uh oh!

melsaa Jul 2, 2021 Author

Uh oh!

rusty1s Jul 2, 2021 Maintainer

melsaa
Jun 30, 2021

Replies: 1 comment 2 replies

rusty1s
Jul 1, 2021
Maintainer

melsaa Jul 2, 2021
Author

rusty1s Jul 2, 2021
Maintainer