Testing on a huge graph. #4538

JiaruiWang · 2022-04-26T02:52:21Z

JiaruiWang
Apr 26, 2022

My task is a node classification problem on a 60M nodes graph. 20M nodes are labeled, and 40M nodes are unlabeled. I created the dataset with a 16M nodes training mask, a 2M nodes validation mask, and a 2M nodes testing mask, out of 20M labeled nodes.
I feed the 16M training nodes into NeighborLoader to generate the data loader for training. Sampled subgraphs can fit into the GPU memory.

In the evaluation process, all the examples pass the whole graph into the model, then get the output[test_mask]. This will cause the GPU OOM. If compute the evaluation process on the CPU, it is too slow.

What's the best practice for it?

Thank you very much

rusty1s · 2022-04-26T07:49:18Z

rusty1s
Apr 26, 2022
Maintainer

You are right that one cannot do full-batch inference on CPU on such a giant graph. The alternative is to also use neighbor sampling during inference. To reduce the variance due to sampling/dropout of edges, you can either try to sample all neighbors in the inference loader (num_neighbors=[-1] * num_layers), or increase the number of neighbors. You can also re-run inference multiple times and perform ensembling.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Testing on a huge graph. #4538

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Testing on a huge graph. #4538

Uh oh!

JiaruiWang Apr 26, 2022

Replies: 1 comment

Uh oh!

rusty1s Apr 26, 2022 Maintainer

JiaruiWang
Apr 26, 2022

rusty1s
Apr 26, 2022
Maintainer