Node embedding training on large graph #5912

garyhsu29 · 2022-11-06T07:34:59Z

garyhsu29
Nov 6, 2022

Hello, I have a large dataset that contains around 200,000,000 nodes and 3,000,000,000 edges. In my case, it's a single large graph and the graph is dynamic. My goal is to learn node embedding for downstream classification task or use some clustering algorithms to find the community of nodes.

I have two questions here:

Because the graph is so large and I could not load the whole graph into my RAM, and I am wondering how to split my whole graph into multiple subgraphs without losing a lot of information about its topology.
What would be the suitable algorithm that I should use in my case? Do you think the graphsage + edge reconstruction loss could learn the embedding properly?

Answered by rusty1s

Nov 7, 2022

Wow, this is a giant graph. We are currently working on letting PyG scale to out-of-memory datasets via FeatureStore and GraphStore abstractions (will be part of the next release). Currently, you would need to rely on external sources to split your graph before inputting into PyG.
Yes, that sounds good. GraphSAGE is always a strong baseline.

View full answer

rusty1s · 2022-11-07T07:25:15Z

rusty1s
Nov 7, 2022
Maintainer

Wow, this is a giant graph. We are currently working on letting PyG scale to out-of-memory datasets via FeatureStore and GraphStore abstractions (will be part of the next release). Currently, you would need to rely on external sources to split your graph before inputting into PyG.
Yes, that sounds good. GraphSAGE is always a strong baseline.

1 reply

garyhsu29 Nov 7, 2022
Author

Thank you so much!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Node embedding training on large graph #5912

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Node embedding training on large graph #5912

Uh oh!

garyhsu29 Nov 6, 2022

Replies: 1 comment · 1 reply

Uh oh!

rusty1s Nov 7, 2022 Maintainer

Uh oh!

garyhsu29 Nov 7, 2022 Author

garyhsu29
Nov 6, 2022

Replies: 1 comment 1 reply

rusty1s
Nov 7, 2022
Maintainer

garyhsu29 Nov 7, 2022
Author