-
Hello, I have a large dataset that contains around 200,000,000 nodes and 3,000,000,000 edges. In my case, it's a single large graph and the graph is dynamic. My goal is to learn node embedding for downstream classification task or use some clustering algorithms to find the community of nodes. I have two questions here:
|
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
|
Beta Was this translation helpful? Give feedback.
Wow, this is a giant graph. We are currently working on letting PyG scale to out-of-memory datasets via
FeatureStore
andGraphStore
abstractions (will be part of the next release). Currently, you would need to rely on external sources to split your graph before inputting into PyG.Yes, that sounds good.
GraphSAGE
is always a strong baseline.