How to structure multiple sequences of graphs #3048

petergroth · 2021-08-30T08:12:11Z

petergroth
Aug 30, 2021

Hi,

I have a dataset which consists of sequences of graphs which describe the state of different systems evolving over time. Let's assume that the edge structure is static and that the number of nodes for each sequence is constant, and that only the node features change from time step to time step. What would be a suitable way of handling the Data() objects and creating a Pytorch Geometric Dataset?

A simple approach would be to create a Data() object for each graph for each time step, and then setting shuffle to False when creating the dataloader. This might lead to batching issues, however, at it seems inflexible. Alternatively, since I'm (currently at least) assuming a static edge structure, I can create a single Data() object for each sequence, and then simply have the time step as an additional dimension of the node attributes.

What is the suggested approach to handling multiple sequences of graphs while retaining the flexibility of the Dataset and DataLoader classes?

Answered by rusty1s

Aug 30, 2021

If your edge structure is static and only the node features evolve over time, you can save your node features as [num_nodes, num_timestamps, num_features] inside Data (as you suggested). This should support mini-batching out-of-the box. For edge structures, the current DataLoader sadly cannot handle sequences, which I want to fix in an upcoming release. This should allow you to do the following:

data.edge_index = [edge_index_1, edge_index_2, ...]

There also exists PyTorch Geometric Temporal which might already fit your use-case.

View full answer

rusty1s · 2021-08-30T10:43:18Z

rusty1s
Aug 30, 2021
Maintainer

If your edge structure is static and only the node features evolve over time, you can save your node features as [num_nodes, num_timestamps, num_features] inside Data (as you suggested). This should support mini-batching out-of-the box. For edge structures, the current DataLoader sadly cannot handle sequences, which I want to fix in an upcoming release. This should allow you to do the following:

data.edge_index = [edge_index_1, edge_index_2, ...]

There also exists PyTorch Geometric Temporal which might already fit your use-case.

6 replies

rusty1s Oct 29, 2021
Maintainer

PyG 2.0.* does support this already :)

MCiurletti Oct 29, 2021

Oh wow, I guess I didn't look close enough. Thanks!

MCiurletti Oct 29, 2021

Just one more question, as you recommended to use [num_nodes, num_timestamps, num_features] node feature shape, how could you handle additional or disappearing nodes? Same goes for edge features. Intuitively I would say that num_timesteps makes more sense as first dimension.

rusty1s Oct 30, 2021
Maintainer

Yes, it makes more sense as first dimension, but shouldn't make much of a difference.

Adding or removing nodes is still challenging, and I suggest for a given mini-batch, you simply use dummy nodes (zero features, no graph connectivity) before a certain node appears. Does that work in your case?

MCiurletti Oct 31, 2021

Okay, something like that was my idea too. Thanks for the help!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to structure multiple sequences of graphs #3048

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 6 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to structure multiple sequences of graphs #3048

Uh oh!

petergroth Aug 30, 2021

Replies: 1 comment · 6 replies

Uh oh!

Uh oh!

rusty1s Aug 30, 2021 Maintainer

Uh oh!

rusty1s Oct 29, 2021 Maintainer

Uh oh!

MCiurletti Oct 29, 2021

Uh oh!

MCiurletti Oct 29, 2021

Uh oh!

rusty1s Oct 30, 2021 Maintainer

Uh oh!

MCiurletti Oct 31, 2021

petergroth
Aug 30, 2021

Replies: 1 comment 6 replies

rusty1s
Aug 30, 2021
Maintainer

rusty1s Oct 29, 2021
Maintainer

rusty1s Oct 30, 2021
Maintainer