How to convert node with multimodal data (image and feature vector) to node features #4390

srg9000 · 2022-03-31T16:34:42Z

srg9000
Mar 31, 2022

Hello, I am new to graph networks and I am working on a graph classification problem. Each of my graphs are set of variable nodes, each node consists of one 32x32x1 image and 1x29 feature vector. What i would like to do is convert the image to a 1 dimensional feature vector by passing it through 2-3 conv2d layers and combine with the 1x29 feature vector by passing both through linear layer/directly concatenating etc. After this, I plan on attaching other standard layers like GATConv

I would like to know how I can accomplish this. Do I create a custom message passing layer that does this in it's forward pass? What would my inputs in the init constructor be?

Thank you

Answered by rusty1s

Apr 1, 2022

In general, you would first apply a CNN on your images, and then use the embedding produced by the CNN as input to your GNN. Given images of shape [num_nodes, num_channels, width, height], you can do:

img = CNN(img)
img = img.view(num_nodes, -1)
x = torch.cat([img, feature_vector], dim=-1)
out = GNN(x, edge_index)

Keep in mind that this will not scale well for large graphs. Currently, the model is trained jointly, that is each image for every node is processed together inside the CNN). An alternative is to use some pre-trained CNN, process the embeddings of nodes once, and use them afterwards as detached input to your GNN.

View full answer

rusty1s · 2022-04-01T11:17:38Z

rusty1s
Apr 1, 2022
Maintainer

In general, you would first apply a CNN on your images, and then use the embedding produced by the CNN as input to your GNN. Given images of shape [num_nodes, num_channels, width, height], you can do:

img = CNN(img)
img = img.view(num_nodes, -1)
x = torch.cat([img, feature_vector], dim=-1)
out = GNN(x, edge_index)

Keep in mind that this will not scale well for large graphs. Currently, the model is trained jointly, that is each image for every node is processed together inside the CNN). An alternative is to use some pre-trained CNN, process the embeddings of nodes once, and use them afterwards as detached input to your GNN.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to convert node with multimodal data (image and feature vector) to node features #4390

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to convert node with multimodal data (image and feature vector) to node features #4390

Uh oh!

srg9000 Mar 31, 2022

Replies: 1 comment

Uh oh!

rusty1s Apr 1, 2022 Maintainer

srg9000
Mar 31, 2022

rusty1s
Apr 1, 2022
Maintainer