GAT has the same cost a self attention even for sparse adjency matrix #6424

MehdiZouitine · 2023-01-14T14:33:11Z

MehdiZouitine
Jan 14, 2023

Hello,

I am wondering why graph attention (GAT) is as expensive in memory as self attention for a very sparse adjacency matrix. I understand that both GAT and self attention involve a lot of matrix multiplications and dot products, but I would expect GAT to be more memory efficient since it is dealing with sparse graphs.

Can anyone explain the reason for this or offer any suggestions for how to reduce the memory requirements for GAT on sparse graphs?

Thank you for your help.

wsad1 · 2023-01-14T15:15:16Z

wsad1
Jan 14, 2023
Maintainer

Which implementations are you comparing here? I want to understand how you arrived at this conclusion.

Also this doc talks about how GNNs use up memory, and how this can be improved for certain GNNs. Currently GATConv doesn't have support for this, but there is an open PR that tries to add support.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GAT has the same cost a self attention even for sparse adjency matrix #6424

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

GAT has the same cost a self attention even for sparse adjency matrix #6424

Uh oh!

MehdiZouitine Jan 14, 2023

Replies: 1 comment

Uh oh!

Uh oh!

wsad1 Jan 14, 2023 Maintainer

MehdiZouitine
Jan 14, 2023

wsad1
Jan 14, 2023
Maintainer