Questions about Transconv's attention design #7409

HelloWorldLTY · 2023-05-22T12:16:48Z

HelloWorldLTY
May 22, 2023

Hi, I am trying to replace the transformconv's attention manchism with current fast attention design. However, the speed becomes slower, and my output is quite different.

The original code is:

        alpha = (query_i * key_j).sum(dim=-1) / math.sqrt(self.out_channels)
        alpha = softmax(alpha, index, ptr, size_i)
        self._alpha = alpha
        alpha = F.dropout(alpha, p=self.dropout, training=self.training)

        out = value_j
        if edge_attr is not None:
            out = out + edge_attr

        out = out * alpha.view(-1, self.heads, 1)
        return out

My design is:

        query_i = query_i.view(1, query_i.shape[1], -1,  query_i.shape[2],)
        key_j = key_j.view(1, key_j.shape[1], -1,  key_j.shape[2],)
        value_j = value_j.view(1, value_j.shape[1], -1,  value_j.shape[2],)
        
        out = self.fast_attn(query_i, key_j, value_j)
        out = out.reshape(-1, self.heads, self.out_channels)

akihironitta · 2023-05-22T14:07:47Z

akihironitta
May 22, 2023
Maintainer

I would suggest running the script with PyTorch profiler to see which part of the execution is taking so long:

0 replies

rusty1s · 2023-05-23T06:02:31Z

rusty1s
May 23, 2023
Maintainer

Also answered you on Slack. I think @akihironitta is right. From looking at the code, I am not sure your formula is correct. It looks like you will compute attention over all edges rather than local neighborhoods. I am also not sure fast_attn is very optimized for batch_size=1 with a large number of tokens.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Questions about Transconv's attention design #7409

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Questions about Transconv's attention design #7409

Uh oh!

Uh oh!

HelloWorldLTY May 22, 2023

Replies: 2 comments

Uh oh!

akihironitta May 22, 2023 Maintainer

Uh oh!

rusty1s May 23, 2023 Maintainer

HelloWorldLTY
May 22, 2023

akihironitta
May 22, 2023
Maintainer

rusty1s
May 23, 2023
Maintainer