A problem in the PositionalEncoding model code

Hi，
Thank you for developing Corigami !
I have encountered some problems when I use corigami to train my data.

![image](https://github.com/tanjimin/C.Origami/assets/87375686/cf5e85d7-7220-4463-8749-069736c5d946)
After the encoder step，here, the transposed matrix is input into attn.
the matrix x  is : Tensor, shape [batch_size, seq_lenth, embedding_dim], not Tensor, shape [seq_lenth, batch_size, embedding_dim] !
![image](https://github.com/tanjimin/C.Origami/assets/87375686/747c10db-2be6-4fe5-9317-b26d03fcb1c0)
if perform this step: x = x + self.pe[:x.size(0)] will return the wrong location information result.

I think the code may have made an error in transponse after the encoder.
![63062fd2a7490cbc735a0d4005c233f](https://github.com/tanjimin/C.Origami/assets/87375686/dc0679b8-9e90-4316-995b-ba91d10f5f81)

Best wishes,
Kirtio


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A problem in the PositionalEncoding model code #42

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

A problem in the PositionalEncoding model code #42

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions