-
Notifications
You must be signed in to change notification settings - Fork 31
Open
Description
tensor use in the transformer,i see the encoder q,k is the same tensor which produced by the previous in the pytorch version code,but in this code we use dense api to get q,k. Are there something wrong? And beside that , the torch version code use resblocks in the transfomer and i can't find in this code .
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels