-
Hi, In the (f) part Why do we have (x,x) as a residual connection for embedding net but not in (g) for output energy creation? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 2 replies
-
@njzjz Could you please help me clear the above doubt? |
Beta Was this translation helpful? Give feedback.
-
No issue, I think I figured it out it is for making dimensions equal to output for addition. As the output is going to be a matrix for (f) and in (g) part only single dimension output so we can directly add input value of the single dimension. Once again Thanks for attending to my question. |
Beta Was this translation helpful? Give feedback.
No issue, I think I figured it out it is for making dimensions equal to output for addition. As the output is going to be a matrix for (f) and in (g) part only single dimension output so we can directly add input value of the single dimension.
Once again Thanks for attending to my question.
@wanghan-iapcm my explanation is correct?