The significance of position_embeddings #3260

Lnan1615 · 2021-11-04T09:27:05Z

Lnan1615
Nov 4, 2021

I cann't understand what the point of adding position_embeddings to the sequence is. For image blocks, when 2d or 3D images are transformed into 1d sequence or 1d sequence is transformed into 2D or 3D images, the operation is carried out in strict order, and the sequence sequence is not disrupted later, so it seems that position embedding of the sequence is no longer needed. In addition, I didn't understand the calculation method of position_embeddings. Could you give me a general explanation?

wyli · 2021-11-04T09:57:23Z

wyli
Nov 4, 2021
Collaborator

I assume you are talking about the vision transformers. The position embeddings are trainable position representations instead of hard-coded position indices.

MONAI/monai/networks/blocks/patchembedding.py

Line 99 in c9302e4

    
           self.position_embeddings = nn.Parameter(torch.zeros(1, self.n_patches, hidden_size))

cc @ahatamiz

1 reply

ahatamiz Nov 4, 2021

Hi @wyli and @Lnan1615

In addition to the above, the position embedding is particularly used to retrain the spatial information with respect to each token in an input sequence. In a conventional CNN, the inductive image bias mitigates the need for such a mechanism, but in a ViT model ( or generally transformers), the locations of token need to be specified. This can be done via hard-coded (fixed) or trainable position embeddings. The ViT and UNETR both use the latter.

Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

The significance of position_embeddings #3260

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

The significance of position_embeddings #3260

Uh oh!

Lnan1615 Nov 4, 2021

Replies: 1 comment · 1 reply

Uh oh!

wyli Nov 4, 2021 Collaborator

Uh oh!

ahatamiz Nov 4, 2021

Lnan1615
Nov 4, 2021

Replies: 1 comment 1 reply

wyli
Nov 4, 2021
Collaborator