Vision encoder output dimension does not match

Hi, thanks for your excellent work! I'm trying to run `bash eval_calvin.sh`.
When running to FeedbackPolicy/models/policy.py, there is an issue where the shape of the vision_x input to vision_encoder is 192 * 192, which does not match the model size of 224 * 224.
So I interpolated vision_x to 224 * 224, and the shape of output by vision_encoder is 8 * 768, which does not match the dimension of the rearrange operation.`vision_x = rearrange(vision_x, "(b T) d h w  -> b T (h w) d", b=b, T=T)`


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Vision encoder output dimension does not match #1

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Vision encoder output dimension does not match #1

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions