-
Notifications
You must be signed in to change notification settings - Fork 1.3k
Open
Description
I am building an image classification workflow on top of coca captions and embeddings. The only downside is that this is slow (about 100/images per minute on a google colab).
So two related questions:
- Is it possible to extract the embeddings calculated within model.generate()? Currently I use encode_image on top which is basically a duplicate.
- Are there some settings that may speed up model.generate at the expense of accuracy? In my current workflow I only need the top characteristic words from the captions of images that belong to the same cluster. I'm not entirely clear how beamsearch work.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels