-
Notifications
You must be signed in to change notification settings - Fork 566
Closed
Labels
Description
Originally posted by @KennethEnevoldsen in #3837
Code for calculation:
If the model can be loaded from transformers/senetence-transformers then load the model and use below code:
import numpy as np
from transformers import AutoModel
model = AutoModel.from_pretrained(model_name, trust_remote_code=True)
emb = model[0].auto_model.get_input_embeddings()
meta.n_embedding_parameters = int(np.prod(emb.weight.shape))
If the model loaded using CrossEncoder then:
import numpy as np
from sentence_transformers import CrossEncoder
model = CrossEncoder("cross-encoder/ms-marco-MiniLM-L6-v2")
emb = model.model.get_input_embeddings()
n_embedding_parameters = int(np.prod(emb.weight.shape))
List of models for which n_embedding_parameters is to be evaluated is as follows:
- ApsaraStackMaaS/EvoQwen2.5-VL-Retriever-3B-v1
- ApsaraStackMaaS/EvoQwen2.5-VL-Retriever-7B-v1
- BAAI/bge-visualized-base
- BAAI/bge-visualized-m3
- ByteDance/ListConRanker
- Bytedance/Seed1.6-embedding
- GritLM/GritLM-8x7B
- NovaSearch/stella_en_400M_v5
- QuanSun/EVA02-CLIP-B-16
- QuanSun/EVA02-CLIP-L-14
- QuanSun/EVA02-CLIP-bigE-14
- QuanSun/EVA02-CLIP-bigE-14-plus
- Salesforce/SFR-Embedding-Code-2B_R
- Salesforce/blip2-opt-2.7b
- Salesforce/blip2-opt-6.7b-coco
- Snowflake/snowflake-arctic-embed-m-long
- TencentBAC/Conan-embedding-v2
- TomoroAI/tomoro-colqwen3-embed-4b
- TomoroAI/tomoro-colqwen3-embed-8b
- ai-sage/Giga-Embeddings-instruct
- baseline/bm25s
- consciousAI/cai-stellaris-text-embeddings
- deepvk/USER2-base
- deepvk/USER2-small
- facebook/SONAR
- facebook/dinov2-base
- facebook/dinov2-giant
- facebook/dinov2-large
- facebook/dinov2-small
- facebook/webssl-dino1b-full2b-224
- facebook/webssl-dino2b-full2b-224
- facebook/webssl-dino2b-heavy2b-224
- facebook/webssl-dino2b-light2b-224
- facebook/webssl-dino300m-full2b-224
- facebook/webssl-dino3b-full2b-224
- facebook/webssl-dino3b-heavy2b-224
- facebook/webssl-dino3b-light2b-224
- facebook/webssl-mae1b-full2b-224
- facebook/webssl-mae300m-full2b-224
- ibm-granite/granite-vision-3.3-2b-embedding
- intfloat/mmE5-mllama-11b-instruct
- jinaai/jina-clip-v1
- jinaai/jina-colbert-v2
- jinaai/jina-reranker-v2-base-multilingual
- jxm/cde-small-v1
- jxm/cde-small-v2
- kakaobrain/align-base
- laion/CLIP-ViT-B-16-DataComp.XL-s13B-b90K
- laion/CLIP-ViT-B-32-DataComp.XL-s13B-b90K
- laion/CLIP-ViT-B-32-laion2B-s34B-b79K
- laion/CLIP-ViT-H-14-laion2B-s32B-b79K
- laion/CLIP-ViT-L-14-DataComp.XL-s13B-b90K
- laion/CLIP-ViT-L-14-laion2B-s32B-b82K
- laion/CLIP-ViT-bigG-14-laion2B-39B-b160k
- laion/CLIP-ViT-g-14-laion2B-s34B-b88K
- malenia1/ternary-weight-embedding
- microsoft/LLM2CLIP-Openai-B-16
- microsoft/LLM2CLIP-Openai-L-14-224
- microsoft/LLM2CLIP-Openai-L-14-336
- nomic-ai/colnomic-embed-multimodal-3b
- nomic-ai/colnomic-embed-multimodal-7b
- nomic-ai/modernbert-embed-base
- nomic-ai/nomic-embed-text-v1
- nomic-ai/nomic-embed-text-v1-ablated
- nomic-ai/nomic-embed-text-v1-unsupervised
- nomic-ai/nomic-embed-text-v1.5
- nomic-ai/nomic-embed-text-v2-moe
- nomic-ai/nomic-embed-vision-v1.5
- nyu-visionx/moco-v3-vit-b
- nyu-visionx/moco-v3-vit-l
- openai/clip-vit-base-patch16
- openai/clip-vit-base-patch32
- openai/clip-vit-large-patch14
- samaya-ai/promptriever-llama2-7b-v1
- samaya-ai/promptriever-llama3.1-8b-instruct-v1
- samaya-ai/promptriever-llama3.1-8b-v1
- vidore/colSmol-256M
- vidore/colSmol-500M
- vidore/colpali-v1.1
- vidore/colpali-v1.2
- vidore/colpali-v1.3
- vidore/colqwen2-v1.0
- vidore/colqwen2.5-v0.2
- voyageai/voyage-multimodal-3
Reactions are currently unavailable